Matches in SemOpenAlex for { <https://semopenalex.org/work/W2949203910> ?p ?o ?g. }
- W2949203910 abstract "Vanishing (and exploding) gradients effect is a common problem for recurrent neural networks with nonlinear activation functions which use backpropagation method for calculation of derivatives. Deep feedforward neural networks with many hidden layers also suffer from this effect. In this paper we propose a novel universal technique that makes the norm of the gradient stay in the suitable range. We construct a way to estimate a contribution of each training example to the norm of the long-term components of the target function s gradient. Using this subroutine we can construct mini-batches for the stochastic gradient descent (SGD) training that leads to high performance and accuracy of the trained network even for very complex tasks. We provide a straightforward mathematical estimation of minibatch s impact on for the gradient norm and prove its correctness theoretically. To check our framework experimentally we use some special synthetic benchmarks for testing RNNs on ability to capture long-term dependencies. Our network can detect links between events in the (temporal) sequence at the range approx. 100 and longer." @default.
- W2949203910 created "2019-06-27" @default.
- W2949203910 creator A5006823112 @default.
- W2949203910 creator A5039085915 @default.
- W2949203910 date "2016-06-24" @default.
- W2949203910 modified "2023-09-23" @default.
- W2949203910 title "Sampling-based Gradient Regularization for Capturing Long-Term Dependencies in Recurrent Neural Networks" @default.
- W2949203910 cites W1408639475 @default.
- W2949203910 cites W1492673282 @default.
- W2949203910 cites W1591801644 @default.
- W2949203910 cites W1604264128 @default.
- W2949203910 cites W179875071 @default.
- W2949203910 cites W1815076433 @default.
- W2949203910 cites W1905882502 @default.
- W2949203910 cites W2016931393 @default.
- W2949203910 cites W2064675550 @default.
- W2949203910 cites W2107878631 @default.
- W2949203910 cites W2108563286 @default.
- W2949203910 cites W2143612262 @default.
- W2949203910 cites W2171800554 @default.
- W2949203910 cites W2172140247 @default.
- W2949203910 cites W2252143850 @default.
- W2949203910 cites W2557283755 @default.
- W2949203910 cites W2941943687 @default.
- W2949203910 cites W581956982 @default.
- W2949203910 cites W194249466 @default.
- W2949203910 cites W3020834653 @default.
- W2949203910 hasPublicationYear "2016" @default.
- W2949203910 type Work @default.
- W2949203910 sameAs 2949203910 @default.
- W2949203910 citedByCount "1" @default.
- W2949203910 countsByYear W29492039102018 @default.
- W2949203910 crossrefType "posted-content" @default.
- W2949203910 hasAuthorship W2949203910A5006823112 @default.
- W2949203910 hasAuthorship W2949203910A5039085915 @default.
- W2949203910 hasConcept C111919701 @default.
- W2949203910 hasConcept C11413529 @default.
- W2949203910 hasConcept C115680565 @default.
- W2949203910 hasConcept C121332964 @default.
- W2949203910 hasConcept C127413603 @default.
- W2949203910 hasConcept C133731056 @default.
- W2949203910 hasConcept C147168706 @default.
- W2949203910 hasConcept C153258448 @default.
- W2949203910 hasConcept C154945302 @default.
- W2949203910 hasConcept C155032097 @default.
- W2949203910 hasConcept C158622935 @default.
- W2949203910 hasConcept C159985019 @default.
- W2949203910 hasConcept C17744445 @default.
- W2949203910 hasConcept C191795146 @default.
- W2949203910 hasConcept C192562407 @default.
- W2949203910 hasConcept C199360897 @default.
- W2949203910 hasConcept C199539241 @default.
- W2949203910 hasConcept C204323151 @default.
- W2949203910 hasConcept C206688291 @default.
- W2949203910 hasConcept C2776135515 @default.
- W2949203910 hasConcept C2780801425 @default.
- W2949203910 hasConcept C38365724 @default.
- W2949203910 hasConcept C38858127 @default.
- W2949203910 hasConcept C41008148 @default.
- W2949203910 hasConcept C50644808 @default.
- W2949203910 hasConcept C55439883 @default.
- W2949203910 hasConcept C61797465 @default.
- W2949203910 hasConcept C62520636 @default.
- W2949203910 hasConcept C96147967 @default.
- W2949203910 hasConceptScore W2949203910C111919701 @default.
- W2949203910 hasConceptScore W2949203910C11413529 @default.
- W2949203910 hasConceptScore W2949203910C115680565 @default.
- W2949203910 hasConceptScore W2949203910C121332964 @default.
- W2949203910 hasConceptScore W2949203910C127413603 @default.
- W2949203910 hasConceptScore W2949203910C133731056 @default.
- W2949203910 hasConceptScore W2949203910C147168706 @default.
- W2949203910 hasConceptScore W2949203910C153258448 @default.
- W2949203910 hasConceptScore W2949203910C154945302 @default.
- W2949203910 hasConceptScore W2949203910C155032097 @default.
- W2949203910 hasConceptScore W2949203910C158622935 @default.
- W2949203910 hasConceptScore W2949203910C159985019 @default.
- W2949203910 hasConceptScore W2949203910C17744445 @default.
- W2949203910 hasConceptScore W2949203910C191795146 @default.
- W2949203910 hasConceptScore W2949203910C192562407 @default.
- W2949203910 hasConceptScore W2949203910C199360897 @default.
- W2949203910 hasConceptScore W2949203910C199539241 @default.
- W2949203910 hasConceptScore W2949203910C204323151 @default.
- W2949203910 hasConceptScore W2949203910C206688291 @default.
- W2949203910 hasConceptScore W2949203910C2776135515 @default.
- W2949203910 hasConceptScore W2949203910C2780801425 @default.
- W2949203910 hasConceptScore W2949203910C38365724 @default.
- W2949203910 hasConceptScore W2949203910C38858127 @default.
- W2949203910 hasConceptScore W2949203910C41008148 @default.
- W2949203910 hasConceptScore W2949203910C50644808 @default.
- W2949203910 hasConceptScore W2949203910C55439883 @default.
- W2949203910 hasConceptScore W2949203910C61797465 @default.
- W2949203910 hasConceptScore W2949203910C62520636 @default.
- W2949203910 hasConceptScore W2949203910C96147967 @default.
- W2949203910 hasOpenAccess W2949203910 @default.
- W2949203910 hasRelatedWork W1839868949 @default.
- W2949203910 hasRelatedWork W2557389081 @default.
- W2949203910 hasRelatedWork W2767206870 @default.
- W2949203910 hasRelatedWork W2787442904 @default.
- W2949203910 hasRelatedWork W2898324627 @default.
- W2949203910 hasRelatedWork W2899748887 @default.