Matches in SemOpenAlex for { <https://semopenalex.org/work/W2473934411> ?p ?o ?g. }
- W2473934411 abstract "Many sequential processing tasks require complex nonlinear transition functions from one step to the next. However, recurrent neural networks with 'deep' transition functions remain difficult to train, even when using Long Short-Term Memory (LSTM) networks. We introduce a novel theoretical analysis of recurrent networks based on Gersgorin's circle theorem that illuminates several modeling and optimization issues and improves our understanding of the LSTM cell. Based on this analysis we propose Recurrent Highway Networks, which extend the LSTM architecture to allow step-to-step transition depths larger than one. Several language modeling experiments demonstrate that the proposed architecture results in powerful and efficient models. On the Penn Treebank corpus, solely increasing the transition depth from 1 to 10 improves word-level perplexity from 90.6 to 65.4 using the same number of parameters. On the larger Wikipedia datasets for character prediction (text8 and enwik8), RHNs outperform all previous results and achieve an entropy of 1.27 bits per character." @default.
- W2473934411 created "2016-07-22" @default.
- W2473934411 creator A5044629673 @default.
- W2473934411 creator A5068199087 @default.
- W2473934411 creator A5072320253 @default.
- W2473934411 creator A5081580957 @default.
- W2473934411 date "2016-07-12" @default.
- W2473934411 modified "2023-09-23" @default.
- W2473934411 title "Recurrent Highway Networks" @default.
- W2473934411 cites W1509672901 @default.
- W2473934411 cites W1526990717 @default.
- W2473934411 cites W1632114991 @default.
- W2473934411 cites W1689711448 @default.
- W2473934411 cites W1771459135 @default.
- W2473934411 cites W179875071 @default.
- W2473934411 cites W1810943226 @default.
- W2473934411 cites W1971129545 @default.
- W2473934411 cites W1999965501 @default.
- W2473934411 cites W2018435387 @default.
- W2473934411 cites W2036317923 @default.
- W2473934411 cites W2046578104 @default.
- W2473934411 cites W2064675550 @default.
- W2473934411 cites W2076063813 @default.
- W2473934411 cites W2089947415 @default.
- W2473934411 cites W2108677974 @default.
- W2473934411 cites W2115121720 @default.
- W2473934411 cites W2136848157 @default.
- W2473934411 cites W2157331557 @default.
- W2473934411 cites W2194775991 @default.
- W2473934411 cites W2212703438 @default.
- W2473934411 cites W2259472270 @default.
- W2473934411 cites W2325237720 @default.
- W2473934411 cites W2493544825 @default.
- W2473934411 cites W2553303224 @default.
- W2473934411 cites W2613634265 @default.
- W2473934411 cites W2949242443 @default.
- W2473934411 cites W2950621961 @default.
- W2473934411 cites W2951559648 @default.
- W2473934411 cites W2953061907 @default.
- W2473934411 cites W2964084166 @default.
- W2473934411 cites W581956982 @default.
- W2473934411 hasPublicationYear "2016" @default.
- W2473934411 type Work @default.
- W2473934411 sameAs 2473934411 @default.
- W2473934411 citedByCount "73" @default.
- W2473934411 countsByYear W24739344112016 @default.
- W2473934411 countsByYear W24739344112017 @default.
- W2473934411 countsByYear W24739344112018 @default.
- W2473934411 countsByYear W24739344112019 @default.
- W2473934411 countsByYear W24739344112020 @default.
- W2473934411 countsByYear W24739344112021 @default.
- W2473934411 crossrefType "posted-content" @default.
- W2473934411 hasAuthorship W2473934411A5044629673 @default.
- W2473934411 hasAuthorship W2473934411A5068199087 @default.
- W2473934411 hasAuthorship W2473934411A5072320253 @default.
- W2473934411 hasAuthorship W2473934411A5081580957 @default.
- W2473934411 hasConcept C100279451 @default.
- W2473934411 hasConcept C104317684 @default.
- W2473934411 hasConcept C123657996 @default.
- W2473934411 hasConcept C137293760 @default.
- W2473934411 hasConcept C142362112 @default.
- W2473934411 hasConcept C147168706 @default.
- W2473934411 hasConcept C153349607 @default.
- W2473934411 hasConcept C154945302 @default.
- W2473934411 hasConcept C185592680 @default.
- W2473934411 hasConcept C186644900 @default.
- W2473934411 hasConcept C194232998 @default.
- W2473934411 hasConcept C204321447 @default.
- W2473934411 hasConcept C206134035 @default.
- W2473934411 hasConcept C2524010 @default.
- W2473934411 hasConcept C2780861071 @default.
- W2473934411 hasConcept C33923547 @default.
- W2473934411 hasConcept C41008148 @default.
- W2473934411 hasConcept C50644808 @default.
- W2473934411 hasConcept C55493867 @default.
- W2473934411 hasConcept C80444323 @default.
- W2473934411 hasConcept C90805587 @default.
- W2473934411 hasConceptScore W2473934411C100279451 @default.
- W2473934411 hasConceptScore W2473934411C104317684 @default.
- W2473934411 hasConceptScore W2473934411C123657996 @default.
- W2473934411 hasConceptScore W2473934411C137293760 @default.
- W2473934411 hasConceptScore W2473934411C142362112 @default.
- W2473934411 hasConceptScore W2473934411C147168706 @default.
- W2473934411 hasConceptScore W2473934411C153349607 @default.
- W2473934411 hasConceptScore W2473934411C154945302 @default.
- W2473934411 hasConceptScore W2473934411C185592680 @default.
- W2473934411 hasConceptScore W2473934411C186644900 @default.
- W2473934411 hasConceptScore W2473934411C194232998 @default.
- W2473934411 hasConceptScore W2473934411C204321447 @default.
- W2473934411 hasConceptScore W2473934411C206134035 @default.
- W2473934411 hasConceptScore W2473934411C2524010 @default.
- W2473934411 hasConceptScore W2473934411C2780861071 @default.
- W2473934411 hasConceptScore W2473934411C33923547 @default.
- W2473934411 hasConceptScore W2473934411C41008148 @default.
- W2473934411 hasConceptScore W2473934411C50644808 @default.
- W2473934411 hasConceptScore W2473934411C55493867 @default.
- W2473934411 hasConceptScore W2473934411C80444323 @default.
- W2473934411 hasConceptScore W2473934411C90805587 @default.
- W2473934411 hasOpenAccess W2473934411 @default.
- W2473934411 hasRelatedWork W1522301498 @default.