Matches in SemOpenAlex for { <https://semopenalex.org/work/W2955227499> ?p ?o ?g. }
- W2955227499 abstract "Transformer networks have lead to important progress in language modeling and machine translation. These models include two consecutive modules, a feed-forward layer and a self-attention layer. The latter allows the network to capture long term dependencies and are often regarded as the key ingredient in the success of Transformers. Building upon this intuition, we propose a new model that solely consists of attention layers. More precisely, we augment the self-attention layers with persistent memory vectors that play a similar role as the feed-forward layer. Thanks to these vectors, we can remove the feed-forward layer without degrading the performance of a transformer. Our evaluation shows the benefits brought by our model on standard character and word level language modeling benchmarks." @default.
- W2955227499 created "2019-07-12" @default.
- W2955227499 creator A5033867133 @default.
- W2955227499 creator A5041693326 @default.
- W2955227499 creator A5054371148 @default.
- W2955227499 creator A5060255128 @default.
- W2955227499 creator A5069316249 @default.
- W2955227499 date "2019-07-02" @default.
- W2955227499 modified "2023-10-01" @default.
- W2955227499 title "Augmenting Self-attention with Persistent Memory" @default.
- W2955227499 cites W1514535095 @default.
- W2955227499 cites W1558797106 @default.
- W2955227499 cites W1591801644 @default.
- W2955227499 cites W1793121960 @default.
- W2955227499 cites W179875071 @default.
- W2955227499 cites W1815076433 @default.
- W2955227499 cites W2025653905 @default.
- W2955227499 cites W2064675550 @default.
- W2955227499 cites W2095705004 @default.
- W2955227499 cites W2111305191 @default.
- W2955227499 cites W2132339004 @default.
- W2955227499 cites W2146502635 @default.
- W2955227499 cites W2194775991 @default.
- W2955227499 cites W2259472270 @default.
- W2955227499 cites W2571859396 @default.
- W2955227499 cites W2626778328 @default.
- W2955227499 cites W2792376130 @default.
- W2955227499 cites W2908336025 @default.
- W2955227499 cites W2911109671 @default.
- W2955227499 cites W2940744433 @default.
- W2955227499 cites W2946567085 @default.
- W2955227499 cites W2950527759 @default.
- W2955227499 cites W2962784628 @default.
- W2955227499 cites W2962964385 @default.
- W2955227499 cites W2963034893 @default.
- W2955227499 cites W2963088785 @default.
- W2955227499 cites W2963341956 @default.
- W2955227499 cites W2963347649 @default.
- W2955227499 cites W2963430354 @default.
- W2955227499 cites W2963448850 @default.
- W2955227499 cites W2963494889 @default.
- W2955227499 cites W2963631907 @default.
- W2955227499 cites W2963735467 @default.
- W2955227499 cites W2963925437 @default.
- W2955227499 cites W2963970792 @default.
- W2955227499 cites W2963983719 @default.
- W2955227499 cites W2964019776 @default.
- W2955227499 cites W2964269252 @default.
- W2955227499 cites W2964308564 @default.
- W2955227499 cites W36903255 @default.
- W2955227499 hasPublicationYear "2019" @default.
- W2955227499 type Work @default.
- W2955227499 sameAs 2955227499 @default.
- W2955227499 citedByCount "38" @default.
- W2955227499 countsByYear W29552274992019 @default.
- W2955227499 countsByYear W29552274992020 @default.
- W2955227499 countsByYear W29552274992021 @default.
- W2955227499 crossrefType "posted-content" @default.
- W2955227499 hasAuthorship W2955227499A5033867133 @default.
- W2955227499 hasAuthorship W2955227499A5041693326 @default.
- W2955227499 hasAuthorship W2955227499A5054371148 @default.
- W2955227499 hasAuthorship W2955227499A5060255128 @default.
- W2955227499 hasAuthorship W2955227499A5069316249 @default.
- W2955227499 hasConcept C113775141 @default.
- W2955227499 hasConcept C119599485 @default.
- W2955227499 hasConcept C119857082 @default.
- W2955227499 hasConcept C127413603 @default.
- W2955227499 hasConcept C132010649 @default.
- W2955227499 hasConcept C137293760 @default.
- W2955227499 hasConcept C154945302 @default.
- W2955227499 hasConcept C15744967 @default.
- W2955227499 hasConcept C165801399 @default.
- W2955227499 hasConcept C188147891 @default.
- W2955227499 hasConcept C41008148 @default.
- W2955227499 hasConcept C66322947 @default.
- W2955227499 hasConceptScore W2955227499C113775141 @default.
- W2955227499 hasConceptScore W2955227499C119599485 @default.
- W2955227499 hasConceptScore W2955227499C119857082 @default.
- W2955227499 hasConceptScore W2955227499C127413603 @default.
- W2955227499 hasConceptScore W2955227499C132010649 @default.
- W2955227499 hasConceptScore W2955227499C137293760 @default.
- W2955227499 hasConceptScore W2955227499C154945302 @default.
- W2955227499 hasConceptScore W2955227499C15744967 @default.
- W2955227499 hasConceptScore W2955227499C165801399 @default.
- W2955227499 hasConceptScore W2955227499C188147891 @default.
- W2955227499 hasConceptScore W2955227499C41008148 @default.
- W2955227499 hasConceptScore W2955227499C66322947 @default.
- W2955227499 hasLocation W29552274991 @default.
- W2955227499 hasOpenAccess W2955227499 @default.
- W2955227499 hasPrimaryLocation W29552274991 @default.
- W2955227499 hasRelatedWork W2101105183 @default.
- W2955227499 hasRelatedWork W2194775991 @default.
- W2955227499 hasRelatedWork W2613718673 @default.
- W2955227499 hasRelatedWork W2908336025 @default.
- W2955227499 hasRelatedWork W2940744433 @default.
- W2955227499 hasRelatedWork W2946567085 @default.
- W2955227499 hasRelatedWork W2963341956 @default.
- W2955227499 hasRelatedWork W2963403868 @default.
- W2955227499 hasRelatedWork W2963631907 @default.
- W2955227499 hasRelatedWork W2963925437 @default.