Matches in SemOpenAlex for { <https://semopenalex.org/work/W3128847475> ?p ?o ?g. }
- W3128847475 abstract "Attention is a powerful component of modern neural networks across a wide variety of domains. In this paper, we seek to quantify the regularity (i.e. the amount of smoothness) of the attention operation. To accomplish this goal, we propose a new mathematical framework that uses measure theory and integral operators to model attention. We show that this framework is consistent with the usual definition, and that it captures the essential properties of attention. Then we use this framework to prove that, on compact domains, the attention operation is Lipschitz continuous and provide an estimate of its Lipschitz constant. Additionally, by focusing on a specific type of attention, we extend these Lipschitz continuity results to non-compact domains. We also discuss the effects regularity can have on NLP models, and applications to invertible and infinitely-deep networks." @default.
- W3128847475 created "2021-02-15" @default.
- W3128847475 creator A5059210995 @default.
- W3128847475 creator A5064841098 @default.
- W3128847475 creator A5080042640 @default.
- W3128847475 date "2021-02-10" @default.
- W3128847475 modified "2023-10-01" @default.
- W3128847475 title "On the Regularity of Attention." @default.
- W3128847475 cites W1595781024 @default.
- W3128847475 cites W1676820704 @default.
- W3128847475 cites W1793121960 @default.
- W3128847475 cites W2130942839 @default.
- W3128847475 cites W2211925278 @default.
- W3128847475 cites W2342070830 @default.
- W3128847475 cites W2766453196 @default.
- W3128847475 cites W2894384847 @default.
- W3128847475 cites W2948981900 @default.
- W3128847475 cites W2950527759 @default.
- W3128847475 cites W2951815760 @default.
- W3128847475 cites W2963341956 @default.
- W3128847475 cites W2963403868 @default.
- W3128847475 cites W2963540976 @default.
- W3128847475 cites W2964282829 @default.
- W3128847475 cites W2964308564 @default.
- W3128847475 cites W2970900903 @default.
- W3128847475 cites W2973525135 @default.
- W3128847475 cites W2988160852 @default.
- W3128847475 cites W3006096721 @default.
- W3128847475 cites W3033357972 @default.
- W3128847475 cites W3034995113 @default.
- W3128847475 cites W3035314023 @default.
- W3128847475 cites W3036369012 @default.
- W3128847475 cites W3036495721 @default.
- W3128847475 cites W3037798801 @default.
- W3128847475 cites W3112479704 @default.
- W3128847475 cites W391985582 @default.
- W3128847475 hasPublicationYear "2021" @default.
- W3128847475 type Work @default.
- W3128847475 sameAs 3128847475 @default.
- W3128847475 citedByCount "2" @default.
- W3128847475 countsByYear W31288474752021 @default.
- W3128847475 countsByYear W31288474752022 @default.
- W3128847475 crossrefType "posted-content" @default.
- W3128847475 hasAuthorship W3128847475A5059210995 @default.
- W3128847475 hasAuthorship W3128847475A5064841098 @default.
- W3128847475 hasAuthorship W3128847475A5080042640 @default.
- W3128847475 hasConcept C102634674 @default.
- W3128847475 hasConcept C121332964 @default.
- W3128847475 hasConcept C124101348 @default.
- W3128847475 hasConcept C134306372 @default.
- W3128847475 hasConcept C136197465 @default.
- W3128847475 hasConcept C154945302 @default.
- W3128847475 hasConcept C168167062 @default.
- W3128847475 hasConcept C18903297 @default.
- W3128847475 hasConcept C199360897 @default.
- W3128847475 hasConcept C202444582 @default.
- W3128847475 hasConcept C22324862 @default.
- W3128847475 hasConcept C2777027219 @default.
- W3128847475 hasConcept C2777299769 @default.
- W3128847475 hasConcept C2780009758 @default.
- W3128847475 hasConcept C33923547 @default.
- W3128847475 hasConcept C41008148 @default.
- W3128847475 hasConcept C80444323 @default.
- W3128847475 hasConcept C86803240 @default.
- W3128847475 hasConcept C96442724 @default.
- W3128847475 hasConcept C97355855 @default.
- W3128847475 hasConceptScore W3128847475C102634674 @default.
- W3128847475 hasConceptScore W3128847475C121332964 @default.
- W3128847475 hasConceptScore W3128847475C124101348 @default.
- W3128847475 hasConceptScore W3128847475C134306372 @default.
- W3128847475 hasConceptScore W3128847475C136197465 @default.
- W3128847475 hasConceptScore W3128847475C154945302 @default.
- W3128847475 hasConceptScore W3128847475C168167062 @default.
- W3128847475 hasConceptScore W3128847475C18903297 @default.
- W3128847475 hasConceptScore W3128847475C199360897 @default.
- W3128847475 hasConceptScore W3128847475C202444582 @default.
- W3128847475 hasConceptScore W3128847475C22324862 @default.
- W3128847475 hasConceptScore W3128847475C2777027219 @default.
- W3128847475 hasConceptScore W3128847475C2777299769 @default.
- W3128847475 hasConceptScore W3128847475C2780009758 @default.
- W3128847475 hasConceptScore W3128847475C33923547 @default.
- W3128847475 hasConceptScore W3128847475C41008148 @default.
- W3128847475 hasConceptScore W3128847475C80444323 @default.
- W3128847475 hasConceptScore W3128847475C86803240 @default.
- W3128847475 hasConceptScore W3128847475C96442724 @default.
- W3128847475 hasConceptScore W3128847475C97355855 @default.
- W3128847475 hasLocation W31288474751 @default.
- W3128847475 hasOpenAccess W3128847475 @default.
- W3128847475 hasPrimaryLocation W31288474751 @default.
- W3128847475 hasRelatedWork W118607236 @default.
- W3128847475 hasRelatedWork W1671910847 @default.
- W3128847475 hasRelatedWork W1786935204 @default.
- W3128847475 hasRelatedWork W2026749963 @default.
- W3128847475 hasRelatedWork W2162216049 @default.
- W3128847475 hasRelatedWork W2334720777 @default.
- W3128847475 hasRelatedWork W2567717468 @default.
- W3128847475 hasRelatedWork W2769660485 @default.
- W3128847475 hasRelatedWork W2889596539 @default.
- W3128847475 hasRelatedWork W2951406160 @default.
- W3128847475 hasRelatedWork W3006052099 @default.