Matches in SemOpenAlex for { <https://semopenalex.org/work/W3039502308> ?p ?o ?g. }
- W3039502308 abstract "Attention is a powerful component of modern neural networks across a wide variety of domains. However, despite its ubiquity in machine learning, there is a gap in our understanding of attention from a theoretical point of view. We propose a framework to fill this gap by building a mathematically equivalent model of attention using measure theory. With this model, we are able to interpret self-attention as a system of self-interacting particles, we shed light on self-attention from a maximum entropy perspective, and we show that attention is actually Lipschitz-continuous (with an appropriate metric) under suitable assumptions. We then apply these insights to the problem of mis-specified input data; infinitely-deep, weight-sharing self-attention networks; and more general Lipschitz estimates for a specific type of attention studied in concurrent work." @default.
- W3039502308 created "2020-07-10" @default.
- W3039502308 creator A5059210995 @default.
- W3039502308 creator A5064841098 @default.
- W3039502308 creator A5080042640 @default.
- W3039502308 date "2020-07-06" @default.
- W3039502308 modified "2023-09-27" @default.
- W3039502308 title "A Mathematical Theory of Attention" @default.
- W3039502308 cites W1511986666 @default.
- W3039502308 cites W1793121960 @default.
- W3039502308 cites W1971713783 @default.
- W3039502308 cites W2011000015 @default.
- W3039502308 cites W2026653933 @default.
- W3039502308 cites W2028316825 @default.
- W3039502308 cites W2103496339 @default.
- W3039502308 cites W2105148255 @default.
- W3039502308 cites W2143722962 @default.
- W3039502308 cites W2211925278 @default.
- W3039502308 cites W2342070830 @default.
- W3039502308 cites W2766453196 @default.
- W3039502308 cites W2894384847 @default.
- W3039502308 cites W2946417913 @default.
- W3039502308 cites W2948981900 @default.
- W3039502308 cites W2950527759 @default.
- W3039502308 cites W2951815760 @default.
- W3039502308 cites W2962845550 @default.
- W3039502308 cites W2963341956 @default.
- W3039502308 cites W2963403868 @default.
- W3039502308 cites W2964308564 @default.
- W3039502308 cites W2972324944 @default.
- W3039502308 cites W2973525135 @default.
- W3039502308 cites W2988160852 @default.
- W3039502308 cites W3033357972 @default.
- W3039502308 cites W3034995113 @default.
- W3039502308 cites W3035314023 @default.
- W3039502308 cites W3036369012 @default.
- W3039502308 cites W3036495721 @default.
- W3039502308 cites W3037798801 @default.
- W3039502308 cites W3112479704 @default.
- W3039502308 cites W3146803896 @default.
- W3039502308 hasPublicationYear "2020" @default.
- W3039502308 type Work @default.
- W3039502308 sameAs 3039502308 @default.
- W3039502308 citedByCount "2" @default.
- W3039502308 countsByYear W30395023082020 @default.
- W3039502308 countsByYear W30395023082021 @default.
- W3039502308 crossrefType "posted-content" @default.
- W3039502308 hasAuthorship W3039502308A5059210995 @default.
- W3039502308 hasAuthorship W3039502308A5064841098 @default.
- W3039502308 hasAuthorship W3039502308A5080042640 @default.
- W3039502308 hasConcept C106301342 @default.
- W3039502308 hasConcept C121332964 @default.
- W3039502308 hasConcept C12713177 @default.
- W3039502308 hasConcept C136197465 @default.
- W3039502308 hasConcept C154945302 @default.
- W3039502308 hasConcept C162324750 @default.
- W3039502308 hasConcept C168167062 @default.
- W3039502308 hasConcept C176217482 @default.
- W3039502308 hasConcept C202444582 @default.
- W3039502308 hasConcept C21547014 @default.
- W3039502308 hasConcept C22324862 @default.
- W3039502308 hasConcept C2524010 @default.
- W3039502308 hasConcept C28719098 @default.
- W3039502308 hasConcept C33923547 @default.
- W3039502308 hasConcept C41008148 @default.
- W3039502308 hasConcept C62520636 @default.
- W3039502308 hasConcept C80444323 @default.
- W3039502308 hasConcept C97355855 @default.
- W3039502308 hasConceptScore W3039502308C106301342 @default.
- W3039502308 hasConceptScore W3039502308C121332964 @default.
- W3039502308 hasConceptScore W3039502308C12713177 @default.
- W3039502308 hasConceptScore W3039502308C136197465 @default.
- W3039502308 hasConceptScore W3039502308C154945302 @default.
- W3039502308 hasConceptScore W3039502308C162324750 @default.
- W3039502308 hasConceptScore W3039502308C168167062 @default.
- W3039502308 hasConceptScore W3039502308C176217482 @default.
- W3039502308 hasConceptScore W3039502308C202444582 @default.
- W3039502308 hasConceptScore W3039502308C21547014 @default.
- W3039502308 hasConceptScore W3039502308C22324862 @default.
- W3039502308 hasConceptScore W3039502308C2524010 @default.
- W3039502308 hasConceptScore W3039502308C28719098 @default.
- W3039502308 hasConceptScore W3039502308C33923547 @default.
- W3039502308 hasConceptScore W3039502308C41008148 @default.
- W3039502308 hasConceptScore W3039502308C62520636 @default.
- W3039502308 hasConceptScore W3039502308C80444323 @default.
- W3039502308 hasConceptScore W3039502308C97355855 @default.
- W3039502308 hasLocation W30395023081 @default.
- W3039502308 hasOpenAccess W3039502308 @default.
- W3039502308 hasPrimaryLocation W30395023081 @default.
- W3039502308 hasRelatedWork W1581068350 @default.
- W3039502308 hasRelatedWork W1599935199 @default.
- W3039502308 hasRelatedWork W1750840729 @default.
- W3039502308 hasRelatedWork W196636731 @default.
- W3039502308 hasRelatedWork W2072896212 @default.
- W3039502308 hasRelatedWork W2141109002 @default.
- W3039502308 hasRelatedWork W2251952542 @default.
- W3039502308 hasRelatedWork W2624997958 @default.
- W3039502308 hasRelatedWork W2766346350 @default.
- W3039502308 hasRelatedWork W2952892016 @default.
- W3039502308 hasRelatedWork W2970247852 @default.