Matches in SemOpenAlex for { <https://semopenalex.org/work/W3209374680> ?p ?o ?g. }
- W3209374680 abstract "A central goal of sequence modeling is designing a single principled model that can address sequence data across a range of modalities and tasks, particularly on long-range dependencies. Although conventional models including RNNs, CNNs, and Transformers have specialized variants for capturing long dependencies, they still struggle to scale to very long sequences of $10000$ or more steps. A promising recent approach proposed modeling sequences by simulating the fundamental state space model (SSM) ( x'(t) = Ax(t) + Bu(t), y(t) = Cx(t) + Du(t) ), and showed that for appropriate choices of the state matrix ( A ), this system could handle long-range dependencies mathematically and empirically. However, this method has prohibitive computation and memory requirements, rendering it infeasible as a general sequence modeling solution. We propose the Structured State Space sequence model (S4) based on a new parameterization for the SSM, and show that it can be computed much more efficiently than prior approaches while preserving their theoretical strengths. Our technique involves conditioning ( A ) with a low-rank correction, allowing it to be diagonalized stably and reducing the SSM to the well-studied computation of a Cauchy kernel. S4 achieves strong empirical results across a diverse range of established benchmarks, including (i) 91% accuracy on sequential CIFAR-10 with no data augmentation or auxiliary losses, on par with a larger 2-D ResNet, (ii) substantially closing the gap to Transformers on image and language modeling tasks, while performing generation $60times$ faster (iii) SoTA on every task from the Long Range Arena benchmark, including solving the challenging Path-X task of length 16k that all prior work fails on, while being as efficient as all competitors." @default.
- W3209374680 created "2021-11-08" @default.
- W3209374680 creator A5025386668 @default.
- W3209374680 creator A5063897942 @default.
- W3209374680 creator A5070267604 @default.
- W3209374680 date "2021-10-30" @default.
- W3209374680 modified "2023-09-23" @default.
- W3209374680 title "Efficiently Modeling Long Sequences with Structured State Spaces" @default.
- W3209374680 cites W1489355995 @default.
- W3209374680 cites W1815076433 @default.
- W3209374680 cites W2064675550 @default.
- W3209374680 cites W2130809717 @default.
- W3209374680 cites W2792764867 @default.
- W3209374680 cites W2894295011 @default.
- W3209374680 cites W2908336025 @default.
- W3209374680 cites W2940744433 @default.
- W3209374680 cites W2949382160 @default.
- W3209374680 cites W2954108053 @default.
- W3209374680 cites W2962832549 @default.
- W3209374680 cites W2963016848 @default.
- W3209374680 cites W2963042606 @default.
- W3209374680 cites W2963212650 @default.
- W3209374680 cites W2963241221 @default.
- W3209374680 cites W2963403868 @default.
- W3209374680 cites W2963430354 @default.
- W3209374680 cites W2963605062 @default.
- W3209374680 cites W2963631907 @default.
- W3209374680 cites W2963910749 @default.
- W3209374680 cites W2963970792 @default.
- W3209374680 cites W2964115871 @default.
- W3209374680 cites W2964122153 @default.
- W3209374680 cites W2964347220 @default.
- W3209374680 cites W2964348070 @default.
- W3209374680 cites W2970783931 @default.
- W3209374680 cites W2971278153 @default.
- W3209374680 cites W2971306341 @default.
- W3209374680 cites W3034297959 @default.
- W3209374680 cites W3034573343 @default.
- W3209374680 cites W3035332875 @default.
- W3209374680 cites W3094502228 @default.
- W3209374680 cites W3096986660 @default.
- W3209374680 cites W3120383865 @default.
- W3209374680 cites W3121592593 @default.
- W3209374680 cites W3125056032 @default.
- W3209374680 cites W3128694157 @default.
- W3209374680 cites W3157506437 @default.
- W3209374680 cites W3167470318 @default.
- W3209374680 cites W3167805910 @default.
- W3209374680 cites W3099512283 @default.
- W3209374680 doi "https://doi.org/10.48550/arxiv.2111.00396" @default.
- W3209374680 hasPublicationYear "2021" @default.
- W3209374680 type Work @default.
- W3209374680 sameAs 3209374680 @default.
- W3209374680 citedByCount "0" @default.
- W3209374680 crossrefType "posted-content" @default.
- W3209374680 hasAuthorship W3209374680A5025386668 @default.
- W3209374680 hasAuthorship W3209374680A5063897942 @default.
- W3209374680 hasAuthorship W3209374680A5070267604 @default.
- W3209374680 hasBestOaLocation W32093746801 @default.
- W3209374680 hasConcept C11413529 @default.
- W3209374680 hasConcept C118615104 @default.
- W3209374680 hasConcept C154945302 @default.
- W3209374680 hasConcept C159985019 @default.
- W3209374680 hasConcept C192562407 @default.
- W3209374680 hasConcept C204323151 @default.
- W3209374680 hasConcept C205711294 @default.
- W3209374680 hasConcept C2778112365 @default.
- W3209374680 hasConcept C33923547 @default.
- W3209374680 hasConcept C41008148 @default.
- W3209374680 hasConcept C45374587 @default.
- W3209374680 hasConcept C54355233 @default.
- W3209374680 hasConcept C74193536 @default.
- W3209374680 hasConcept C80444323 @default.
- W3209374680 hasConcept C86803240 @default.
- W3209374680 hasConceptScore W3209374680C11413529 @default.
- W3209374680 hasConceptScore W3209374680C118615104 @default.
- W3209374680 hasConceptScore W3209374680C154945302 @default.
- W3209374680 hasConceptScore W3209374680C159985019 @default.
- W3209374680 hasConceptScore W3209374680C192562407 @default.
- W3209374680 hasConceptScore W3209374680C204323151 @default.
- W3209374680 hasConceptScore W3209374680C205711294 @default.
- W3209374680 hasConceptScore W3209374680C2778112365 @default.
- W3209374680 hasConceptScore W3209374680C33923547 @default.
- W3209374680 hasConceptScore W3209374680C41008148 @default.
- W3209374680 hasConceptScore W3209374680C45374587 @default.
- W3209374680 hasConceptScore W3209374680C54355233 @default.
- W3209374680 hasConceptScore W3209374680C74193536 @default.
- W3209374680 hasConceptScore W3209374680C80444323 @default.
- W3209374680 hasConceptScore W3209374680C86803240 @default.
- W3209374680 hasLocation W32093746801 @default.
- W3209374680 hasOpenAccess W3209374680 @default.
- W3209374680 hasPrimaryLocation W32093746801 @default.
- W3209374680 hasRelatedWork W1513831164 @default.
- W3209374680 hasRelatedWork W1534842550 @default.
- W3209374680 hasRelatedWork W1975616635 @default.
- W3209374680 hasRelatedWork W1988122740 @default.
- W3209374680 hasRelatedWork W2130391181 @default.
- W3209374680 hasRelatedWork W2158137312 @default.
- W3209374680 hasRelatedWork W2329972207 @default.
- W3209374680 hasRelatedWork W2354062721 @default.