Matches in SemOpenAlex for { <https://semopenalex.org/work/W4312658081> ?p ?o ?g. }
- W4312658081 abstract "Video understanding requires reasoning at multiple spatiotemporal resolutions – from short fine-grained motions to events taking place over longer durations. Although transformer architectures have recently advanced the state-of-the-art, they have not explicitly modelled different spatiotemporal resolutions. To this end, we present Multiview Transformers for Video Recognition (MTV). Our model consists of separate encoders to represent different views of the input video with lateral connections to fuse information across views. We present thorough ablation studies of our model and show that MTV consistently performs better than single-view counterparts in terms of accuracy and computational cost across a range of model sizes. Furthermore, we achieve state-of-the-art results on six standard datasets, and improve even further with large-scale pretraining. Code and checkpoints are available at: https://github.com/google-research/scenic." @default.
- W4312658081 created "2023-01-05" @default.
- W4312658081 creator A5013074509 @default.
- W4312658081 creator A5027687139 @default.
- W4312658081 creator A5035108037 @default.
- W4312658081 creator A5036897639 @default.
- W4312658081 creator A5045217258 @default.
- W4312658081 creator A5060145891 @default.
- W4312658081 creator A5088146048 @default.
- W4312658081 date "2022-06-01" @default.
- W4312658081 modified "2023-10-12" @default.
- W4312658081 title "Multiview Transformers for Video Recognition" @default.
- W4312658081 cites W1522734439 @default.
- W4312658081 cites W2016053056 @default.
- W4312658081 cites W2024868105 @default.
- W4312658081 cites W2068611653 @default.
- W4312658081 cites W2097117768 @default.
- W4312658081 cites W2108598243 @default.
- W4312658081 cites W2119799051 @default.
- W4312658081 cites W2147800946 @default.
- W4312658081 cites W2151103935 @default.
- W4312658081 cites W2161969291 @default.
- W4312658081 cites W2162915993 @default.
- W4312658081 cites W2183341477 @default.
- W4312658081 cites W2194775991 @default.
- W4312658081 cites W2331143823 @default.
- W4312658081 cites W2531409750 @default.
- W4312658081 cites W2560023338 @default.
- W4312658081 cites W2565639579 @default.
- W4312658081 cites W2618530766 @default.
- W4312658081 cites W2625366777 @default.
- W4312658081 cites W2948048211 @default.
- W4312658081 cites W2955874753 @default.
- W4312658081 cites W2962711930 @default.
- W4312658081 cites W2962843773 @default.
- W4312658081 cites W2963091558 @default.
- W4312658081 cites W2963155035 @default.
- W4312658081 cites W2963524571 @default.
- W4312658081 cites W2964191259 @default.
- W4312658081 cites W2980037812 @default.
- W4312658081 cites W2984008963 @default.
- W4312658081 cites W2984287396 @default.
- W4312658081 cites W2990152177 @default.
- W4312658081 cites W2990503944 @default.
- W4312658081 cites W3034572008 @default.
- W4312658081 cites W3035104321 @default.
- W4312658081 cites W3035303837 @default.
- W4312658081 cites W3035682985 @default.
- W4312658081 cites W3108241103 @default.
- W4312658081 cites W3109304426 @default.
- W4312658081 cites W3131500599 @default.
- W4312658081 cites W3138516171 @default.
- W4312658081 cites W3151130473 @default.
- W4312658081 cites W3173621652 @default.
- W4312658081 cites W3207758636 @default.
- W4312658081 cites W4214482673 @default.
- W4312658081 cites W4214516465 @default.
- W4312658081 cites W4214612132 @default.
- W4312658081 cites W4214614183 @default.
- W4312658081 cites W4231697575 @default.
- W4312658081 cites W4237434670 @default.
- W4312658081 doi "https://doi.org/10.1109/cvpr52688.2022.00333" @default.
- W4312658081 hasPublicationYear "2022" @default.
- W4312658081 type Work @default.
- W4312658081 citedByCount "57" @default.
- W4312658081 countsByYear W43126580812022 @default.
- W4312658081 countsByYear W43126580812023 @default.
- W4312658081 crossrefType "proceedings-article" @default.
- W4312658081 hasAuthorship W4312658081A5013074509 @default.
- W4312658081 hasAuthorship W4312658081A5027687139 @default.
- W4312658081 hasAuthorship W4312658081A5035108037 @default.
- W4312658081 hasAuthorship W4312658081A5036897639 @default.
- W4312658081 hasAuthorship W4312658081A5045217258 @default.
- W4312658081 hasAuthorship W4312658081A5060145891 @default.
- W4312658081 hasAuthorship W4312658081A5088146048 @default.
- W4312658081 hasBestOaLocation W43126580812 @default.
- W4312658081 hasConcept C111919701 @default.
- W4312658081 hasConcept C118505674 @default.
- W4312658081 hasConcept C119599485 @default.
- W4312658081 hasConcept C127413603 @default.
- W4312658081 hasConcept C141353440 @default.
- W4312658081 hasConcept C154945302 @default.
- W4312658081 hasConcept C165801399 @default.
- W4312658081 hasConcept C31972630 @default.
- W4312658081 hasConcept C41008148 @default.
- W4312658081 hasConcept C66322947 @default.
- W4312658081 hasConceptScore W4312658081C111919701 @default.
- W4312658081 hasConceptScore W4312658081C118505674 @default.
- W4312658081 hasConceptScore W4312658081C119599485 @default.
- W4312658081 hasConceptScore W4312658081C127413603 @default.
- W4312658081 hasConceptScore W4312658081C141353440 @default.
- W4312658081 hasConceptScore W4312658081C154945302 @default.
- W4312658081 hasConceptScore W4312658081C165801399 @default.
- W4312658081 hasConceptScore W4312658081C31972630 @default.
- W4312658081 hasConceptScore W4312658081C41008148 @default.
- W4312658081 hasConceptScore W4312658081C66322947 @default.
- W4312658081 hasLocation W43126580811 @default.
- W4312658081 hasLocation W43126580812 @default.
- W4312658081 hasOpenAccess W4312658081 @default.
- W4312658081 hasPrimaryLocation W43126580811 @default.