Matches in SemOpenAlex for { <https://semopenalex.org/work/W3197364345> ?p ?o ?g. }
- W3197364345 endingPage "1379" @default.
- W3197364345 startingPage "1367" @default.
- W3197364345 abstract "Spatiotemporal attention learning for video question answering (VideoQA) has always been a challenging task, where existing approaches treat the attention parts and the nonattention parts in isolation. In this work, we propose to enforce the correlation between the attention parts and the nonattention parts as a distance constraint for discriminative spatiotemporal attention learning. Specifically, we first introduce a novel attention-guided erasing mechanism in the traditional spatiotemporal attention to obtain multiple aggregated attention features and nonattention features and then learn to separate the attention and the nonattention features with an appropriate distance. The distance constraint is enforced by a metric learning loss, without increasing the inference complexity. In this way, the model can learn to produce more discriminative spatiotemporal attention distribution on videos, thus enabling more accurate question answering. In order to incorporate the multiscale spatiotemporal information that is beneficial for video understanding, we additionally develop a pyramid variant on basis of the proposed approach. Comprehensive ablation experiments are conducted to validate the effectiveness of our approach, and state-of-the-art performance is achieved on several widely used datasets for VideoQA." @default.
- W3197364345 created "2021-09-13" @default.
- W3197364345 creator A5035282947 @default.
- W3197364345 creator A5048279362 @default.
- W3197364345 creator A5051332325 @default.
- W3197364345 creator A5056659804 @default.
- W3197364345 date "2023-03-01" @default.
- W3197364345 modified "2023-10-13" @default.
- W3197364345 title "Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering" @default.
- W3197364345 cites W102708294 @default.
- W3197364345 cites W1927052826 @default.
- W3197364345 cites W1933349210 @default.
- W3197364345 cites W1950117310 @default.
- W3197364345 cites W1975517671 @default.
- W3197364345 cites W2064675550 @default.
- W3197364345 cites W2083897630 @default.
- W3197364345 cites W2108598243 @default.
- W3197364345 cites W2127589108 @default.
- W3197364345 cites W2145287260 @default.
- W3197364345 cites W2157331557 @default.
- W3197364345 cites W2194775991 @default.
- W3197364345 cites W2250539671 @default.
- W3197364345 cites W2425121537 @default.
- W3197364345 cites W2560730294 @default.
- W3197364345 cites W2561529111 @default.
- W3197364345 cites W2565656701 @default.
- W3197364345 cites W2582558662 @default.
- W3197364345 cites W2597425697 @default.
- W3197364345 cites W2600144439 @default.
- W3197364345 cites W2606982687 @default.
- W3197364345 cites W2607037079 @default.
- W3197364345 cites W2737435850 @default.
- W3197364345 cites W2741903908 @default.
- W3197364345 cites W2745461083 @default.
- W3197364345 cites W2765716052 @default.
- W3197364345 cites W2798590501 @default.
- W3197364345 cites W2798786641 @default.
- W3197364345 cites W2808124938 @default.
- W3197364345 cites W2832876791 @default.
- W3197364345 cites W2904291752 @default.
- W3197364345 cites W2904452845 @default.
- W3197364345 cites W2952524542 @default.
- W3197364345 cites W2952620298 @default.
- W3197364345 cites W2954199749 @default.
- W3197364345 cites W2962798895 @default.
- W3197364345 cites W2962949233 @default.
- W3197364345 cites W2963150162 @default.
- W3197364345 cites W2963176022 @default.
- W3197364345 cites W2963383024 @default.
- W3197364345 cites W2963717374 @default.
- W3197364345 cites W2963954913 @default.
- W3197364345 cites W2964067226 @default.
- W3197364345 cites W2964220823 @default.
- W3197364345 cites W2964274719 @default.
- W3197364345 cites W2964303913 @default.
- W3197364345 cites W2964760297 @default.
- W3197364345 cites W2966683369 @default.
- W3197364345 cites W2976187852 @default.
- W3197364345 cites W2981582341 @default.
- W3197364345 cites W2981963155 @default.
- W3197364345 cites W2981985547 @default.
- W3197364345 cites W2997344006 @default.
- W3197364345 cites W2997805943 @default.
- W3197364345 cites W2998166190 @default.
- W3197364345 cites W3003991937 @default.
- W3197364345 cites W3034730770 @default.
- W3197364345 cites W3099206234 @default.
- W3197364345 cites W3105204788 @default.
- W3197364345 cites W3161559655 @default.
- W3197364345 doi "https://doi.org/10.1109/tnnls.2021.3105280" @default.
- W3197364345 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/34464265" @default.
- W3197364345 hasPublicationYear "2023" @default.
- W3197364345 type Work @default.
- W3197364345 sameAs 3197364345 @default.
- W3197364345 citedByCount "2" @default.
- W3197364345 countsByYear W31973643452022 @default.
- W3197364345 countsByYear W31973643452023 @default.
- W3197364345 crossrefType "journal-article" @default.
- W3197364345 hasAuthorship W3197364345A5035282947 @default.
- W3197364345 hasAuthorship W3197364345A5048279362 @default.
- W3197364345 hasAuthorship W3197364345A5051332325 @default.
- W3197364345 hasAuthorship W3197364345A5056659804 @default.
- W3197364345 hasConcept C119857082 @default.
- W3197364345 hasConcept C120665830 @default.
- W3197364345 hasConcept C121332964 @default.
- W3197364345 hasConcept C127413603 @default.
- W3197364345 hasConcept C142575187 @default.
- W3197364345 hasConcept C154945302 @default.
- W3197364345 hasConcept C162324750 @default.
- W3197364345 hasConcept C176217482 @default.
- W3197364345 hasConcept C187736073 @default.
- W3197364345 hasConcept C21547014 @default.
- W3197364345 hasConcept C2776036281 @default.
- W3197364345 hasConcept C2776214188 @default.
- W3197364345 hasConcept C2780451532 @default.
- W3197364345 hasConcept C41008148 @default.
- W3197364345 hasConcept C44291984 @default.
- W3197364345 hasConcept C78519656 @default.