Matches in SemOpenAlex for { <https://semopenalex.org/work/W4312551640> ?p ?o ?g. }
Showing items 1 to 89 of
89
with 100 items per page.
- W4312551640 endingPage "346" @default.
- W4312551640 startingPage "330" @default.
- W4312551640 abstract "The ability to model intra-modal and inter-modal interactions is fundamental in multimodal machine learning. The current state-of-the-art models usually adopt deep learning models with fixed structures. They can achieve exceptional performances on specific tasks, but face a particularly challenging problem of modality mismatch because of diversity of input modalities and their fixed structures. In this paper, we present Switch-BERT for joint vision and language representation learning to address this problem. Switch-BERT extends BERT architecture by introducing learnable layer-wise and cross-layer interactions. It learns to optimize attention from a set of attention modes representing these interactions. One specific property of the model is that it learns to attend outputs from various depths, therefore mitigates the modality mismatch problem. We present extensive experiments on visual question answering, image-text retrieval and referring expression comprehension experiments. Results confirm that, whereas alternative architectures including ViLBERT and UNITER may excel in particular tasks, Switch-BERT can consistently achieve better or comparable performances than the current state-of-the-art models in these tasks. Ablation studies indicate that the proposed model achieves superior performances due to its ability in learning task-specific multimodal interactions." @default.
- W4312551640 created "2023-01-05" @default.
- W4312551640 creator A5034946345 @default.
- W4312551640 creator A5047666676 @default.
- W4312551640 creator A5065688486 @default.
- W4312551640 date "2022-01-01" @default.
- W4312551640 modified "2023-09-23" @default.
- W4312551640 title "Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input" @default.
- W4312551640 cites W1773149199 @default.
- W4312551640 cites W1933349210 @default.
- W4312551640 cites W2251512949 @default.
- W4312551640 cites W2277195237 @default.
- W4312551640 cites W2560730294 @default.
- W4312551640 cites W2745461083 @default.
- W4312551640 cites W2886641317 @default.
- W4312551640 cites W2946417913 @default.
- W4312551640 cites W2964345792 @default.
- W4312551640 cites W2970231061 @default.
- W4312551640 cites W2973154008 @default.
- W4312551640 cites W2981851019 @default.
- W4312551640 cites W2997591391 @default.
- W4312551640 cites W2998356391 @default.
- W4312551640 cites W3035635319 @default.
- W4312551640 cites W3090449556 @default.
- W4312551640 cites W3091588028 @default.
- W4312551640 cites W3159619744 @default.
- W4312551640 cites W3201264086 @default.
- W4312551640 doi "https://doi.org/10.1007/978-3-031-20059-5_19" @default.
- W4312551640 hasPublicationYear "2022" @default.
- W4312551640 type Work @default.
- W4312551640 citedByCount "0" @default.
- W4312551640 crossrefType "book-chapter" @default.
- W4312551640 hasAuthorship W4312551640A5034946345 @default.
- W4312551640 hasAuthorship W4312551640A5047666676 @default.
- W4312551640 hasAuthorship W4312551640A5065688486 @default.
- W4312551640 hasConcept C111472728 @default.
- W4312551640 hasConcept C119857082 @default.
- W4312551640 hasConcept C138885662 @default.
- W4312551640 hasConcept C154945302 @default.
- W4312551640 hasConcept C162324750 @default.
- W4312551640 hasConcept C177264268 @default.
- W4312551640 hasConcept C17744445 @default.
- W4312551640 hasConcept C187736073 @default.
- W4312551640 hasConcept C189950617 @default.
- W4312551640 hasConcept C199360897 @default.
- W4312551640 hasConcept C199539241 @default.
- W4312551640 hasConcept C2776359362 @default.
- W4312551640 hasConcept C2780226545 @default.
- W4312551640 hasConcept C2780451532 @default.
- W4312551640 hasConcept C2780660688 @default.
- W4312551640 hasConcept C41008148 @default.
- W4312551640 hasConcept C59404180 @default.
- W4312551640 hasConcept C94625758 @default.
- W4312551640 hasConceptScore W4312551640C111472728 @default.
- W4312551640 hasConceptScore W4312551640C119857082 @default.
- W4312551640 hasConceptScore W4312551640C138885662 @default.
- W4312551640 hasConceptScore W4312551640C154945302 @default.
- W4312551640 hasConceptScore W4312551640C162324750 @default.
- W4312551640 hasConceptScore W4312551640C177264268 @default.
- W4312551640 hasConceptScore W4312551640C17744445 @default.
- W4312551640 hasConceptScore W4312551640C187736073 @default.
- W4312551640 hasConceptScore W4312551640C189950617 @default.
- W4312551640 hasConceptScore W4312551640C199360897 @default.
- W4312551640 hasConceptScore W4312551640C199539241 @default.
- W4312551640 hasConceptScore W4312551640C2776359362 @default.
- W4312551640 hasConceptScore W4312551640C2780226545 @default.
- W4312551640 hasConceptScore W4312551640C2780451532 @default.
- W4312551640 hasConceptScore W4312551640C2780660688 @default.
- W4312551640 hasConceptScore W4312551640C41008148 @default.
- W4312551640 hasConceptScore W4312551640C59404180 @default.
- W4312551640 hasConceptScore W4312551640C94625758 @default.
- W4312551640 hasLocation W43125516401 @default.
- W4312551640 hasOpenAccess W4312551640 @default.
- W4312551640 hasPrimaryLocation W43125516401 @default.
- W4312551640 hasRelatedWork W2108201743 @default.
- W4312551640 hasRelatedWork W2952745240 @default.
- W4312551640 hasRelatedWork W2959445501 @default.
- W4312551640 hasRelatedWork W3087493185 @default.
- W4312551640 hasRelatedWork W4213012905 @default.
- W4312551640 hasRelatedWork W4285606578 @default.
- W4312551640 hasRelatedWork W4301143707 @default.
- W4312551640 hasRelatedWork W4312395240 @default.
- W4312551640 hasRelatedWork W4312851439 @default.
- W4312551640 hasRelatedWork W4362598752 @default.
- W4312551640 isParatext "false" @default.
- W4312551640 isRetracted "false" @default.
- W4312551640 workType "book-chapter" @default.