Matches in SemOpenAlex for { <https://semopenalex.org/work/W4308222497> ?p ?o ?g. }
- W4308222497 abstract "Multimodal sentiment analysis (MSA), which supposes to improve text-based sentiment analysis with associated acoustic and visual modalities, is an emerging research area due to its potential applications in Human-Computer Interaction (HCI). However, existing researches observe that the acoustic and visual modalities contribute much less than the textual modality, termed as text-predominant. Under such circumstances, in this work, we emphasize making non-verbal cues matter for the MSA task. Firstly, from the resource perspective, we present the CH-SIMS v2.0 dataset, an extension and enhancement of the CH-SIMS. Compared with the original dataset, the CH-SIMS v2.0 doubles its size with another 2121 refined video segments containing both unimodal and multimodal annotations and collects 10161 unlabelled raw video segments with rich acoustic and visual emotion-bearing context to highlight non-verbal cues for sentiment prediction. Secondly, from the model perspective, benefiting from the unimodal annotations and the unsupervised data in the CH-SIMS v2.0, the Acoustic Visual Mixup Consistent (AV-MC) framework is proposed. The designed modality mixup module can be regarded as an augmentation, which mixes the acoustic and visual modalities from different videos. Through drawing unobserved multimodal context along with the text, the model can learn to be aware of different non-verbal contexts for sentiment prediction. Our evaluations demonstrate that both CH-SIMS v2.0 and AV-MC framework enable further research for discovering emotion-bearing acoustic and visual cues and pave the path to interpretable end-to-end HCI applications for real-world scenarios. The full dataset and code are available for use at https://github.com/thuiar/ch-sims-v2." @default.
- W4308222497 created "2022-11-09" @default.
- W4308222497 creator A5012506420 @default.
- W4308222497 creator A5013271085 @default.
- W4308222497 creator A5014238437 @default.
- W4308222497 creator A5033915643 @default.
- W4308222497 creator A5057456638 @default.
- W4308222497 creator A5065294248 @default.
- W4308222497 creator A5075349120 @default.
- W4308222497 creator A5079656283 @default.
- W4308222497 creator A5086201515 @default.
- W4308222497 creator A5088936558 @default.
- W4308222497 date "2022-11-07" @default.
- W4308222497 modified "2023-10-16" @default.
- W4308222497 title "Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module" @default.
- W4308222497 cites W1520861770 @default.
- W4308222497 cites W2064675550 @default.
- W4308222497 cites W2085662862 @default.
- W4308222497 cites W2122563357 @default.
- W4308222497 cites W2510170536 @default.
- W4308222497 cites W2556418146 @default.
- W4308222497 cites W2787581402 @default.
- W4308222497 cites W2883409523 @default.
- W4308222497 cites W2924126491 @default.
- W4308222497 cites W2963104701 @default.
- W4308222497 cites W2964010806 @default.
- W4308222497 cites W2964051877 @default.
- W4308222497 cites W2964216663 @default.
- W4308222497 cites W2964346351 @default.
- W4308222497 cites W3034266838 @default.
- W4308222497 cites W3034849760 @default.
- W4308222497 cites W3035542229 @default.
- W4308222497 cites W3037611961 @default.
- W4308222497 cites W3100921325 @default.
- W4308222497 cites W3101998545 @default.
- W4308222497 cites W3103167052 @default.
- W4308222497 cites W3159769479 @default.
- W4308222497 cites W3174311454 @default.
- W4308222497 cites W3206008172 @default.
- W4308222497 cites W3209458476 @default.
- W4308222497 cites W3214432797 @default.
- W4308222497 cites W4221155339 @default.
- W4308222497 cites W4307823382 @default.
- W4308222497 cites W4312639100 @default.
- W4308222497 doi "https://doi.org/10.1145/3536221.3556630" @default.
- W4308222497 hasPublicationYear "2022" @default.
- W4308222497 type Work @default.
- W4308222497 citedByCount "1" @default.
- W4308222497 countsByYear W43082224972023 @default.
- W4308222497 crossrefType "proceedings-article" @default.
- W4308222497 hasAuthorship W4308222497A5012506420 @default.
- W4308222497 hasAuthorship W4308222497A5013271085 @default.
- W4308222497 hasAuthorship W4308222497A5014238437 @default.
- W4308222497 hasAuthorship W4308222497A5033915643 @default.
- W4308222497 hasAuthorship W4308222497A5057456638 @default.
- W4308222497 hasAuthorship W4308222497A5065294248 @default.
- W4308222497 hasAuthorship W4308222497A5075349120 @default.
- W4308222497 hasAuthorship W4308222497A5079656283 @default.
- W4308222497 hasAuthorship W4308222497A5086201515 @default.
- W4308222497 hasAuthorship W4308222497A5088936558 @default.
- W4308222497 hasBestOaLocation W43082224971 @default.
- W4308222497 hasConcept C107457646 @default.
- W4308222497 hasConcept C111370547 @default.
- W4308222497 hasConcept C12713177 @default.
- W4308222497 hasConcept C144024400 @default.
- W4308222497 hasConcept C151730666 @default.
- W4308222497 hasConcept C154945302 @default.
- W4308222497 hasConcept C2779343474 @default.
- W4308222497 hasConcept C2779903281 @default.
- W4308222497 hasConcept C2780226545 @default.
- W4308222497 hasConcept C28490314 @default.
- W4308222497 hasConcept C36289849 @default.
- W4308222497 hasConcept C41008148 @default.
- W4308222497 hasConcept C66402592 @default.
- W4308222497 hasConcept C86803240 @default.
- W4308222497 hasConceptScore W4308222497C107457646 @default.
- W4308222497 hasConceptScore W4308222497C111370547 @default.
- W4308222497 hasConceptScore W4308222497C12713177 @default.
- W4308222497 hasConceptScore W4308222497C144024400 @default.
- W4308222497 hasConceptScore W4308222497C151730666 @default.
- W4308222497 hasConceptScore W4308222497C154945302 @default.
- W4308222497 hasConceptScore W4308222497C2779343474 @default.
- W4308222497 hasConceptScore W4308222497C2779903281 @default.
- W4308222497 hasConceptScore W4308222497C2780226545 @default.
- W4308222497 hasConceptScore W4308222497C28490314 @default.
- W4308222497 hasConceptScore W4308222497C36289849 @default.
- W4308222497 hasConceptScore W4308222497C41008148 @default.
- W4308222497 hasConceptScore W4308222497C66402592 @default.
- W4308222497 hasConceptScore W4308222497C86803240 @default.
- W4308222497 hasFunder F4320321001 @default.
- W4308222497 hasFunder F4320322163 @default.
- W4308222497 hasLocation W43082224971 @default.
- W4308222497 hasLocation W43082224972 @default.
- W4308222497 hasOpenAccess W4308222497 @default.
- W4308222497 hasPrimaryLocation W43082224971 @default.
- W4308222497 hasRelatedWork W2004831463 @default.
- W4308222497 hasRelatedWork W2110287964 @default.
- W4308222497 hasRelatedWork W2168054807 @default.
- W4308222497 hasRelatedWork W2383394264 @default.
- W4308222497 hasRelatedWork W3125968744 @default.