Matches in SemOpenAlex for { <https://semopenalex.org/work/W3108764796> ?p ?o ?g. }
- W3108764796 abstract "In this paper, we study the task of multimodal sequence analysis which aims to draw inferences from visual, language and acoustic sequences. A majority of existing works generally focus on aligned fusion, mostly at word level, of the three modalities to accomplish this task, which is impractical in real-world scenarios. To overcome this issue, we seek to address the task of multimodal sequence analysis on unaligned modality sequences which is still relatively underexplored and also more challenging. Recurrent neural network (RNN) and its variants are widely used in multimodal sequence analysis, but they are susceptible to the issues of gradient vanishing/explosion and high time complexity due to its recurrent nature. Therefore, we propose a novel model, termed Multimodal Graph, to investigate the effectiveness of graph neural networks (GNN) on modeling multimodal sequential data. The graph-based structure enables parallel computation in time dimension and can learn longer temporal dependency in long unaligned sequences. Specifically, our Multimodal Graph is hierarchically structured to cater to two stages, i.e., intra- and inter-modal dynamics learning. For the first stage, a graph convolutional network is employed for each modality to learn intra-modal dynamics. In the second stage, given that the multimodal sequences are unaligned, the commonly considered word-level fusion does not pertain. To this end, we devise a graph pooling fusion network to automatically learn the associations between various nodes from different modalities. Additionally, we define multiple ways to construct the adjacency matrix for sequential data. Experimental results suggest that our graph-based model reaches state-of-the-art performance on two benchmark datasets." @default.
- W3108764796 created "2020-12-07" @default.
- W3108764796 creator A5005411874 @default.
- W3108764796 creator A5010270301 @default.
- W3108764796 creator A5051867844 @default.
- W3108764796 creator A5056953478 @default.
- W3108764796 creator A5085190039 @default.
- W3108764796 date "2020-11-27" @default.
- W3108764796 modified "2023-09-25" @default.
- W3108764796 title "Analyzing Unaligned Multimodal Sequence via Graph Convolution and Graph Pooling Fusion." @default.
- W3108764796 cites W1679826675 @default.
- W3108764796 cites W2029996593 @default.
- W3108764796 cites W2064675550 @default.
- W3108764796 cites W2079725295 @default.
- W3108764796 cites W2085789144 @default.
- W3108764796 cites W2095176743 @default.
- W3108764796 cites W2107878631 @default.
- W3108764796 cites W2116341502 @default.
- W3108764796 cites W2127141656 @default.
- W3108764796 cites W2139906443 @default.
- W3108764796 cites W2149359396 @default.
- W3108764796 cites W2157331557 @default.
- W3108764796 cites W2168465881 @default.
- W3108764796 cites W2250539671 @default.
- W3108764796 cites W2465534249 @default.
- W3108764796 cites W2533262878 @default.
- W3108764796 cites W2546919788 @default.
- W3108764796 cites W2556418146 @default.
- W3108764796 cites W2583643061 @default.
- W3108764796 cites W2619383789 @default.
- W3108764796 cites W2740550900 @default.
- W3108764796 cites W2767249564 @default.
- W3108764796 cites W2788919350 @default.
- W3108764796 cites W2792764867 @default.
- W3108764796 cites W2811124557 @default.
- W3108764796 cites W2883409523 @default.
- W3108764796 cites W2886193235 @default.
- W3108764796 cites W2924719072 @default.
- W3108764796 cites W2937484199 @default.
- W3108764796 cites W2946218857 @default.
- W3108764796 cites W2949391930 @default.
- W3108764796 cites W2958722525 @default.
- W3108764796 cites W2962711740 @default.
- W3108764796 cites W2962718314 @default.
- W3108764796 cites W2962767366 @default.
- W3108764796 cites W2962931510 @default.
- W3108764796 cites W2963016848 @default.
- W3108764796 cites W2963063161 @default.
- W3108764796 cites W2963128932 @default.
- W3108764796 cites W2963317470 @default.
- W3108764796 cites W2963403868 @default.
- W3108764796 cites W2963685106 @default.
- W3108764796 cites W2963710346 @default.
- W3108764796 cites W2963757395 @default.
- W3108764796 cites W2963858333 @default.
- W3108764796 cites W2964010806 @default.
- W3108764796 cites W2964015378 @default.
- W3108764796 cites W2964051877 @default.
- W3108764796 cites W2964121744 @default.
- W3108764796 cites W2964216663 @default.
- W3108764796 cites W2964265128 @default.
- W3108764796 cites W2964266095 @default.
- W3108764796 cites W2970972665 @default.
- W3108764796 cites W2971050617 @default.
- W3108764796 cites W2972493869 @default.
- W3108764796 cites W2996091850 @default.
- W3108764796 cites W2997573100 @default.
- W3108764796 cites W3007282427 @default.
- W3108764796 cites W3034897750 @default.
- W3108764796 cites W3087647883 @default.
- W3108764796 hasPublicationYear "2020" @default.
- W3108764796 type Work @default.
- W3108764796 sameAs 3108764796 @default.
- W3108764796 citedByCount "4" @default.
- W3108764796 countsByYear W31087647962021 @default.
- W3108764796 countsByYear W31087647962022 @default.
- W3108764796 crossrefType "posted-content" @default.
- W3108764796 hasAuthorship W3108764796A5005411874 @default.
- W3108764796 hasAuthorship W3108764796A5010270301 @default.
- W3108764796 hasAuthorship W3108764796A5051867844 @default.
- W3108764796 hasAuthorship W3108764796A5056953478 @default.
- W3108764796 hasAuthorship W3108764796A5085190039 @default.
- W3108764796 hasConcept C132525143 @default.
- W3108764796 hasConcept C147168706 @default.
- W3108764796 hasConcept C154945302 @default.
- W3108764796 hasConcept C180356752 @default.
- W3108764796 hasConcept C41008148 @default.
- W3108764796 hasConcept C50644808 @default.
- W3108764796 hasConcept C70437156 @default.
- W3108764796 hasConcept C80444323 @default.
- W3108764796 hasConceptScore W3108764796C132525143 @default.
- W3108764796 hasConceptScore W3108764796C147168706 @default.
- W3108764796 hasConceptScore W3108764796C154945302 @default.
- W3108764796 hasConceptScore W3108764796C180356752 @default.
- W3108764796 hasConceptScore W3108764796C41008148 @default.
- W3108764796 hasConceptScore W3108764796C50644808 @default.
- W3108764796 hasConceptScore W3108764796C70437156 @default.
- W3108764796 hasConceptScore W3108764796C80444323 @default.
- W3108764796 hasLocation W31087647961 @default.
- W3108764796 hasOpenAccess W3108764796 @default.