Matches in SemOpenAlex for { <https://semopenalex.org/work/W3118593445> ?p ?o ?g. }
- W3118593445 abstract "Active speaker detection requires a solid integration of multi-modal cues. While individual modalities can approximate a solution, accurate predictions can only be achieved by explicitly fusing the audio and visual features and modeling their temporal progression. Despite its inherent muti-modal nature, current methods still focus on modeling and fusing short-term audiovisual features for individual speakers, often at frame level. In this paper we present a novel approach to active speaker detection that directly addresses the multi-modal nature of the problem, and provides a straightforward strategy where independent visual features from potential speakers in the scene are assigned to a previously detected speech event. Our experiments show that, an small graph data structure built from a single frame, allows to approximate an instantaneous audio-visual assignment problem. Moreover, the temporal extension of this initial graph achieves a new state-of-the-art on the AVA-ActiveSpeaker dataset with a mAP of 88.8%." @default.
- W3118593445 created "2021-01-18" @default.
- W3118593445 creator A5022774523 @default.
- W3118593445 creator A5024763828 @default.
- W3118593445 creator A5026143507 @default.
- W3118593445 creator A5082397682 @default.
- W3118593445 date "2021-01-11" @default.
- W3118593445 modified "2023-09-26" @default.
- W3118593445 title "MAAS: Multi-modal Assignation for Active Speaker Detection" @default.
- W3118593445 cites W2016053056 @default.
- W3118593445 cites W2038101708 @default.
- W3118593445 cites W2081074144 @default.
- W3118593445 cites W2108598243 @default.
- W3118593445 cites W2117671523 @default.
- W3118593445 cites W2138621090 @default.
- W3118593445 cites W2138761194 @default.
- W3118593445 cites W2154636774 @default.
- W3118593445 cites W2159591770 @default.
- W3118593445 cites W2163973301 @default.
- W3118593445 cites W2184188583 @default.
- W3118593445 cites W2194775991 @default.
- W3118593445 cites W2287407690 @default.
- W3118593445 cites W2302255633 @default.
- W3118593445 cites W2519887557 @default.
- W3118593445 cites W2547701628 @default.
- W3118593445 cites W2579549467 @default.
- W3118593445 cites W2603203130 @default.
- W3118593445 cites W2604379605 @default.
- W3118593445 cites W2638067502 @default.
- W3118593445 cites W2726515241 @default.
- W3118593445 cites W2749694333 @default.
- W3118593445 cites W2753324760 @default.
- W3118593445 cites W2776622059 @default.
- W3118593445 cites W2808631503 @default.
- W3118593445 cites W2810482788 @default.
- W3118593445 cites W2886970679 @default.
- W3118593445 cites W2889385246 @default.
- W3118593445 cites W2896538040 @default.
- W3118593445 cites W2897924318 @default.
- W3118593445 cites W2899771611 @default.
- W3118593445 cites W2906848589 @default.
- W3118593445 cites W2918342466 @default.
- W3118593445 cites W2947078341 @default.
- W3118593445 cites W2948124424 @default.
- W3118593445 cites W2954458766 @default.
- W3118593445 cites W2962788625 @default.
- W3118593445 cites W2962918445 @default.
- W3118593445 cites W2962960500 @default.
- W3118593445 cites W2963076818 @default.
- W3118593445 cites W2963165299 @default.
- W3118593445 cites W2963184176 @default.
- W3118593445 cites W2963470929 @default.
- W3118593445 cites W2963801643 @default.
- W3118593445 cites W2963887950 @default.
- W3118593445 cites W2964052309 @default.
- W3118593445 cites W2964121744 @default.
- W3118593445 cites W2979750740 @default.
- W3118593445 cites W2990045899 @default.
- W3118593445 cites W2990280855 @default.
- W3118593445 cites W3011519842 @default.
- W3118593445 cites W3016098309 @default.
- W3118593445 cites W3034623254 @default.
- W3118593445 cites W3034702511 @default.
- W3118593445 cites W3035649237 @default.
- W3118593445 cites W3038871978 @default.
- W3118593445 cites W3048065599 @default.
- W3118593445 hasPublicationYear "2021" @default.
- W3118593445 type Work @default.
- W3118593445 sameAs 3118593445 @default.
- W3118593445 citedByCount "1" @default.
- W3118593445 countsByYear W31185934452021 @default.
- W3118593445 crossrefType "posted-content" @default.
- W3118593445 hasAuthorship W3118593445A5022774523 @default.
- W3118593445 hasAuthorship W3118593445A5024763828 @default.
- W3118593445 hasAuthorship W3118593445A5026143507 @default.
- W3118593445 hasAuthorship W3118593445A5082397682 @default.
- W3118593445 hasConcept C120665830 @default.
- W3118593445 hasConcept C121332964 @default.
- W3118593445 hasConcept C126042441 @default.
- W3118593445 hasConcept C132525143 @default.
- W3118593445 hasConcept C154945302 @default.
- W3118593445 hasConcept C185592680 @default.
- W3118593445 hasConcept C188027245 @default.
- W3118593445 hasConcept C192209626 @default.
- W3118593445 hasConcept C28490314 @default.
- W3118593445 hasConcept C3017588708 @default.
- W3118593445 hasConcept C31972630 @default.
- W3118593445 hasConcept C41008148 @default.
- W3118593445 hasConcept C49774154 @default.
- W3118593445 hasConcept C71139939 @default.
- W3118593445 hasConcept C76155785 @default.
- W3118593445 hasConcept C80444323 @default.
- W3118593445 hasConceptScore W3118593445C120665830 @default.
- W3118593445 hasConceptScore W3118593445C121332964 @default.
- W3118593445 hasConceptScore W3118593445C126042441 @default.
- W3118593445 hasConceptScore W3118593445C132525143 @default.
- W3118593445 hasConceptScore W3118593445C154945302 @default.
- W3118593445 hasConceptScore W3118593445C185592680 @default.
- W3118593445 hasConceptScore W3118593445C188027245 @default.
- W3118593445 hasConceptScore W3118593445C192209626 @default.