Matches in SemOpenAlex for { <https://semopenalex.org/work/W3203466265> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W3203466265 endingPage "1203" @default.
- W3203466265 startingPage "1193" @default.
- W3203466265 abstract "Successful active speaker detection requires a three-stage pipeline: (i) audio-visual encoding for all speakers in the clip, (ii) inter-speaker relation modeling between a reference speaker and the background speakers within each frame, and (iii) temporal modeling for the reference speaker. Each stage of this pipeline plays an important role for the final performance of the created architecture. Based on a series of controlled experiments, this work presents several practical guidelines for audio-visual active speaker detection. Correspondingly, we present a new architecture called ASDNet, which achieves a new state-of-the-art on the AVA-ActiveSpeaker dataset with a mAP of 93.5% outperforming the second best with a large margin of 4.7%. Our code and pretrained models are publicly available." @default.
- W3203466265 created "2021-10-11" @default.
- W3203466265 creator A5021837066 @default.
- W3203466265 creator A5039092855 @default.
- W3203466265 creator A5066848553 @default.
- W3203466265 date "2021-10-01" @default.
- W3203466265 modified "2023-09-26" @default.
- W3203466265 title "How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild." @default.
- W3203466265 hasPublicationYear "2021" @default.
- W3203466265 type Work @default.
- W3203466265 sameAs 3203466265 @default.
- W3203466265 citedByCount "0" @default.
- W3203466265 crossrefType "proceedings-article" @default.
- W3203466265 hasAuthorship W3203466265A5021837066 @default.
- W3203466265 hasAuthorship W3203466265A5039092855 @default.
- W3203466265 hasAuthorship W3203466265A5066848553 @default.
- W3203466265 hasConcept C119857082 @default.
- W3203466265 hasConcept C123657996 @default.
- W3203466265 hasConcept C125411270 @default.
- W3203466265 hasConcept C126042441 @default.
- W3203466265 hasConcept C133892786 @default.
- W3203466265 hasConcept C142362112 @default.
- W3203466265 hasConcept C149838564 @default.
- W3203466265 hasConcept C153349607 @default.
- W3203466265 hasConcept C154945302 @default.
- W3203466265 hasConcept C177264268 @default.
- W3203466265 hasConcept C199360897 @default.
- W3203466265 hasConcept C2776760102 @default.
- W3203466265 hasConcept C28490314 @default.
- W3203466265 hasConcept C3017588708 @default.
- W3203466265 hasConcept C41008148 @default.
- W3203466265 hasConcept C43521106 @default.
- W3203466265 hasConcept C49774154 @default.
- W3203466265 hasConcept C76155785 @default.
- W3203466265 hasConcept C774472 @default.
- W3203466265 hasConceptScore W3203466265C119857082 @default.
- W3203466265 hasConceptScore W3203466265C123657996 @default.
- W3203466265 hasConceptScore W3203466265C125411270 @default.
- W3203466265 hasConceptScore W3203466265C126042441 @default.
- W3203466265 hasConceptScore W3203466265C133892786 @default.
- W3203466265 hasConceptScore W3203466265C142362112 @default.
- W3203466265 hasConceptScore W3203466265C149838564 @default.
- W3203466265 hasConceptScore W3203466265C153349607 @default.
- W3203466265 hasConceptScore W3203466265C154945302 @default.
- W3203466265 hasConceptScore W3203466265C177264268 @default.
- W3203466265 hasConceptScore W3203466265C199360897 @default.
- W3203466265 hasConceptScore W3203466265C2776760102 @default.
- W3203466265 hasConceptScore W3203466265C28490314 @default.
- W3203466265 hasConceptScore W3203466265C3017588708 @default.
- W3203466265 hasConceptScore W3203466265C41008148 @default.
- W3203466265 hasConceptScore W3203466265C43521106 @default.
- W3203466265 hasConceptScore W3203466265C49774154 @default.
- W3203466265 hasConceptScore W3203466265C76155785 @default.
- W3203466265 hasConceptScore W3203466265C774472 @default.
- W3203466265 hasLocation W32034662651 @default.
- W3203466265 hasOpenAccess W3203466265 @default.
- W3203466265 hasPrimaryLocation W32034662651 @default.
- W3203466265 hasRelatedWork W1559425193 @default.
- W3203466265 hasRelatedWork W1978607478 @default.
- W3203466265 hasRelatedWork W2003579603 @default.
- W3203466265 hasRelatedWork W2015293542 @default.
- W3203466265 hasRelatedWork W2033291042 @default.
- W3203466265 hasRelatedWork W2044306518 @default.
- W3203466265 hasRelatedWork W2133042973 @default.
- W3203466265 hasRelatedWork W2591265425 @default.
- W3203466265 hasRelatedWork W270444782 @default.
- W3203466265 hasRelatedWork W2787692317 @default.
- W3203466265 hasRelatedWork W2805503960 @default.
- W3203466265 hasRelatedWork W2808706139 @default.
- W3203466265 hasRelatedWork W2896704015 @default.
- W3203466265 hasRelatedWork W2906786027 @default.
- W3203466265 hasRelatedWork W2992483973 @default.
- W3203466265 hasRelatedWork W2999503024 @default.
- W3203466265 hasRelatedWork W3015783745 @default.
- W3203466265 hasRelatedWork W3016098309 @default.
- W3203466265 hasRelatedWork W3109950060 @default.
- W3203466265 hasRelatedWork W3172472082 @default.
- W3203466265 isParatext "false" @default.
- W3203466265 isRetracted "false" @default.
- W3203466265 magId "3203466265" @default.
- W3203466265 workType "article" @default.