Matches in SemOpenAlex for { <https://semopenalex.org/work/W2906848589> ?p ?o ?g. }
- W2906848589 abstract "Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech enhancement, and human-robot interaction. The absence of a large, carefully labeled audio-visual dataset for this task has constrained algorithm evaluations with respect to data diversity, environments, and accuracy. This has made comparisons and improvements difficult. In this paper, we present the AVA Active Speaker detection dataset (AVA-ActiveSpeaker) that will be released publicly to facilitate algorithm development and enable comparisons. The dataset contains temporally labeled face tracks in video, where each face instance is labeled as speaking or not, and whether the speech is audible. This dataset contains about 3.65 million human labeled frames or about 38.5 hours of face tracks, and the corresponding audio. We also present a new audio-visual approach for active speaker detection, and analyze its performance, demonstrating both its strength and the contributions of the dataset." @default.
- W2906848589 created "2019-01-11" @default.
- W2906848589 creator A5019766344 @default.
- W2906848589 creator A5033832936 @default.
- W2906848589 creator A5041852716 @default.
- W2906848589 creator A5044666169 @default.
- W2906848589 creator A5045217258 @default.
- W2906848589 creator A5062133033 @default.
- W2906848589 creator A5065761531 @default.
- W2906848589 creator A5068559933 @default.
- W2906848589 creator A5074995400 @default.
- W2906848589 creator A5083515346 @default.
- W2906848589 creator A5084384753 @default.
- W2906848589 date "2019-01-05" @default.
- W2906848589 modified "2023-09-27" @default.
- W2906848589 title "AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection" @default.
- W2906848589 cites W1569447338 @default.
- W2906848589 cites W163811496 @default.
- W2906848589 cites W1872883209 @default.
- W2906848589 cites W1934410531 @default.
- W2906848589 cites W1975879668 @default.
- W2906848589 cites W2015293542 @default.
- W2906848589 cites W2064194796 @default.
- W2906848589 cites W2066348332 @default.
- W2906848589 cites W2071828724 @default.
- W2906848589 cites W2093153344 @default.
- W2906848589 cites W2098923380 @default.
- W2906848589 cites W2099403067 @default.
- W2906848589 cites W2100561338 @default.
- W2906848589 cites W2104804886 @default.
- W2906848589 cites W2106488367 @default.
- W2906848589 cites W2106793713 @default.
- W2906848589 cites W2118847468 @default.
- W2906848589 cites W2147520277 @default.
- W2906848589 cites W2163973301 @default.
- W2906848589 cites W2168996682 @default.
- W2906848589 cites W2250641567 @default.
- W2906848589 cites W2287407690 @default.
- W2906848589 cites W2293467699 @default.
- W2906848589 cites W2296073425 @default.
- W2906848589 cites W2316138215 @default.
- W2906848589 cites W2330149154 @default.
- W2906848589 cites W2547701628 @default.
- W2906848589 cites W2604379605 @default.
- W2906848589 cites W2612445135 @default.
- W2906848589 cites W2621109248 @default.
- W2906848589 cites W2726515241 @default.
- W2906848589 cites W2749694333 @default.
- W2906848589 cites W2759799350 @default.
- W2906848589 cites W2883383043 @default.
- W2906848589 cites W2885307078 @default.
- W2906848589 cites W2962960500 @default.
- W2906848589 cites W2963082324 @default.
- W2906848589 cites W2964161785 @default.
- W2906848589 cites W2964171275 @default.
- W2906848589 cites W2964327384 @default.
- W2906848589 cites W3042359118 @default.
- W2906848589 cites W3105099157 @default.
- W2906848589 cites W3123318516 @default.
- W2906848589 hasPublicationYear "2019" @default.
- W2906848589 type Work @default.
- W2906848589 sameAs 2906848589 @default.
- W2906848589 citedByCount "13" @default.
- W2906848589 countsByYear W29068485892019 @default.
- W2906848589 countsByYear W29068485892020 @default.
- W2906848589 countsByYear W29068485892021 @default.
- W2906848589 crossrefType "posted-content" @default.
- W2906848589 hasAuthorship W2906848589A5019766344 @default.
- W2906848589 hasAuthorship W2906848589A5033832936 @default.
- W2906848589 hasAuthorship W2906848589A5041852716 @default.
- W2906848589 hasAuthorship W2906848589A5044666169 @default.
- W2906848589 hasAuthorship W2906848589A5045217258 @default.
- W2906848589 hasAuthorship W2906848589A5062133033 @default.
- W2906848589 hasAuthorship W2906848589A5065761531 @default.
- W2906848589 hasAuthorship W2906848589A5068559933 @default.
- W2906848589 hasAuthorship W2906848589A5074995400 @default.
- W2906848589 hasAuthorship W2906848589A5083515346 @default.
- W2906848589 hasAuthorship W2906848589A5084384753 @default.
- W2906848589 hasConcept C133892786 @default.
- W2906848589 hasConcept C144024400 @default.
- W2906848589 hasConcept C149838564 @default.
- W2906848589 hasConcept C153180895 @default.
- W2906848589 hasConcept C154945302 @default.
- W2906848589 hasConcept C162324750 @default.
- W2906848589 hasConcept C187736073 @default.
- W2906848589 hasConcept C204201278 @default.
- W2906848589 hasConcept C2779304628 @default.
- W2906848589 hasConcept C2780451532 @default.
- W2906848589 hasConcept C28490314 @default.
- W2906848589 hasConcept C3017588708 @default.
- W2906848589 hasConcept C36289849 @default.
- W2906848589 hasConcept C41008148 @default.
- W2906848589 hasConcept C49774154 @default.
- W2906848589 hasConcept C61328038 @default.
- W2906848589 hasConceptScore W2906848589C133892786 @default.
- W2906848589 hasConceptScore W2906848589C144024400 @default.
- W2906848589 hasConceptScore W2906848589C149838564 @default.
- W2906848589 hasConceptScore W2906848589C153180895 @default.
- W2906848589 hasConceptScore W2906848589C154945302 @default.
- W2906848589 hasConceptScore W2906848589C162324750 @default.