Matches in SemOpenAlex for { <https://semopenalex.org/work/W3149201639> ?p ?o ?g. }
- W3149201639 abstract "In this paper, we present AISHELL-4, a sizable real-recorded Mandarin speech dataset collected by 8-channel circular microphone array for speech processing in conference scenario. The dataset consists of 211 recorded meeting sessions, each containing 4 to 8 speakers, with a total length of 120 hours. This dataset aims to bridge the advanced research on multi-speaker processing and the practical application scenario in three aspects. With real recorded meetings, AISHELL-4 provides realistic acoustics and rich natural speech characteristics in conversation such as short pause, speech overlap, quick speaker turn, noise, etc. Meanwhile, accurate transcription and speaker voice activity are provided for each meeting in AISHELL-4. This allows the researchers to explore different aspects in meeting processing, ranging from individual tasks such as speech front-end processing, speech recognition and speaker diarization, to multi-modality modeling and joint optimization of relevant tasks. Given most open source dataset for multi-speaker tasks are in English, AISHELL-4 is the only Mandarin dataset for conversation speech, providing additional value for data diversity in speech community. We also release a PyTorch-based training and evaluation framework as baseline system to promote reproducible research in this field." @default.
- W3149201639 created "2021-04-13" @default.
- W3149201639 creator A5010250251 @default.
- W3149201639 creator A5017208328 @default.
- W3149201639 creator A5017825677 @default.
- W3149201639 creator A5024558170 @default.
- W3149201639 creator A5029573919 @default.
- W3149201639 creator A5031232410 @default.
- W3149201639 creator A5033976028 @default.
- W3149201639 creator A5040519711 @default.
- W3149201639 creator A5046975280 @default.
- W3149201639 creator A5056129529 @default.
- W3149201639 creator A5057521182 @default.
- W3149201639 creator A5066595711 @default.
- W3149201639 creator A5087994947 @default.
- W3149201639 date "2021-04-08" @default.
- W3149201639 modified "2023-09-28" @default.
- W3149201639 title "AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario" @default.
- W3149201639 cites W1494198834 @default.
- W3149201639 cites W1589137271 @default.
- W3149201639 cites W1591607137 @default.
- W3149201639 cites W2000221426 @default.
- W3149201639 cites W2112865675 @default.
- W3149201639 cites W2138730338 @default.
- W3149201639 cites W2166637769 @default.
- W3149201639 cites W2170579896 @default.
- W3149201639 cites W2221409856 @default.
- W3149201639 cites W2508048623 @default.
- W3149201639 cites W262275730 @default.
- W3149201639 cites W2726515241 @default.
- W3149201639 cites W2766219058 @default.
- W3149201639 cites W2889048668 @default.
- W3149201639 cites W2892009249 @default.
- W3149201639 cites W2936774411 @default.
- W3149201639 cites W2963242190 @default.
- W3149201639 cites W2969985801 @default.
- W3149201639 cites W2972541922 @default.
- W3149201639 cites W2981087920 @default.
- W3149201639 cites W2981461916 @default.
- W3149201639 cites W2996322586 @default.
- W3149201639 cites W3008283340 @default.
- W3149201639 cites W3015191643 @default.
- W3149201639 cites W3015598461 @default.
- W3149201639 cites W3016232124 @default.
- W3149201639 cites W3016400019 @default.
- W3149201639 cites W3027008958 @default.
- W3149201639 cites W3094005623 @default.
- W3149201639 cites W3094007021 @default.
- W3149201639 cites W3115133930 @default.
- W3149201639 cites W3133834828 @default.
- W3149201639 cites W3140783160 @default.
- W3149201639 cites W3143843080 @default.
- W3149201639 cites W97072897 @default.
- W3149201639 hasPublicationYear "2021" @default.
- W3149201639 type Work @default.
- W3149201639 sameAs 3149201639 @default.
- W3149201639 citedByCount "3" @default.
- W3149201639 countsByYear W31492016392021 @default.
- W3149201639 crossrefType "posted-content" @default.
- W3149201639 hasAuthorship W3149201639A5010250251 @default.
- W3149201639 hasAuthorship W3149201639A5017208328 @default.
- W3149201639 hasAuthorship W3149201639A5017825677 @default.
- W3149201639 hasAuthorship W3149201639A5024558170 @default.
- W3149201639 hasAuthorship W3149201639A5029573919 @default.
- W3149201639 hasAuthorship W3149201639A5031232410 @default.
- W3149201639 hasAuthorship W3149201639A5033976028 @default.
- W3149201639 hasAuthorship W3149201639A5040519711 @default.
- W3149201639 hasAuthorship W3149201639A5046975280 @default.
- W3149201639 hasAuthorship W3149201639A5056129529 @default.
- W3149201639 hasAuthorship W3149201639A5057521182 @default.
- W3149201639 hasAuthorship W3149201639A5066595711 @default.
- W3149201639 hasAuthorship W3149201639A5087994947 @default.
- W3149201639 hasConcept C133892786 @default.
- W3149201639 hasConcept C138885662 @default.
- W3149201639 hasConcept C138954614 @default.
- W3149201639 hasConcept C149838564 @default.
- W3149201639 hasConcept C154945302 @default.
- W3149201639 hasConcept C163294075 @default.
- W3149201639 hasConcept C179926584 @default.
- W3149201639 hasConcept C2776182073 @default.
- W3149201639 hasConcept C2777200299 @default.
- W3149201639 hasConcept C2778263558 @default.
- W3149201639 hasConcept C2778806681 @default.
- W3149201639 hasConcept C28490314 @default.
- W3149201639 hasConcept C41008148 @default.
- W3149201639 hasConcept C41895202 @default.
- W3149201639 hasConcept C61328038 @default.
- W3149201639 hasConcept C68115822 @default.
- W3149201639 hasConcept C76155785 @default.
- W3149201639 hasConceptScore W3149201639C133892786 @default.
- W3149201639 hasConceptScore W3149201639C138885662 @default.
- W3149201639 hasConceptScore W3149201639C138954614 @default.
- W3149201639 hasConceptScore W3149201639C149838564 @default.
- W3149201639 hasConceptScore W3149201639C154945302 @default.
- W3149201639 hasConceptScore W3149201639C163294075 @default.
- W3149201639 hasConceptScore W3149201639C179926584 @default.
- W3149201639 hasConceptScore W3149201639C2776182073 @default.
- W3149201639 hasConceptScore W3149201639C2777200299 @default.
- W3149201639 hasConceptScore W3149201639C2778263558 @default.
- W3149201639 hasConceptScore W3149201639C2778806681 @default.