Matches in SemOpenAlex for { <https://semopenalex.org/work/W4287235989> ?p ?o ?g. }
Showing items 1 to 81 of
81
with 100 items per page.
- W4287235989 abstract "In this paper, we present AISHELL-4, a sizable real-recorded Mandarin speech dataset collected by 8-channel circular microphone array for speech processing in conference scenario. The dataset consists of 211 recorded meeting sessions, each containing 4 to 8 speakers, with a total length of 120 hours. This dataset aims to bridge the advanced research on multi-speaker processing and the practical application scenario in three aspects. With real recorded meetings, AISHELL-4 provides realistic acoustics and rich natural speech characteristics in conversation such as short pause, speech overlap, quick speaker turn, noise, etc. Meanwhile, accurate transcription and speaker voice activity are provided for each meeting in AISHELL-4. This allows the researchers to explore different aspects in meeting processing, ranging from individual tasks such as speech front-end processing, speech recognition and speaker diarization, to multi-modality modeling and joint optimization of relevant tasks. Given most open source dataset for multi-speaker tasks are in English, AISHELL-4 is the only Mandarin dataset for conversation speech, providing additional value for data diversity in speech community. We also release a PyTorch-based training and evaluation framework as baseline system to promote reproducible research in this field." @default.
- W4287235989 created "2022-07-25" @default.
- W4287235989 creator A5001713310 @default.
- W4287235989 creator A5017208328 @default.
- W4287235989 creator A5017825677 @default.
- W4287235989 creator A5029573919 @default.
- W4287235989 creator A5029797051 @default.
- W4287235989 creator A5029817092 @default.
- W4287235989 creator A5031232410 @default.
- W4287235989 creator A5056129529 @default.
- W4287235989 creator A5057521182 @default.
- W4287235989 creator A5058814917 @default.
- W4287235989 creator A5080273554 @default.
- W4287235989 creator A5084524299 @default.
- W4287235989 creator A5087994947 @default.
- W4287235989 date "2021-04-08" @default.
- W4287235989 modified "2023-10-12" @default.
- W4287235989 title "AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario" @default.
- W4287235989 doi "https://doi.org/10.48550/arxiv.2104.03603" @default.
- W4287235989 hasPublicationYear "2021" @default.
- W4287235989 type Work @default.
- W4287235989 citedByCount "0" @default.
- W4287235989 crossrefType "posted-content" @default.
- W4287235989 hasAuthorship W4287235989A5001713310 @default.
- W4287235989 hasAuthorship W4287235989A5017208328 @default.
- W4287235989 hasAuthorship W4287235989A5017825677 @default.
- W4287235989 hasAuthorship W4287235989A5029573919 @default.
- W4287235989 hasAuthorship W4287235989A5029797051 @default.
- W4287235989 hasAuthorship W4287235989A5029817092 @default.
- W4287235989 hasAuthorship W4287235989A5031232410 @default.
- W4287235989 hasAuthorship W4287235989A5056129529 @default.
- W4287235989 hasAuthorship W4287235989A5057521182 @default.
- W4287235989 hasAuthorship W4287235989A5058814917 @default.
- W4287235989 hasAuthorship W4287235989A5080273554 @default.
- W4287235989 hasAuthorship W4287235989A5084524299 @default.
- W4287235989 hasAuthorship W4287235989A5087994947 @default.
- W4287235989 hasBestOaLocation W42872359891 @default.
- W4287235989 hasConcept C133892786 @default.
- W4287235989 hasConcept C138885662 @default.
- W4287235989 hasConcept C138954614 @default.
- W4287235989 hasConcept C149838564 @default.
- W4287235989 hasConcept C179926584 @default.
- W4287235989 hasConcept C2777200299 @default.
- W4287235989 hasConcept C2778263558 @default.
- W4287235989 hasConcept C2778806681 @default.
- W4287235989 hasConcept C28490314 @default.
- W4287235989 hasConcept C41008148 @default.
- W4287235989 hasConcept C41895202 @default.
- W4287235989 hasConcept C61328038 @default.
- W4287235989 hasConcept C68115822 @default.
- W4287235989 hasConcept C76155785 @default.
- W4287235989 hasConceptScore W4287235989C133892786 @default.
- W4287235989 hasConceptScore W4287235989C138885662 @default.
- W4287235989 hasConceptScore W4287235989C138954614 @default.
- W4287235989 hasConceptScore W4287235989C149838564 @default.
- W4287235989 hasConceptScore W4287235989C179926584 @default.
- W4287235989 hasConceptScore W4287235989C2777200299 @default.
- W4287235989 hasConceptScore W4287235989C2778263558 @default.
- W4287235989 hasConceptScore W4287235989C2778806681 @default.
- W4287235989 hasConceptScore W4287235989C28490314 @default.
- W4287235989 hasConceptScore W4287235989C41008148 @default.
- W4287235989 hasConceptScore W4287235989C41895202 @default.
- W4287235989 hasConceptScore W4287235989C61328038 @default.
- W4287235989 hasConceptScore W4287235989C68115822 @default.
- W4287235989 hasConceptScore W4287235989C76155785 @default.
- W4287235989 hasLocation W42872359891 @default.
- W4287235989 hasOpenAccess W4287235989 @default.
- W4287235989 hasPrimaryLocation W42872359891 @default.
- W4287235989 hasRelatedWork W2216290105 @default.
- W4287235989 hasRelatedWork W2667475753 @default.
- W4287235989 hasRelatedWork W2996322586 @default.
- W4287235989 hasRelatedWork W3002397307 @default.
- W4287235989 hasRelatedWork W3046222353 @default.
- W4287235989 hasRelatedWork W3089403906 @default.
- W4287235989 hasRelatedWork W3097700828 @default.
- W4287235989 hasRelatedWork W3149201639 @default.
- W4287235989 hasRelatedWork W4287235989 @default.
- W4287235989 hasRelatedWork W95889140 @default.
- W4287235989 isParatext "false" @default.
- W4287235989 isRetracted "false" @default.
- W4287235989 workType "article" @default.