Matches in SemOpenAlex for { <https://semopenalex.org/work/W4311646269> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W4311646269 abstract "Automatic Speech Recognition (ASR) for air traffic control is generally trained by pooling Air Traffic Controller (ATCO) and pilot data into one set. This is motivated by the fact that pilot's voice communications are more scarce than ATCOs. Due to this data imbalance and other reasons (e.g., varying acoustic conditions), the speech from ATCOs is usually recognized more accurately than from pilots. Automatically identifying the speaker roles is a challenging task, especially in the case of the noisy voice recordings collected using Very High Frequency (VHF) receivers or due to the unavailability of the push-to-talk (PTT) signal, i.e., both audio channels are mixed. In this work, we propose to (1) automatically segment the ATCO and pilot data based on an intuitive approach exploiting ASR transcripts and (2) subsequently consider an automatic recognition of ATCOs' and pilots' voice as two separate tasks. Our work is performed on VHF audio data with high noise levels, i.e., signal-to-noise (SNR) ratios below 15 dB, as this data is recognized to be helpful for various speech-based machine-learning tasks. Specifically, for the speaker role identification task, the module is represented by a simple yet efficient knowledge-based system exploiting a grammar defined by the International Civil Aviation Organization (ICAO). The system accepts text as the input, either manually verified annotations or automatically generated transcripts. The developed approach provides an average accuracy in speaker role identification of about 83%. Finally, we show that training an acoustic model for ASR tasks separately (i.e., separate models for ATCOs and pilots) or using a multitask approach is well suited for the noisy data and outperforms the traditional ASR system where all data is pooled together." @default.
- W4311646269 created "2022-12-27" @default.
- W4311646269 creator A5003975051 @default.
- W4311646269 creator A5021226188 @default.
- W4311646269 creator A5046858513 @default.
- W4311646269 creator A5066348116 @default.
- W4311646269 creator A5076409146 @default.
- W4311646269 creator A5076763342 @default.
- W4311646269 creator A5078502662 @default.
- W4311646269 date "2021-08-27" @default.
- W4311646269 modified "2023-09-27" @default.
- W4311646269 title "Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition" @default.
- W4311646269 doi "https://doi.org/10.48550/arxiv.2108.12175" @default.
- W4311646269 hasPublicationYear "2021" @default.
- W4311646269 type Work @default.
- W4311646269 citedByCount "0" @default.
- W4311646269 crossrefType "posted-content" @default.
- W4311646269 hasAuthorship W4311646269A5003975051 @default.
- W4311646269 hasAuthorship W4311646269A5021226188 @default.
- W4311646269 hasAuthorship W4311646269A5046858513 @default.
- W4311646269 hasAuthorship W4311646269A5066348116 @default.
- W4311646269 hasAuthorship W4311646269A5076409146 @default.
- W4311646269 hasAuthorship W4311646269A5076763342 @default.
- W4311646269 hasAuthorship W4311646269A5078502662 @default.
- W4311646269 hasBestOaLocation W43116462691 @default.
- W4311646269 hasConcept C111919701 @default.
- W4311646269 hasConcept C115961682 @default.
- W4311646269 hasConcept C116834253 @default.
- W4311646269 hasConcept C127413603 @default.
- W4311646269 hasConcept C133892786 @default.
- W4311646269 hasConcept C146978453 @default.
- W4311646269 hasConcept C154945302 @default.
- W4311646269 hasConcept C166961238 @default.
- W4311646269 hasConcept C177264268 @default.
- W4311646269 hasConcept C199360897 @default.
- W4311646269 hasConcept C201995342 @default.
- W4311646269 hasConcept C204201278 @default.
- W4311646269 hasConcept C2778476105 @default.
- W4311646269 hasConcept C2780451532 @default.
- W4311646269 hasConcept C28490314 @default.
- W4311646269 hasConcept C2908850654 @default.
- W4311646269 hasConcept C41008148 @default.
- W4311646269 hasConcept C59822182 @default.
- W4311646269 hasConcept C61328038 @default.
- W4311646269 hasConcept C86803240 @default.
- W4311646269 hasConcept C99498987 @default.
- W4311646269 hasConceptScore W4311646269C111919701 @default.
- W4311646269 hasConceptScore W4311646269C115961682 @default.
- W4311646269 hasConceptScore W4311646269C116834253 @default.
- W4311646269 hasConceptScore W4311646269C127413603 @default.
- W4311646269 hasConceptScore W4311646269C133892786 @default.
- W4311646269 hasConceptScore W4311646269C146978453 @default.
- W4311646269 hasConceptScore W4311646269C154945302 @default.
- W4311646269 hasConceptScore W4311646269C166961238 @default.
- W4311646269 hasConceptScore W4311646269C177264268 @default.
- W4311646269 hasConceptScore W4311646269C199360897 @default.
- W4311646269 hasConceptScore W4311646269C201995342 @default.
- W4311646269 hasConceptScore W4311646269C204201278 @default.
- W4311646269 hasConceptScore W4311646269C2778476105 @default.
- W4311646269 hasConceptScore W4311646269C2780451532 @default.
- W4311646269 hasConceptScore W4311646269C28490314 @default.
- W4311646269 hasConceptScore W4311646269C2908850654 @default.
- W4311646269 hasConceptScore W4311646269C41008148 @default.
- W4311646269 hasConceptScore W4311646269C59822182 @default.
- W4311646269 hasConceptScore W4311646269C61328038 @default.
- W4311646269 hasConceptScore W4311646269C86803240 @default.
- W4311646269 hasConceptScore W4311646269C99498987 @default.
- W4311646269 hasLocation W43116462691 @default.
- W4311646269 hasOpenAccess W4311646269 @default.
- W4311646269 hasPrimaryLocation W43116462691 @default.
- W4311646269 hasRelatedWork W2137069055 @default.
- W4311646269 hasRelatedWork W2184127972 @default.
- W4311646269 hasRelatedWork W2418631473 @default.
- W4311646269 hasRelatedWork W3087422378 @default.
- W4311646269 hasRelatedWork W3135230428 @default.
- W4311646269 hasRelatedWork W3165119447 @default.
- W4311646269 hasRelatedWork W4211118001 @default.
- W4311646269 hasRelatedWork W4286905416 @default.
- W4311646269 hasRelatedWork W642007152 @default.
- W4311646269 hasRelatedWork W2556771176 @default.
- W4311646269 isParatext "false" @default.
- W4311646269 isRetracted "false" @default.
- W4311646269 workType "article" @default.