Matches in SemOpenAlex for { <https://semopenalex.org/work/W3200725407> ?p ?o ?g. }
Showing items 1 to 88 of
88
with 100 items per page.
- W3200725407 abstract "When a sufficiently large far-field training data is presented, jointly optimizing a multichannel frontend and an end-to-end (E2E) Automatic Speech Recognition (ASR) backend shows promising results. Recent literature has shown traditional beamformer designs, such as MVDR (Minimum Variance Distortionless Response) or fixed beamformers can be successfully integrated as the frontend into an E2E ASR system with learnable parameters. In this work, we propose the self-attention channel combinator (SACC) ASR frontend, which leverages the self-attention mechanism to combine multichannel audio signals in the magnitude spectral domain. Experiments conducted on a multichannel playback test data shows that the SACC achieved a 9.3% WERR compared to a state-of-the-art fixed beamformer-based frontend, both jointly optimized with a ContextNet-based ASR backend. We also demonstrate the connection between the SACC and the traditional beamformers, and analyze the intermediate outputs of the SACC." @default.
- W3200725407 created "2021-09-27" @default.
- W3200725407 creator A5011101212 @default.
- W3200725407 creator A5011337004 @default.
- W3200725407 creator A5011519401 @default.
- W3200725407 creator A5012493356 @default.
- W3200725407 creator A5057545580 @default.
- W3200725407 creator A5077676839 @default.
- W3200725407 date "2021-09-10" @default.
- W3200725407 modified "2023-10-16" @default.
- W3200725407 title "Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition" @default.
- W3200725407 cites W1494198834 @default.
- W3200725407 cites W2117678320 @default.
- W3200725407 cites W2327501763 @default.
- W3200725407 cites W2506203739 @default.
- W3200725407 cites W2577366047 @default.
- W3200725407 cites W2600628583 @default.
- W3200725407 cites W2627092829 @default.
- W3200725407 cites W2746095516 @default.
- W3200725407 cites W2767071179 @default.
- W3200725407 cites W2803583024 @default.
- W3200725407 cites W2889367162 @default.
- W3200725407 cites W2890553422 @default.
- W3200725407 cites W2892009249 @default.
- W3200725407 cites W2921496354 @default.
- W3200725407 cites W2936774411 @default.
- W3200725407 cites W2963403868 @default.
- W3200725407 cites W3015527782 @default.
- W3200725407 cites W3015791598 @default.
- W3200725407 cites W3086154751 @default.
- W3200725407 cites W3095173472 @default.
- W3200725407 cites W3096073522 @default.
- W3200725407 cites W3117722298 @default.
- W3200725407 cites W3160207687 @default.
- W3200725407 cites W3162315798 @default.
- W3200725407 cites W47527944 @default.
- W3200725407 doi "https://doi.org/10.48550/arxiv.2109.04783" @default.
- W3200725407 hasPublicationYear "2021" @default.
- W3200725407 type Work @default.
- W3200725407 sameAs 3200725407 @default.
- W3200725407 citedByCount "0" @default.
- W3200725407 crossrefType "posted-content" @default.
- W3200725407 hasAuthorship W3200725407A5011101212 @default.
- W3200725407 hasAuthorship W3200725407A5011337004 @default.
- W3200725407 hasAuthorship W3200725407A5011519401 @default.
- W3200725407 hasAuthorship W3200725407A5012493356 @default.
- W3200725407 hasAuthorship W3200725407A5057545580 @default.
- W3200725407 hasAuthorship W3200725407A5077676839 @default.
- W3200725407 hasBestOaLocation W32007254071 @default.
- W3200725407 hasConcept C127162648 @default.
- W3200725407 hasConcept C134306372 @default.
- W3200725407 hasConcept C154945302 @default.
- W3200725407 hasConcept C202444582 @default.
- W3200725407 hasConcept C28490314 @default.
- W3200725407 hasConcept C33923547 @default.
- W3200725407 hasConcept C36503486 @default.
- W3200725407 hasConcept C41008148 @default.
- W3200725407 hasConcept C74296488 @default.
- W3200725407 hasConcept C76155785 @default.
- W3200725407 hasConcept C9652623 @default.
- W3200725407 hasConceptScore W3200725407C127162648 @default.
- W3200725407 hasConceptScore W3200725407C134306372 @default.
- W3200725407 hasConceptScore W3200725407C154945302 @default.
- W3200725407 hasConceptScore W3200725407C202444582 @default.
- W3200725407 hasConceptScore W3200725407C28490314 @default.
- W3200725407 hasConceptScore W3200725407C33923547 @default.
- W3200725407 hasConceptScore W3200725407C36503486 @default.
- W3200725407 hasConceptScore W3200725407C41008148 @default.
- W3200725407 hasConceptScore W3200725407C74296488 @default.
- W3200725407 hasConceptScore W3200725407C76155785 @default.
- W3200725407 hasConceptScore W3200725407C9652623 @default.
- W3200725407 hasLocation W32007254071 @default.
- W3200725407 hasOpenAccess W3200725407 @default.
- W3200725407 hasPrimaryLocation W32007254071 @default.
- W3200725407 hasRelatedWork W2312116756 @default.
- W3200725407 hasRelatedWork W2359317704 @default.
- W3200725407 hasRelatedWork W2368779261 @default.
- W3200725407 hasRelatedWork W2623347760 @default.
- W3200725407 hasRelatedWork W2778699561 @default.
- W3200725407 hasRelatedWork W2793122029 @default.
- W3200725407 hasRelatedWork W2794438528 @default.
- W3200725407 hasRelatedWork W2893763841 @default.
- W3200725407 hasRelatedWork W3201373518 @default.
- W3200725407 hasRelatedWork W4321764135 @default.
- W3200725407 isParatext "false" @default.
- W3200725407 isRetracted "false" @default.
- W3200725407 magId "3200725407" @default.
- W3200725407 workType "article" @default.