Matches in SemOpenAlex for { <https://semopenalex.org/work/W4319862408> ?p ?o ?g. }
Showing items 1 to 87 of
87
with 100 items per page.
- W4319862408 abstract "In this work, we compare from-scratch sequence-level cross-entropy (full-sum) training of Hidden Markov Model (HMM) and Connectionist Temporal Classification (CTC) topologies for automatic speech recognition (ASR). Besides accuracy, we further analyze their capability for generating high-quality time alignment between the speech signal and the transcription, which can be crucial for many subsequent applications. Moreover, we propose several methods to improve convergence of from-scratch full-sum training by addressing the alignment modeling issue. Systematic comparison is conducted on both Switchboard and LibriSpeech corpora across CTC, posterior HMM with and w/o transition probabilities, and standard hybrid HMM. We also provide a detailed analysis of both Viterbi forced-alignment and Baum-Welch full-sum occupation probabilities." @default.
- W4319862408 created "2023-02-11" @default.
- W4319862408 creator A5025049641 @default.
- W4319862408 creator A5028362932 @default.
- W4319862408 creator A5050968038 @default.
- W4319862408 creator A5087367411 @default.
- W4319862408 creator A5088968292 @default.
- W4319862408 date "2023-01-09" @default.
- W4319862408 modified "2023-09-23" @default.
- W4319862408 title "HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch" @default.
- W4319862408 cites W1494198834 @default.
- W4319862408 cites W1977434607 @default.
- W4319862408 cites W2009150118 @default.
- W4319862408 cites W2016185147 @default.
- W4319862408 cites W2024200390 @default.
- W4319862408 cites W2127141656 @default.
- W4319862408 cites W2166637769 @default.
- W4319862408 cites W2291282366 @default.
- W4319862408 cites W2514741789 @default.
- W4319862408 cites W2597757402 @default.
- W4319862408 cites W2748816379 @default.
- W4319862408 cites W2799923439 @default.
- W4319862408 cites W2808939837 @default.
- W4319862408 cites W2889282842 @default.
- W4319862408 cites W2963587345 @default.
- W4319862408 cites W3103005696 @default.
- W4319862408 cites W3160551958 @default.
- W4319862408 cites W4225302305 @default.
- W4319862408 cites W4225741214 @default.
- W4319862408 doi "https://doi.org/10.1109/slt54892.2023.10022967" @default.
- W4319862408 hasPublicationYear "2023" @default.
- W4319862408 type Work @default.
- W4319862408 citedByCount "0" @default.
- W4319862408 crossrefType "proceedings-article" @default.
- W4319862408 hasAuthorship W4319862408A5025049641 @default.
- W4319862408 hasAuthorship W4319862408A5028362932 @default.
- W4319862408 hasAuthorship W4319862408A5050968038 @default.
- W4319862408 hasAuthorship W4319862408A5087367411 @default.
- W4319862408 hasAuthorship W4319862408A5088968292 @default.
- W4319862408 hasConcept C111919701 @default.
- W4319862408 hasConcept C119857082 @default.
- W4319862408 hasConcept C153180895 @default.
- W4319862408 hasConcept C154945302 @default.
- W4319862408 hasConcept C163836022 @default.
- W4319862408 hasConcept C196956702 @default.
- W4319862408 hasConcept C23224414 @default.
- W4319862408 hasConcept C2781235140 @default.
- W4319862408 hasConcept C28490314 @default.
- W4319862408 hasConcept C41008148 @default.
- W4319862408 hasConcept C50644808 @default.
- W4319862408 hasConcept C54907487 @default.
- W4319862408 hasConcept C60582962 @default.
- W4319862408 hasConcept C8521452 @default.
- W4319862408 hasConcept C98763669 @default.
- W4319862408 hasConceptScore W4319862408C111919701 @default.
- W4319862408 hasConceptScore W4319862408C119857082 @default.
- W4319862408 hasConceptScore W4319862408C153180895 @default.
- W4319862408 hasConceptScore W4319862408C154945302 @default.
- W4319862408 hasConceptScore W4319862408C163836022 @default.
- W4319862408 hasConceptScore W4319862408C196956702 @default.
- W4319862408 hasConceptScore W4319862408C23224414 @default.
- W4319862408 hasConceptScore W4319862408C2781235140 @default.
- W4319862408 hasConceptScore W4319862408C28490314 @default.
- W4319862408 hasConceptScore W4319862408C41008148 @default.
- W4319862408 hasConceptScore W4319862408C50644808 @default.
- W4319862408 hasConceptScore W4319862408C54907487 @default.
- W4319862408 hasConceptScore W4319862408C60582962 @default.
- W4319862408 hasConceptScore W4319862408C8521452 @default.
- W4319862408 hasConceptScore W4319862408C98763669 @default.
- W4319862408 hasFunder F4320320879 @default.
- W4319862408 hasFunder F4320321114 @default.
- W4319862408 hasLocation W43198624081 @default.
- W4319862408 hasOpenAccess W4319862408 @default.
- W4319862408 hasPrimaryLocation W43198624081 @default.
- W4319862408 hasRelatedWork W1893636011 @default.
- W4319862408 hasRelatedWork W2140001811 @default.
- W4319862408 hasRelatedWork W2140539590 @default.
- W4319862408 hasRelatedWork W2161328464 @default.
- W4319862408 hasRelatedWork W2164513574 @default.
- W4319862408 hasRelatedWork W2340122276 @default.
- W4319862408 hasRelatedWork W2379938888 @default.
- W4319862408 hasRelatedWork W2382132287 @default.
- W4319862408 hasRelatedWork W4306891271 @default.
- W4319862408 hasRelatedWork W4319862408 @default.
- W4319862408 isParatext "false" @default.
- W4319862408 isRetracted "false" @default.
- W4319862408 workType "article" @default.