Matches in SemOpenAlex for { <https://semopenalex.org/work/W4376457029> ?p ?o ?g. }
Showing items 1 to 72 of
72
with 100 items per page.
- W4376457029 endingPage "100" @default.
- W4376457029 startingPage "89" @default.
- W4376457029 abstract "Conventional ASR systems use frame-level phoneme posterior to conduct force-alignment (FA) and provide timestamps, while end-to-end ASR systems especially AED based ones are short of such ability. This paper proposes to perform timestamp prediction (TP) while recognizing by utilizing continuous integrate-and-fire (CIF) mechanism in non-autoregressive ASR model - Paraformer. Foucing on the fire place bias issue of CIF, we conduct post-processing strategies including fire-delay and silence insertion. Besides, we propose to use scaled-CIF to smooth the weights of CIF output, which is proved beneficial for both ASR and TP task. Accumulated averaging shift (AAS) and diarization error rate (DER) are adopted to measure the quality of timestamps and we compare these metrics of proposed system and conventional hybrid force-alignment system. The experiment results over manually-marked timestamps testset show that the proposed optimization methods significantly improve the accuracy of CIF timestamps, reducing 66.7% and 82.1% of AAS and DER respectively. Comparing to Kaldi force-alignment trained with the same data, optimized CIF timestamps achieved 12.3% relative AAS reduction." @default.
- W4376457029 created "2023-05-14" @default.
- W4376457029 creator A5008102398 @default.
- W4376457029 creator A5055433405 @default.
- W4376457029 creator A5061850214 @default.
- W4376457029 creator A5062644421 @default.
- W4376457029 date "2023-01-01" @default.
- W4376457029 modified "2023-10-12" @default.
- W4376457029 title "Achieving Timestamp Prediction While Recognizing with Non-autoregressive End-to-End ASR Model" @default.
- W4376457029 cites W121610373 @default.
- W4376457029 cites W1963727751 @default.
- W4376457029 cites W2076596602 @default.
- W4376457029 cites W2127141656 @default.
- W4376457029 cites W2144499799 @default.
- W4376457029 cites W2327501763 @default.
- W4376457029 cites W2747874407 @default.
- W4376457029 cites W2962780374 @default.
- W4376457029 cites W3016167541 @default.
- W4376457029 cites W3023953056 @default.
- W4376457029 cites W3043783436 @default.
- W4376457029 cites W4224935349 @default.
- W4376457029 cites W4225644313 @default.
- W4376457029 cites W4283067311 @default.
- W4376457029 cites W4283828241 @default.
- W4376457029 cites W4285115600 @default.
- W4376457029 doi "https://doi.org/10.1007/978-981-99-2401-1_8" @default.
- W4376457029 hasPublicationYear "2023" @default.
- W4376457029 type Work @default.
- W4376457029 citedByCount "0" @default.
- W4376457029 crossrefType "book-chapter" @default.
- W4376457029 hasAuthorship W4376457029A5008102398 @default.
- W4376457029 hasAuthorship W4376457029A5055433405 @default.
- W4376457029 hasAuthorship W4376457029A5061850214 @default.
- W4376457029 hasAuthorship W4376457029A5062644421 @default.
- W4376457029 hasBestOaLocation W43764570292 @default.
- W4376457029 hasConcept C105795698 @default.
- W4376457029 hasConcept C113954288 @default.
- W4376457029 hasConcept C124101348 @default.
- W4376457029 hasConcept C154945302 @default.
- W4376457029 hasConcept C159877910 @default.
- W4376457029 hasConcept C28490314 @default.
- W4376457029 hasConcept C33923547 @default.
- W4376457029 hasConcept C41008148 @default.
- W4376457029 hasConcept C79403827 @default.
- W4376457029 hasConceptScore W4376457029C105795698 @default.
- W4376457029 hasConceptScore W4376457029C113954288 @default.
- W4376457029 hasConceptScore W4376457029C124101348 @default.
- W4376457029 hasConceptScore W4376457029C154945302 @default.
- W4376457029 hasConceptScore W4376457029C159877910 @default.
- W4376457029 hasConceptScore W4376457029C28490314 @default.
- W4376457029 hasConceptScore W4376457029C33923547 @default.
- W4376457029 hasConceptScore W4376457029C41008148 @default.
- W4376457029 hasConceptScore W4376457029C79403827 @default.
- W4376457029 hasLocation W43764570291 @default.
- W4376457029 hasLocation W43764570292 @default.
- W4376457029 hasOpenAccess W4376457029 @default.
- W4376457029 hasPrimaryLocation W43764570291 @default.
- W4376457029 hasRelatedWork W105617988 @default.
- W4376457029 hasRelatedWork W1547043107 @default.
- W4376457029 hasRelatedWork W1606975172 @default.
- W4376457029 hasRelatedWork W2028868645 @default.
- W4376457029 hasRelatedWork W2368138740 @default.
- W4376457029 hasRelatedWork W2377216019 @default.
- W4376457029 hasRelatedWork W3111068657 @default.
- W4376457029 hasRelatedWork W4244121124 @default.
- W4376457029 hasRelatedWork W4318751845 @default.
- W4376457029 hasRelatedWork W4376457029 @default.
- W4376457029 isParatext "false" @default.
- W4376457029 isRetracted "false" @default.
- W4376457029 workType "book-chapter" @default.