Matches in SemOpenAlex for { <https://semopenalex.org/work/W3025245884> ?p ?o ?g. }
- W3025245884 abstract "To speed up the inference of neural speech synthesis, non-autoregressive models receive increasing attention recently. In non-autoregressive models, additional durations of text tokens are required to make a hard alignment between the encoder and the decoder. The duration-based alignment plays a crucial role since it controls the correspondence between text tokens and spectrum frames and determines the rhythm and speed of synthesized audio. To get better duration-based alignment and improve the quality of non-autoregressive speech synthesis, in this paper, we propose a novel neural alignment model named MoboAligner. Given the pairs of the text and mel spectrum, MoboAligner tries to identify the boundaries of text tokens in the given mel spectrum frames based on the token-frame similarity in the neural semantic space with an end-to-end framework. With these boundaries, durations can be extracted and used in the training of non-autoregressive TTS models. Compared with the duration extracted by TransformerTTS, MoboAligner brings improvement for the non-autoregressive TTS model on MOS (3.74 comparing to FastSpeech's 3.44). Besides, MoboAligner is task-specified and lightweight, which reduces the parameter number by 45% and the training time consuming by 30%." @default.
- W3025245884 created "2020-05-21" @default.
- W3025245884 creator A5019056174 @default.
- W3025245884 creator A5027423694 @default.
- W3025245884 creator A5028066219 @default.
- W3025245884 creator A5034770439 @default.
- W3025245884 creator A5072111039 @default.
- W3025245884 creator A5088930074 @default.
- W3025245884 date "2020-05-18" @default.
- W3025245884 modified "2023-09-26" @default.
- W3025245884 title "MoBoAligner: a Neural Alignment Model for Non-autoregressive TTS with Monotonic Boundary Search" @default.
- W3025245884 cites W2102003408 @default.
- W3025245884 cites W2111284386 @default.
- W3025245884 cites W2129142580 @default.
- W3025245884 cites W2154920538 @default.
- W3025245884 cites W2547875792 @default.
- W3025245884 cites W2605141709 @default.
- W3025245884 cites W2769810959 @default.
- W3025245884 cites W2777302760 @default.
- W3025245884 cites W2788851830 @default.
- W3025245884 cites W2903739847 @default.
- W3025245884 cites W2949382160 @default.
- W3025245884 cites W2952165242 @default.
- W3025245884 cites W2963300588 @default.
- W3025245884 cites W2963403868 @default.
- W3025245884 cites W2970730223 @default.
- W3025245884 cites W3015338123 @default.
- W3025245884 cites W3016160783 @default.
- W3025245884 doi "https://doi.org/10.48550/arxiv.2005.08528" @default.
- W3025245884 hasPublicationYear "2020" @default.
- W3025245884 type Work @default.
- W3025245884 sameAs 3025245884 @default.
- W3025245884 citedByCount "2" @default.
- W3025245884 countsByYear W30252458842020 @default.
- W3025245884 countsByYear W30252458842021 @default.
- W3025245884 crossrefType "posted-content" @default.
- W3025245884 hasAuthorship W3025245884A5019056174 @default.
- W3025245884 hasAuthorship W3025245884A5027423694 @default.
- W3025245884 hasAuthorship W3025245884A5028066219 @default.
- W3025245884 hasAuthorship W3025245884A5034770439 @default.
- W3025245884 hasAuthorship W3025245884A5072111039 @default.
- W3025245884 hasAuthorship W3025245884A5088930074 @default.
- W3025245884 hasBestOaLocation W30252458841 @default.
- W3025245884 hasConcept C103278499 @default.
- W3025245884 hasConcept C105795698 @default.
- W3025245884 hasConcept C112758219 @default.
- W3025245884 hasConcept C115961682 @default.
- W3025245884 hasConcept C119857082 @default.
- W3025245884 hasConcept C124952713 @default.
- W3025245884 hasConcept C126042441 @default.
- W3025245884 hasConcept C134306372 @default.
- W3025245884 hasConcept C142362112 @default.
- W3025245884 hasConcept C151406439 @default.
- W3025245884 hasConcept C153180895 @default.
- W3025245884 hasConcept C154945302 @default.
- W3025245884 hasConcept C159877910 @default.
- W3025245884 hasConcept C194657046 @default.
- W3025245884 hasConcept C24338571 @default.
- W3025245884 hasConcept C2776214188 @default.
- W3025245884 hasConcept C28490314 @default.
- W3025245884 hasConcept C33923547 @default.
- W3025245884 hasConcept C38652104 @default.
- W3025245884 hasConcept C41008148 @default.
- W3025245884 hasConcept C48145219 @default.
- W3025245884 hasConcept C72169020 @default.
- W3025245884 hasConcept C76155785 @default.
- W3025245884 hasConceptScore W3025245884C103278499 @default.
- W3025245884 hasConceptScore W3025245884C105795698 @default.
- W3025245884 hasConceptScore W3025245884C112758219 @default.
- W3025245884 hasConceptScore W3025245884C115961682 @default.
- W3025245884 hasConceptScore W3025245884C119857082 @default.
- W3025245884 hasConceptScore W3025245884C124952713 @default.
- W3025245884 hasConceptScore W3025245884C126042441 @default.
- W3025245884 hasConceptScore W3025245884C134306372 @default.
- W3025245884 hasConceptScore W3025245884C142362112 @default.
- W3025245884 hasConceptScore W3025245884C151406439 @default.
- W3025245884 hasConceptScore W3025245884C153180895 @default.
- W3025245884 hasConceptScore W3025245884C154945302 @default.
- W3025245884 hasConceptScore W3025245884C159877910 @default.
- W3025245884 hasConceptScore W3025245884C194657046 @default.
- W3025245884 hasConceptScore W3025245884C24338571 @default.
- W3025245884 hasConceptScore W3025245884C2776214188 @default.
- W3025245884 hasConceptScore W3025245884C28490314 @default.
- W3025245884 hasConceptScore W3025245884C33923547 @default.
- W3025245884 hasConceptScore W3025245884C38652104 @default.
- W3025245884 hasConceptScore W3025245884C41008148 @default.
- W3025245884 hasConceptScore W3025245884C48145219 @default.
- W3025245884 hasConceptScore W3025245884C72169020 @default.
- W3025245884 hasConceptScore W3025245884C76155785 @default.
- W3025245884 hasLocation W30252458841 @default.
- W3025245884 hasOpenAccess W3025245884 @default.
- W3025245884 hasPrimaryLocation W30252458841 @default.
- W3025245884 hasRelatedWork W2015538044 @default.
- W3025245884 hasRelatedWork W2791758686 @default.
- W3025245884 hasRelatedWork W3004979489 @default.
- W3025245884 hasRelatedWork W3022372506 @default.
- W3025245884 hasRelatedWork W3184187848 @default.
- W3025245884 hasRelatedWork W3197304116 @default.
- W3025245884 hasRelatedWork W3208309985 @default.
- W3025245884 hasRelatedWork W3209239055 @default.