Matches in SemOpenAlex for { <https://semopenalex.org/work/W3152136404> ?p ?o ?g. }
Showing items 1 to 98 of
98
with 100 items per page.
- W3152136404 abstract "This paper presents a novel design of neural network system for fine-grained style modeling, transfer and prediction in expressive text-to-speech (TTS) synthesis. Fine-grained modeling is realized by extracting style embeddings from the mel-spectrograms of phone-level speech segments. Collaborative learning and adversarial learning strategies are applied in order to achieve effective disentanglement of content and style factors in speech and alleviate the content leakage problem in style modeling. The proposed system can be used for varying-content speech style transfer in the single-speaker scenario. The results of objective and subjective evaluation show that our system performs better than other fine-grained speech style transfer models, especially in the aspect of content preservation. By incorporating a style predictor, the proposed system can also be used for text-to-speech synthesis. Audio samples are provided for system demonstration https://daxintan-cuhk.github.io/pl-csd-speech ." @default.
- W3152136404 created "2021-04-13" @default.
- W3152136404 creator A5001795601 @default.
- W3152136404 creator A5006527442 @default.
- W3152136404 date "2021-08-30" @default.
- W3152136404 modified "2023-10-02" @default.
- W3152136404 title "Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement" @default.
- W3152136404 cites W2099471712 @default.
- W3152136404 cites W2187089797 @default.
- W3152136404 cites W2608207374 @default.
- W3152136404 cites W2747874407 @default.
- W3152136404 cites W2794490148 @default.
- W3152136404 cites W2795109282 @default.
- W3152136404 cites W2795935804 @default.
- W3152136404 cites W2890964092 @default.
- W3152136404 cites W2897548994 @default.
- W3152136404 cites W2903739847 @default.
- W3152136404 cites W2904459034 @default.
- W3152136404 cites W2937343983 @default.
- W3152136404 cites W2945544731 @default.
- W3152136404 cites W2949281321 @default.
- W3152136404 cites W2952269766 @default.
- W3152136404 cites W2963300588 @default.
- W3152136404 cites W2964138190 @default.
- W3152136404 cites W2964243274 @default.
- W3152136404 cites W2965685620 @default.
- W3152136404 cites W2970730223 @default.
- W3152136404 cites W2971753973 @default.
- W3152136404 cites W2973158936 @default.
- W3152136404 cites W2997399314 @default.
- W3152136404 cites W3015212100 @default.
- W3152136404 cites W3021469861 @default.
- W3152136404 cites W3022876224 @default.
- W3152136404 cites W3025165719 @default.
- W3152136404 cites W3047107405 @default.
- W3152136404 cites W3095505419 @default.
- W3152136404 cites W3130016944 @default.
- W3152136404 cites W3135644023 @default.
- W3152136404 cites W2906797124 @default.
- W3152136404 doi "https://doi.org/10.21437/interspeech.2021-1129" @default.
- W3152136404 hasPublicationYear "2021" @default.
- W3152136404 type Work @default.
- W3152136404 sameAs 3152136404 @default.
- W3152136404 citedByCount "11" @default.
- W3152136404 countsByYear W31521364042021 @default.
- W3152136404 countsByYear W31521364042022 @default.
- W3152136404 countsByYear W31521364042023 @default.
- W3152136404 crossrefType "proceedings-article" @default.
- W3152136404 hasAuthorship W3152136404A5001795601 @default.
- W3152136404 hasAuthorship W3152136404A5006527442 @default.
- W3152136404 hasBestOaLocation W31521364042 @default.
- W3152136404 hasConcept C138885662 @default.
- W3152136404 hasConcept C14999030 @default.
- W3152136404 hasConcept C150899416 @default.
- W3152136404 hasConcept C154945302 @default.
- W3152136404 hasConcept C166957645 @default.
- W3152136404 hasConcept C204321447 @default.
- W3152136404 hasConcept C2776445246 @default.
- W3152136404 hasConcept C2778707766 @default.
- W3152136404 hasConcept C28490314 @default.
- W3152136404 hasConcept C41008148 @default.
- W3152136404 hasConcept C41895202 @default.
- W3152136404 hasConcept C45273575 @default.
- W3152136404 hasConcept C50644808 @default.
- W3152136404 hasConcept C95457728 @default.
- W3152136404 hasConceptScore W3152136404C138885662 @default.
- W3152136404 hasConceptScore W3152136404C14999030 @default.
- W3152136404 hasConceptScore W3152136404C150899416 @default.
- W3152136404 hasConceptScore W3152136404C154945302 @default.
- W3152136404 hasConceptScore W3152136404C166957645 @default.
- W3152136404 hasConceptScore W3152136404C204321447 @default.
- W3152136404 hasConceptScore W3152136404C2776445246 @default.
- W3152136404 hasConceptScore W3152136404C2778707766 @default.
- W3152136404 hasConceptScore W3152136404C28490314 @default.
- W3152136404 hasConceptScore W3152136404C41008148 @default.
- W3152136404 hasConceptScore W3152136404C41895202 @default.
- W3152136404 hasConceptScore W3152136404C45273575 @default.
- W3152136404 hasConceptScore W3152136404C50644808 @default.
- W3152136404 hasConceptScore W3152136404C95457728 @default.
- W3152136404 hasLocation W31521364041 @default.
- W3152136404 hasLocation W31521364042 @default.
- W3152136404 hasLocation W31521364043 @default.
- W3152136404 hasOpenAccess W3152136404 @default.
- W3152136404 hasPrimaryLocation W31521364041 @default.
- W3152136404 hasRelatedWork W2331826121 @default.
- W3152136404 hasRelatedWork W2946200149 @default.
- W3152136404 hasRelatedWork W2970730223 @default.
- W3152136404 hasRelatedWork W2973108288 @default.
- W3152136404 hasRelatedWork W30434212 @default.
- W3152136404 hasRelatedWork W3127598333 @default.
- W3152136404 hasRelatedWork W3203164699 @default.
- W3152136404 hasRelatedWork W4224930731 @default.
- W3152136404 hasRelatedWork W4290074601 @default.
- W3152136404 hasRelatedWork W4310471687 @default.
- W3152136404 isParatext "false" @default.
- W3152136404 isRetracted "false" @default.
- W3152136404 magId "3152136404" @default.
- W3152136404 workType "article" @default.