Matches in SemOpenAlex for { <https://semopenalex.org/work/W2938222313> ?p ?o ?g. }
- W2938222313 abstract "Humans are able to imagine a person's voice from the person's appearance and imagine the person's appearance from his/her voice. In this paper, we make the first attempt to develop a method that can convert speech into a voice that matches an input face image and generate a face image that matches the voice of the input speech by leveraging the correlation between faces and voices. We propose a model, consisting of a speech converter, a face encoder/decoder and a voice encoder. We use the latent code of an input face image encoded by the face encoder as the auxiliary input into the speech converter and train the speech converter so that the original latent code can be recovered from the generated speech by the voice encoder. We also train the face decoder along with the face encoder to ensure that the latent code will contain sufficient information to reconstruct the input face image. We confirmed experimentally that a speech converter trained in this way was able to convert input speech into a voice that matched an input face image and that the voice encoder and face decoder can be used to generate a face image that matches the voice of the input speech." @default.
- W2938222313 created "2019-04-25" @default.
- W2938222313 creator A5001243214 @default.
- W2938222313 creator A5002179552 @default.
- W2938222313 creator A5020693766 @default.
- W2938222313 creator A5062509967 @default.
- W2938222313 creator A5064979071 @default.
- W2938222313 date "2019-04-09" @default.
- W2938222313 modified "2023-10-08" @default.
- W2938222313 title "Crossmodal Voice Conversion." @default.
- W2938222313 cites W115285041 @default.
- W2938222313 cites W1834627138 @default.
- W2938222313 cites W1959608418 @default.
- W2938222313 cites W2099471712 @default.
- W2938222313 cites W2108501770 @default.
- W2938222313 cites W2120605154 @default.
- W2938222313 cites W2156142001 @default.
- W2938222313 cites W2266401008 @default.
- W2938222313 cites W2471520273 @default.
- W2938222313 cites W2518312472 @default.
- W2938222313 cites W2532494225 @default.
- W2938222313 cites W2598581049 @default.
- W2938222313 cites W2608338293 @default.
- W2938222313 cites W2611160234 @default.
- W2938222313 cites W2651834199 @default.
- W2938222313 cites W2774848319 @default.
- W2938222313 cites W2800289214 @default.
- W2938222313 cites W2804998325 @default.
- W2938222313 cites W2887264325 @default.
- W2938222313 cites W2962793481 @default.
- W2938222313 cites W2963035245 @default.
- W2938222313 cites W2963444790 @default.
- W2938222313 cites W2963539064 @default.
- W2938222313 cites W2963567641 @default.
- W2938222313 cites W2963663420 @default.
- W2938222313 cites W2963767194 @default.
- W2938222313 cites W2963807156 @default.
- W2938222313 cites W2963887950 @default.
- W2938222313 cites W2963970792 @default.
- W2938222313 hasPublicationYear "2019" @default.
- W2938222313 type Work @default.
- W2938222313 sameAs 2938222313 @default.
- W2938222313 citedByCount "2" @default.
- W2938222313 countsByYear W29382223132021 @default.
- W2938222313 crossrefType "posted-content" @default.
- W2938222313 hasAuthorship W2938222313A5001243214 @default.
- W2938222313 hasAuthorship W2938222313A5002179552 @default.
- W2938222313 hasAuthorship W2938222313A5020693766 @default.
- W2938222313 hasAuthorship W2938222313A5062509967 @default.
- W2938222313 hasAuthorship W2938222313A5064979071 @default.
- W2938222313 hasConcept C111919701 @default.
- W2938222313 hasConcept C115961682 @default.
- W2938222313 hasConcept C118505674 @default.
- W2938222313 hasConcept C138885662 @default.
- W2938222313 hasConcept C14999030 @default.
- W2938222313 hasConcept C154945302 @default.
- W2938222313 hasConcept C177264268 @default.
- W2938222313 hasConcept C199360897 @default.
- W2938222313 hasConcept C204201278 @default.
- W2938222313 hasConcept C2776760102 @default.
- W2938222313 hasConcept C2779304628 @default.
- W2938222313 hasConcept C28490314 @default.
- W2938222313 hasConcept C31972630 @default.
- W2938222313 hasConcept C41008148 @default.
- W2938222313 hasConcept C41895202 @default.
- W2938222313 hasConcept C61328038 @default.
- W2938222313 hasConceptScore W2938222313C111919701 @default.
- W2938222313 hasConceptScore W2938222313C115961682 @default.
- W2938222313 hasConceptScore W2938222313C118505674 @default.
- W2938222313 hasConceptScore W2938222313C138885662 @default.
- W2938222313 hasConceptScore W2938222313C14999030 @default.
- W2938222313 hasConceptScore W2938222313C154945302 @default.
- W2938222313 hasConceptScore W2938222313C177264268 @default.
- W2938222313 hasConceptScore W2938222313C199360897 @default.
- W2938222313 hasConceptScore W2938222313C204201278 @default.
- W2938222313 hasConceptScore W2938222313C2776760102 @default.
- W2938222313 hasConceptScore W2938222313C2779304628 @default.
- W2938222313 hasConceptScore W2938222313C28490314 @default.
- W2938222313 hasConceptScore W2938222313C31972630 @default.
- W2938222313 hasConceptScore W2938222313C41008148 @default.
- W2938222313 hasConceptScore W2938222313C41895202 @default.
- W2938222313 hasConceptScore W2938222313C61328038 @default.
- W2938222313 hasLocation W29382223131 @default.
- W2938222313 hasOpenAccess W2938222313 @default.
- W2938222313 hasPrimaryLocation W29382223131 @default.
- W2938222313 hasRelatedWork W2414722319 @default.
- W2938222313 hasRelatedWork W2826629083 @default.
- W2938222313 hasRelatedWork W2830614486 @default.
- W2938222313 hasRelatedWork W2835438077 @default.
- W2938222313 hasRelatedWork W2840802228 @default.
- W2938222313 hasRelatedWork W2844408502 @default.
- W2938222313 hasRelatedWork W2848085012 @default.
- W2938222313 hasRelatedWork W2854790965 @default.
- W2938222313 hasRelatedWork W2866243361 @default.
- W2938222313 hasRelatedWork W2879922668 @default.
- W2938222313 hasRelatedWork W3090483352 @default.
- W2938222313 hasRelatedWork W3090772179 @default.
- W2938222313 hasRelatedWork W3097573318 @default.
- W2938222313 hasRelatedWork W3110552966 @default.
- W2938222313 hasRelatedWork W3131710578 @default.