Matches in SemOpenAlex for { <https://semopenalex.org/work/W4385474305> ?p ?o ?g. }
Showing items 1 to 67 of
67
with 100 items per page.
- W4385474305 abstract "Video-to-speech synthesis involves reconstructing the speech signal of a speaker from a silent video. The implicit assumption of this task is that the sound signal is either missing or contains a high amount of noise/corruption such that it is not useful for processing. Previous works in the literature either use video inputs only or employ both video and audio inputs during training, and discard the input audio pathway during inference. In this work we investigate the effect of using video and audio inputs for video-to-speech synthesis during both training and inference. In particular, we use pre-trained video-to-speech models to synthesize the missing speech signals and then train an audio-visual-to-speech synthesis model, using both the silent video and the synthesized speech as inputs, to predict the final reconstructed speech. Our experiments demonstrate that this approach is successful with both raw waveforms and mel spectrograms as target outputs." @default.
- W4385474305 created "2023-08-02" @default.
- W4385474305 creator A5016033078 @default.
- W4385474305 creator A5050734738 @default.
- W4385474305 creator A5060152748 @default.
- W4385474305 date "2023-07-31" @default.
- W4385474305 modified "2023-09-23" @default.
- W4385474305 title "Audio-visual video-to-speech synthesis with synthesized input audio" @default.
- W4385474305 doi "https://doi.org/10.48550/arxiv.2307.16584" @default.
- W4385474305 hasPublicationYear "2023" @default.
- W4385474305 type Work @default.
- W4385474305 citedByCount "0" @default.
- W4385474305 crossrefType "posted-content" @default.
- W4385474305 hasAuthorship W4385474305A5016033078 @default.
- W4385474305 hasAuthorship W4385474305A5050734738 @default.
- W4385474305 hasAuthorship W4385474305A5060152748 @default.
- W4385474305 hasBestOaLocation W43854743051 @default.
- W4385474305 hasConcept C13895895 @default.
- W4385474305 hasConcept C14999030 @default.
- W4385474305 hasConcept C154945302 @default.
- W4385474305 hasConcept C155635449 @default.
- W4385474305 hasConcept C157968479 @default.
- W4385474305 hasConcept C162324750 @default.
- W4385474305 hasConcept C187736073 @default.
- W4385474305 hasConcept C199360897 @default.
- W4385474305 hasConcept C204201278 @default.
- W4385474305 hasConcept C2776214188 @default.
- W4385474305 hasConcept C2779843651 @default.
- W4385474305 hasConcept C2780451532 @default.
- W4385474305 hasConcept C28490314 @default.
- W4385474305 hasConcept C41008148 @default.
- W4385474305 hasConcept C45273575 @default.
- W4385474305 hasConcept C61328038 @default.
- W4385474305 hasConcept C64922751 @default.
- W4385474305 hasConceptScore W4385474305C13895895 @default.
- W4385474305 hasConceptScore W4385474305C14999030 @default.
- W4385474305 hasConceptScore W4385474305C154945302 @default.
- W4385474305 hasConceptScore W4385474305C155635449 @default.
- W4385474305 hasConceptScore W4385474305C157968479 @default.
- W4385474305 hasConceptScore W4385474305C162324750 @default.
- W4385474305 hasConceptScore W4385474305C187736073 @default.
- W4385474305 hasConceptScore W4385474305C199360897 @default.
- W4385474305 hasConceptScore W4385474305C204201278 @default.
- W4385474305 hasConceptScore W4385474305C2776214188 @default.
- W4385474305 hasConceptScore W4385474305C2779843651 @default.
- W4385474305 hasConceptScore W4385474305C2780451532 @default.
- W4385474305 hasConceptScore W4385474305C28490314 @default.
- W4385474305 hasConceptScore W4385474305C41008148 @default.
- W4385474305 hasConceptScore W4385474305C45273575 @default.
- W4385474305 hasConceptScore W4385474305C61328038 @default.
- W4385474305 hasConceptScore W4385474305C64922751 @default.
- W4385474305 hasLocation W43854743051 @default.
- W4385474305 hasOpenAccess W4385474305 @default.
- W4385474305 hasPrimaryLocation W43854743051 @default.
- W4385474305 hasRelatedWork W1520422233 @default.
- W4385474305 hasRelatedWork W1911859126 @default.
- W4385474305 hasRelatedWork W2131711534 @default.
- W4385474305 hasRelatedWork W2184127972 @default.
- W4385474305 hasRelatedWork W2536333783 @default.
- W4385474305 hasRelatedWork W2620660273 @default.
- W4385474305 hasRelatedWork W3015280134 @default.
- W4385474305 hasRelatedWork W4283069119 @default.
- W4385474305 hasRelatedWork W642007152 @default.
- W4385474305 hasRelatedWork W2341426843 @default.
- W4385474305 isParatext "false" @default.
- W4385474305 isRetracted "false" @default.
- W4385474305 workType "article" @default.