Matches in SemOpenAlex for { <https://semopenalex.org/work/W2890952074> ?p ?o ?g. }
- W2890952074 endingPage "8727" @default.
- W2890952074 startingPage "8717" @default.
- W2890952074 abstract "The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an <italic xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink>open-world</i> problem – unconstrained natural language sentences, and in the wild videos. Our key contributions are: (1) we compare two models for lip reading, one using a CTC loss, and the other using a sequence-to-sequence loss. Both models are built on top of the transformer self-attention architecture; (2) we investigate to what extent lip reading is complementary to audio speech recognition, especially when the audio signal is noisy; (3) we introduce and publicly release a new dataset for audio-visual speech recognition, LRS2-BBC, consisting of thousands of natural sentences from British television. The models that we train surpass the performance of all previous work on a lip reading benchmark dataset by a significant margin." @default.
- W2890952074 created "2018-09-27" @default.
- W2890952074 creator A5003562101 @default.
- W2890952074 creator A5018690028 @default.
- W2890952074 creator A5038723822 @default.
- W2890952074 creator A5057678172 @default.
- W2890952074 creator A5079708487 @default.
- W2890952074 date "2022-12-01" @default.
- W2890952074 modified "2023-10-18" @default.
- W2890952074 title "Deep Audio-Visual Speech Recognition" @default.
- W2890952074 cites W1503933356 @default.
- W2890952074 cites W2015143272 @default.
- W2890952074 cites W2029996593 @default.
- W2890952074 cites W2060510034 @default.
- W2890952074 cites W2076029968 @default.
- W2890952074 cites W2076462394 @default.
- W2890952074 cites W2097117768 @default.
- W2890952074 cites W2117539524 @default.
- W2890952074 cites W2127141656 @default.
- W2890952074 cites W2157331557 @default.
- W2890952074 cites W2160815625 @default.
- W2890952074 cites W2194775991 @default.
- W2890952074 cites W2243738093 @default.
- W2890952074 cites W2267805933 @default.
- W2890952074 cites W2289925289 @default.
- W2890952074 cites W2293858598 @default.
- W2890952074 cites W2342662179 @default.
- W2890952074 cites W2404704342 @default.
- W2890952074 cites W2556171197 @default.
- W2890952074 cites W2570575067 @default.
- W2890952074 cites W2952746495 @default.
- W2890952074 cites W2962824709 @default.
- W2890952074 cites W2962901777 @default.
- W2890952074 cites W2963240019 @default.
- W2890952074 cites W2963403664 @default.
- W2890952074 cites W2963528589 @default.
- W2890952074 cites W2963654155 @default.
- W2890952074 cites W2963920996 @default.
- W2890952074 cites W2964283370 @default.
- W2890952074 cites W3106250896 @default.
- W2890952074 doi "https://doi.org/10.1109/tpami.2018.2889052" @default.
- W2890952074 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/30582526" @default.
- W2890952074 hasPublicationYear "2022" @default.
- W2890952074 type Work @default.
- W2890952074 sameAs 2890952074 @default.
- W2890952074 citedByCount "216" @default.
- W2890952074 countsByYear W28909520742018 @default.
- W2890952074 countsByYear W28909520742019 @default.
- W2890952074 countsByYear W28909520742020 @default.
- W2890952074 countsByYear W28909520742021 @default.
- W2890952074 countsByYear W28909520742022 @default.
- W2890952074 countsByYear W28909520742023 @default.
- W2890952074 crossrefType "journal-article" @default.
- W2890952074 hasAuthorship W2890952074A5003562101 @default.
- W2890952074 hasAuthorship W2890952074A5018690028 @default.
- W2890952074 hasAuthorship W2890952074A5038723822 @default.
- W2890952074 hasAuthorship W2890952074A5057678172 @default.
- W2890952074 hasAuthorship W2890952074A5079708487 @default.
- W2890952074 hasBestOaLocation W28909520742 @default.
- W2890952074 hasConcept C119857082 @default.
- W2890952074 hasConcept C121332964 @default.
- W2890952074 hasConcept C13280743 @default.
- W2890952074 hasConcept C138885662 @default.
- W2890952074 hasConcept C154945302 @default.
- W2890952074 hasConcept C165801399 @default.
- W2890952074 hasConcept C185798385 @default.
- W2890952074 hasConcept C195324797 @default.
- W2890952074 hasConcept C204321447 @default.
- W2890952074 hasConcept C205649164 @default.
- W2890952074 hasConcept C28490314 @default.
- W2890952074 hasConcept C41008148 @default.
- W2890952074 hasConcept C41895202 @default.
- W2890952074 hasConcept C554936623 @default.
- W2890952074 hasConcept C62520636 @default.
- W2890952074 hasConcept C66322947 @default.
- W2890952074 hasConcept C774472 @default.
- W2890952074 hasConceptScore W2890952074C119857082 @default.
- W2890952074 hasConceptScore W2890952074C121332964 @default.
- W2890952074 hasConceptScore W2890952074C13280743 @default.
- W2890952074 hasConceptScore W2890952074C138885662 @default.
- W2890952074 hasConceptScore W2890952074C154945302 @default.
- W2890952074 hasConceptScore W2890952074C165801399 @default.
- W2890952074 hasConceptScore W2890952074C185798385 @default.
- W2890952074 hasConceptScore W2890952074C195324797 @default.
- W2890952074 hasConceptScore W2890952074C204321447 @default.
- W2890952074 hasConceptScore W2890952074C205649164 @default.
- W2890952074 hasConceptScore W2890952074C28490314 @default.
- W2890952074 hasConceptScore W2890952074C41008148 @default.
- W2890952074 hasConceptScore W2890952074C41895202 @default.
- W2890952074 hasConceptScore W2890952074C554936623 @default.
- W2890952074 hasConceptScore W2890952074C62520636 @default.
- W2890952074 hasConceptScore W2890952074C66322947 @default.
- W2890952074 hasConceptScore W2890952074C774472 @default.
- W2890952074 hasIssue "12" @default.
- W2890952074 hasLocation W28909520741 @default.
- W2890952074 hasLocation W28909520742 @default.
- W2890952074 hasLocation W28909520743 @default.
- W2890952074 hasLocation W28909520744 @default.