Matches in SemOpenAlex for { <https://semopenalex.org/work/W2096391593> ?p ?o ?g. }
- W2096391593 endingPage "1326" @default.
- W2096391593 startingPage "1306" @default.
- W2096391593 abstract "Visual speech information from the speaker's mouth region has been successfully shown to improve noise robustness of automatic speech recognizers, thus promising to extend their usability in the human computer interface. In this paper, we review the main components of audiovisual automatic speech recognition (ASR) and present novel contributions in two main areas: first, the visual front-end design, based on a cascade of linear image transforms of an appropriate video region of interest, and subsequently, audiovisual speech integration. On the latter topic, we discuss new work on feature and decision fusion combination, the modeling of audiovisual speech asynchrony, and incorporating modality reliability estimates to the bimodal recognition process. We also briefly touch upon the issue of audiovisual adaptation. We apply our algorithms to three multisubject bimodal databases, ranging from small- to large-vocabulary recognition tasks, recorded in both visually controlled and challenging environments. Our experiments demonstrate that the visual modality improves ASR over all conditions and data considered, though less so for visually challenging environments and large vocabulary tasks." @default.
- W2096391593 created "2016-06-24" @default.
- W2096391593 creator A5002107058 @default.
- W2096391593 creator A5008190208 @default.
- W2096391593 creator A5009895550 @default.
- W2096391593 creator A5024184433 @default.
- W2096391593 creator A5079708487 @default.
- W2096391593 date "2003-09-01" @default.
- W2096391593 modified "2023-10-18" @default.
- W2096391593 title "Recent advances in the automatic recognition of audiovisual speech" @default.
- W2096391593 cites W133422505 @default.
- W2096391593 cites W1488776412 @default.
- W2096391593 cites W1523010034 @default.
- W2096391593 cites W1527102103 @default.
- W2096391593 cites W1529401398 @default.
- W2096391593 cites W1549965231 @default.
- W2096391593 cites W1552594198 @default.
- W2096391593 cites W1572240262 @default.
- W2096391593 cites W1573024203 @default.
- W2096391593 cites W1604074770 @default.
- W2096391593 cites W1633840029 @default.
- W2096391593 cites W1797411461 @default.
- W2096391593 cites W1800365115 @default.
- W2096391593 cites W1877570817 @default.
- W2096391593 cites W1896113342 @default.
- W2096391593 cites W1919971835 @default.
- W2096391593 cites W1922557984 @default.
- W2096391593 cites W1955166624 @default.
- W2096391593 cites W1967878178 @default.
- W2096391593 cites W1978380426 @default.
- W2096391593 cites W1997418877 @default.
- W2096391593 cites W2002591263 @default.
- W2096391593 cites W2014621385 @default.
- W2096391593 cites W2015394094 @default.
- W2096391593 cites W2033895145 @default.
- W2096391593 cites W2038010270 @default.
- W2096391593 cites W2038952578 @default.
- W2096391593 cites W2049155405 @default.
- W2096391593 cites W2064347532 @default.
- W2096391593 cites W2098553397 @default.
- W2096391593 cites W2100152238 @default.
- W2096391593 cites W2100969003 @default.
- W2096391593 cites W2104095591 @default.
- W2096391593 cites W2105852393 @default.
- W2096391593 cites W2106137268 @default.
- W2096391593 cites W2106284211 @default.
- W2096391593 cites W2108492645 @default.
- W2096391593 cites W2111135419 @default.
- W2096391593 cites W2111429641 @default.
- W2096391593 cites W2116418373 @default.
- W2096391593 cites W2118977726 @default.
- W2096391593 cites W2120535071 @default.
- W2096391593 cites W2121486117 @default.
- W2096391593 cites W2122272452 @default.
- W2096391593 cites W2124174353 @default.
- W2096391593 cites W2124629003 @default.
- W2096391593 cites W2127211243 @default.
- W2096391593 cites W2132217089 @default.
- W2096391593 cites W2132549764 @default.
- W2096391593 cites W2132999255 @default.
- W2096391593 cites W2133115605 @default.
- W2096391593 cites W2133906911 @default.
- W2096391593 cites W2135081730 @default.
- W2096391593 cites W2135212327 @default.
- W2096391593 cites W2139683134 @default.
- W2096391593 cites W2144788278 @default.
- W2096391593 cites W2145336848 @default.
- W2096391593 cites W2146871184 @default.
- W2096391593 cites W2148659689 @default.
- W2096391593 cites W2150824310 @default.
- W2096391593 cites W2151043030 @default.
- W2096391593 cites W2152239535 @default.
- W2096391593 cites W2153664791 @default.
- W2096391593 cites W2155471382 @default.
- W2096391593 cites W2157190406 @default.
- W2096391593 cites W2158275940 @default.
- W2096391593 cites W2159686933 @default.
- W2096391593 cites W2162598851 @default.
- W2096391593 cites W2163680580 @default.
- W2096391593 cites W2164568552 @default.
- W2096391593 cites W2166846876 @default.
- W2096391593 cites W2169265436 @default.
- W2096391593 cites W2171074980 @default.
- W2096391593 cites W2172803778 @default.
- W2096391593 cites W2217896605 @default.
- W2096391593 cites W2278884322 @default.
- W2096391593 cites W2481376636 @default.
- W2096391593 cites W2487271655 @default.
- W2096391593 cites W2793908218 @default.
- W2096391593 cites W3140321225 @default.
- W2096391593 cites W3169507310 @default.
- W2096391593 cites W4231869218 @default.
- W2096391593 cites W4300873085 @default.
- W2096391593 doi "https://doi.org/10.1109/jproc.2003.817150" @default.
- W2096391593 hasPublicationYear "2003" @default.
- W2096391593 type Work @default.
- W2096391593 sameAs 2096391593 @default.
- W2096391593 citedByCount "642" @default.