Matches in SemOpenAlex for { <https://semopenalex.org/work/W2893015615> ?p ?o ?g. }
Showing items 1 to 97 of
97
with 100 items per page.
- W2893015615 abstract "The Audiovisual Speech Recognition (AVSR) most commonly applied to multimodal learning employs both the video and audio information to do Robust Automatic Speech Recognition. Traditionally, AVSR was regarded as the inference and projection, a lot of restrictions on the ability of it. With the in-depth study, DNN becomes an important part of the toolkit in traditional classification tools, such as automatic speech recognition, image classification, natural language processing. AVSR often use some DNN models including Multimodal Deep Autoencoders (MDAEs), Multimodal Deep Belief Network (MDBN) and Multimodal Deep Boltzmann Machine (MDBM), which are always better than the traditional methods. However, such DNN models have several shortcomings: Firstly, they can’t balance the modal fusion and temporal fusion, or even haven’t temporal fusion; Secondly, the architecture of these models isn’t end-to-end. In addition, the training and testing are cumbersome. We designed a DNN model—Aggregate(varvec{d}) Mult(varvec{i})moda(varvec{l}) Bidirection(varvec{a})l Recurren(varvec{t}) Mod(varvec{e})l (DILATE)—to overcome such weakness. The DILATE could be not just trained and tested simultaneously, but alternatively easy to train and prevent overfitting automatically. The experiments show that DILATE is superior to traditional methods and other DNN models in some benchmark datasets." @default.
- W2893015615 created "2018-10-05" @default.
- W2893015615 creator A5028463238 @default.
- W2893015615 creator A5028580598 @default.
- W2893015615 creator A5032118254 @default.
- W2893015615 creator A5038424727 @default.
- W2893015615 creator A5040036617 @default.
- W2893015615 creator A5046000818 @default.
- W2893015615 creator A5046755808 @default.
- W2893015615 creator A5071026135 @default.
- W2893015615 creator A5087246586 @default.
- W2893015615 date "2018-01-01" @default.
- W2893015615 modified "2023-10-16" @default.
- W2893015615 title "Aggregated Multimodal Bidirectional Recurrent Model for Audiovisual Speech Recognition" @default.
- W2893015615 cites W1503933356 @default.
- W2893015615 cites W1978380426 @default.
- W2893015615 cites W2014271812 @default.
- W2893015615 cites W2016043834 @default.
- W2893015615 cites W2053101950 @default.
- W2893015615 cites W2064675550 @default.
- W2893015615 cites W2076462394 @default.
- W2893015615 cites W2088998933 @default.
- W2893015615 cites W2113814270 @default.
- W2893015615 cites W2125838338 @default.
- W2893015615 cites W2127141656 @default.
- W2893015615 cites W2131774270 @default.
- W2893015615 cites W2136155248 @default.
- W2893015615 cites W2164598857 @default.
- W2893015615 cites W2289925289 @default.
- W2893015615 cites W2474638510 @default.
- W2893015615 cites W59325952 @default.
- W2893015615 doi "https://doi.org/10.1007/978-3-030-00021-9_35" @default.
- W2893015615 hasPublicationYear "2018" @default.
- W2893015615 type Work @default.
- W2893015615 sameAs 2893015615 @default.
- W2893015615 citedByCount "0" @default.
- W2893015615 crossrefType "book-chapter" @default.
- W2893015615 hasAuthorship W2893015615A5028463238 @default.
- W2893015615 hasAuthorship W2893015615A5028580598 @default.
- W2893015615 hasAuthorship W2893015615A5032118254 @default.
- W2893015615 hasAuthorship W2893015615A5038424727 @default.
- W2893015615 hasAuthorship W2893015615A5040036617 @default.
- W2893015615 hasAuthorship W2893015615A5046000818 @default.
- W2893015615 hasAuthorship W2893015615A5046755808 @default.
- W2893015615 hasAuthorship W2893015615A5071026135 @default.
- W2893015615 hasAuthorship W2893015615A5087246586 @default.
- W2893015615 hasConcept C108583219 @default.
- W2893015615 hasConcept C13280743 @default.
- W2893015615 hasConcept C153180895 @default.
- W2893015615 hasConcept C154945302 @default.
- W2893015615 hasConcept C185798385 @default.
- W2893015615 hasConcept C192576344 @default.
- W2893015615 hasConcept C205649164 @default.
- W2893015615 hasConcept C22019652 @default.
- W2893015615 hasConcept C28490314 @default.
- W2893015615 hasConcept C41008148 @default.
- W2893015615 hasConcept C50644808 @default.
- W2893015615 hasConcept C81363708 @default.
- W2893015615 hasConceptScore W2893015615C108583219 @default.
- W2893015615 hasConceptScore W2893015615C13280743 @default.
- W2893015615 hasConceptScore W2893015615C153180895 @default.
- W2893015615 hasConceptScore W2893015615C154945302 @default.
- W2893015615 hasConceptScore W2893015615C185798385 @default.
- W2893015615 hasConceptScore W2893015615C192576344 @default.
- W2893015615 hasConceptScore W2893015615C205649164 @default.
- W2893015615 hasConceptScore W2893015615C22019652 @default.
- W2893015615 hasConceptScore W2893015615C28490314 @default.
- W2893015615 hasConceptScore W2893015615C41008148 @default.
- W2893015615 hasConceptScore W2893015615C50644808 @default.
- W2893015615 hasConceptScore W2893015615C81363708 @default.
- W2893015615 hasLocation W28930156151 @default.
- W2893015615 hasOpenAccess W2893015615 @default.
- W2893015615 hasPrimaryLocation W28930156151 @default.
- W2893015615 hasRelatedWork W1892788530 @default.
- W2893015615 hasRelatedWork W2547793174 @default.
- W2893015615 hasRelatedWork W2588077199 @default.
- W2893015615 hasRelatedWork W2890244912 @default.
- W2893015615 hasRelatedWork W2947338973 @default.
- W2893015615 hasRelatedWork W2971766686 @default.
- W2893015615 hasRelatedWork W2989571531 @default.
- W2893015615 hasRelatedWork W2995240650 @default.
- W2893015615 hasRelatedWork W3009035583 @default.
- W2893015615 hasRelatedWork W3011234510 @default.
- W2893015615 hasRelatedWork W3015678833 @default.
- W2893015615 hasRelatedWork W3048912752 @default.
- W2893015615 hasRelatedWork W3081788987 @default.
- W2893015615 hasRelatedWork W3104842308 @default.
- W2893015615 hasRelatedWork W3105099157 @default.
- W2893015615 hasRelatedWork W3132072393 @default.
- W2893015615 hasRelatedWork W3157555801 @default.
- W2893015615 hasRelatedWork W3211053973 @default.
- W2893015615 hasRelatedWork W2898727538 @default.
- W2893015615 hasRelatedWork W2971048891 @default.
- W2893015615 isParatext "false" @default.
- W2893015615 isRetracted "false" @default.
- W2893015615 magId "2893015615" @default.
- W2893015615 workType "book-chapter" @default.