Matches in SemOpenAlex for { <https://semopenalex.org/work/W3157861865> ?p ?o ?g. }
- W3157861865 endingPage "707" @default.
- W3157861865 startingPage "673" @default.
- W3157861865 abstract "This survey provides an overview of the evolution of visually grounded models of spoken language over the last 20 years. Such models are inspired by the observation that when children pick up a language, they rely on a wide range of indirect and noisy clues, crucially including signals from the visual modality co-occurring with spoken utterances. Several fields have made important contributions to this approach to modeling or mimicking the process of learning language: Machine Learning, Natural Language and Speech Processing, Computer Vision and Cognitive Science. The current paper brings together these contributions in order to provide a useful introduction and overview for practitioners in all these areas. We discuss the central research questions addressed, the timeline of developments, and the datasets which enabled much of this work. We then summarize the main modeling architectures and offer an exhaustive overview of the evaluation metrics and analysis techniques." @default.
- W3157861865 created "2021-05-10" @default.
- W3157861865 creator A5022698890 @default.
- W3157861865 date "2022-02-18" @default.
- W3157861865 modified "2023-10-14" @default.
- W3157861865 title "Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques" @default.
- W3157861865 cites W1574972348 @default.
- W3157861865 cites W1614298861 @default.
- W3157861865 cites W1686810756 @default.
- W3157861865 cites W1797268635 @default.
- W3157861865 cites W1861492603 @default.
- W3157861865 cites W1905882502 @default.
- W3157861865 cites W1924770834 @default.
- W3157861865 cites W2006969979 @default.
- W3157861865 cites W2024490156 @default.
- W3157861865 cites W2080320702 @default.
- W3157861865 cites W2095897464 @default.
- W3157861865 cites W2102605133 @default.
- W3157861865 cites W2107917162 @default.
- W3157861865 cites W2108598243 @default.
- W3157861865 cites W2112912048 @default.
- W3157861865 cites W2119775030 @default.
- W3157861865 cites W2123815913 @default.
- W3157861865 cites W2125566341 @default.
- W3157861865 cites W2132921748 @default.
- W3157861865 cites W2134670479 @default.
- W3157861865 cites W2160654481 @default.
- W3157861865 cites W2194775991 @default.
- W3157861865 cites W2230076941 @default.
- W3157861865 cites W2250790822 @default.
- W3157861865 cites W2282219577 @default.
- W3157861865 cites W2507296351 @default.
- W3157861865 cites W2524365899 @default.
- W3157861865 cites W2531381952 @default.
- W3157861865 cites W2533598788 @default.
- W3157861865 cites W2553608650 @default.
- W3157861865 cites W2556930864 @default.
- W3157861865 cites W2586148577 @default.
- W3157861865 cites W2784025607 @default.
- W3157861865 cites W2920166246 @default.
- W3157861865 cites W2927673779 @default.
- W3157861865 cites W2938991416 @default.
- W3157861865 cites W2940544976 @default.
- W3157861865 cites W2950133079 @default.
- W3157861865 cites W2962753610 @default.
- W3157861865 cites W2962813140 @default.
- W3157861865 cites W2962832640 @default.
- W3157861865 cites W2962862718 @default.
- W3157861865 cites W2963115079 @default.
- W3157861865 cites W2963163163 @default.
- W3157861865 cites W2963330681 @default.
- W3157861865 cites W2963403868 @default.
- W3157861865 cites W2963525826 @default.
- W3157861865 cites W2963778889 @default.
- W3157861865 cites W2963799213 @default.
- W3157861865 cites W2963902314 @default.
- W3157861865 cites W2963983719 @default.
- W3157861865 cites W2964001192 @default.
- W3157861865 cites W2964099072 @default.
- W3157861865 cites W2965147078 @default.
- W3157861865 cites W2971709506 @default.
- W3157861865 cites W2972808286 @default.
- W3157861865 cites W2972892814 @default.
- W3157861865 cites W2973135958 @default.
- W3157861865 cites W2984008963 @default.
- W3157861865 cites W2988907666 @default.
- W3157861865 cites W2989358187 @default.
- W3157861865 cites W2995680346 @default.
- W3157861865 cites W3005578234 @default.
- W3157861865 cites W3015300171 @default.
- W3157861865 cites W3035750922 @default.
- W3157861865 cites W3095670406 @default.
- W3157861865 cites W3095881291 @default.
- W3157861865 cites W3100813302 @default.
- W3157861865 cites W3100923070 @default.
- W3157861865 cites W3105148948 @default.
- W3157861865 cites W3111013239 @default.
- W3157861865 cites W3114436296 @default.
- W3157861865 cites W3121480429 @default.
- W3157861865 cites W3158565912 @default.
- W3157861865 cites W3159476814 @default.
- W3157861865 cites W3161348170 @default.
- W3157861865 cites W3170972077 @default.
- W3157861865 cites W3177829661 @default.
- W3157861865 cites W3196698946 @default.
- W3157861865 cites W3197828817 @default.
- W3157861865 cites W3200287550 @default.
- W3157861865 cites W3213502289 @default.
- W3157861865 cites W3217290931 @default.
- W3157861865 cites W385555557 @default.
- W3157861865 cites W2593779438 @default.
- W3157861865 cites W3027324582 @default.
- W3157861865 doi "https://doi.org/10.1613/jair.1.12967" @default.
- W3157861865 hasPublicationYear "2022" @default.
- W3157861865 type Work @default.
- W3157861865 sameAs 3157861865 @default.
- W3157861865 citedByCount "9" @default.
- W3157861865 countsByYear W31578618652020 @default.