Matches in SemOpenAlex for { <https://semopenalex.org/work/W4285390645> ?p ?o ?g. }
- W4285390645 abstract "Objective Acoustic addressee detection is a challenge that arises in human group interactions, as well as in interactions with technical systems. The research domain is relatively new, and no structured review is available. Especially due to the recent growth of usage of voice assistants, this topic received increased attention. To allow a natural interaction on the same level as human interactions, many studies focused on the acoustic analyses of speech. The aim of this survey is to give an overview on the different studies and compare them in terms of utilized features, datasets, as well as classification architectures, which has so far been not conducted. Methods The survey followed the Preferred Reporting Items for Systematic reviews and Meta-Analysis (PRISMA) guidelines. We included all studies which were analyzing acoustic and/or acoustic characteristics of speech utterances to automatically detect the addressee. For each study, we describe the used dataset, feature set, classification architecture, performance, and other relevant findings. Results 1,581 studies were screened, of which 23 studies met the inclusion criteria. The majority of studies utilized German or English speech corpora. Twenty-six percent of the studies were tested on in-house datasets, where only limited information is available. Nearly 40% of the studies employed hand-crafted feature sets, the other studies mostly rely on Interspeech ComParE 2013 feature set or Log-FilterBank Energy and Log Energy of Short-Time Fourier Transform features. 12 out of 23 studies used deep-learning approaches, the other 11 studies used classical machine learning methods. Nine out of 23 studies furthermore employed a classifier fusion. Conclusion Speech-based automatic addressee detection is a relatively new research domain. Especially by using vast amounts of material or sophisticated models, device-directed speech is distinguished from non-device-directed speech. Furthermore, a clear distinction between in-house datasets and pre-existing ones can be drawn and a clear trend toward pre-defined larger feature sets (with partly used feature selection methods) is apparent." @default.
- W4285390645 created "2022-07-14" @default.
- W4285390645 creator A5033995312 @default.
- W4285390645 creator A5047373591 @default.
- W4285390645 creator A5073575554 @default.
- W4285390645 date "2022-07-14" @default.
- W4285390645 modified "2023-09-30" @default.
- W4285390645 title "Acoustic-Based Automatic Addressee Detection for Technical Systems: A Review" @default.
- W4285390645 cites W1570557094 @default.
- W4285390645 cites W1851144431 @default.
- W4285390645 cites W201747637 @default.
- W4285390645 cites W2034438017 @default.
- W4285390645 cites W2047247026 @default.
- W4285390645 cites W2053115512 @default.
- W4285390645 cites W2076948628 @default.
- W4285390645 cites W2091746061 @default.
- W4285390645 cites W2115999181 @default.
- W4285390645 cites W2128093162 @default.
- W4285390645 cites W2128740179 @default.
- W4285390645 cites W2137639365 @default.
- W4285390645 cites W2144005487 @default.
- W4285390645 cites W2144622310 @default.
- W4285390645 cites W2150610356 @default.
- W4285390645 cites W2171092607 @default.
- W4285390645 cites W2180721986 @default.
- W4285390645 cites W2488227055 @default.
- W4285390645 cites W2561158378 @default.
- W4285390645 cites W2563559521 @default.
- W4285390645 cites W2612919868 @default.
- W4285390645 cites W2745600062 @default.
- W4285390645 cites W2751801591 @default.
- W4285390645 cites W2759491482 @default.
- W4285390645 cites W2769024834 @default.
- W4285390645 cites W2797759721 @default.
- W4285390645 cites W2887080793 @default.
- W4285390645 cites W2888185730 @default.
- W4285390645 cites W2890875924 @default.
- W4285390645 cites W2898147565 @default.
- W4285390645 cites W2912581782 @default.
- W4285390645 cites W2942968409 @default.
- W4285390645 cites W2965288462 @default.
- W4285390645 cites W2979348174 @default.
- W4285390645 cites W3012624518 @default.
- W4285390645 cites W3022797014 @default.
- W4285390645 cites W3040433873 @default.
- W4285390645 cites W3082482205 @default.
- W4285390645 cites W3089726186 @default.
- W4285390645 cites W3093949709 @default.
- W4285390645 cites W3096755600 @default.
- W4285390645 cites W3124974106 @default.
- W4285390645 cites W3141500890 @default.
- W4285390645 cites W3158874131 @default.
- W4285390645 cites W3173575109 @default.
- W4285390645 cites W3189912401 @default.
- W4285390645 cites W4210849719 @default.
- W4285390645 cites W4212904291 @default.
- W4285390645 cites W4214889565 @default.
- W4285390645 cites W4237337958 @default.
- W4285390645 cites W4238144524 @default.
- W4285390645 cites W4239890301 @default.
- W4285390645 cites W4287643567 @default.
- W4285390645 cites W4287694128 @default.
- W4285390645 cites W4294215472 @default.
- W4285390645 cites W4385826981 @default.
- W4285390645 doi "https://doi.org/10.3389/fcomp.2022.831784" @default.
- W4285390645 hasPublicationYear "2022" @default.
- W4285390645 type Work @default.
- W4285390645 citedByCount "0" @default.
- W4285390645 crossrefType "journal-article" @default.
- W4285390645 hasAuthorship W4285390645A5033995312 @default.
- W4285390645 hasAuthorship W4285390645A5047373591 @default.
- W4285390645 hasAuthorship W4285390645A5073575554 @default.
- W4285390645 hasBestOaLocation W42853906451 @default.
- W4285390645 hasConcept C105795698 @default.
- W4285390645 hasConcept C108583219 @default.
- W4285390645 hasConcept C119857082 @default.
- W4285390645 hasConcept C138885662 @default.
- W4285390645 hasConcept C154945302 @default.
- W4285390645 hasConcept C177264268 @default.
- W4285390645 hasConcept C186370098 @default.
- W4285390645 hasConcept C199360897 @default.
- W4285390645 hasConcept C204321447 @default.
- W4285390645 hasConcept C2776401178 @default.
- W4285390645 hasConcept C2778827112 @default.
- W4285390645 hasConcept C28490314 @default.
- W4285390645 hasConcept C33923547 @default.
- W4285390645 hasConcept C41008148 @default.
- W4285390645 hasConcept C41895202 @default.
- W4285390645 hasConceptScore W4285390645C105795698 @default.
- W4285390645 hasConceptScore W4285390645C108583219 @default.
- W4285390645 hasConceptScore W4285390645C119857082 @default.
- W4285390645 hasConceptScore W4285390645C138885662 @default.
- W4285390645 hasConceptScore W4285390645C154945302 @default.
- W4285390645 hasConceptScore W4285390645C177264268 @default.
- W4285390645 hasConceptScore W4285390645C186370098 @default.
- W4285390645 hasConceptScore W4285390645C199360897 @default.
- W4285390645 hasConceptScore W4285390645C204321447 @default.
- W4285390645 hasConceptScore W4285390645C2776401178 @default.
- W4285390645 hasConceptScore W4285390645C2778827112 @default.
- W4285390645 hasConceptScore W4285390645C28490314 @default.