Matches in SemOpenAlex for { <https://semopenalex.org/work/W4384824100> ?p ?o ?g. }
- W4384824100 endingPage "20" @default.
- W4384824100 startingPage "1" @default.
- W4384824100 abstract "As one of the most preferred forms of Human-Computer Interaction (HCI) nowadays, speech-based HCI enables people to communicate verbally with machines, leveraging technologies such as speech recognition and speech synthesis. Current paradigm of speech-based HCI focus on the content of speech only, failing to comprehend deeper pointing information in voice interaction. In particular, when encountering scenarios with multiple smart voice devices around, if people intend to interact with a certain device, the lack of extra pointing information (like the role played by the direction of eye gaze) would cause unintended response from the other devices, resulting in poor interaction experience during HCI. Hence, an interesting problem is: Is it possible for the devices to be aware of the orientation of human voice with only the acoustic speech signals? There is little research studying this topic, except for very a few primary works with much room for improvement. The main challenge of this study lies in capturing the concealed orientation information embedded within the speech signal, while simultaneously maintaining the scheme’s practicality and high precision. In this paper, we propose Oriennet, for identifying the orientation of human voice. With a series of features intentionally designed in view of the indoor voice propagation model and mouth radiation pattern, as well as the application of attention mechanism, Oriennet achieve 95% accuracy in terms of judging whether people are facing the device or not. Even for the fine-grained task of classifying people’s specific orientation from 8 different directions, our work achieved an accuracy of 74%, far outperforming the existed works. We have validated the robustness of Oriennet under various conditions (noisy environment; different people, rooms, languages, locations; fewer microphones), demonstrating its promising applicability in real-life scenarios." @default.
- W4384824100 created "2023-07-21" @default.
- W4384824100 creator A5050267088 @default.
- W4384824100 creator A5054082208 @default.
- W4384824100 date "2023-07-19" @default.
- W4384824100 modified "2023-09-24" @default.
- W4384824100 title "Voice Orientation Recognition: New Paradigm of Speech-Based Human-Computer Interaction" @default.
- W4384824100 cites W16132381 @default.
- W4384824100 cites W2037684811 @default.
- W4384824100 cites W2052384514 @default.
- W4384824100 cites W2056132907 @default.
- W4384824100 cites W2098651198 @default.
- W4384824100 cites W2101284180 @default.
- W4384824100 cites W2107049458 @default.
- W4384824100 cites W2119932130 @default.
- W4384824100 cites W2147494647 @default.
- W4384824100 cites W2149264547 @default.
- W4384824100 cites W2150309794 @default.
- W4384824100 cites W2162443378 @default.
- W4384824100 cites W2290536607 @default.
- W4384824100 cites W2403444653 @default.
- W4384824100 cites W253649471 @default.
- W4384824100 cites W2586354743 @default.
- W4384824100 cites W2618099328 @default.
- W4384824100 cites W2794284562 @default.
- W4384824100 cites W2795679027 @default.
- W4384824100 cites W2955455382 @default.
- W4384824100 cites W2963341956 @default.
- W4384824100 cites W2974856071 @default.
- W4384824100 cites W3021395787 @default.
- W4384824100 cites W3029297226 @default.
- W4384824100 cites W3093949709 @default.
- W4384824100 cites W3102138973 @default.
- W4384824100 cites W3108981297 @default.
- W4384824100 cites W3118630692 @default.
- W4384824100 cites W3170201991 @default.
- W4384824100 cites W3174086521 @default.
- W4384824100 cites W3195997474 @default.
- W4384824100 cites W3198914896 @default.
- W4384824100 cites W3201142223 @default.
- W4384824100 cites W3201854521 @default.
- W4384824100 cites W3206706278 @default.
- W4384824100 cites W3211278025 @default.
- W4384824100 cites W3211438798 @default.
- W4384824100 cites W4213305518 @default.
- W4384824100 cites W4214867578 @default.
- W4384824100 cites W4221094122 @default.
- W4384824100 cites W4225496162 @default.
- W4384824100 cites W4283740590 @default.
- W4384824100 cites W4292572011 @default.
- W4384824100 cites W4319862670 @default.
- W4384824100 cites W766042757 @default.
- W4384824100 doi "https://doi.org/10.1080/10447318.2023.2233128" @default.
- W4384824100 hasPublicationYear "2023" @default.
- W4384824100 type Work @default.
- W4384824100 citedByCount "0" @default.
- W4384824100 crossrefType "journal-article" @default.
- W4384824100 hasAuthorship W4384824100A5050267088 @default.
- W4384824100 hasAuthorship W4384824100A5054082208 @default.
- W4384824100 hasConcept C107457646 @default.
- W4384824100 hasConcept C120665830 @default.
- W4384824100 hasConcept C121332964 @default.
- W4384824100 hasConcept C127413603 @default.
- W4384824100 hasConcept C154945302 @default.
- W4384824100 hasConcept C16345878 @default.
- W4384824100 hasConcept C192209626 @default.
- W4384824100 hasConcept C201995342 @default.
- W4384824100 hasConcept C2524010 @default.
- W4384824100 hasConcept C2779916870 @default.
- W4384824100 hasConcept C2780451532 @default.
- W4384824100 hasConcept C28490314 @default.
- W4384824100 hasConcept C33923547 @default.
- W4384824100 hasConcept C41008148 @default.
- W4384824100 hasConceptScore W4384824100C107457646 @default.
- W4384824100 hasConceptScore W4384824100C120665830 @default.
- W4384824100 hasConceptScore W4384824100C121332964 @default.
- W4384824100 hasConceptScore W4384824100C127413603 @default.
- W4384824100 hasConceptScore W4384824100C154945302 @default.
- W4384824100 hasConceptScore W4384824100C16345878 @default.
- W4384824100 hasConceptScore W4384824100C192209626 @default.
- W4384824100 hasConceptScore W4384824100C201995342 @default.
- W4384824100 hasConceptScore W4384824100C2524010 @default.
- W4384824100 hasConceptScore W4384824100C2779916870 @default.
- W4384824100 hasConceptScore W4384824100C2780451532 @default.
- W4384824100 hasConceptScore W4384824100C28490314 @default.
- W4384824100 hasConceptScore W4384824100C33923547 @default.
- W4384824100 hasConceptScore W4384824100C41008148 @default.
- W4384824100 hasLocation W43848241001 @default.
- W4384824100 hasOpenAccess W4384824100 @default.
- W4384824100 hasPrimaryLocation W43848241001 @default.
- W4384824100 hasRelatedWork W1553305517 @default.
- W4384824100 hasRelatedWork W1973969444 @default.
- W4384824100 hasRelatedWork W1998941159 @default.
- W4384824100 hasRelatedWork W2021400925 @default.
- W4384824100 hasRelatedWork W2081647779 @default.
- W4384824100 hasRelatedWork W2807438818 @default.
- W4384824100 hasRelatedWork W3206877762 @default.
- W4384824100 hasRelatedWork W4243607943 @default.