Matches in SemOpenAlex for { <https://semopenalex.org/work/W1681324445> ?p ?o ?g. }
- W1681324445 abstract "At a cocktail party, we can selectively attend to a single voice and filter out all the other acoustical interferences. This perceptual ability has motivated the emergence of a new field of study known as computational auditory scene analysis (CASA) which aims to build speech separation systems that incorporate principles of auditory organization. This dissertation investigates four aspects of CASA processing: location-based speech segregation in multisource environments, binaural tracking of multiple moving sources, binaural sound segregation in reverberant environments, and monaural segregation of reverberant speech. The principal cues used by the auditory system to determine locations are the interaural time difference (ITD) and interaural intensity difference (IID) between the two ears. We observe that within a narrow frequency band, modifications to the relative strength of the target source with respect to the interference trigger systematic changes for ITD and IID. Moreover, for a fixed spatial configuration, this interaction produces a characteristic clustering in the binaural feature space. Consequently, we propose a supervised learning approach to estimate the ideal binary mask using the estimated binaural features. A systematic evaluation in terms of signal-to-noise ratio (SNR) as well as automatic speech recognition (ASR) scores shows that the resulting system produces masks very close to the ideal binary ones in anechoic conditions. Furthermore, the model produces large speech intelligibility improvements with normal listeners. While the above binaural systems perform optimally in anechoic conditions, reverberation affects the ITD and IID cues and therefore degrades their performance. For reverberant conditions, we propose a binaural segregation system that combines target cancellation through adaptive filtering and a binary decision rule to estimate the ideal binary mask. Specifically, we observe a correlation between the attenuation produced by the target cancellation stage and the relative strength between target and interference which is used subsequently to determine the target dominant T-F units. A major advantage of the proposed system is that, while requiring a fixed target location, it imposes no restrictions on the number, location or content of the interfering sources. An extensive comparison using SNR as well as ASR results shows that our system outperforms standard two-microphone beamforming approaches. (Abstract shortened by UMI.)" @default.
- W1681324445 created "2016-06-24" @default.
- W1681324445 creator A5051837453 @default.
- W1681324445 creator A5053306339 @default.
- W1681324445 date "2005-01-01" @default.
- W1681324445 modified "2023-09-26" @default.
- W1681324445 title "Auditory-based algorithms for sound segregation in multisource and reverberant environments" @default.
- W1681324445 cites W121527727 @default.
- W1681324445 cites W126774805 @default.
- W1681324445 cites W133022121 @default.
- W1681324445 cites W1490760466 @default.
- W1681324445 cites W1492221128 @default.
- W1681324445 cites W1500640442 @default.
- W1681324445 cites W1508165687 @default.
- W1681324445 cites W1515932869 @default.
- W1681324445 cites W1519903322 @default.
- W1681324445 cites W1522599309 @default.
- W1681324445 cites W1526819877 @default.
- W1681324445 cites W1536929369 @default.
- W1681324445 cites W1544582875 @default.
- W1681324445 cites W1548802052 @default.
- W1681324445 cites W1560013842 @default.
- W1681324445 cites W1563371067 @default.
- W1681324445 cites W1565658805 @default.
- W1681324445 cites W1567098080 @default.
- W1681324445 cites W1575829986 @default.
- W1681324445 cites W1578856370 @default.
- W1681324445 cites W1581848821 @default.
- W1681324445 cites W160800111 @default.
- W1681324445 cites W1648602279 @default.
- W1681324445 cites W1808196926 @default.
- W1681324445 cites W1869758581 @default.
- W1681324445 cites W1907143431 @default.
- W1681324445 cites W1908335757 @default.
- W1681324445 cites W1946152311 @default.
- W1681324445 cites W1956364622 @default.
- W1681324445 cites W1963527011 @default.
- W1681324445 cites W1964705026 @default.
- W1681324445 cites W1966948181 @default.
- W1681324445 cites W1969105061 @default.
- W1681324445 cites W1971016029 @default.
- W1681324445 cites W1979493366 @default.
- W1681324445 cites W1980459899 @default.
- W1681324445 cites W1984357105 @default.
- W1681324445 cites W1989320958 @default.
- W1681324445 cites W1991139021 @default.
- W1681324445 cites W1991850640 @default.
- W1681324445 cites W1995186998 @default.
- W1681324445 cites W1997709277 @default.
- W1681324445 cites W2005596566 @default.
- W1681324445 cites W2005679925 @default.
- W1681324445 cites W2011833091 @default.
- W1681324445 cites W2013020033 @default.
- W1681324445 cites W2015563078 @default.
- W1681324445 cites W2015636737 @default.
- W1681324445 cites W2018368999 @default.
- W1681324445 cites W2021396690 @default.
- W1681324445 cites W2022377850 @default.
- W1681324445 cites W2026257400 @default.
- W1681324445 cites W2029421006 @default.
- W1681324445 cites W2032026767 @default.
- W1681324445 cites W2033174849 @default.
- W1681324445 cites W2033284130 @default.
- W1681324445 cites W2034040413 @default.
- W1681324445 cites W2041638389 @default.
- W1681324445 cites W2044222806 @default.
- W1681324445 cites W2046317813 @default.
- W1681324445 cites W2050758723 @default.
- W1681324445 cites W2057889776 @default.
- W1681324445 cites W2059744545 @default.
- W1681324445 cites W2060011767 @default.
- W1681324445 cites W2065691163 @default.
- W1681324445 cites W2066381245 @default.
- W1681324445 cites W2074354966 @default.
- W1681324445 cites W2088805357 @default.
- W1681324445 cites W2095380628 @default.
- W1681324445 cites W2096554907 @default.
- W1681324445 cites W2096779346 @default.
- W1681324445 cites W2097191389 @default.
- W1681324445 cites W2101609516 @default.
- W1681324445 cites W2106325442 @default.
- W1681324445 cites W2106981472 @default.
- W1681324445 cites W2107274532 @default.
- W1681324445 cites W2107493093 @default.
- W1681324445 cites W2110322414 @default.
- W1681324445 cites W2111164644 @default.
- W1681324445 cites W2113139249 @default.
- W1681324445 cites W2116590962 @default.
- W1681324445 cites W2117678320 @default.
- W1681324445 cites W2119599673 @default.
- W1681324445 cites W2120076267 @default.
- W1681324445 cites W2123337747 @default.
- W1681324445 cites W2126127391 @default.
- W1681324445 cites W2126681966 @default.
- W1681324445 cites W2127923214 @default.
- W1681324445 cites W2128599137 @default.
- W1681324445 cites W2129905273 @default.
- W1681324445 cites W2130650491 @default.
- W1681324445 cites W2132056833 @default.
- W1681324445 cites W2134751974 @default.