Sound Source Localization Using a Profile Fitting Method with Sound Reflectors(<Special Section>Speech Dynamics by Ear, Eye, Mouth and Machine)
スポンサーリンク
概要
- 論文の詳細を見る
In a two-microphone approach, interchannel differences in time (ICTD) and interchannel differences in sound level (ICLD) have generally been used for sound source localization. But those cues are not effective for vertical localization in the median plane (direct front). For that purpose, spectral cues based on features of head-related transfer functions (HRTF) have been investigated, but they are not robust enough against signal variations and environmental noise. In this paper, we use a "profile" as a cue while using a combination of reflectors specially designed for vertical localization. The observed sound is converted into a profile containing information about reflections as well as ICTD and ICLD data. The observed profile is decomposed into signal and noise by using template profiles associated with sound source locations. The template minimizing the residual of the decomposition gives the estimated sound source location. Experiments show this method can correctly provide a rough estimate of the vertical location even in a noisy environment.
- 社団法人電子情報通信学会の論文
- 2004-05-01
著者
-
Ichikawa Osamu
Tokyo Research Laboratory Ibm Japan Ltd.
-
Takiguchi Tetsuya
Ibm Tokyo Research Laboratory
-
Ichikawa O
Tokyo Research Laboratory Ibm Japan Ltd.
-
Nishimura Masafumi
Ibm Tokyo Research Laboratory
-
Nishimura Masafumi
Tokyo Research Lab. Ibm Japan
-
TAKIGUCHI Tetsuya
Tokyo Research Laboratory, IBM Japan Ltd.
関連論文
- Automatic Prosody Labeling Using Multiple Models for Japanese(Speech and Hearing)
- Local Peak Enhancement for In-Car Speech Recognition in Noisy Environment
- Simultaneous Adaptation of Echo Cancellation and Spectral Subtraction for In-Car Speech Recognition(Speech Enhancement, Multi-channel Acoustic Signal Processing)
- Sound Source Localization Using a Profile Fitting Method with Sound Reflectors(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Speech Enhancement by Profile Fitting Method (Special Issue on Speech Information Processing)
- Improved HMM Separation for Distant-Talking Speech Recognition(Speech Dynamics by Ear, Eye, Mouth and Machine)