Voice Activity Detection with Array Signal Processing in the Wavelet Domain(Engineering Acoustics)
スポンサーリンク
概要
- 論文の詳細を見る
In speech enhancement with adaptive microphone array, the voice activity detection (VAD) is indispensable for the adaptation control. Even though many VAD methods have been proposed as a pre-processor for speech recognition and compression, they can hardly discriminate nonstationary interferences which frequently exist in real environment. In this research, we propose a novel VAD method with array signal processing in the wavelet domain. In that domain we can integrate the temporal, spectral and spatial information to achieve robust voice activity discriminability for a nonstationary interference arriving from close direction of speech. The signals acquired by microphone array are at first decomposed into appropriate subbands using wavelet packet to extract its temporal and spectral features. Then directionality check and direction estimation on each subbands are executed to do VAD with respect to the spatial information. Computer simulation results for sound data demonstrate that the proposed method keeps its discriminability even for the interference arriving from close direction of speech.
- 社団法人電子情報通信学会の論文
- 2003-11-01
著者
-
Hioka Y
Keio Univ. Yokohama‐shi Jpn
-
HIOKA Yusuke
Department of System Design Engineering, the Faculty of Science and Technology, Keio University
-
HAMADA Nozomu
Department of System Design Engineering, the Faculty of Science and Technology, Keio University
-
Hamada Nozomu
Department Of System Design Engineering The Faculty Of Science And Technology Keio University
関連論文
- Voice Activity Detection with Array Signal Processing in the Wavelet Domain(Engineering Acoustics)
- DOA Estimation of Multiple Speech Sources from a Stereophonic Mixture in Underdetermined Case
- An Estimation Method of Sound Source Orientation Using Eigenspace Variation of Spatial Correlation Matrix