Noise Robust Speech Recognition Using Subband-Crosscorrelation Analysis
スポンサーリンク
概要
- 論文の詳細を見る
This paper describes subband-crossorrelation analysis (SBXCOR) using two input channel signals. SBXCOR is an extended signal processing technique of subband-autocorrelation analysis (SBCOR) that extracts periodicities associated with the inverse of center frequencies present in speech signals. In addition, to extract more periodicity information associated with the inverse of center frequencies, the multi-delay weighting (MDW) processing is applied to SBXCOR. In experiments, the noise robustness of SBXCOR is evaluated using a DTW word recognizer under (1) a simulated acousic condition with white noise and (2) a real acoustic condition in a sound proof room with human speech-like noise. As the results, under the simulated acoustic condition, it is shown that SBXCOR is more robust than the conventional one-channel SBCOR, but less robust than SBCOR extracted from the two-channel-summed signal. Furthermore, by applying MDW processing, the performance of SBXCOR improved about 2% at SNR 0dB. The resultant performance of SBXCOR with MDW processing was much better than those of smoothed group delay spectrum (SGDS) and mel-filterbank cepstral coefficient (MFCC) below SNR 10dB. The results under the real acoustic condition were almost the same as the simulated acoustic condition.
- 社団法人電子情報通信学会の論文
- 1998-10-25
著者
-
TAKEDA Kazuya
Department of Nuclear Engineering, School of Engineering, Tokai University
-
Takeda K
Nagoya Univ. Nagoya Jpn
-
Takeda Kazuya
Department Of Information Electronics Graduate School Of Engineering Nagoya University
-
Kajita S
Center For Information Media Studies Nagoya University
-
Takeda K
Center For Integrated Acoustic Information Research Graduate School Of Engineering Nagoya University
-
ITAKURA Fumitada
Center for Information Media Studies, Nagoya University
-
Itakura F
Graduate School Of Information Engineering Meijo University
-
Itakura Fumitada
Center For Information Media Studies Nagoya University
-
KAJITA Shoji
Center for Information Media Studies, Nagoya University
関連論文
- 磁化シートプラズマを用いたガス・ダイバータの基礎実験
- CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments
- IMPROVEMENT OF CHOLEDOCHOSCOPY : CHROMOENDOCHOLEDOCHOSCOPY, AUTOFLUORESCENCE IMAGING, OR NARROW-BAND IMAGING
- AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments(Speech and Hearing)
- MC-32 Development of microdrive assembly process
- Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- Driver's irritation detection using speech recognition results (音声・第10回音声言語シンポジウム)
- Driver's irritation detection using speech recognition results (音声言語情報処理)
- Driver's irritation detection using speech recognition results (言語理解とコミュニケーション・第10回音声言語シンポジウム)
- サブバンドに含まれる周波数成分の瞬時周波数に基づく推定
- Lack of Interaction Between Cefdinir and Calcium Polycarbophil : In vitro and In vivo Studies
- Predicting the Degradation of Speech Recognition Performance from Sub-band Dynamic Ranges (特集 音声言語情報処理とその応用)
- A model of perceptual distance for group delays based on ellipsoidal mapping
- The effect of group delay spectrum on timbre
- Direction of Arrival Estimation Using Nonlinear Microphone Array
- Speech Enhancement Using Nonlinear Microphone Array Based on Noise Adaptive Complementary Beamforming
- Speech Enhancement Using Nonlinear Microphone Array Based on Complementary Beamforming (Special Section on Digital Signal Processing)
- Noise Robust Speech Recognition Using Subband-Crosscorrelation Analysis
- An Acoustically Oriented Vocal-Tract Model
- Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition(Speech Enhancement, Multi-channel Acoustic Signal Processing)
- On the use of two-mass vocal cord model in characterizing the stress speech (音声)
- Particle Size Distribution Measurement of Free-Falling Fine Particles in a Dusty Plasma Experiment
- Relaxation behavior of laser-peening residual stress under tensile loading investigated by X-ray and neutron diffraction