Histogram equalization for noise-robust speech recognition using discrete-mixture HMMs
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we introduce a new method of robust speech recognition under noisy conditions based on discrete-mixture hidden Markov models (DMHMMs). DMHMMs were originally proposed to reduce calculation costs in the decoding process. Recently, we have applied DMHMMs to noisy speech recognition, and found that they were effective for modeling noisy speech. Towards the further improvement of noise-robust speech recognition, we propose a novel normalization method for DMHMMs based on histogram equalization (HEQ). The HEQ method can compensate the nonlinear effects of additive noise. It is generally used for the feature space normalization of continuous-mixture HMM (CMHMM) systems. In this paper, we propose both model space and feature space normalization of DMHMMs by using HEQ. In the model space normalization, codebooks of DMHMMs are modified by the transform function derived from the HEQ method. The proposed method was compared using both conventional CMHMMs and DMHMMs. The results showed that the model space normalization of DMHMMs by multiple transform functions was effective for noise-robust speech recognition.
- 社団法人日本音響学会の論文
著者
-
Kohda Masaki
Graduate School Of Science And Engineering Yamagata University
-
KOSAKA Tetsuo
Graduate School of Science and Engineering, Yamagata University
-
Kosaka Tetsuo
Graduate School Of Science And Engineering Yamagata University
-
Katoh Masaharu
Graduate School of Science and Engineering, Yamagata University
-
Katoh Masaharu
Graduate School Of Science And Engineering Yamagata University
関連論文
- Lecture Speech Recognition Using Discrete-Mixture HMMs
- Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
- Histogram equalization for noise-robust speech recognition using discrete-mixture HMMs
- Robust Speech Recognition Using Discrete-Mixture HMMs(Speech and Hearing)