Robust Speech Recognition by Model Adaptation and Normalization Using Pre-Observed Noise
スポンサーリンク
概要
- 論文の詳細を見る
Users require speech recognition systems that offer rapid response and high accuracy concurrently. Speech recognition accuracy is degraded by additive noise, imposed by ambient noise, and convolutional noise, created by space transfer characteristics, especially in distant talking situations. Against each type of noise, existing model adaptation techniques achieve robustness by using HMM-composition and CMN (cepstral mean normalization). Since they need an additive noise sample as well as a user speech sample to generate the models required, they can not achieve rapid response, though it may be possible to catch just the additive noise in a previous step. In the previous step, the technique proposed herein uses just the additive noise to generate an adapted and normalized model against both types of noise. When the users speech sample is captured, only online-CMN need be performed to start the recognition processing, so the technique offers rapid response. In addition, to cover the unpredictable S/N values possible in real applications, the technique creates several S/N HMMs. Simulations using artificial speech data show that the proposed technique increased the character correct rate by 11.62% compared to CMN.
- 2008-03-01
著者
-
Kobashikawa Satoshi
NTT Cyber Space Laboratories, NTT Corporation
-
Takahashi Satoshi
NTT Cyber Space Laboratories, NTT Corporation
-
Kobashikawa Satoshi
Ntt Cyber Space Laboratories Ntt Corp.
-
Takahashi Satoshi
Ntt Cyber Space Laboratories Ntt Corporation
関連論文
- Efficient data selection for speech recognition based on prior confidence estimation
- Efficient Combination of Likelihood Recycling and Batch Calculation for Fast Acoustic Likelihood Calculation
- Robust Speech Recognition by Model Adaptation and Normalization Using Pre-Observed Noise