Acoustic Model Adaptation Using First-Order Linear Prediction for Reverberant Speech (Speech Recognition, <Special Section> Statistical Modeling for Speech Processing)
スポンサーリンク
概要
- 論文の詳細を見る
This paper describes a hands-free speech recognition technique based on acoustic model adaptation to reverberant speech. In handsfree speech recognition, the recognition accuracy is degraded by reverberation, since each segment of speech is affected by the reflection energy of the preceding segment. To compensate for the reflection signal we introduce a frame-by-frame adaptation method adding the reflection signal to the means of the acoustic model. The reflection signal is approximated by a first-order linear prediction from the observation signal at the preceding frame, and the linear prediction coefficient is estimated with a maximum likelihood method by using the EM algorithm, which maximizes the likelihood of the adaptation data. Its effectiveness is confirmed by word recognition experiments on reverberant speech.
- 社団法人電子情報通信学会の論文
- 2006-03-01
著者
-
Ariki Yasuo
Kobe Univ. Kobe‐shi Jpn
-
Ariki Yasuo
The Department Of Computer And System Engineering Kobe University
-
TAKIGUCHI Tetsuya
the IBM Tokyo Research Laboratory
-
NISHIMURA Masafumi
the IBM Tokyo Research Laboratory
-
Takiguchi Tetsuya
Kobe Univ. Kobe‐shi Jpn
関連論文
- CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments
- Acoustic Model Adaptation Using First-Order Linear Prediction for Reverberant Speech (Speech Recognition, Statistical Modeling for Speech Processing)
- Language Modeling Using PLSA-Based Topic HMM
- A Low-Power Real-Time SIFT Descriptor Generation Engine for Full-HDTV Video Recognition