Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood
スポンサーリンク
概要
- 論文の詳細を見る
Automatic speech recognition (ASR) in reverberant environments is a challenging task. Most dereverberation techniques address this problem through signal processing and enhances the reverberant waveform independent from the speech recognizer. In this paper, we propose a novel scheme to perform dereverberation in relation with the likelihood of the back-end ASR system. Our proposed approach effectively selects the dereverberation parameters, in the form of multiband scale factors, so that they improve the likelihood of the acoustic model. Then, the acoustic model is retrained using the optimal parameters. During the recognition phase, we implement additional optimization of the parameters. By using Gaussian mixture model (GMM), the process for selecting the scale factors become efficient. Moreover, we remove the dependency of the adopted dereverberation technique on the room impulse response (RIR) measurement, by using an artificial RIR generator and selecting based on the acoustic likelihood. Experimental results show significant improvement in recognition performance with the proposed method over the conventional approach.
論文 | ランダム
- センシュアス・ホリゾント--仮想現実とテ-マ・パ-ク (ハイパ-都市--遊戯空間の身体論)
- 虚像としての性身体--性の器官とメディアの境界をめぐって (***グラフィ)
- 解説 治験に係る補償・賠償責任の最新事情--情報公開法の施行や弁護士会広告規制緩和で治験環境はどう変わるか
- 治験事故と補償に関する一考察--改正GCPを踏まえて
- 「劉玄徳酔走黄鶴楼」の考察--三国平話と三国雑劇-4-