Trigger-Based Language Model Adaptation for Automatic Transcription of Panel Discussions(Speech Recognition, <Special Section> Statistical Modeling for Speech Processing)
スポンサーリンク
概要
- 論文の詳細を見る
We present a novel trigger-based language model adaptation method oriented to the transcription of meetings. In meetings, the topic is focused and consistent throughout the whole session, therefore keywords can be correlated over long distances. The trigger-based language model is designed to capture such long-distance dependencies, but it is typically constructed from a large corpus, which is usually too general to derive taskdependent trigger pairs. In the proposed method, we make use of the initial speech recognition results to extract task-dependent trigger pairs and to estimate their statistics. Moreover, we introduce a back-off scheme that also exploits the statistics estimated from a large corpus. The proposed model reduced the test-set perplexity considerably more than the typical triggerbased language model constructed from a large corpus, and achieved a remarkable perplexity reduction of 44% over the baseline when combined with an adapted trigram language model. In addition, a reduction in word error rate was obtained when using the proposed language model to rescore word graphs.
- 社団法人電子情報通信学会の論文
- 2006-03-01
著者
-
Kawahara Tatsuya
The School Of Informatics Kyoto University
-
TRONCOSO Carlos
the School of Informatics, Kyoto University
-
Troncoso Carlos
The School Of Informatics Kyoto University
関連論文
- Verification of Speech Recognition Results Incorporating In-domain Confidence and Discourse Coherence Measures(Speech Recognition, Statistical Modeling for Speech Processing)
- Trigger-Based Language Model Adaptation for Automatic Transcription of Panel Discussions(Speech Recognition, Statistical Modeling for Speech Processing)