Task Adaptation in Syllable Trigram Models for Continuous Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
スポンサーリンク
概要
- 論文の詳細を見る
In speech recognition systems dealing with unlimited vocabulary and based on stochastic language models, when the target recognition task is changed, recognition performance decreases because the language model is no longer appro-priate. This paper describes two approaches for adapting a specific / general syllable trigram model to a new task. One uses a small amount of text data similar to the target task, and the other uses supervised learning using the most recent input phrases and similar text. In this paper, these adaptation methods are called "preliminary learning" and "successive learning", respectively. These adaptation are evaluated using syllable perplexity and phrase recognition rates. The perplexity was reduced from 24.5 to 14.3 for the adaptation using 1000 phrases of similar text by preliminary learning, and was reduced to 12.1 using 1000 phrases including the 100 most recent phrases by successive learning. The recognition rates were also improved from 42.3% to 51.3% and 52.9%, respectively. Text similarity for the approaches is also studied in this paper.
- 社団法人電子情報通信学会の論文
- 1993-01-25
著者
-
Shikano Kiyohiro
NTT Human Interface Laboratories
-
Matsunaga Sho-ichi
NTT Human Interface Laboratories
-
Yamada Tomokazu
NTT Human Interface Laboratories
関連論文
- Speaker Weighted Training of HMM Using Multiple Reference Speakers
- Task Adaptation in Syllable Trigram Models for Continuous Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Isolated Word Recognition Using Pitch Pattern Information