Efficient discriminative training of error corrective models using high-WER competitors (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
スポンサーリンク
概要
- 論文の詳細を見る
We focus on error corrective models for speech recognition, which select a more accurate word sequence in a word N-best list produced by a speech recognizer. In general, an error corrective model is trained so that the correct word sequence is discriminated by the model from all the sequences in each N-best list. However, we show that a more accurate model can be provided through the discrimination of only two word sequences; one is the correct word sequence and the other is the sequence with the highest word error rate (WER) in the list. We think there are two reasons for this effect: (1) many important error patterns can be obtained since typical errors frequently appear in the word sequences with high WERs, and (2) training with fewer word sequences alleviates the difficulty of parameter estimation. In addition, the model size becomes more compact and the training time is significantly reduced with our method. Experiments using the Corpus of Spontaneous Japanese (CSJ) show that our proposed method generates accurate and compact models, and specifically performs robustly for different tasks and linguistic features.
- 社団法人電子情報通信学会の論文
- 2008-03-13
著者
-
Nakamura Atsushi
Ntt Communication Science Laboratories Ntt Corporation
-
Oba Takanobu
Ntt Communication Science Laboratories Ntt Corporation
-
Hori Takaaki
Ntt Communication Science Laboratories Ntt Corporation
関連論文
- Improved Phoneme-History-Dependent Search Method for Large-Vocabulary Continuous-Speech Recognition
- Improved Sequential Dependency Analysis Integrating Labeling-Based Sentence Boundary Detection
- Efficient discriminative training of error corrective models using high-WER competitors (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Efficient discriminative training of error corrective models using high-WER competitors
- Speech Recognition Based on Student's t-Distribution Derived from Total Bayesian Framework(Speech Recognition, Statistical Modeling for Speech Processing)
- Selection of Shared-State Hidden Markov Model Structure Using Bayesian Criterion(the 2003 IEICE Excellent Paper Award)
- Efficient Combination of Likelihood Recycling and Batch Calculation for Fast Acoustic Likelihood Calculation
- Production-Oriented Models for Speech Recognition(Speech Recognition, Statistical Modeling for Speech Processing)
- Model Shrinkage for Discriminative Language Models