Effectiveness of Word String Language Models on Noisy Broadcast News Speech Recognition
スポンサーリンク
概要
- 論文の詳細を見る
Experiments were conducted to examine an approach from language modeling side to improving noisy speech recognition performance. By adopting appropriate word strings as new units of processing, speech recognition performance was improved by acoustic effects as well as by test-set perplexity reduction. Three kinds of word string language models were evaluated, whose additional lexical entries were selected based on combinations of part of speech information, word length, occurrence frequency, and log likelihood ratio of the hypotheses about the bigram frequency. All of the three word string models reduced errors in broadcast news speech recognition, and also lowered test-set perplexity. The word string model based on log likelihood ratio exhibited the best improvement for noisy speech recognition, by which deletion errors were reduced by 26%, substitution errors by 9.3%, and insertion errors by 13%, in the experiments using the speaker-dependent, noise-adapted triphone. Effectiveness of word string models on error reduction was more prominent for noisy speech than for studio-clean speech.
- 社団法人電子情報通信学会の論文
- 2002-07-01
著者
-
Takagi Kazuyuki
University Of Electro-communications
-
OGURO Rei
University of Electro-Communications
-
OZEKI Kazuhiko
University of Electro-Communications
-
Ozeki K
Univ. Electro‐communications Chofu‐shi Jpn
関連論文
- Effectiveness of Word String Language Models on Noisy Broadcast News Speech Recognition
- The Use of Overlapped Sub-Bands in Multi-Band, Multi-SNR, Multi-Path Recognition of Noisy Word Utterances
- Automatic Adjustment of Subband Likelihood Recombination Weights for Improving Noise-Robustness of a Multi-SNR Multi-Band Speaker Identification System(Speech and Hearing)