Improved Phoneme-History-Dependent Search Method for Large-Vocabulary Continuous-Speech Recognition
スポンサーリンク
概要
- 論文の詳細を見る
This paper presents an improved phoneme-history-dependent (PHD) search algorithm. This method is an optimum algorithm under the assumption that the starting time of a recognized word depends on only a few preceding phonemes (phoneme history). The computational cost and the number of recognition errors can be reduced if the phoneme-history-dependent search uses re-selection of the preceding word and an appropriate length of phoneme histories. These improvements increase the speed of decoding and help to ensure that the resulting word graph has the correct word sequence. In a 65k-word domain-independent Japanese read-speech dictation task and 1000-word spontaneous-speech airline-ticket-reservation task, the improved PHD search was 1.2-1.8 times faster than a traditional word-dependent search under the condition of equal word accuracy. The improved search reduced the number of errors by a maximum of 21% under the condition of equal processing time. The results also show that our search can generate more compact and accurate word graphs than those of the original PHD search method. In addition, we investigated the optimum length of the phoneme history in the search.
- 社団法人電子情報通信学会の論文
- 2003-06-01
著者
-
NODA Yoshiaki
NTT Cyber Space Laboratories
-
Noda Yoshiaki
Ntt Communications Corporation
-
Hori Takaaki
Ntt Communication Science Laboratories Ntt Corporation
-
Hori Takaaki
Ntt Cyber Space Laboratories Ntt Corporation
-
MATSUNAGA Shoichi
NTT Cyber Space Laboratories, NTT Corporation
-
Matsunaga Shoichi
Ntt Cyber Space Laboratories Ntt Corporation
関連論文
- 音声合成のための合成単位の基礎
- Improved Phoneme-History-Dependent Search Method for Large-Vocabulary Continuous-Speech Recognition
- Improved Sequential Dependency Analysis Integrating Labeling-Based Sentence Boundary Detection
- Efficient discriminative training of error corrective models using high-WER competitors (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Efficient discriminative training of error corrective models using high-WER competitors
- Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech
- Model Shrinkage for Discriminative Language Models