Multisegment Multiple VQ Codebooks-Based Speaker Independent Isolated-Word Recognition Using Unbiased Mel Cepstrum
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose a new approach to speaker independent isolated-word speech recognition using multisegment multiple vector quantization (VQ) codebooks. In this approach, words are recognized by means of multisegment multiple VQ codebooks, a separate multisegment multiple VQ codebooks are designed for each word in the recognition vocabulary by dividing equally the word into multiple segments which is correlative with number of syllables or phonemes of the word, and designing two individual VQ codebooks consisting of both instantaneous and transitional speech features for each segment. Using this approach, the influence of the within-word coarticulation can be minimized, the time-sequence information of speech can be used, and the word length differences in the vocabulary or speaking rates variations can be adapted automatically. Moreover, the mel-cepstral coefficients based on unbiased estimation of log spectrum (UELS) are used, and comparison experiment with LPC derived mel cepstral coefficients is made. Recognition experiments Using testing databases consisting of 100 Japanese words (Waseda database) and 216 phonetically balanced words (ATR database), confirmed the effectiveness of the new method and the new speech features. The approach is described, computational complexity as well as memory requirements are analyzed, the experimental results are presented.
- 社団法人電子情報通信学会の論文
- 1995-09-25
著者
-
Zhou Liang
Precision And Intelligence Labolatory Tokyo Institute Of Technology
-
Imai Satoshi
Precision And Intelligence Labolatory Tokyo Institute Of Technology
関連論文
- Multisegment Multiple VQ Codebooks-Based Speaker Independent Isolated-Word Recognition Using Unbiased Mel Cepstrum
- Generalized Cepstral Modeling of Degraded Speech and Its Application to Speech Enhancement (Special Section of Papers Selected from the 7th Digital Signal Processing Symymposium
- Combining Multiple Classifiers in a Hybrid System for High Performance Chinese Syllable Recognition
- Harmonics Estimation Based on Instantaneous Frequency and Its Application to Pitch Determination of Speech
- A New Approach of Parsing and Search Based on the Divide and Conquer Strategy for Continuous Speech Recognition