Automatic Determination of the Number of Mixture Components for Continuous HMMs Based on a Uniform Variance Criterion
スポンサーリンク
概要
- 論文の詳細を見る
We discuss how to determine automatically the number of mixture components in continuous mixture density HMMs (CHMMs). A notable trend has been the use of CHMMs in recent years. One of the major problems with a CHMM is how to determine its structure, that is, how many mixture components and states it has and its optimal topology. The number of mixture components has been determined heuristically so far. To solve this problem, we first investigate the influence of the number of mixture components on . model parameters and the output log likelihood value. As a result, in contrast to the "mixture number uniformity" which is applied in conventional approaches to determine the number of mixture components, we propose the principle of "distribution size uniformity". An algorithm is introduced for automatically determining the number of mixture components. The performance of this algorithm is shown through recognition experiments involving all Japanese phonemes. Two types of experiments are carried out. One assumes that the number of mixture components for each state is the same within a phonetic model but may vary between states belonging to different phonemes. The other assumes that each state has a variable number of mixture components. These two experiments give better results than the conventional method.
- 社団法人電子情報通信学会の論文
- 1995-06-25
著者
-
Sagayama Shigeki
Ntt Human Interface Laboratories
-
Sagayama Shigeki
Ntt Human Interface Labs
-
Kosaka Tetsuo
ATR Interpreting Telecommunications Research Labs
関連論文
- Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling (Special Issue on Natural Language Processing and Understanding)
- LR Parsing with a Category Reachability Test Applied to Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Speaker-Consistent Parsing for Speaker-Independent Continuous Speech Recognition
- Automatic Determination of the Number of Mixture Components for Continuous HMMs Based on a Uniform Variance Criterion
- Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network
- Speech Recognition Using Function-Word N-Grams and Content-Word N-Grams
- Discriminative Training Based on Minimum Classification Error for a Small Amount of Data Enhanced by Vector-Field-Smoothed Bayesian Learning