Genetic Algorithm Based Optimization of Partly-Hidden Markov Model Structure Using Discriminative Criterion(Speech Recognition, <Special Section> Statistical Modeling for Speech Processing)
スポンサーリンク
概要
- 論文の詳細を見る
A discriminative modeling is applied to optimize the structure of a Partly-Hidden Markov Model (PHMM). PHMM was proposed in our previous work to deal with the complicated temporal changes of acoustic features. It can represent observation dependent behaviors in both observations and state transitions. In the formulation of the previous PHMM, we used a common structure for all models. However, it is expected that the optimal structure which gives the best performance differs from category to category. In this paper, we designed a new structure optimization method in which the dependence of the states and the observations of PHMM are optimally defined according to each model using the weighted likelihood-ratio maximization (WLRM) criterion. The WLRM criterion gives high discriminability between the correct category and the incorrect categories. Therefore it gives model structures with good discriminative performance. We define the model structure combination which satisfy the WLRM criterion for any possible structure combinations as the optimal structures. A genetic algorithm is also applied to the adequate approximation of a full search. With results of continuous lecture talk speech recognition, the effectiveness of the proposed structure optimization is shown: it reduced the word errors compared to HMM and PHMM with a common structure for all models.
- 社団法人電子情報通信学会の論文
- 2006-03-01
著者
-
Kobayashi Tetsunori
Waseda Univ. Tokyo Jpn
-
Kobayashi Tetsunori
The School Of Science And Engineering Waseda University
-
OGAWA Tetsuji
the School of Science and Engineering, Waseda University
-
Ogawa Tetsuji
The School Of Science And Engineering Waseda University
-
KOBAYASHI Tetsunori
the School of Science and Engineering, Waseda University
関連論文
- Ears of the Robot : Direction of Arrival Estimation Based on Pattern Recognition Using Robot-Mounted Microphones
- Mutual Information Based Dynamic Integration of Multiple Feature Streams for Robust Real-Time LVCSR
- Filter Bank Subtraction for Robust Speech Recognition (Special Issue on Speech Information Processing)
- Simultaneous Subtitling System for Broadcast News Programs with a Speech Recognizer(Special Issue on the 2001 IEICE Excellent Paper Award)
- Ears of the Robot : Three Simultaneous Speech Segregation and Recognition Using Robot-Mounted Microphones(Speech and Hearing)
- Genetic Algorithm Based Optimization of Partly-Hidden Markov Model Structure Using Discriminative Criterion(Speech Recognition, Statistical Modeling for Speech Processing)
- High Quality Synthetic Speech Generation Using Synchronized Oscillators (Special Section on Speech Synthesis: Current Technologies and Equipment)