Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, the robustness of the posterior-based confidence measures is improved by utilizing entropy information, which is calculated for speech-unit-level posteriors using only the best recognition result, without requiring a larger computational load than conventional methods. Using different normalization methods, two posterior-based entropy confidence measures are proposed. Practical details are discussed for two typical levels of hidden Markov model (HMM)-based posterior confidence measures, and both levels are compared in terms of their performances. Experiments show that the entropy information results in significant improvements in the posterior-based confidence measures. The absolute improvements of the out-of-vocabulary (OOV) rejection rate are more than 20% for both the phoneme-level confidence measures and the state-level confidence measures for our embedded test sets, without a significant decline of the in-vocabulary accuracy.
- (社)電子情報通信学会の論文
- 2010-09-01
著者
-
SUN Yanqing
Institute of Acoustics, Chinese Academy of Sciences
-
Yan Yonghong
Institute Of Acoustics Chinese Academy Of Science
-
Zhao Qingwei
Thinkit Speech Lab Institute Of Acoustics Chinese Academy Of Sciences
-
Yan Yonghong
Thinkit Speech Lab.
-
Yan Yonghong
Thinkit Speech Laboratory Institute Of Acoustics Chinese Academy Of Sciences Beijing
-
Zhou Yu
Institute For Advanced Ceramics School Of Materials Science And Engineering Harbin Institute Of Tech
-
Zhou Yu
Institute Of Acoustics Chinese Academy Of Sciences
-
Zhao Qingwei
Institute Of Acoustics Chinese Academy Of Sciences
-
Sun Yanqing
Institute Of Acoustics Chinese Academy Of Sciences
-
Pan Fuping
Institute Of Acoustics Chinese Academy Of Sciences
-
ZHANG Pengyuan
Institute of Acoustics, Chinese Academy of Sciences
-
Zhang Pengyuan
Institute Of Acoustics Chinese Academy Of Sciences
関連論文
- Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition
- Effects of single-channel speech enhancement algorithms on Mandarin speech intelligibility (応用音響)
- ICONE11-36512 RESEARCHING ON KNOWLEDGE ARCHITECTURE OF DESIGN BY ANALYSIS BASED ON ASEME CODE
- ICONE11-36511 AN OBJECT-ORIENTED HYBRID KNOWLEDGE REPRESENTATION METHOD BASED ON THE ASME CODE
- Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification
- Using a Kind of Novel Phonotactic Information for SVM Based Speaker Recognition
- Robust Speaker Clustering Using Affinity Propagation
- An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns
- Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech
- A One-Pass Real-Time Decoder Using Memory-Efficient State Network
- Development of a Mandarin-English Bilingual Speech Recognition System for Real World Music Retrieval
- Automatic Singing Performance Evaluation for Untrained Singers
- Melody Track Selection Using Discriminative Language Model
- Automatic Language Identification with Discriminative Language Characterization Based on SVM
- Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter
- Microstructural Characterization of Spark Plasma Sintered In Situ TiB Reinforced Ti Matrix Composite by EBSD and TEM
- Effects of the Temporal Fine Structure in Different Frequency Bands on Mandarin Tone Perception
- Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition
- A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
- Enhanced Electrical Conductivities of Complex Hydrides Li_2(BH_4)(NH_2) and Li_4(BH_4)(NH_2)_3 by Melting
- Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition
- Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation