Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification
スポンサーリンク
概要
- 論文の詳細を見る
The GMM-UBM framework has been proved to be one of the most effective approaches to the automatic speaker verification (ASV) task in recent years. In this letter, we first propose an approximate decision function of traditional GMM-UBM, from which it is shown that the contribution to classification of each Gaussian component is equally important. However, research in speaker perception shows that a different speech sound unit defined by Gaussian component makes a different contribution to speaker verification. This motivates us to emphasize some sound units which have discriminability between speakers while de-emphasize the speech sound units which contain little information for speaker verification. Experiments on 2006 NIST SRE core task show that the proposed approach outperforms traditional GMM-UBM approach in classification accuracy.
- (社)電子情報通信学会の論文
- 2009-09-01
著者
-
Yan Yonghong
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
-
Yan Yonghong
Institute Of Acoustics Chinese Academy Of Science
-
Zhao Qingwei
Thinkit Speech Lab Institute Of Acoustics Chinese Academy Of Sciences
-
XIAO Xiang
ThinkIT Speech Lab., Institute of Acoustics, Chinese Academy of Sciences
-
ZHANG Xiang
ThinkIT Speech Lab., Institute of Acoustics, Chinese Academy of Sciences
-
WANG Haipeng
ThinkIT Speech Lab., Institute of Acoustics, Chinese Academy of Sciences
-
SUO Hongbin
ThinkIT Speech Lab., Institute of Acoustics, Chinese Academy of Sciences
-
Yan Yonghong
Thinkit Speech Lab Institute Of Acoustics Chinese Academy Of Sciences
-
Yan Yonghong
Thinkit Speech Lab.
-
Yan Yonghong
Thinkit Speech Laboratory Institute Of Acoustics Chinese Academy Of Sciences Beijing
-
Xiao Xiang
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
-
Suo Hongbin
Thinkit Speech Lab Institute Of Acoustics Chinese Academy Of Sciences
-
Zhang Xiang
Thinkit Speech Lab Institute Of Acoustics Chinese Academy Of Sciences
-
Wang Haipeng
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
-
Wang Haipeng
Thinkit Speech Lab, Institute of Acoustics, Chinese Academy of Sciences
-
Xiao Xiang
Thinkit Speech Lab, Institute of Acoustics, Chinese Academy of Sciences
関連論文
- Effects of single-channel speech enhancement algorithms on Mandarin speech intelligibility (応用音響)
- Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification
- Using a Kind of Novel Phonotactic Information for SVM Based Speaker Recognition
- Robust Speaker Clustering Using Affinity Propagation
- An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns
- Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech
- A One-Pass Real-Time Decoder Using Memory-Efficient State Network
- Development of a Mandarin-English Bilingual Speech Recognition System for Real World Music Retrieval
- Automatic Singing Performance Evaluation for Untrained Singers
- Melody Track Selection Using Discriminative Language Model
- Automatic Language Identification with Discriminative Language Characterization Based on SVM
- A two-element-microphone-array-based speech recognition system in vehicle environment(Commemoration of the Japan-China Joint Conference on Acoustics 2007 (JCA2007))
- Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter
- Effects of the Temporal Fine Structure in Different Frequency Bands on Mandarin Tone Perception
- Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition
- A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
- Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition
- Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation
- A bayesian logistic regression approach to spoken language identification