Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition
スポンサーリンク
概要
- 論文の詳細を見る
Gaussian Mixture Models (GMM) are ubiquitously used in state-of-the-art speaker recognition systems. The popular GMM-SVM paradigm uses Maximum A Posteriori (MAP) speaker-adapted GMM models by stacking the mean vectors into a supervector that is fed into a Support Vector Machine classifier. In this paper, we modify the standard relevance MAP algorithm to better fit the speaker recognition task. We propose to emphasize the adaptation of the Gaussian mixtures according to the inter-speaker variability exhibited on a training set, thus accounting for both the occupation count and the speaker discrimination ability during adaptation. We evaluate our proposal on a relevance MAP based GMM-SVM system using a large telephone speech corpus such as the one provided in the 2006 NIST Speaker Recognition Evaluation. We show that despite its simplicity this technique is effective.
- 2010-12-13
著者
-
Marc Ferras
Tokyo Institute of Technology
-
Koichi Shinoda
Tokyo Institute of Technology
-
Sadaoki Furui
Tokyo Institute of Technology
-
Koichi Shinoda
Tokyo Insitute Of Technology
関連論文
- Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition
- Optimal use of trees in structural MAP adaptation for speaker verification
- Speaker Adaptation for Dialogue Act Classification
- Distance-based Graph Linearization and Sampled Max-sum Algorithm for Efficient 3D Potential Decoding of Macromolecules
- Fusing deep speaker specific features and MFCC for robust speaker verification