Optimal use of trees in structural MAP adaptation for speaker verification
スポンサーリンク
概要
- 論文の詳細を見る
In speaker verification, the Structural Maximum-A-Posteriori(SMAP) adaptation technique is often used to train speaker-adapted acoustic models by using available speech data in an efficient and flexible manner. In SMAP adaptation, a tree structure is used to represent the acoustic space of the human voice. We observed that one particular tree structure is not necessarily optimal for modeling the acoustic space of all speakers. In this paper, we propose a voting approach as a way to combine the decisions of multiple SMAP-adapted systems using different tree structures. We expect that this approach is more robust than using a single tree structure. We evaluate our proposed method on the 10sec4w-10sec4w task of NIST SRE 2006 and show that our method is more effective than the conventional SMAP adaptation as well as relevance MAP adaptation.
- 2010-12-13
著者
-
Marc Ferras
Tokyo Institute of Technology
-
Koichi Shinoda
Tokyo Institute of Technology
-
Sadaoki Furui
Tokyo Institute of Technology
-
Sangeeta Biswas
Tokyo Institute of Technology
-
Koichi Shinoda
Tokyo Insitute Of Technology
関連論文
- Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition
- Optimal use of trees in structural MAP adaptation for speaker verification
- Speaker Adaptation for Dialogue Act Classification
- Distance-based Graph Linearization and Sampled Max-sum Algorithm for Efficient 3D Potential Decoding of Macromolecules
- Fusing deep speaker specific features and MFCC for robust speaker verification