Speaker Verification Using MMAP Adaptation (言語理解とコミュニケーション)
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes maximum a posteriori (MAP) adaptation of Gaussian mixture models (GMM) using multiple priors for text-independent speaker verification. Although the hierarchical prior used in structural MAP (SMAP) adaptation has been proven to outperform the prior used in relevance MAP adaptation, there may still be some complementary information in the relevance prior which might be useful. An idea of combining these two priors is introduced here in the MAP framework. We call this method multiprior MAP (MMAP). We evaluated our proposed method on NIST SRE 2006 10sec4w-10sec4w task. We compared MMAP with classical maximum likelihood (ML) estimation, relevance MAP and SMAP adaptation techniques and proved its effectiveness. We also investigated the effect of Z-Norm and T-Norm on the likelihood ratio scores of different systems here.
- 2011-12-12
著者
-
Shinoda Koichi
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institut
-
Furui Sadaoki
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institut
-
Biswas Sangeeta
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institute Of Technology
-
ROHDIN Johan
Department of Computer Science, Graduate School of Information Science and Engineering, Tokyo Institute of Technology
-
Rohdin Johan
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institute Of Technology
-
BISWAS Sangeeta
Department of Computer Science, Graduate School of Information Science and Engineering
関連論文
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Tree-Structured Clustering Methods for Piecewise Linear-Transformation-Based Noise Adaptation(Speech and Hearing)
- Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast(Image Processing and Video Processing)
- Automatic recognition of Indonesian declarative questions and statements using polynomial coefficients of the pitch contours
- Initial evaluation of the drivers' Japanese speech corpus in a car environment (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Accent analysis for Mandarin large vocabulary continuous speech recognition (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Evaluation of a Noise-Robust Multi-Stream Speaker Verification Method Using F_0 Information
- Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech
- Robust Acoustic Modeling for Speech Recognition
- Invited: Robust Acoustic Modeling for Speech Recognition (国際ワークショップ"Beyond HMM")
- Robust Acoustic Modeling for Speech Recognition
- Noise Robust Speech Recognition Using F_0 Contour Information(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Phonology and Morphology Modeling in a Very Large Vocabulary Hungarian Dictation System(Speech and Hearing)
- 連続発話認識のための言語モデル
- Dynamic Bayesian Network-Based Acoustic Models Incorporating Speaking Rate Effects(Speech and Hearing)
- Neural-network-based HMM adaptation for noisy speech recognition
- Nonlinear Normalization Using g-Logarithm for Robust Speech Recognition
- Speaker Verification Using MMAP Adaptation (言語理解とコミュニケーション)
- Speaker Verification Using MMAP Adaptation (音声)
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Two-pass Approach for Recognizing Code-Switching Speech
- Two-pass Approach for Recognizing Code-Switching Speech
- Two-pass Approach for Recognizing Code-Switching Speech
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model
- A video watermarking method to objects robust against various attacks
- Speaker Verification Using MMAP Adaptation
- A video watermarking method to objects robust against various attacks
- A video watermarking method to objects robust against various attacks
- Two-pass Approach for Recognizing Code-Switching Speech