Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model
スポンサーリンク
概要
- 論文の詳細を見る
A novel online speaker clustering method based on a generative model is proposed. It employs an incremental variant of variational Bayesian learning and provides probabilistic (non-deterministic) decisions for each input utterance, on the basis of the history of preceding utterances. It can be expected to be robust against errors in cluster estimation and the classification of utterances, and hence to be applicable to many real-time applications. Experimental results show that it produces 50% fewer classification errors than does a conventional online method. They also show that it is possible to reduce the number of speech recognition errors by combining the method with unsupervised speaker adaptation.
著者
-
Shinoda Koichi
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institut
-
NAGATOMO Kentaro
Information and Media Processing Labs., NEC Corporation
-
KOSHINAKA Takafumi
Information and Media Processing Labs., NEC Corporation
関連論文
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast(Image Processing and Video Processing)
- Automatic recognition of Indonesian declarative questions and statements using polynomial coefficients of the pitch contours
- Initial evaluation of the drivers' Japanese speech corpus in a car environment (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Robust Acoustic Modeling for Speech Recognition
- Invited: Robust Acoustic Modeling for Speech Recognition (国際ワークショップ"Beyond HMM")
- Robust Acoustic Modeling for Speech Recognition
- Nonlinear Normalization Using g-Logarithm for Robust Speech Recognition
- Speaker Verification Using MMAP Adaptation (言語理解とコミュニケーション)
- Speaker Verification Using MMAP Adaptation (音声)
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Two-pass Approach for Recognizing Code-Switching Speech
- Two-pass Approach for Recognizing Code-Switching Speech
- Two-pass Approach for Recognizing Code-Switching Speech
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model
- A video watermarking method to objects robust against various attacks
- Speaker Verification Using MMAP Adaptation
- A video watermarking method to objects robust against various attacks
- A video watermarking method to objects robust against various attacks
- Two-pass Approach for Recognizing Code-Switching Speech