Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model

概要

論文の詳細を見る
A novel online speaker clustering method based on a generative model is proposed. It employs an incremental variant of variational Bayesian learning and provides probabilistic (non-deterministic) decisions for each input utterance, on the basis of the history of preceding utterances. It can be expected to be robust against errors in cluster estimation and the classification of utterances, and hence to be applicable to many real-time applications. Experimental results show that it produces 50% fewer classification errors than does a conventional online method. They also show that it is possible to reduce the number of speech recognition errors by combining the method with unsupervised speaker adaptation.

著者

Shinoda Koichi
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institut
NAGATOMO Kentaro
Information and Media Processing Labs., NEC Corporation
KOSHINAKA Takafumi
Information and Media Processing Labs., NEC Corporation

関連論文

Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast(Image Processing and Video Processing)
Automatic recognition of Indonesian declarative questions and statements using polynomial coefficients of the pitch contours
Initial evaluation of the drivers' Japanese speech corpus in a car environment (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
Robust Acoustic Modeling for Speech Recognition
Invited: Robust Acoustic Modeling for Speech Recognition (国際ワークショップ"Beyond HMM")
Robust Acoustic Modeling for Speech Recognition
Nonlinear Normalization Using g-Logarithm for Robust Speech Recognition
Speaker Verification Using MMAP Adaptation (言語理解とコミュニケーション)
Speaker Verification Using MMAP Adaptation (音声)
Subject Adaptation and Adaptive Training for Gait-based Person Identification
Subject Adaptation and Adaptive Training for Gait-based Person Identification
Two-pass Approach for Recognizing Code-Switching Speech
Two-pass Approach for Recognizing Code-Switching Speech
Two-pass Approach for Recognizing Code-Switching Speech
Subject Adaptation and Adaptive Training for Gait-based Person Identification
Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model
A video watermarking method to objects robust against various attacks
Speaker Verification Using MMAP Adaptation
A video watermarking method to objects robust against various attacks
A video watermarking method to objects robust against various attacks
Two-pass Approach for Recognizing Code-Switching Speech

Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model

スポンサーリンク

概要

著者

関連論文

スポンサーリンク