Robust Acoustic Modeling for Speech Recognition
スポンサーリンク
概要
- 論文の詳細を見る
While Hidden Markov Models (HMMs) have been successfully applied to automatic speech recognition, they are not still robust enough against differences in speakers, speaking-styles, and environmental noises. To tackle this problem, we need to study the inner structure of speech by using large corpus and rich computational power. In this direction, the model size tends to be increase and hence the data insufficiency problem becomes more serious. In this paper, we focus on robust modeling against data insufficiency. Approaches based on information criteria such as Minimum Description Length and structural approaches in which models are changed according to the amount of data availabl are discussed.. While these techniques have been important for HMM research, it will be more important in the research beyond HMM.
- 社団法人電子情報通信学会の論文
- 2004-12-13
著者
-
SHINODA Koichi
Department of Computer Science, Graduate School of Information Science and Engineering, Tokyo Instit
-
Shinoda Koichi
Department Of Computer Science Tokyo Institute Of Technology
-
Shinoda Koichi
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institut
関連論文
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast(Image Processing and Video Processing)
- Automatic recognition of Indonesian declarative questions and statements using polynomial coefficients of the pitch contours
- Initial evaluation of the drivers' Japanese speech corpus in a car environment (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Robust Acoustic Modeling for Speech Recognition
- Invited: Robust Acoustic Modeling for Speech Recognition (国際ワークショップ"Beyond HMM")
- Robust Acoustic Modeling for Speech Recognition
- Nonlinear Normalization Using g-Logarithm for Robust Speech Recognition
- Speaker Verification Using MMAP Adaptation (言語理解とコミュニケーション)
- Speaker Verification Using MMAP Adaptation (音声)
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Two-pass Approach for Recognizing Code-Switching Speech
- Two-pass Approach for Recognizing Code-Switching Speech
- Two-pass Approach for Recognizing Code-Switching Speech
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model
- A video watermarking method to objects robust against various attacks
- Speaker Verification Using MMAP Adaptation
- A video watermarking method to objects robust against various attacks
- A video watermarking method to objects robust against various attacks
- Two-pass Approach for Recognizing Code-Switching Speech