Speaker Adaptation Based on a Maximum Observation Probability Criterion
スポンサーリンク
概要
- 論文の詳細を見る
A speaker adaptation technique that maximizes the observation probability of an input speech is proposed. It is applied to semi-continuous hidden Markov model (SCHMM) speech recognizers. The proposed algorithm adapts the mean μ and the covariance Σ iteratively by the gradient search technique so that the features of the adaptation speech data could achieve maximum observation probabilities. The mixture coefficients and the state transition probabilities are adapted by the model interpolation scheme. The main advantage of this scheme is that the means and the variances, which are common to all states in SCHMM, are adapted independently from the other parameters of SCHMM. It allows fast and precise adaptation especially when there is a large acoustic mismatch between the reference model and a new speaker. Also, it is possible that this scheme could be adopted to other areas which use codebook. The proposed adaptation algorithm was evaluated by a male speaker-dependent, a female speaker-dependent, and a speaker-independent recognizers. The experimental results on the isolated word recognition showed that the proposed adaptation algorithm achieved 46.03% average enhancement in the male speaker-dependent recognizer, 52.18% in the female speaker-dependent recognizer, and 9.84% in the speaker-independent recognizer.
- 社団法人電子情報通信学会の論文
- 2001-02-01
著者
-
Youn Dae-hee
The Dept. Of Electrical And Electronic Eng. Yonsei Univ. Seoul Korea.
-
Youn D‐h
Yonsei Univ. Seoul Kor
-
LEE Chungyong
Dept. of Electrical and Electronic Eng., Yonsei University
-
Youn Dae-hee
Department Of Electrical And Electronic Engineering Yonsei University
-
LEE Chungyong
the Dept. of Electrical and Electronic Eng., Yonsei Univ., Seoul, Korea.
-
YANG Tae-Young
Dept. of Electrical and Computer Eng. Yonsei Univ.
-
YOUN Dae-Hee
Dept. of Electrical and Computer Eng. Yonsei Univ.
-
Lee Chungyong
Dept. Of Electrical And Electronic Eng. Yonsei University
-
Lee Chungyong
The Dept. Of Electrical And Electronic Eng. Yonsei Univ. Seoul Korea.
-
Yang Tae-young
The Lg Electronics Institute Of Technology Seoul Korea.
-
YOUN Dae-Hee
Dept. of Electrical & Electronic Eng. Yonsei University
関連論文
- Block Tomlinson-Harashima Precoding with Vector Perturbation for Multiuser MIMO Downlink
- Bark Coherence Function for Speech Quality Evaluation over CDMA System
- An Efficient Active Noise Control Algorithm Based on the Lattice-Transversal Joint(LTJ) Filter Structure
- A GMM-Based Feature Selection Algorithm for Multi-Class Classification
- Design of Time-Varying Reverberators for Low Memory Applications
- Duration Modeling Using Cumulative Duration Probability
- A Grammatical Structure of the FSN for the Recognition of Korean Price Sentences
- Speaker Adaptation Based on a Maximum Observation Probability Criterion
- A Very Low Complexity VSELP Speech Coder Using Regular Pulse Basis Vectors (Special Section of Papers Selected from ITC-CSCC'96)
- Performance Comparison of Single and Multi-Stage Algebraic Codebooks(Speech and Hearing)
- Performance Comparison of Single and Multi-Stage Algebraic Codebooks
- A New Method for Calibration of NLOS Error in Positioning Systems
- Voice Conversion Using Low Dimensional Vector Mapping(Regular Section)
- An Efficient Searching Algorithm for Receive Minimum Distance in MIMO Systems with ML Receiver
- An Efficient Multi-Mode Precoding for MIMO Systems with Full CSI(Wireless Communication Technologies)
- An MMSE-Nulling Partial-PIC Receiver for Multiuser Downlink MIMO MC-CDMA Systems(Wireless Communication Technologies)
- Block Tomlinson-Harashima Precoding with Vector Perturbation for Multiuser MIMO Downlink
- A New Approximation of the Receive Minimum Distance and Its Application to MIMO Systems(Wireless Communication Technologies)
- A Structured Codebook with Various Codeword Configurations for Downlink MIMO Systems
- Efficient Windowing Scheme for MDCT-Based TCX in AMR-WB+
- Amplify-and-Forward Relay Filter Design with MIMO System for Two-Way Relay Channels