Confidence Scoring for Accurate HMM-Based Speech Recognition by Using Monophone-Level Normalization Based on Subspace Method (<Special Issue>Special Issue on Speech Information Processing)
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, a novel confidence scoring method that is applied to N-best hypotheses (word candidates) output from an HMM-based classifier is proposed. In the first pass of the proposed method, the HMM-based classifier with monophone models outputs N-best hypotheses and boundaries of all monophones in the hypotheses. In the second pass, an SM (Subspace Method)-based verifier tests the hypotheses by comparing confidence scores. To test the hypotheses, at first, the SM-based verifier calculates the similarity between phone vectors and an eigen vector set of monophones, then this similarity score is converted into a likelihood score with normalization of acoustic quality, and finally, an HMM-based likelihood of word level and an SM-based likelihood of monophone level are combined to formulate the confidence measure. Two kinds of experiments were performed to evaluate this confidence measure on speaker-independent word recognition. The results showed that the proposed confidence scoring method significantly reduced the word error rate from 4.7% obtained by the standard HMM classifier to 2.0%, and in an unknown word rejection, it reduced the equal error rate from 9.0% to 6.5%.
- 社団法人電子情報通信学会の論文
- 2003-03-01
著者
-
NITTA Tsuneo
Graduate School of Engineering, Toyohashi University of Technology
-
Nitta T
Graduate School Of Engineering Toyohashi University Of Technology
-
Nitta Tsuneo
The Graduate School Of Engineering Toyohashi University Of Technology
-
Nitta Tsuneo
Graduate School Of Engineering Toyohashi University Of Technology
-
GHULAM Muhammad
Graduate School of Engineering, Toyohashi University of Technology
-
FUKUDA Takashi
Tokyo Research Laboratory, IBM Japan Ltd.
-
Fukuda T
Tokyo Research Laboratory Ibm Japan Ltd.
-
SATO Takaharu
Graduate School of Engineering, Toyohashi University of Technology
-
FUKUDA Takashi
Graduate School of Engineering, Toyohashi University of Technology
-
Ghulam Muhammad
Graduate School Of Engineering Toyohashi University Of Technology
-
Sato T
Graduate School Of Engineering Toyohashi University Of Technology:(present Address)fujitsu Tohoku Sy
-
Fukuda Takashi
Graduate School Of Engineering Toyohashi University Of Technology
関連論文
- Distinctive Phonetic Feature (DPF) Extraction Based on MLNs and Inhibition/Enhancement Network
- Orthogonalized Distinctive Phonetic Feature Extraction for Noise-Robust Automatic Speech Recognition(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Distinctive Phonetic Feature (DPF) Extraction Based on MLNs and Inhibition/Enhancement Network
- Canonicalization of Feature Parameters for Robust Speech Recognition Based on Distinctive Phonetic Feature (DPF) Vectors
- Photocurrent Excitation Spectra Observed with An-Al Heteroelectrodes Biased Reversely and Reflection Spectra in Trans-Polyacetylene
- PS-ZCPA Based Feature Extraction with Auditory Masking, Modulation Enhancement and Noise Reduction for Robust ASR(Speech Recognition, Statistical Modeling for Speech Processing)
- Confidence Scoring for Accurate HMM-Based Speech Recognition by Using Monophone-Level Normalization Based on Subspace Method (Special Issue on Speech Information Processing)
- Pitch-Synchronous Peak-Amplitude (PS-PA)-Based Feature Extraction Method for Noise-Robust ASR(Speech and Hearing)
- Local Peak Enhancement for In-Car Speech Recognition in Noisy Environment
- Volume Holographic Imaging Element with Background Noise Reduction Function for Eye-Gaze Detection under White Light Illumination
- Changes of Bacterial Population in Frozen Soil
- Search Method for Inhibitors of Staphyloxanthin Production by Methicillin-Resistant Staphylococcus aureus
- Trichocyalides A and B, new inhibitors of alkaline phosphatase activity in bone morphogenetic protein-stimulated myoblasts, produced by Trichoderma sp. FKI-5513
- A new ascochlorin derivative from Cylindrocarpon sp. FKI-4602
- New dinapinone derivatives, potent inhibitors of triacylglycerol synthesis in mammalian cells, produced by Talaromyces pinophilus FKI-3864