A Comparative Study of Output Probability Functions in HMMs
スポンサーリンク
概要
- 論文の詳細を見る
One of the most effective methods in speech recognition is the HMM which has been used to model speech statistically. The discrete distribution and the continuous distribution HMMs have been widely used in various applications. However, in recent years, HMMs with various output probability functions have been proposed to further improve recognition performance, e.g. the Gaussian mixture continuous and the semi-continuous distributed HMMs. We recently have also proposed the RBF (radial basis function)-based HMM and the VQ-distortion based HMM which use a RBF function and VQ-distortion measure at each state instead of an output probability density function used by traditional HMMs. In this paper, we describe the RBF-based HMM and the VQ-distortion based HMM and compare their performance with the discrete distributed, the Gaussian mixture distributed and the semi-continuous distributed HMMs based on their speech recognition performance rates through experiments on speaker-independent spoken digit recognition. Our results confirmed that the RBF-based and VQ-distortion based HMMs are more robust and superior to traditional HMMs.
- 社団法人電子情報通信学会の論文
- 1995-06-25
著者
-
ZHAO Li
Department of Physiology, China Pharmaceutical University
-
Nakagawa Seiichi
Department of Information and Computer Sciences, Toyohashi University of Technology
-
Zhao Li
中華人民共和国
-
Nakagawa S
Toyohashi Univ. Technol. Toyohashi Jpn
-
Zhao Li
Faculty Of Bioscience And Biotechnology Tokyo Institute Of Technology
-
Zhao L
Saitama Univ. Saitama
-
Zhao Li
Faculty Of Information And Computer Sciences Toyohashi University Of Technology
-
Nakagawa Seiichi
Faculty of Information and Computer Sciences, Toyohashi University of Technology
-
Suzuki Hideyuki
Faculty of Information and Computer Sciences, Toyohashi University of Technology
-
Nakagawa Seiichi
Faculty Of Engineering Toyohashi University Of Technology
-
Suzuki Hideyuki
Faculty Of Engineering Saitama University
関連論文
- The Anticancer Activities of Wogonin in Murine Sarcoma S180 both in Vitro and in Vivo(Pharmacology)
- Topic dependent language model based on on-line voting (言語理解とコミュニケーション)
- A transitive translation for Indonesian-Japanese CLQA (自然言語処理)
- Indonesian-Japanese Transitive Translation using English for CLIR
- 1P028 Protein-protein interactions between gp13, a neck protein, and the connector, gp15, of bacteriophage T4
- 2P112Structural analysis of the tail completion proteins, P3 and P15, of T4 phage by analytical ultracentrifugation and electron microscopy
- 2PA131 Stoichiometry and Inter-subunit Interactions of the Wedge Initiation Complex, Gp10-gp11, of Bacteriophage T4
- A Comparative Study of Output Probability Functions in HMMs
- Gambogic Acid Inhibits Proliferation of Human Lung Carcinoma SPC-A1 Cells in Vivo and in Vitro and Represses Telomerase Activity and Telomerase Reverse Transcriptase mRNA Expression in the Cells(Pharmacology)
- Gambogic Acid Induces Apoptosis and Regulates Expressions of Bax and Bcl-2 Protein in Human Gastric Carcinoma MGC-803 Cells(Pharmacology)
- A Low-Loss 5 GHz Bandpass Filter Using HTS Quarter-Wavelength Coplanar Waveguide Resonators(Special Issue on Superconductive Electronics)
- N, N-Dialkylation of Aminocarbene Complexes under Phase-Transfer Conditions
- Synthesis of (1,5-Diazabicyclo[4.3.0]nonan-2-ylidene)pentacarbonylchromium and -tungsten Using Reaction of 2-Unsaturated Carbene Complexes with 6-Membered Hydrazine
- Text-Independent Speaker Identification Utilizing Likelihood Normalization Technique
- A Statistical Method of Evaluating Pronunciation Proficiency for English Words Spoken by Japanese(Speech and Hearing)
- A Spoken Dialog System with Verification and Clarification Queries (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task(Spoken Language Systems, Corpus-Based Speech Technologies)
- An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems(Spoken Language Systems, Corpus-Based Speech Technologies)
- Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System
- Continuous Speech Recognition Using an On-Line Speaker Adaptation Method Based on Automatic Speaker Clustering (Special Issue on Speech Information Processing)
- A Spoken Dialog System for Spontaneous Conversations Considering Response Timing and Response Type