Recognition of phonemes in continuous speech using a modified LVQ2 method
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposed a new phoneme recognition method based on the Learning Vector Quatization(LVQ2)algorithm proposed by Kohonen. We propose three versions of a modified training algorithm to overcome a shortcoming of the LVQ2 method. In the modified LVQ2 algorithm, p reference vectors are modified at the same time if the correct class in within the N-th rank where N is set to some constant. Using this algorithm, the phoneme recognition scores obtained by the modified LVQ2 algorithm were higher than those obtained by the original LVQ2 algorithm. Furthermore, we propose a segmentation and recognition method for phonemes in continuous speech. At first a likelihood matrix is computed using the reference vectors, where each row indicates the likelihood sequence of each phoneme and each column indicates the likelihood of all phonemes for each 10-ms unit. The optimum phoneme sequence is computed from the likelihood matrix using the DP with duration constraints. We applied this method to a multi-speaker-dependent phoneme recognition task for continuous speech uttered Bunsetsu by Bunsetsu. The phoneme recognition score was 85. 5% for the speech samples in continuous speech.
- 社団法人日本音響学会の論文
著者
-
Sone Toshio
Research Institute Of Electrical Communication Tohoku University
-
Makino Shozo
Research Center For Applied Information Sciences Tohoku University
-
Makino Shozo
Research Center For Applied Information Science Tohoku University
-
Sone T
Tohoku Univ. Sendai Jpn
-
Endo Mitsuru
Matsushita Research Institute Tokyo Inc.
関連論文
- Loudness and noisiness of a repeated impact sound : Results of round robin tests in Japan(II)
- Preliminary investigation on shape estimation of concrete pile by vibration analysis
- Analysis and recognition of Korean isolated vowels using formant frequency
- A temporal integration model for loudness perception of repeated impulsive sounds
- Equal-loudness level contours for pure tone under free field listening conditions (I) : Some data and considerations on experimental conditions
- Growth of the loudness of a tone burst with a duration up to 10 seconds
- Sound Field Reproduction by Controlling the Transfer Functions from the Source to Multiple Points in Close Proximity
- Sound localization in headphone reproduction by simulating transfer functions from the sound source to the external ear
- 正中面に置かれた二音源による音像定位
- はりの遠方場における振動インテンシティの能動制御
- Adaptive Control of Vibration Intensity in a Beam in the Frequency Domain (Special Section on Advanced Signal Processing Techniques for Analysis of Acoustical and Vibrational Signals)
- Information of Loudness in Aural Communication
- Recognition of phonemes in continuous speech using a modified LVQ2 method
- The Third WESTPAC : The Third Western Pacific Regional Acoustics Conference
- BINAURAL PERCEPTION OF THE MODULATION DEPTH OF AM SIGNALS
- The Fourth WESTPRAC : The Fourth Western Pacific Regional Acoustics Conference