Toward speech recognition of continuously spoken Chinese sentences.
スポンサーリンク
概要
- 論文の詳細を見る
A undamental speech recognition procedure for continuously spoken simple Chinese sentences is described and the method of specific regions is proposed. Phonemes are identified every 10ms by extracting numbers of zero-crossings, PARCOR coefficients, <I>F</I><SUB>1</SUB> and <I>F</I><SUB>2</SUB> etc. from speech waves. In Chinese language, each Chinese character is pronounced as a monosyllable and has definite meaning. Using these characteristics, continuously spoken speech waves are divided into monosyllables, and each vowel segment of monosyllables is partitioned into8minor segments. The average first2formant frequencies of each first minor vowel segment point out a specific region on the <I>F</I><SUB>1</SUB>-<I>F</I><SUB>2</SUB> plane. Since this region decides a group of monosyllables or Chinese characters which have similar vowels, a monosyllable can be identified from them. Moreover, use of a syntactic state transition network improves recognition scores of sentences. Average recognition scores of130charactersand33sentences uttered by3male adults are90.7% and75.7%, respectively.
- 一般社団法人 日本音響学会の論文
一般社団法人 日本音響学会 | 論文
- How large is the individual difference in hearing sensitivity?: Establishment of ISO 28961 on the statistical distribution of hearing thresholds of otologically normal young persons
- Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis
- Vocal cord vibration in the production of consonants. Observation by means of high-speed digital imaging using a fiberscope.:Observation by means of high-speed digital imaging using a fiberscope
- The early reflections of the impulse response in an auditorium.
- Multiple reflections between rigid plane panels.