Myanmar text-to-speech system with rule-based tone synthesis
スポンサーリンク
概要
- 論文の詳細を見る
We have introduced a novel Myanmar text to speech (MyanmarTTS) system with rule-based tone synthesis. Myanmar is a tonal language that possesses unique characteristics compared with other tonal languages such as Chinese, Vietnamese and Thai. Such languages have complicated fundamental-frequency (<I>F</I><SUB>0</SUB>) patterns of tones, and <I>F</I><SUB>0</SUB> is of foremost importance. Myanmar tones are unique in their simplistic pattern related not only to <I>F</I><SUB>0</SUB> but also, more specifically to duration. Myanmar tones have different durations between short-tone and long-tone groups. In accordance, we defined a tone rule employing two parameters <I>F</I><SUB>0</SUB> at the center of the syllable and the syllable’s duration. The rule is implemented with a linear <I>F</I><SUB>0</SUB> pattern. Large variability exists in the <I>F</I><SUB>0</SUB> and duration uttered by different speakers of different syllables. Hence, for tone synthesis, normalization of the <I>F</I><SUB>0</SUB> and duration is important and necessary to discriminate tones. We proposed a normalization method and the effectiveness of this method was confirmed in the distribution of the <I>F</I><SUB>0</SUB> and duration. The intelligibility of the synthesized tone was confirmed through listening tests with correct rates of 95.6% for male and 97.8% for female speech. As a result, we showed that the linear pattern is sufficient for Myanmar tone synthesis.
著者
-
Takara Tomio
Department Of Information Engineering University Of The Ryukyus
-
Win Kyawt
Department of Information Engineering, University of the Ryukyus
関連論文
- Vietnamese Text-To-Speech system with precise tone generation
- Perceptual characteristics of broken and drop tones in Vietnamese
- Myanmar text-to-speech system with rule-based tone synthesis