Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, tone nucleus model is employed to represent and convert F0 contour for synthesizing an emotional Mandarin speech from a neutral speech. Compared with previous prosody transforming methods, the proposed method 1) only converts the tone nucleus part of each syllable rather than the whole F0 contour to avoid the data sparseness problems; 2) builds mapping functions for well-chosen tone nucleus model parameters to better capture Mandarin tonal information. Using only a modest amount of training data, the perceptual accuracy achieved by our method was shown to be comparable to that obtained by a professional speaker.
- 2011-07-14
著者
-
Keikichi Hirose
Department Of Information And Communication Engineering The University Of Tokyo
-
Nobuaki Minematsu
Department Of Information And Communication Engineering The University Of Tokyo
-
Miaomiao Wen
東京大学大学院工学系研究科
-
Miaomiao Wang
東京大学大学院工学系研究科
-
Keikichi Hirose
東京大学大学院情報理工学系研究科
-
Nobuaki Minematsu
東京大学大学院情報理工学系研究科
-
Miaomiao Wang
Department Of Electrical Engineering And Information Systems The University Of Tokyo
関連論文
- An Investigation of Hidden Structure Model
- Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model
- Prosody Improvement for HMM-based Mandarin Speech Synthesis Using the Tone Nucleus Model
- A Preliminary Perceptual Analysis on the Relationship of Phoneme Duration and Speaking Rate
- 言語クラスタリングと自動単位選抜による波形重畳音声合成