Prosody reconstruction by rescaling fundamental frequency contours in order to synthesize communicative speech (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
スポンサーリンク
概要
- 論文の詳細を見る
This paper presents a method of prosody reconstruction that can be used to synthesize conversational speech. In our method, we use a conventional text-to-speech engine to initially generate reading-style prosody for input text. We then use a frequency modulation technique to rescale the fundamental frequency (F_0) contours to add the communicative functions of intonation to the synthesized speech. The frequency modulation technique is based on a functional F_0 model, and the transformation scales are modeled by combining simple piece-wise-linear patterns according to input tags. We conducted two experiments to evaluate our method: modulating the F_0 range of reading-style prosody when synthesizing Japanese speech to convey "good news" and "bad news", and making a narrow focus when synthesizing Chinese dialog to convey emphasis. The results showed that our method could use much para-linguistic information to achieve specific communicative purposes.
- 社団法人電子情報通信学会の論文
- 2008-03-13
著者
-
Sakai Shinsuke
National Institute Of Information And Communications Technology:atr Spoken Language Translation Rese
-
Ni Jinfu
National Institute Of Information And Communications Technology:atr Spoken Language Translation Rese
-
Nakamura Satoshi
National Inst. Information And Communications Technol. (nict) Kyoto‐fu Jpn
関連論文
- CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments
- Class-Dependent Modeling for Dialog Translation
- Using Mutual Information Criterion to Design an Efficient Phoneme Set for Chinese Speech Recognition
- A Non-stationary Noise Suppression Method Based on Particle Filtering and Polyak Averaging(Speech Recognition, Statistical Modeling for Speech Processing)
- Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models
- Prosody reconstruction by rescaling fundamental frequency contours in order to synthesize communicative speech (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Prosody reconstruction by rescaling fundamental frequency contours in order to synthesize communicative speech
- Ambient Browser: Web Browser for Daily Use (日韓合同ワークショップ 1st Korea-Japan Joint Workshop on Ubiquitous Computing and Networking Systems (ubiCNS 2005))
- A Bayesian Model of Transliteration and Its Human Evaluation When Integrated into a Machine Translation System
- CENSREC-4: An evaluation framework for distant-talking speech recognition in reverberant environments
- Situated Spoken Dialogue with Robots Using Active Learning