Phoneme Power Control for Speech Synthesis (Special Section on Speech Synthesis: Current Technologies and Equipment)
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes a new method of phoneme power control for speech synthesis by rule. The innovation of this method lies in its use of the phoneme environment and the relationship between speech power and pitch frequency. First, the permissible threshold (PT) for power modification is measured by subjective experiments using power manipulated speech material. As a result, it is concluded that the PT of power modification is 4.1 dB. This experimental result is significant when discussing power control and gives a criterion for power control accuracy. Next, the relationship between speech power and pitch frequency is analyzed using a very large speech data base. The results show that the relationship between phoneme power and pitch frequency is affected by the kind of phoneme, the adjoining phonemes, rising or falling pitch, and initial or final position in the sentence. Finally, we propose that the phoneme power should be controlled by pitch frequency and phoneme environment. This proposal is implemented in a waveform concatenation type text-to-speech synthesizer. This new method yields an averaged root mean square error between real and estimated speech power of 2.17 dB. This value indicates that 94 of the estimated power values are within the permissible threshold of human perception.
- 社団法人電子情報通信学会の論文
- 1993-11-25
著者
-
Sato Hirokazu
Speech And Acoustics Laboratory Ntt Human Interface Laboratories
-
Itoh Kenzo
Speech and Acoustics Laboratory, NTT Human Interface Laboratories
-
Hirokawa Tomohisa
Speech and Acoustics Laboratory, NTT Human Interface Laboratories
-
Itoh Kenzo
Speech And Acoustics Laboratory Ntt Human Interface Laboratories
-
Hirokawa Tomohisa
Speech And Acoustics Laboratory Ntt Human Interface Laboratories
関連論文
- High Quality Speech Synthesis System Based on Waveform Concatenation of Phoneme Segment (Special Section on Speech Synthesis: Current Technologies and Equipment)
- Phoneme Power Control for Speech Synthesis (Special Section on Speech Synthesis: Current Technologies and Equipment)