Automatic Estimation of Accentual Attribute Values of Words for Accent Sandhi Rules of Japanese Text-to-Speech Conversion (<Special Issue>Special Issue on Speech Information Processing)
スポンサーリンク
概要
- 論文の詳細を見る
Accurate estimation of accentual attribute values of words, which is required to apply rules of Japanese word accent sandhi to prosody generation, is an important factor to realize high-quality text-to-speech (TTS) conversion. The rules were already formulated by Sagisaka et al. [1], [2] and are widely used in Japanese TTS conversion systems. Application of these rules, however, requires values of a few accentual attributes of each constituent word of input text. The attribute values can-not be found in any public database or any accent dictionaries of Japanese. Further, these values are difficult even for native speakers of Japanese to estimate only with their introspective consideration of properties of their mother tongue. In this paper, an algorithm was proposed, where these values were automatically estimated from a large amount of data of accent types of accentual phrases, which were collected through a long series of listening experiments. In the proposed algorithm, inter-speaker differences of knowledge of accent sandhi were well considered. To improve the coverage of the estimated values over the obtained data, the rules were tentatively modified. Evaluation experiments using two-mora accentual phrases showed the high validity of the estimated values and the modified rules and also some defects caused by varieties of linguistic expressions of Japanese.
- 社団法人電子情報通信学会の論文
- 2003-03-01
著者
-
Hirose Keikichi
Graduate School Of Frontier Sciences The University Of Tokyo
-
Minematsu Nobuaki
Graduate School Of Information Science And Technology The University Of Tokyo
-
KITA Ryuji
Graduate School of Information Science and Technology, The University of Tokyo
-
Kita Ryuji
Graduate School Of Information Science And Technology The University Of Tokyo
-
MINEMATSU Nobuaki
Graduate School of Frontier Sciences, The University of Tokyo
関連論文
- Regularized Maximum Likelihood Linear Regression Adaptation for Computer-Assisted Language Learning Systems
- Speaker Verification in Realistic Noisy Environment in Forensic Science
- Automatic Estimation of Accentual Attribute Values of Words for Accent Sandhi Rules of Japanese Text-to-Speech Conversion (Special Issue on Speech Information Processing)
- Prosodic Analysis and Modeling of Nagauta Singing to Generate Prosodic Contours from Standard Scores(Speech Dynamics by Ear, Eye, Mouth and Machine)