Perceptually-Related F0 Parameters for Automatic Classification of Phrase Final Tones(Speech Synthesis and Prosody, <Special Section>Corpus-Based Speech Technologies)
スポンサーリンク
概要
- 論文の詳細を見る
Automatic labeling of prosodic features is an important topic when constructing large speech databases for speech synthesis or analysis purposes. Perceptually-relate F0 parameters are proposed with the aim of automatically classifying phrase final tones. Analyses are conducted to verify how consistently subjects are able to categorize phrase final tones, and how perceptual features are related with the categories. Three types of acoustic parameters are proposed and analyzed for representing the perceptual features related to the tone categories: one related to pitch movement within the phrase final, one related to pitch reset prior to the phrase final, and one related to the length of the phrase final. A classification tree is constructed to evaluate automatic classification of phrase final tones, resulting in 79.2% accuracy for the consistently categorized samples, using the best combination among the proposed acoustic parameters.
- 一般社団法人電子情報通信学会の論文
- 2005-03-01
一般社団法人電子情報通信学会 | 論文
- DE2000-30 英語テキストにおける関連性の重ね合わせモデルの検索特性
- 関連性の重ね合わせモデルに基づく問い合わせ表現の拡張
- テクスチャを利用した視点依存型ポリゴンリダクション
- Webページの立体音響を用いた閲覧支援
- FPGAを用いた動的再構成可能システムを対象とするスケジューリング手法