A context clustering technique for improvement of tone intelligibility of average-voice-based Thai speech synthesis (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")

概要

論文の詳細を見る
This paper describes a novel approach to the context clustering process in a speaker independent HMM-based Thai speech synthesis for improvement of the tone intelligibility of the average voice and also the speaker adapted voice. In our previous work, phrase intonation features extracted from a generative model were proposed to improve the tone intelligibility. In the present work, we propose a number of tonal features including tone-geometrical features and phrase intonation features to be exploited in the context clustering process of HMM training stage. In experiments, subjective evaluations of both average voice and adapted voice in terms of the intelligibility of tone are conducted. Effects on decision trees of the extracted features are also evaluated. By considering gender in training speech, two core experiments were conducted. The first experiment shows that the proposed tonal features can improve the tone intelligibility for female speech model above that of male speech model, while the second experiment shows that the proposed tonal features give the better improvement of the tone intelligibility for gender dependent model than for gender independent model. Both experimental results confirm that the tone correctness of the synthesized speech is significantly improved when using most of the extracted features.
一般社団法人電子情報通信学会の論文
2008-03-13