Automatic Measurement of Pressed/Breathy Phonation at Acoustic Centres of Reliability in Continuous Speech (<Special Issue>Special Issue on Speech Information Processing)
スポンサーリンク
概要
- 論文の詳細を見る
With the aim of enabling concatenative synthesis of expressive speech, we herein report progress towards developing robust and automatic algorithms for paralinguistic annotation of very large recorded-speech corpora. In particular, we describe a method of combining robust acoustic-prosodic and cepstral analyses to locate centres of acoustic-phonetic reliability in the speech stream, wherein physiologically meaningful parameters related to voice quality can be estimated more reliably. We then report some evaluations of a specific voice-quality parameter known as the glottal Amplitude Quotient (AQ), which was proposed in [2], [6] and is here measured automatically at centres of reliability in continuous speech. Analyses of a large, single-speaker corpus of emotional speech first validate the perceptual importance of the AQ parameter in quantifying the mode of phonation along the pressed-modal-breathy continuum, then reveal some of its phonetic, prosodic, and paralinguistic dependencies.
- 社団法人電子情報通信学会の論文
- 2003-03-01
著者
-
Mokhtari Parham
Jst
-
CAMPBELL Nick
CREST-ESP Project, at ATR Human Information Science Laboratories
-
Campbell Nick
Crest-esp Project At Atr Human Information Science Laboratories