Invited Paper What are the Essential Cues for Understanding Spoken Language ? (国際ワークショップ:Speech Dynamics by Ear, Eye, Mouth and Machine)
スポンサーリンク
概要
- 論文の詳細を見る
Classical models of speech recognition assume that a detailed, short-term analysis of the acoustic signal is essential for accurately decoding the speech signal and that this decoding process is rooted in the phonetic segment. This paper presents an alternative view, one in which the time scales required to accurately describe and model spoken language are both shorter and longer than the phonetic segment, and are inherently wedded to the syllable. The syllable reflects a singular property of the acoustic signal - the mod - ulation spectrum - which provides a principled, quantitative framework to describe the process by which the listener proceeds from sound to meaning. The ability to understand spoken language (i.e., intelligibility) vitally depends on the integrity of the modulation spectrum within the core range of the syllable (3-10 Hz) and reflects the variation in syllable emphasis associated with the concept of prosodic prominence ("accent"). A model of spoken language is described in which the prosodic properties of the speech signal are embedded in the temporal dynamics associated with the syllable, a unit serving as the organizational interface among the various tiers of linguistic representation.
- 社団法人電子情報通信学会の論文
- 2003-06-20
著者
関連論文
- What are the Essential Cues for Understanding Spoken Language?(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Special issue on introduction to the amazing world of sounds with demonstrations(introduction to the amazing world of sounds with demonstrations)
- Study on Noise Reduction of Ventilator Noise in Recoded Speech Signals (国際ワークショップ Frontiers in Speech and Hearing Research)
- Invited Paper What are the Essential Cues for Understanding Spoken Language ? (国際ワークショップ:Speech Dynamics by Ear, Eye, Mouth and Machine)
- Speech Dynamics by Ear, Eye, Mouth and Machine