Extraction of Low Dimensional Representation of Vowels in Articulatory Space (国際ワークショップ Frontiers in Speech and Hearing Research)
スポンサーリンク
概要
- 論文の詳細を見る
Cognitive science shows that perceptual task, such as similarity measurement, is carried out in a low dimensional representation space, and patterns are represented in a topological view in the low dimensional space by considering their similarity relationship. Accordingly, it is assumed that speech production and perception in the high level are carried out in lower dimensions with a similar topology. In this paper, we used an unsupervised learning method, i.e., Locally Linear Embedding (LLE) to extract low dimensional structure of five Japanese vowels using articulatory data with eight observation points. The results showed that the learned topological structure in articulatory space with a low dimension is consistent with F1-F2 pattern of the vowels in acoustic domain. Because the acoustic data is produced by the articulatory movement which is controlled in motor area in brain, these kinds of topological structures may suggest the cognition of speech in high level by invariant topology mapping between different spaces. Also, from neural mechanism for pattern encoding aspect in high level, with the evolution of neurons which are exposed to so many patterns, the plasticity of neurons is adapted to encode all speech patterns efficiently by reference encoding which explores their similarity relationship between them.
- 社団法人電子情報通信学会の論文
- 2006-03-20
著者
-
Lu Xugang
Atr Spoken Language Communication Research Laboratories
-
Lu Xugang
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Lu Xugang
Japan Advanced Institute Of Science And Technology
-
Dang Jianwu
Japan Advanced Inst. Of Sci. And Technol. Ishikawa Jpn
-
Lu Xugang
Information School Japan Advanced Institute Of Science And Technology
-
Dang Jianwu
Information School Japan Advanced Institute Of Science And Technology
関連論文
- Robust voice activity detection based on noise eigenspace
- A model-based investigation of activations of the tongue muscles in vowel production
- Speech Enhancement based on Noise Eigenspace Projection
- A speech enhancement framework based on noise eigenspace projection (音声)
- Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems
- Sub-Band Temporal Envelope Restoration for ASR in Reverberation Environment (国際ワークショップ Frontiers in Speech and Hearing Research)
- Robust speech feature extraction based on auditory neuronal adaptation mechanism
- A Model-Based Learning Process for Modeling Coarticulation of Human Speech(Knowledge, Information and Creativity Support System)
- Normalization of vocal tract shape using radial basis function (音声)
- Normalization of vocal tract shape using radial basis function