Continuous Speech Segmentation Based on a Self-Learning Neuro-Fuzzy System (Special Section on Digital Signal Processing)
スポンサーリンク
概要
- 論文の詳細を見る
For reducing requirement of large memory and minimizing computation complexity in a large-vocabulary continuous speech recognition system, speech segmentation plays an important role in speech recognition systems. In this paper, we formulate the speech segmentation as a two-phase problem. Phase 1 (frame labeling) involves labeling frames of speech data. Frames are classified into three types: (1) silence, (2) consonant and (3) vowel according to two segmentation features. In phase 2 (syllabic unit segmentation) we apply the concept of transition states to segment continuous speech data into syllabic units based on the labeled frames. The novel class of hyperrectangular composite neural networks (HRCNNs) is used to cluster frames. The HRCNNs integrate the rule-based approach and neural network paradigms, therefore, this special hybrid system may neutralize the disadvantages of each alternative. The parameters of the trained HRCNNs are utilized to extract both crisp and fuzzy classification rules. In our experiments, a database containing continuous reading-rate Mandarin speech recorded from newscast was utilized to illustrate the performance of the proposed speaker independent speech segmentation system. The effectiveness of the proposed segmentation system is confirmed by the experimental results.
- 社団法人電子情報通信学会の論文
- 1996-08-25
著者
-
Su Mu-chun
Department Of Computer Science And Information Engineering National Central University
-
Su Mu-chun
Department Of Electrical Engineering Tamkang University
-
HSIEH Ching-Tang
Department of Electrical Engineering Tamkang University
-
HSU Chih-Hsu
Department of Electrical Engineering Tamkang University
-
Hsieh C‐t
Tamkang Univ. Taipei County Twn
関連論文
- A Competitive Learning Algorithm Using Symmetry
- A Healing Mechanism to Improve the Topological Preserving Property of Feature Maps
- Continuous Speech Segmentation Based on a Self-Learning Neuro-Fuzzy System (Special Section on Digital Signal Processing)
- Associative-Memory-Based Human Face Detection
- A Novel Bandelet-Based Image Inpainting
- Progressive Image Inpainting Based on Wavelet Transform(Image Coding, Information Theory and Its Applications)
- Robust Speaker Identification System Based on Multilayer Eigen-Codebook Vector Quantization(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Generalized Fuzzy Kohonen Clustering Networks (Special Section on Information Theory and Its Applications)