Isolated Word Recognition Using Pitch Pattern Information
スポンサーリンク
概要
- 論文の詳細を見る
This paper describes a new technique for isolated word recognition that uses both pitch information and spectral information. In conventional methods, words with similar phoneme features tend to be misrecognized even if their phonemes are accented differently because these methods use only spectral information. It is possible to improve recognition accuracy by considering pitch patterns of words. Many phonetically-similar Japanese words are classified by pitch patterns. In this technique, a pitch pattern template is produced by averaging pitch patterns obtained from a set of words which have the same accent pattern. A measure for word recognition is proposed. This measure based on a combination of the phoneme likelihood and the pitch pattern distance which is the distance between a pitch pattern of an input speech and pitch pattern templates. Speaker-dependent word recognition experiments were carried out using 216 Japanese words uttered by five male and five female speakers. The proposed technique reduces the recognition error rate by 40 compared with the conventional method using only phoneme likelihood.
- 社団法人電子情報通信学会の論文
- 1993-02-25
著者
-
Sagayama Shigeki
ATR Interpreting Telephony Research Laboratories
-
Takahashi Satoshi
NTT Human Interface Laboratories
-
Matsunaga Sho-ichi
NTT Human Interface Laboratories
関連論文
- A voice conversion based on phoneme segment mapping
- LR Parsing with a Category Reachability Test Applied to Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- A pairwise discriminant approach using artificial neural networks for continuous speech recognition
- Speaker Weighted Training of HMM Using Multiple Reference Speakers
- An HMM State Duration Control Algorithm Applied to Large-Vocabulary Spontaneous Speech Recognition
- Speaker Adaptation Based on Vector Field Smoothing
- Task Adaptation in Syllable Trigram Models for Continuous Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Minimum error classification training of HMMs : Implementation details and experimental results
- Isolated Word Recognition Using Pitch Pattern Information
- Three Different LR Parsing Algorithms for Phoneme-Context-Dependent HMM-Based Continuous Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)