Speaker-Consistent Parsing for Speaker-Independent Continuous Speech Recognition
スポンサーリンク
概要
- 論文の詳細を見る
This paper describes a novel speaker-independent speech recognition method, called "speaker-consistent parsing", which is based on an intra-speaker correlation called the speaker-consistency principle. We focus on the fact that a sentence or a string of words is uttered by an individual speaker even in a speaker-independent task. Thus, the proposed method searches through speaker variations in addition to the contents of utterances. As a result of the recognition process, an appropriate standard speaker is selected for speaker adaptation. This new method is experimentally compared with a conventional speaker-independent speech recognition method. Since the speaker-consistency principle best demonstrates its effect with a large number of training and test speakers, a small-scale experiment may not fully exploit this principle. Nevertheless, even the results of our small-scale experiment show that the new method significantly outperforms the conventional method. In addition, this framework's speaker selection mechanism can drastically reduce the likelihood map computation.
- 社団法人電子情報通信学会の論文
- 1995-06-25
著者
-
Sagayama Shigeki
Ntt Human Interface Laboratories
-
SINGER Harald
ATR音声翻訳通信研究所
-
Singer H
Atr音声翻訳通信研
-
Singer Harald
Atr Interpreting Telecommunications Research Laabs.
-
Yamaguchi Kouichi
SHARP Corporation, Information Technology Research Labs.
-
Matsunaga Shoichi
ATR Interpreting Telecommunications Research Labs
-
Matsunaga Shoichi
Atr Interpreting Telecommunications Research Laboratories
-
Yamaguchi Kouichi
Sharp Corporation Information Technology Research Labs.
関連論文
- Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling (Special Issue on Natural Language Processing and Understanding)
- LR Parsing with a Category Reachability Test Applied to Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- 旅行会話タスクにおけるTARSPRECの性能評価
- 日英音声翻訳システムATR-MATRIXにおける音声認識用音響・言語モデル
- スペクトルサブバンドセントロイドを用いた雑音下での連続音声認識
- 自由発話音声認識における音響分析の比較
- どこでも出来る音声翻訳:クライエントサーバーATR-MATRIX
- 音素配列構造の制約を用いた音素タイプライタ
- An Overview of Speech Recognition with Applications for Medical Professionals
- 話者適応が誤認識特性に及ぼす影響について
- Speaker-Consistent Parsing for Speaker-Independent Continuous Speech Recognition
- Automatic Determination of the Number of Mixture Components for Continuous HMMs Based on a Uniform Variance Criterion
- Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network
- Speech Recognition Using Function-Word N-Grams and Content-Word N-Grams
- Discriminative Training Based on Minimum Classification Error for a Small Amount of Data Enhanced by Vector-Field-Smoothed Bayesian Learning