VLSI Architecture of GMM Processing and Viterbi Decoder for 60,000-Word Real-Time Continuous Speech Recognition
スポンサーリンク
概要
- 論文の詳細を見る
We propose a low-memory-bandwidth, high-efficiency VLSI architecture for 60-k word real-time continuous speech recognition. Our architecture includes a cache architecture using the locality of speech recognition, beam pruning using a dynamic threshold, two-stage language model searching, a parallel Gaussian Mixture Model (GMM) architecture based on the mixture level and frame level, a parallel Viterbi architecture, and pipeline operation between Viterbi transition and GMM processing. Results show that our architecture achieves 88.24% required frequency reduction (66.74MHz) and 84.04% memory bandwidth reduction (549.91MB/s) for real-time 60-k word continuous speech recognition.
論文 | ランダム
- 113 連鋳スラブ内介在物分布に及ぼすノズル形状の影響(連鋳, 製鋼, 日本鉄鋼協会第 88 回(秋季)講演大会)
- 94 連続鋳造鋳型内流動に関する流体模型実験(連続鋳造, 製鋼, 日本鉄鋼協会第 80 回(秋季)講演大会講演)
- 122 取鍋中不活性ガス吹き込みに関する流体模型実験について(真空脱ガス・その他, 製鋼, 日本鉄鋼協会 第 78 回(秋季)講演大会)
- D-12-118 手書きスケッチの評価に向けた頂点情報取得法の検討(D-12.パターン認識・メディア理解,一般講演)
- ICTV8次報告書における植物ウイルスの分類