One-Pass Search Algorithm for Continuous Speech Recognition Using Generalized LR Parsing : A CFG-Driven, Frame-Synchronous HMM-Based Approach
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we present a novel continuous speech recognition algorithm that integrates three major technologies : (1)hidden Markov models for speech, (2)a generalized LR parser for handling context-free grammar (CFG) constraints, and (3)the one-pass search algorithm for efficient searching. We also introduce three techniques that used in the development of the algorithm : (1)LR path-merging, (2)the use of a shared tree-structured stack, and(3)LR-parser-based dynamic network generation. By means of the proposed algorithm, an optimal hypothesis can be found efficiently for a given speech signal according to a specified CFG in a frame-synchronous process. We implemented an experimental Japanese speech recognition system based on the proposed algorithm, using discrete-type context-independent HMMs without duration control. The system attained a recognition accuracy of 84.1%-88.1%, depending on the beam width. We also experimentally compared our algorithm with the following two methods : (1)the one-pass search algorithm using the finite-state approximation for a CFG, and(2)the HMM-LR algorithm. The experiments showed that the proposed algorithm attained higher accuracy when the beam width was small.
- 一般社団法人情報処理学会の論文
- 1995-05-15
著者
-
Yano Yoneo
Faculty Of Engineering Tokushima University
-
Kita Kenji
Faculty of Engineering, Tokushima University
-
Morimoto Tsuyoshi
ATR Interpreting Telecommunications Research Laboratories
-
Kita K
Faculty Of Engineering Tokushima University
-
Kita Kenji
Faculty Of Engineering Tokushima University
-
Yano Yoneo
Tokushima University
-
Morimoto T
Atr Interpreting Telecommunications Res. Lab. Kyoto Jpn
-
Yano Y
Tokushima University
-
Kita Kenji
Faculty Of Engineering The University Of Tokushima
関連論文
- One-Pass Search Algorithm for Continuous Speech Recognition Using Generalized LR Parsing : A CFG-Driven, Frame-Synchronous HMM-Based Approach
- Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling (Special Issue on Natural Language Processing and Understanding)
- LR Parsing with a Category Reachability Test Applied to Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- A Visualized Explanation Approach to Self-Explanation Learning Environment : VISTA : VIsual Story Teller Architecture
- Kanji Laboratory: An Environmental ICAI System for Kanji Learning (Special Issue on Intelligent CAI and Hypermedia)
- A Comparative Study of Automatic Extraction of Collocations from Corpora : Mutual Information vs. Cost Criteria
- Educational Control of Game Style Learning Environment
- KASTAM: A Model of Kanji Learning for Knowledge Stability
- Development of an Environmental ICAI System for English Conversation Learning (Special Issue on Intelligent CAI and Hypermedia)
- Semantic-level Transfer in Japanese-German Speech Translation : Some Experiences
- FUSING MULTIMODAL INPUTS
- ANALYSIS AND INTEGRATION OF MULTIMODAL INPUTS IN INTERPRETING TELECOMMUNICATIONS
- パターンを用いた曖昧性の検出と再確認方式
- A Unification-Based Japanese Parser for Speech-to-Speech Translation (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Analysis of Gestures in a Multimedia/Multimodal Interpreting Experiment
- An Empirical Study on Rule Granularity and Unification Interleaving in Unification-Based Parsers
- Continuous Speech Recognition Using a Combination of Syntactic Constraints and Dependency Relationships
- An lnformation-Theoretic Model of Discourse for Next Utterance Type Prediction
- Skeleton Pruning Based on the Total Bisector Angle of the End Branches