LR Parsing with a Category Reachability Test Applied to Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose an extended LR parsing algorithm, called LR parsing with a category reachability test (the LR-CRT algorithm). The LR-CRT algorithm enables a parser to efficiently recognize those sentences that belong to a specified grammatical category. The key point of the algorithm is to use an augmented LR parsing table in which each action entry contains a set of reachable categories. When executing a shift or reduce action, the parser checks whether the action can reach a given category using the augmented table. We apply the LR-CRT algorithm to improve a speech recognition system based on two-level LR parsing. This system uses two kinds of grammars, inter-and intra-phrase grammars, to recognize Japanese sentential speech. Two-level LR parsing guides the search of speech recognition through two-level symbol prediction, phrase category prediction and phone prediction, based on these grammars. The LR-CRT algorithm makes possible the efficient phone prediction based on the phrase category prediction. The system was evaluated using sentential speech data uttered phrase by phrase, and attained a word accuracy of 97.5% and a sentence accuracy of 91.2%.
- 社団法人電子情報通信学会の論文
- 1993-01-25
著者
-
Kita Kenji
Faculty of Engineering, Tokushima University
-
Morimoto Tsuyoshi
ATR Interpreting Telecommunications Research Laboratories
-
Sagayama S
Atr Interpreting Telephony Research Lab.
-
Sagayama Shigeki
Ntt Human Interface Laboratories
-
Sagayama Shigeki
ATR Interpreting Telephony Research Laboratories
-
Kita Kenji
ATR Interpreting Telephony Research Laboratories
-
Kita K
Faculty Of Engineering Tokushima University
-
Kita Kenji
Faculty Of Engineering Tokushima University
-
Morimoto T
Atr Interpreting Telecommunications Res. Lab. Kyoto Jpn
関連論文
- One-Pass Search Algorithm for Continuous Speech Recognition Using Generalized LR Parsing : A CFG-Driven, Frame-Synchronous HMM-Based Approach
- A voice conversion based on phoneme segment mapping
- Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling (Special Issue on Natural Language Processing and Understanding)
- LR Parsing with a Category Reachability Test Applied to Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- A pairwise discriminant approach using artificial neural networks for continuous speech recognition
- Speaker Weighted Training of HMM Using Multiple Reference Speakers
- Semantic-level Transfer in Japanese-German Speech Translation : Some Experiences
- FUSING MULTIMODAL INPUTS
- ANALYSIS AND INTEGRATION OF MULTIMODAL INPUTS IN INTERPRETING TELECOMMUNICATIONS
- パターンを用いた曖昧性の検出と再確認方式
- Speaker-Consistent Parsing for Speaker-Independent Continuous Speech Recognition
- A Unification-Based Japanese Parser for Speech-to-Speech Translation (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Speaker Adaptation Based on Vector Field Smoothing
- Analysis of Gestures in a Multimedia/Multimodal Interpreting Experiment
- Automatic Determination of the Number of Mixture Components for Continuous HMMs Based on a Uniform Variance Criterion
- Minimum error classification training of HMMs : Implementation details and experimental results
- An Empirical Study on Rule Granularity and Unification Interleaving in Unification-Based Parsers
- Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network
- HMM speech recognition using stochastic language models
- Continuous Speech Recognition Using a Combination of Syntactic Constraints and Dependency Relationships
- An lnformation-Theoretic Model of Discourse for Next Utterance Type Prediction
- Speech Recognition Using Function-Word N-Grams and Content-Word N-Grams
- Isolated Word Recognition Using Pitch Pattern Information
- Three Different LR Parsing Algorithms for Phoneme-Context-Dependent HMM-Based Continuous Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Skeleton Pruning Based on the Total Bisector Angle of the End Branches
- Discriminative Training Based on Minimum Classification Error for a Small Amount of Data Enhanced by Vector-Field-Smoothed Bayesian Learning