Language Modeling Using Patterns Extracted from Parse Trees for Speech Recognition (<Special Issue>Special Issue on Speech Information Processing)
スポンサーリンク
概要
- 論文の詳細を見る
We propose new language models to represent, phrasal structures by patterns extracted from parse trees. First, modified word trigram models are proposed. They are extracted from sentences analyzed by the preprocessing of the parser with knowledge. Since sentences are analyzed to create sub-trees of a few words, these trigram models can represent relations among a few neighbor words more strongly than conventional word trigram models. Second, word pattern models are used on these modified word trigram models. The word patterns are extracted from parse trees and can represent phrasal structures and much longer word-dependency than trigram models. Experimental results show that modified trigram models are more effective than traditional trigram models and that pattern models attain slight improvements over modified trigram models. Furthermore, additional experiments show that pattern models are more effective for long sentences.
- 社団法人電子情報通信学会の論文
- 2003-03-01
著者
-
Jitsuhiro Takatoshi
Spoken Language Translation Research Laboratories Advanced Telecommunications Research Institute Int
-
Kikui Genichiro
Spoken Language Translation Research Laboratories Advanced Telecommunications Research Institute Int
-
YAMAMOTO Hirofumi
Spoken Language Translation Research Laboratories, Advanced Telecommunications Research Institute In
-
YAMADA Setsuo
Spoken Language Translation Research Laboratories, Advanced Telecommunications Research Institute In
-
SAGISAKA Yoshinori
Spoken Language Translation Research Laboratories, Advanced Telecommunications Research Institute In
-
Yamada Setsuo
Spoken Language Translation Research Laboratories Advanced Telecommunications Research Institute Int
-
Sagisaka Yoshinori
Spoken Language Translation Research Laboratories Advanced Telecommunications Research Institute Int
-
Yamamoto Hirofumi
Spoken Language Translation Research Laboratories Advanced Telecommunications Research Institute Int
関連論文
- Automatic Generation of Non-uniform and Context-Dependent HMMs Based on the Variational Bayesian Approach(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Language Modeling Using Patterns Extracted from Parse Trees for Speech Recognition (Special Issue on Speech Information Processing)
- Automatic Generation of Non-uniform HMM Topologies Based on the MDL Criterion(Speech and Hearing)