Model Shrinkage for Discriminative Language Models
スポンサーリンク
概要
- 論文の詳細を見る
This paper describes a technique for overcoming the model shrinkage problem in automatic speech recognition (ASR), which allows application developers and users to control the model size with less degradation of accuracy. Recently, models for ASR systems tend to be large and this can constitute a bottleneck for developers and users without special knowledge of ASR with respect to introducing the ASR function. Specifically, discriminative language models (DLMs) are usually designed in a high-dimensional parameter space, although DLMs have gained increasing attention as an approach for improving recognition accuracy. Our proposed method can be applied to linear models including DLMs, in which the score of an input sample is given by the inner product of its features and the model parameters, but our proposed method can shrink models in an easy computation by obtaining simple statistics, which are square sums of feature values appearing in a data set. Our experimental results show that our proposed method can shrink a DLM with little degradation in accuracy and perform properly whether or not the data for obtaining the statistics are the same as the data for training the model.
- 2012-05-01
著者
-
Nakamura Atsushi
Ntt Communication Science Laboratories Ntt Corporation
-
Oba Takanobu
Ntt Communication Science Laboratories Ntt Corporation
-
Hori Takaaki
Ntt Communication Science Laboratories Ntt Corporation
-
Ito Akinori
The Graduate School Of Engineering Tohoku University
関連論文
- Improved Phoneme-History-Dependent Search Method for Large-Vocabulary Continuous-Speech Recognition
- Improved Sequential Dependency Analysis Integrating Labeling-Based Sentence Boundary Detection
- Efficient discriminative training of error corrective models using high-WER competitors (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Efficient discriminative training of error corrective models using high-WER competitors
- Speech Recognition Based on Student's t-Distribution Derived from Total Bayesian Framework(Speech Recognition, Statistical Modeling for Speech Processing)
- Selection of Shared-State Hidden Markov Model Structure Using Bayesian Criterion(the 2003 IEICE Excellent Paper Award)
- Efficient Combination of Likelihood Recycling and Batch Calculation for Fast Acoustic Likelihood Calculation
- Production-Oriented Models for Speech Recognition(Speech Recognition, Statistical Modeling for Speech Processing)
- Model Shrinkage for Discriminative Language Models