Adaptation to Pronunciation Variations in Indonesian Spoken Query-Based Information Retrieval
スポンサーリンク
概要
- 論文の詳細を見る
Recognition errors of proper nouns and foreign words significantly decrease the performance of ASR-based speech applications such as voice dialing systems, speech summarization, spoken document retrieval, and spoken query-based information retrieval (IR). The reason is that proper nouns and words that come from other languages are usually the most important key words. The loss of such words due to misrecognition in turn leads to a loss of significant information from the speech source. This paper focuses on how to improve the performance of Indonesian ASR by alleviating the problem of pronunciation variation of proper nouns and foreign words (English words in particular). To improve the proper noun recognition accuracy, proper-noun specific acoustic models are created by supervised adaptation using maximum likelihood linear regression (MLLR). To improve English word recognition, the pronunciation of English words contained in the lexicon is fixed by using rule-based English-to-Indonesian phoneme mapping. The effectiveness of the proposed method was confirmed through spoken query based Indonesian IR. We used Inference Network-based (IN-based) IR and compared its results with those of the classical Vector Space Model (VSM) IR, both using a tf-idf weighting schema. Experimental results show that IN-based IR outperforms VSM IR.
- (社)電子情報通信学会の論文
- 2010-09-01
著者
-
Furui Sadaoki
Tokyo Inst. Of Technol. Tokyo Jpn
-
Furui Sadaoki
Tokyo Institute Of Technology
-
LESTARI Dessi
Tokyo Institute of Technology
関連論文
- Tree-Structured Clustering Methods for Piecewise Linear-Transformation-Based Noise Adaptation(Speech and Hearing)
- Recent Progress in Corpus-Based Spontaneous Speech Recognition(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- THE USE OF FINITE-STATE TRANSDUCERS FOR MODELING PHONOLOGICAL AND MORPHOLOGICAL CONSTRAINTS IN AUTOMATIC SPEECH RECOGNITION
- Adaptation to Pronunciation Variations in Indonesian Spoken Query-Based Information Retrieval
- Committee-Based Active Learning for Speech Recognition
- Robust Gait-Based Person Identification against Walking Speed Variations
- Selected Topics from LVCSR Research for Asian Languages at Tokyo Tech
- Active Learning Using Phone-Error Distribution for Speech Modeling
- Distance-based Factor Graph Linearization and Sampled Max-sum Algorithm for Efficient 3D Potential Decoding of Macromolecules
- Active Learning Using Phone-Error Distribution for Speech Modeling