A two-element-microphone-array-based speech recognition system in vehicle environment(<Special issue>Commemoration of the Japan-China Joint Conference on Acoustics 2007 (JCA2007))
スポンサーリンク
概要
- 論文の詳細を見る
We propose a two-element-microphone-array based speech recognition system featuring auditory subband null-forming and a single channel speech enhancement module. Experiments show that the proposed methods effectively improve the speech quality and recognition rate. Further work may concern the combination of null-forming and post-filtering to deal with transient interferences as in Case 3 mentioned in previous chapter.
- 社団法人日本音響学会の論文
著者
-
Yan Yonghong
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
-
Fu Qiang
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
-
Yan Yonghong
Thinkit Speech Lab Institute Of Acoustics Chinese Academy Of Sciences
-
Zhang Heng
ThinkIT Speech Lab., Institute of Acoustics, Chinese Academy of Sciences
-
Zhang Heng
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
関連論文
- Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification
- Using a Kind of Novel Phonotactic Information for SVM Based Speaker Recognition
- Robust Speaker Clustering Using Affinity Propagation
- An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns
- Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech
- A One-Pass Real-Time Decoder Using Memory-Efficient State Network
- Development of a Mandarin-English Bilingual Speech Recognition System for Real World Music Retrieval
- Automatic Singing Performance Evaluation for Untrained Singers
- Melody Track Selection Using Discriminative Language Model
- Automatic Language Identification with Discriminative Language Characterization Based on SVM
- A two-element-microphone-array-based speech recognition system in vehicle environment(Commemoration of the Japan-China Joint Conference on Acoustics 2007 (JCA2007))
- Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter
- Effects of the Temporal Fine Structure in Different Frequency Bands on Mandarin Tone Perception
- A bayesian logistic regression approach to spoken language identification