Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter
スポンサーリンク
概要
- 論文の詳細を見る
In this letter, a two channel frequency domain speech enhancement algorithm is proposed. The algorithm is designed to achieve better overall performance with relatively small array size. An improved version of adaptive null-forming is used, in which noise cancelation is implemented in auditory subbands. And an OM-LSA based postfiltering stage further purifies the output. The algorithm also features interaction between the array processing and the postfilter to make the filter adaptation more robust. This approach achieves considerable improvement on signal-to-noise ratio (SNR) and subjective quality of the desired speech. Experiments confirm the effectiveness of the proposed system.
- (社)電子情報通信学会の論文
- 2008-12-01
著者
-
YAN Yonghong
Institute of Acoustics, Chinese Academy of Sciences
-
Yan Yonghong
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
-
Yan Yonghong
Institute Of Acoustics Chinese Academy Of Science
-
Fu Qiang
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
-
Zhang Heng
ThinkIT Speech Lab., Institute of Acoustics, Chinese Academy of Sciences
-
ZHANG Heng
Institute of Acoustics, Chinese Academy of Sciences
-
FU Qiang
Institute of Acoustics, Chinese Academy of Sciences
-
Zhang Heng
Thinkit Speech Lab. Institute Of Acoustics Chinese Academy Of Sciences
関連論文
- Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition
- Effects of single-channel speech enhancement algorithms on Mandarin speech intelligibility (応用音響)
- Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification
- Using a Kind of Novel Phonotactic Information for SVM Based Speaker Recognition
- Robust Speaker Clustering Using Affinity Propagation
- An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns
- Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech
- A One-Pass Real-Time Decoder Using Memory-Efficient State Network
- Development of a Mandarin-English Bilingual Speech Recognition System for Real World Music Retrieval
- Automatic Singing Performance Evaluation for Untrained Singers
- Melody Track Selection Using Discriminative Language Model
- Automatic Language Identification with Discriminative Language Characterization Based on SVM
- A two-element-microphone-array-based speech recognition system in vehicle environment(Commemoration of the Japan-China Joint Conference on Acoustics 2007 (JCA2007))
- Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter
- Effects of the Temporal Fine Structure in Different Frequency Bands on Mandarin Tone Perception
- Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition
- A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
- Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition
- Mitigation of Ionizing Radiation-induced Bone Marrow Suppression by p38 Inhibition and G-CSF Administration
- Retinoblastoma 94 Enhances Radiation Treatment of Esophageal Squamous Cell Carcinoma in Vitro and in Vivo
- Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation