Critical Band Subspace-Based Speech Enhancement Using SNR and Auditory Masking Aware Technique(Speech and Hearing)
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, a new subspace-based speech enhancement algorithm is presented. First, we construct a perceptual filterbank from psycho-acoustic model and incorporate it in the subspace-based enhancement approach. This filterbank is created through a five-level wavelet packet decomposition. The masking properties of the human auditory system are then derived based on the perceptual filterbank. Finally, the prior SNR and the masking threshold of each critical band are taken to decide the attenuation factor of the optimal linear estimator. Five different types of in-car noises in TAICAR database were used in our evaluation. The experimental results demonstrated that our approach outperformed conventional subspace and spectral subtraction methods.
- 社団法人電子情報通信学会の論文
- 2007-07-01
著者
-
YANG Chung-Hsien
Department of Electrical Engineering, National Cheng Kung University
-
WANG Jia-Ching
Department of Electrical Engineering, National Cheng Kung University
-
WANG Jhing-Fa
Department of Electrical Engineering, National Cheng Kung University
-
Wang Jhing‐fa
Department Of Electrical Engineering National Cheng Kung University
-
Wang Jhing-fa
Department Of Electrical Engineering National Cheng Kung University
-
Wang Jia-ching
Department Of Electrical Engineering National Cheng Kung University
-
Yang Chung-hsien
Department Of Electrical Engineering National Cheng Kung University
-
LEE Hsiao-Ping
Department of Electrical Engineering, National Cheng-Kung University
-
Lee Hsiao-ping
Department Of Electrical Engineering National Cheng-kung University
-
WANG Jia-Ching
Department of Computer Science and Information Engineering, National Central University
関連論文
- Region similarity based edge detection for motion estimation in H.264/AVC
- A Block-Based Architecture for Lifting Scheme Discrete Wavelet Transform(Image)
- Critical Band Subspace-Based Speech Enhancement Using SNR and Auditory Masking Aware Technique(Speech and Hearing)
- Novel Stroke Decomposition for Noisy and Degraded Chinese Characters Using SOGD Filters(Image Recognition, Computer Vision)
- Efficient Coding Translation of GSM and G.729 Speech Coders across Mobile and IP Networks(Speech and Hearing)
- VLSI Architecture and Implementation for Speech Recognizer Based on Discriminative Bayesian Neural Network(Special Section on Digital Signal Processing)
- A Novel Fast Mode Decision Algorithm for H.264/AVC Using Particle Swarm Optimization