A speech enhancement framework based on noise eigenspace projection (音声)
スポンサーリンク
概要
- 論文の詳細を見る
The performance of most speech enhancement algorithms declines under low-SNR conditions because of residual noise (or speech distortion) and the degradation of voice activity detector (VAD) performance. We therefore propose a speech enhancement approach based on noise eigenspace projection. When noisy speech is projected into the noise eigenspace, the noise energy is packed to a subspace consisting of dimensions with larger eigenvalues. This subspace is fairly dominated by noise. Removing the noise subspace can greatly reduce the noise at the cost of little speech loss. At the same time, the eigenspace dimensions having little noise are used to make a robust VAD. Using the proposed algorithm as a pre-processing block for conventional enhancement algorithms can efficiently reduce the residual noise under low-SNR conditions.
- 社団法人電子情報通信学会の論文
- 2007-07-19
著者
-
Unoki Masashi
School of Information Science, Japan Advanced Institute of Science and Technology
-
Ying Dongwen
School of Information Science, Japan Advanced Institute of Science and Technology
-
Dang Jianwu
School of Information Science, Japan Advanced Institute of Science and Technology
-
Unoki Masashi
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Dang Jianwu
Japan Advanced Inst. Of Sci. And Technol. Ishikawa Jpn
-
Dang Jianwu
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Ying Dongwen
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Unoki Masashi
Information School Japan Advanced Institute Of Science And Technology
-
Dang Jianwu
Information School Japan Advanced Institute Of Science And Technology
-
Unoki Masashi
Japan Advanced Inst. Sci. And Technol. Ishikawa Jpn
-
鵜木 祐史
School of Information Science, Japan Advanced Institute of Science and Technology
-
党 建武
School of Information Science, Japan Advanced Institute of Science and Technology
関連論文
- Study on a method of suppressing noise based on the MTF concept
- An MTF-based method of blind restoration for improving intelligibility of bone-conducted speech
- A study on the LP-based blind model in restoring bone-conducted speech (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- An LP-based blind restoration method for improving intelligibility of bone-conducted speech (音声)
- Robust voice activity detection based on noise eigenspace
- A speech dereverberation method based on the MTF concept in power envelope restoration
- An improved method based on the MTF concept for restoring the power envelope from a reverberant signal
- Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Improvement of robustness using selective sound segregation for automatic speech recognition systems in noisy environments
- LP-baesd method of blind restoration to improve intelligibility of bone-conducted speech
- A model-based investigation of activations of the tongue muscles in vowel production
- A study on audio watermarking method based on the cochlear delay characteristics
- Fundamental frequency estimation for noisy speech based on instantaneous amplitude and frequency
- Estimation of fundamental frequency of reverberant speech by utilizing complex cepstrum analysis
- Speech Enhancement based on Noise Eigenspace Projection
- A speech enhancement framework based on noise eigenspace projection (音声)
- Estimate of auditory filter shape using notched-noise masking for various signal frequencies
- Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems
- Sub-Band Temporal Envelope Restoration for ASR in Reverberation Environment (国際ワークショップ Frontiers in Speech and Hearing Research)
- A Study on Restoration of Bone-Conducted Speech with LPC-Based Model (国際ワークショップ Frontiers in Speech and Hearing Research)
- A computational model of co-modulation masking release
- A method of signal extraction from noisy signal based on auditory scene analysis
- Robust speech feature extraction based on auditory neuronal adaptation mechanism
- A Model-Based Learning Process for Modeling Coarticulation of Human Speech(Knowledge, Information and Creativity Support System)
- Normalization of vocal tract shape using radial basis function (音声)
- Optimization and Evaluation of a Coarticulation Model based on Observation and Simulation
- Parameter Optimization for a Coarticulation Model Based on Observation and Simulation (国際ワークショップ Frontiers in Speech and Hearing Research)
- Extraction of Low Dimensional Representation of Vowels in Articulatory Space (国際ワークショップ Frontiers in Speech and Hearing Research)
- Comparison of Emotion Perception among Different Cultures
- A Computational Tongue Model and its Clinical Application
- Investigation of usual and unusual articulation based on simulations and observations (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Investigation of coarticulation in continuous speech of Japanese
- Investigation of coarticulation effects on vocal tract shapes of vowels based on similarity
- Study on Speech Watermarking Based on Modifications to LSFs for Tampering Detection
- Study on Speech Watermarking Based on Modifications to LSFs for Tampering Detection
- Study on Speech Watermarking Based on Modifications to LSFs for Tampering Detection
- Study on Blind Method of Estimating Speech Transmission Index from Noisy Reverberant Amplitude-Modulated-Signals
- Study on Semi-scramble Method for Speech Signals Based on Phonemic Restoration