Robust voice activity detection based on noise eigenspace
スポンサーリンク
概要
- 論文の詳細を見る
In this study, we propose a voice activity detector (VAD) based on a noise eigenspace, which improve the robustness of VAD by utilizing the compression capability of the eigenspace. A noise eigenspace is constructed by using eigenvalue decomposition of the noise correlation matrix. When noisy speech is projected into the noise eigenspace, the noise energy is packed into a few dimensions with large eigenvalues, and those dimensions hopefully possess relatively less speech, because the speech energy distribution is usually different from noise energy distribution. The noise can be reduced by discarding those dimensions with large noise energy, while no significant loss occurs in speech. To track noise variation, the noise eigenspace is periodically updated, where the computation cost for eigenspace construction can be kept at an acceptable level. The proposed VAD was evaluated using the TIMIT database mixed with several noises. The experiment showed that the proposed VAD is more accurate than previous VADs in noisy environments.
著者
-
Ying Dongwen
School of Information Science, Japan Advanced Institute of Science and Technology
-
Shi Yu
Microsoft Research Asia
-
Lu Xugang
School of Information Science, Japan Advanced Institute of Science and Technology
-
Dang Jianwu
School of Information Science, Japan Advanced Institute of Science and Technology
-
Soong Frank
Microsoft Research Asia
-
Lu Xugang
Atr Spoken Language Communication Research Laboratories
-
Lu Xugang
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Dang Jianwu
Japan Advanced Inst. Of Sci. And Technol. Ishikawa Jpn
-
Dang Jianwu
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Ying Dongwen
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Lu Xugang
Information School Japan Advanced Institute Of Science And Technology
-
Dang Jianwu
Information School Japan Advanced Institute Of Science And Technology
関連論文
- Robust voice activity detection based on noise eigenspace
- A model-based investigation of activations of the tongue muscles in vowel production
- Speech Enhancement based on Noise Eigenspace Projection
- A speech enhancement framework based on noise eigenspace projection (音声)
- Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems
- Sub-Band Temporal Envelope Restoration for ASR in Reverberation Environment (国際ワークショップ Frontiers in Speech and Hearing Research)
- Robust speech feature extraction based on auditory neuronal adaptation mechanism
- A Model-Based Learning Process for Modeling Coarticulation of Human Speech(Knowledge, Information and Creativity Support System)
- Normalization of vocal tract shape using radial basis function (音声)
- Normalization of vocal tract shape using radial basis function
- Optimization and Evaluation of a Coarticulation Model based on Observation and Simulation
- Parameter Optimization for a Coarticulation Model Based on Observation and Simulation (国際ワークショップ Frontiers in Speech and Hearing Research)
- Extraction of Low Dimensional Representation of Vowels in Articulatory Space (国際ワークショップ Frontiers in Speech and Hearing Research)
- Comparison of Emotion Perception among Different Cultures
- A Computational Tongue Model and its Clinical Application
- Investigation of usual and unusual articulation based on simulations and observations (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Investigation of coarticulation in continuous speech of Japanese
- Investigation of coarticulation effects on vocal tract shapes of vowels based on similarity