Voice activity detection in noise using modulation spectrum of speech: Investigation of speech frequency and modulation frequency ranges
スポンサーリンク
概要
- 論文の詳細を見る
Voice activity detection (VAD) in noisy environments is a very important preprocessing scheme in speech communication technology, a field which includes speech recognition, speech coding, speech enhancement and captioning video contents. We have developed a VAD method for noisy environments based on the modulation spectrum. In Experiment 1, we investigate the optimal ranges of speech and modulation frequencies for the proposed algorithm by using the simulated data in the CENSREC-1-C corpus. Results show that when we combine an upper limit frequency between 1,000 and 2,000 Hz with a lower limit frequency of less than 300 Hz as speech frequency bands, error rates are lower than with other bands. Furthermore, when we use the frequency components of the modulation spectrum between 3–9, 3–11, 3–14, 3–18, 4–9, 4–11, 4–14, 4–18, 5–7, 5–9, 5–11, or 5–14 Hz, the proposed method performs VAD well. In Experiment 2, we use one of the best parameter settings from Experiment 1 and evaluate the real environment data in the CENSREC-1-C corpus by comparing our method with other conventional methods. Improvements were observed from the VAD results for each SNR condition and noise type.
- Acoustical Society of Japanの論文
著者
-
Arai Takayuki
Graduate School Of Sci. And Technol. Sophia Univ.
-
Arai Takayuki
Graduate School of Science and Technology, Sophia University
-
Pek Kimhuoch
Graduate School of Science and Technology, Sophia University
-
Kanedera Noboru
Ishikawa National College of Technology
関連論文
- A Novel Synthesis of Highly Branched-Alkyl Aryl Ketones Using the Conjugate Addition of the Gilman Lithio Cuprate to β-Methylthio-α,β-unsaturated Ketones : Abnormal Reactivity of the Intermediary Enolates toward Molecular Oxygen
- Voice activity detection in noise using modulation spectrum of speech: Investigation of speech frequency and modulation frequency ranges