Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility(Speech Enhancement, <Special Section>Multi-channel Acoustic Signal Processing)
スポンサーリンク
概要
- 論文の詳細を見る
A speech signal captured by a distant microphone is generally smeared by reverberation, which severely degrades both the speech intelligibility and Automatic Speech Recognition (ASR) performance. Previously, we proposed a single-microphone dereverberation method, named "Harmonicity based dEReverBeration (HERB)." HERB estimates the inverse filter for an unknown room transfer function by utilizing an essential feature of speech, namely harmonic structure. In previous studies, improvements in speech intelligibility was shown solely with spectrograms, and improvements in ASR performance were simply confirmed by matched condition acoustic model. In this paper, we undertook a further investigation of HERB's potential as regards to the above two factors. First, we examined speech intelligibility by means of objective indices. As a result, we found that HERB is capable of improving the speech intelligibility to approximately that of clean speech. Second, since HERB alone could not improve the ASR performance sufficiently, we further analyzed the HERB mechanism with a view to achieving further improvements. Taking the analysis results into account, we proposed an appropriate ASR configuration and conducted experiments. Experimental results confirmed that, if HERB is used with an ASR adaptation scheme such as MLLR and a multicondition acoustic model, it is very effective for improving ASR performance even in unknown severely reverberant environments.
- 社団法人電子情報通信学会の論文
- 2005-07-01
著者
-
Kinoshita Keisuke
Ntt Communication Science Laboratories
-
Miyoshi Masato
Ntt Corp. Kyoto‐fu Jpn
-
Miyoshi Masato
Ntt Communication Science Laboratories Ntt Corporation
-
Nakatani Tomohiro
NTT Communication Science Laboratories
-
Miyoshi Masato
NTT Communication Science Laboratories
関連論文
- Sound image rendering using a loudspeaker and a fully open-air headphone-set
- On a Blind Speech Dereverberation Algorithm Using Multi-Channel Linear Prediction(Engineering Acoustics)
- Speech dereverberation algorithm using transfer function estimates with overestimated order
- Blind dereverberation algorithm for speech signals based on multi-channel linear prediction
- Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals(Engineering Acoustics)
- A Study on Frequency Characteristics and Transmission Path of Audible Sound Perceived when the Tragus is Vibrated by Amplitude-Modulated Ultrasound
- Fast estimation of a precise dereverberation filter based on the harmonic structure of speech
- Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility(Speech Enhancement, Multi-channel Acoustic Signal Processing)
- Sparse source separation based on simultaneous clustering of source locational and spectral features
- Calculating Inverse Filters for Speech Dereverberation
- A new algorithm for blind estimation of common poles in multiple transmission paths based on linear prediction
- Sound timbre control using estimates of room resonance modes
- Acoustic Nonlinear Effect on Auricular Cartilage Vibrated with Amplitude-Modulated Ultrasound