Sub-Band Temporal Envelope Restoration for ASR in Reverberation Environment (国際ワークショップ Frontiers in Speech and Hearing Research)

概要

論文の詳細を見る
Dereverberation algorithms usually suppose that the room acoustics are known. Before doing dereverberation, the impulse response of the room acoustics is estimated. However, it is difficult to estimate the characteristic of room acoustics only from observed reverberant signals. Our proposed method is motivated by speech intelligibility experiments which show the importance of the speech temporal envelopes for speech perception. We proposed a sub-band modulation transfer function (MTF) based power envelope estimation algorithm for reverberant speech. In our algorithm, the impulse response of a room acoustics is assumed as an exponential decay modulated white noise. Speech is supposed as a temporal modulated white noise as carrier in each frequency sub-band. The reverberant speech is the convolution between the impulse response of room acoustics and speech signals. Based on theoretical analysis of the stochastic signal, we can restore the temporal power envelope of speech in each sub-band by a power envelope inverse filtering. The algorithm is designed as a front-end processor for ASR, and is tested on Japanese digital strings recognition task. Reverberated speech is made artificially by simple convolution between room acoustic and speech. Recognition results show that the proposed de-reverberation algorithm has improves 16.11% on average for reverberatation time from 0.5s to 1.5s compared with auditory power spectrum based method (AFCC).
社団法人電子情報通信学会の論文
2006-03-20

Sub-Band Temporal Envelope Restoration for ASR in Reverberation Environment (国際ワークショップ Frontiers in Speech and Hearing Research)

スポンサーリンク

概要

著者

関連論文

スポンサーリンク