An Approach Using Combination of Multiple Features through Sigmoid Function for Speech-Presence/Absence Discrimination
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we present an approach of detecting speech presence for which the decision rule is based on a combination of multiple features using a sigmoid function. A minimum classification error (MCE) training is used to update the weights adjustment for the combination. The features, consisting of three parameters: the ratio of ZCR, the spectral energy, and spectral entropy, are combined linearly with weights derived from the sub-band domain. First, the Bark-scale wavelet decomposition (BSWD) is used to split the input speech into 24 critical sub-bands. Next, the feature parameters are derived from the selected frequency sub-band to form robust voice feature parameters. In order to discard the seriously corrupted frequency sub-band, a strategy of adaptive frequency sub-band extraction (AFSE) dependant on the sub-band SNR is then applied to only the frequency sub-band used. Finally, these three feature parameters, which only consider the useful sub-band, are combined through a sigmoid type function incorporating optimal weights based on MSE training to detect either a speech present frame or a speech absent frame. Experimental results show that the performance of the proposed algorithm is superior to the standard methods such as G.729B and AMR2.
- (社)電子情報通信学会の論文
- 2011-08-01
著者
-
Wang Kun-ching
Department Of Information Technology & Communication Shin Chien University
-
Chin Chiun-li
Chung Shan Medical University
関連論文
- An Adaptive Wavelet-Based Denoising Algorithm for Enhancing Speech in Non-stationary Noise Environment
- An Approach Using Combination of Multiple Features through Sigmoid Function for Speech-Presence/Absence Discrimination