Automatic Adjustment of Subband Likelihood Recombination Weights for Improving Noise-Robustness of a Multi-SNR Multi-Band Speaker Identification System(Speech and Hearing)
スポンサーリンク
概要
- 論文の詳細を見る
This paper is concerned with improving noise-robustness of a multi-SNR multi-band speaker identification system by introducing automatic adjustment of subband likelihood recombination weights. The adjustment is performed on the basis of subband power calculated from the noise observed just before the speech starts in the input signal. To evaluate the noise-robustness of this system, text-independent speaker identification experiments were conducted on speech data corrupted with noises recorded in five environments: "bus," "car," "office," "lobby," and "restaurant". It was found that the present method reduces the identification error by 15.9% compared with the multi-SNR multi-band method with equal recombination weights at 0 dB SNR. The performance of the present method was compared with a clean fullband method in which a speaker model training is performed on clean speech data, and spectral subtraction is applied to the input signal in the speaker identification stage. When the clean fullband method without spectral subtraction is taken as a baseline, the multi-SNR multi-band method with automatic adjustment of recombination weights attained 56.8% error reduction on average, while the average error reduction rate of the clean fullband method with spectral subtraction was 11.4% at 0 dB SNR.
- 社団法人電子情報通信学会の論文
- 2004-11-01
著者
-
YOSHIDA KENICHI
University of Tsukuba
-
Takagi Kazuyuki
University Of Electro-communications
-
OZEKI Kazuhiko
University of Electro-Communications
-
Yoshida Kenichi
University Of Electro-communications
関連論文
- Automating Viewers' Side Annotations on TV Drama from Internet Bulletin Boards(コンテンツ処理,新たな適用領域を切り開く情報システム)
- Asymmetric Characteristics of Internet Based on Traffic Measurement and Analysis(New Technologies in the Internet and their Applications)
- High-Speed IP Meter HIM and Its Application in LAN/WAN Environments(Special Issue on New Technologies in the Internet and their Applications)
- Effectiveness of Word String Language Models on Noisy Broadcast News Speech Recognition
- The Use of Overlapped Sub-Bands in Multi-Band, Multi-SNR, Multi-Path Recognition of Noisy Word Utterances
- Automatic Adjustment of Subband Likelihood Recombination Weights for Improving Noise-Robustness of a Multi-SNR Multi-Band Speaker Identification System(Speech and Hearing)
- DNS Traffic Analysis — CDN and the World IPv6 Launch