Noise Suppression Based on Multi-Model Compositions Using Multi-Pass Search with Multi-Label N-gram Models
スポンサーリンク
概要
- 論文の詳細を見る
We propose a noise suppression method based on multi-model compositions and multi-pass search. In real environments, input speech for speech recognition includes many kinds of noise signals. To obtain good recognized candidates, suppressing many kinds of noise signals at once and finding target speech is important. Before noise suppression, to find speech and noise label sequences, we introduce multi-pass search with acoustic models including many kinds of noise models and their compositions, their n-gram models, and their lexicon. Noise suppression is frame-synchronously performed using the multiple models selected by recognized label sequences with time alignments. We evaluated this method using the E-Nightingale task, which contains voice memoranda spoken by nurses during actual work at hospitals. The proposed method obtained higher performance than the conventional method.
- (社)電子情報通信学会の論文
- 2008-03-01
著者
-
Kogure Kiyoshi
Atr Knowledge Science Laboratories
-
Toriyama Tomoji
Atr Knowledge Sci. Lab. Kyoto‐fu Jpn
-
Toriyama Tomoji
Atr Knowledge Science Laboratories
-
JITSUHIRO Takatoshi
ATR Knowledge Science Laboratories
-
Jitsuhiro Takatoshi
Atr Knowledge Sci. Lab. Kyoto‐fu Jpn
-
Kogure Kiyoshi
Atr Knowledge Sci. Lab. Kyoto‐fu Jpn
関連論文
- Applicability of Camera Works to Free Viewpoint Videos with Annotation and Planning
- Robust Foreground Segmentation from Color Video Sequences Using Background Subtraction with Multiple Thresholds(Videos)
- Noise Suppression Based on Multi-Model Compositions Using Multi-Pass Search with Multi-Label N-gram Models
- ATR Parallel Decoding Based Speech Recognition System Robust to Noise and Speaking Styles(Speech Recognition, Statistical Modeling for Speech Processing)