Distant Speech Recognition Using a Microphone Array Network
スポンサーリンク
概要
- 論文の詳細を見る
In this work, spatial information consisting of the position and orientation angle of an acoustic source is estimated by an artificial neural network (ANN). The estimated position of a speaker in an enclosed space is used to refine the estimated time delays for a delay-and-sum beamformer, thus enhancing the output signal. On the other hand, the orientation angle is used to restrict the lexicon used in the recognition phase, assuming that the speaker faces a particular direction while speaking. To compensate the effect of the transmission channel inside a short frame analysis window, a new cepstral mean normalization (CMN) method based on a Gaussian mixture model (GMM) is investigated and shows better performance than the conventional CMN for short utterances. The performance of the proposed method is evaluated through Japanese digit/command recognition experiments.
論文 | ランダム
- Session X Analysis of Lipids-1- (米国油化学協会-日本油化学協会第2回合同研究発表-討論会)
- 28. 癌細胞診における悪性基準の再検討 第4報 癌および良性異型細胞の退行変性像 (I. 一般講演 , 第8回 日本臨床細胞学会総会講演要旨)
- 27. 癌細胞診における悪性基準の再検討 第3報 組織診と細胞診の細胞悪性所見の比較 (I. 一般講演 , 第8回 日本臨床細胞学会総会講演要旨)
- 抗菌作用のある非イオン系界面活性剤
- Nuclear Magnetic Resonance Study in Dilute Copper- Cobalt Alloy