Distant Speech Recognition Using a Microphone Array Network
スポンサーリンク
概要
- 論文の詳細を見る
In this work, spatial information consisting of the position and orientation angle of an acoustic source is estimated by an artificial neural network (ANN). The estimated position of a speaker in an enclosed space is used to refine the estimated time delays for a delay-and-sum beamformer, thus enhancing the output signal. On the other hand, the orientation angle is used to restrict the lexicon used in the recognition phase, assuming that the speaker faces a particular direction while speaking. To compensate the effect of the transmission channel inside a short frame analysis window, a new cepstral mean normalization (CMN) method based on a Gaussian mixture model (GMM) is investigated and shows better performance than the conventional CMN for short utterances. The performance of the proposed method is evaluated through Japanese digit/command recognition experiments.
論文 | ランダム
- SF-055-3 肝移植におけるSmall-for-size graft機能不全の解明と治療への応用
- SF-053-5 高圧酸素下(Hyperbaric oxygen)肝冷保存の有用性に関する研究
- SF-053-4 新たな冷保存法とradical oxygen species (ROS)制御を用いた肝保存戦略
- 開発途上国における気候変動適応策と国際協力 (特集 科学技術外交--動き出した海外プロジェクト)
- 改訂請求払保証統一規則の解説--ICC Uniform Rules for Demand Guarantees:略称URDG758(4)