Distant Speech Recognition Using a Microphone Array Network
スポンサーリンク
概要
- 論文の詳細を見る
In this work, spatial information consisting of the position and orientation angle of an acoustic source is estimated by an artificial neural network (ANN). The estimated position of a speaker in an enclosed space is used to refine the estimated time delays for a delay-and-sum beamformer, thus enhancing the output signal. On the other hand, the orientation angle is used to restrict the lexicon used in the recognition phase, assuming that the speaker faces a particular direction while speaking. To compensate the effect of the transmission channel inside a short frame analysis window, a new cepstral mean normalization (CMN) method based on a Gaussian mixture model (GMM) is investigated and shows better performance than the conventional CMN for short utterances. The performance of the proposed method is evaluated through Japanese digit/command recognition experiments.
論文 | ランダム
- 都市林の変遷からみた先駆性高木種イイギリIdesia polycarpa Maxim.の動態
- 帯域適応型Shape From Focus/Defocus法について(インタラクティブシステム・画像入力デバイス・方式、及び一般)
- 八幡製鐵・新日本製鐵の1956年から1980年代にいたるシステム思考の適用とコンピュータ活用に関する一実践側面(2)
- Preparation and Characterization of La0.8Sr0.2Ga0.8Mg0.2O3-δ Film by Electrophoretic Deposition Method
- Effect of Body Size and Sugar Meals on Oviposition of the Yellow Fever Mosquito, Aedes aegypti (Diptera: Culicidae)