Rapid environment adaptation for speech recognition.
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes a <I>rapid environment adaptation algorithm based on spectrum equalization</I> (REALISE). In practical speech recognition applications, differences between training and testing environments often seriously diminish recognition accuracy. These environmental differences can be classified into two types: difference in additive noise and difference in multiplicative noise in the spectral domain. The proposed method calculates time-alignment between a testing utterance and the closest reference pattern to it, and then calculates the noise differences between the two according to the timealignment. Then, we adapt all reference patterns to the testing environment using the differences. Finally, the testing utterance is recognized using the adapted reference patterns. In a 250 Japanese word recognition task, in which the training and testing microphones were of two different types, REALISE improved recognition accuracy from 87% to 96%.
- 一般社団法人 日本音響学会の論文
一般社団法人 日本音響学会 | 論文
- How large is the individual difference in hearing sensitivity?: Establishment of ISO 28961 on the statistical distribution of hearing thresholds of otologically normal young persons
- Applying generation process model constraint to fundamental frequency contours generated by hidden-Markov-model-based speech synthesis
- Vocal cord vibration in the production of consonants. Observation by means of high-speed digital imaging using a fiberscope.:Observation by means of high-speed digital imaging using a fiberscope
- The early reflections of the impulse response in an auditorium.
- Multiple reflections between rigid plane panels.