Initial evaluation of the drivers' Japanese speech corpus in a car environment (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")

概要

論文の詳細を見る
Car navigation systems are getting more and more popular and many of them equip a speech recognition system for hands-free interface. However, the speech input interface is not widely used because of insufficient recognition performance. In order to improve the recognition performance and make the speech interface more practical, a real-car-environment speech corpus "Drivers' Japanese Speech Corpus in a Car Environment" is under construction by a project supported by the Japanese Ministry of Economy, Trade and Industry. In this study, we used the command task portion of the corpus recorded under three conditions: idling, running in a city, and running on a highway. We used the data from the corpus only as a test set and made a recognition system by optimally combining several existing corpora with several noise robustness techniques. Experimental results show that using an HMM trained on multiple conditions with spectral subtraction is the best for the car noises. Recognition performance was largely improved and more than 90% word accuracy was achieved for all the recording conditions. In particular, over a 50% absolute improvement in accuracy was observed for speeches given by female speakers uttered when driving on a highway.
社団法人電子情報通信学会の論文
2008-03-13