Intentional Voice Command Detection for Trigger-Free Speech Interface
スポンサーリンク
概要
- 論文の詳細を見る
In this paper we introduce a new framework of audio processing, which is essential to achieve a trigger-free speech interface for home appliances. If the speech interface works continually in real environments, it must extract occasional voice commands and reject everything else. It is extremely important to reduce the number of false alarms because the number of irrelevant inputs is much larger than the number of voice commands even for heavy users of appliances. The framework, called Intentional Voice Command Detection, is based on voice activity detection, but enhanced by various speech/audio processing techniques such as emotion recognition. The effectiveness of the proposed framework is evaluated using a newly-collected large-scale corpus. The advantages of combining various features were tested and confirmed, and the simple LDA-based classifier demonstrated acceptable performance. The effectiveness of various methods of user adaptation is also discussed.
- 2010-09-01
著者
-
Obuchi Yasunari
Central Research Laboratory Hitachi Ltd.
-
Sumiyoshi Takashi
Central Research Laboratory Hitachi Ltd.
関連論文
- Multi-Input Feature Combination in the Cepstral Domain for Practical Speech Recognition Systems
- Intentional Voice Command Detection for Trigger-Free Speech Interface
- Emotion Recognition using Mel-Frequency Cepstral Coefficients
- Stepwise Phase Difference Restoration Method for DOA Estimation of Multiple Sources
- Multichannel Two-Stage Beamforming with Unconstrained Beamformer and Distortion Reduction
- Noise suppression method for preprocessor of time-lag speech recognition system based on bidirectional optimally modified log spectral amplitude estimation
- Emotion Recognition using Mel-Frequency Cepstral Coefficients