Automatic Allocation of Training Data for Speech Understanding Based on Multiple Model Combinations
スポンサーリンク
概要
- 論文の詳細を見る
The optimal way to build speech understanding modules depends on the amount of training data available. When only a small amount of training data is available, effective allocation of the data is crucial to preventing overfitting of statistical methods. We have developed a method for allocating a limited amount of training data in accordance with the amount available. Our method exploits rule-based methods for when the amount of data is small, which are included in our speech understanding framework based on multiple model combinations, i.e., multiple automatic speech recognition (ASR) modules and multiple language understanding (LU) modules, and then allocates training data preferentially to the modules that dominate the overall performance of speech understanding. Experimental evaluation showed that our allocation method consistently outperforms baseline methods that use a single ASR module and a single LU module while the amount of training data increases.
著者
-
Komatani Kazunori
Graduate School of Informatics, Kyoto University
-
Ogata Tetsuya
Graduate School of Informatics, Kyoto University
-
Ogata Tetsuya
Graduate School Of Informatics Kyoto Univ. Yoshida-honmachi Sakyo-ku 606-8501 Kyoto Jpn
-
Okuno Hiroshi
Graduate School Of Informatics Kyoto University
-
FUNAKOSHI Kotaro
Honda Research Institute Japan Co., Ltd.
-
NAKANO Mikio
Honda Research Institute Japan Co., Ltd.
-
FUNAKOSHI Kotaro
Honda Research Institute Japan, Co., Ltd.
-
KATSUMARU Masaki
Graduate School of Informatics, Kyoto University
-
KOMATANI Kazunori
Graduate School of Engineering, Nagoya University
関連論文
- 5R-5 A Music Retrieval Approach from Alternative Genres of Query by Adjusting Instrument Volume
- Inter-modality mapping in robot with recurrent neural network
- 4R-3 Probabilistic Classification of Monophonic Instrument Playing Techniques
- Detecting Robot-Directed Speech by Situated Understanding in Physical Interaction
- Predicting Object Dynamics From Visual Images Through Active Sensing Experiences
- Experience-based imitation using RNNPB
- Drumix: an audio player with real-time drum-part rearrangement functions for active music listening (特集 インタラクション技術の原理と応用)
- Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music (特集:便利で身近な音楽情報処理)
- Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance (論文特集:人間と共生する情報システム)
- Acquisition of Motion Primitives of Robot in Human-Navigation Task : Towards Human-Robot Interaction based on ``Quasi-Symbols
- Open-end human-robot interaction from the dynamical systems perspective : mutual adaptation and incremental learning
- Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance
- Reinforcement learning of a continuous motor sequence with hidden states
- Dynamic perception after visually guided grasping by a human-like autonomous robot
- 1ZN-2 Score Following by Particle Filtering for Music Robots
- Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening
- Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening
- Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals(Engineering Acoustics)
- Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments
- Self-organization of Dynamic Object Features Based on Bidirectional Training
- Human Tracking System Integrating Sound and Face Localization Using an Expectation-Maximization Algorithm in Real Environments
- Selecting Help Messages by Using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems
- Design and Implementation of Robot Audition System 'HARK'-Open Source Software for Listening to Three Simultaneous Speakers
- A Method for Predicting Stressed Words in Teaching Materials for English Jazz Chants
- Automatic Allocation of Training Data for Speech Understanding Based on Multiple Model Combinations
- Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
- Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus
- A Musical Robot that Synchronizes with a Coplayer Using Non-Verbal Cues
- Classification of Known and Unknown Environmental Sounds Based on Self-Organized Space Using a Recurrent Neural Network
- Automatic Allocation of Training Data for Speech Understanding Based on Multiple Model Combinations