Ears of the Robot : Three Simultaneous Speech Segregation and Recognition Using Robot-Mounted Microphones(Speech and Hearing)
スポンサーリンク
概要
- 論文の詳細を見る
A new type of sound source segregation method using robot-mounted microphones, which are free from strict head related transfer function (HRTF) estimation, has been proposed and successfully applied to three simultaneous speech recognition systems. The proposed segregation method is executed with sound intensity differences that are due to the particular arrangement of the four directivity microphones and the existence of a robot head acting as a sound barrier. The proposed method consists of three-layered signal processing: two-line SAFIA (binary masking based on the narrow band sound intensity comparison), two-line spectral subtraction and their integration. We performed 20K vocabulary continuous speech recognition test in the presence of three speakers' simultaneous talk, and achieved more than 70% word error reduction compared with the case without any segregation processing.
- 社団法人電子情報通信学会の論文
- 2007-09-01
著者
-
Ogawa Tetsuji
Department of Endodontology and Periodontology, Hiroshima University School of Dentistry
-
Ogawa Tetsuji
Waseda Univ. Tokyo Jpn
-
Kobayashi Tetsunori
Waseda Univ. Tokyo Jpn
-
Kobayashi Tetsunori
Department Of Computer Science Waseda University
-
MOCHIKI Naoya
Department of Computer Science, Waseda University
-
Mochiki Naoya
Department Of Computer Science Waseda University
-
Kobayashi Tetsunori
Dep. Of Computer Sci. Waseda Univ.
-
OGAWA Tetsuji
Department of Computer Science, Waseda University
-
Ogawa Tetsuji
Department of Advanced General Dentistry, Hiroshima University Hospital
関連論文
- CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments
- New Attachment to Periodontally Diseased Root Surfaces Treated with Hydrochloric Acid
- C-5 Interfacial Structure between Implant Materials and Periodontal Tissue in the Rat
- Cell-bound Pullulanase from Streptomyces sp. No.27
- Ears of the Robot : Direction of Arrival Estimation Based on Pattern Recognition Using Robot-Mounted Microphones
- Mutual Information Based Dynamic Integration of Multiple Feature Streams for Robust Real-Time LVCSR
- Filter Bank Subtraction for Robust Speech Recognition (Special Issue on Speech Information Processing)
- Simultaneous Subtitling System for Broadcast News Programs with a Speech Recognizer(Special Issue on the 2001 IEICE Excellent Paper Award)
- Influence of Lombard Effect : Accuracy Analysis of Simulation-Based Assessments of Noisy Speech Recognition Systems for Various Recognition Conditions
- A Low-Band Spectrum Envelope Reconstruction Method for PSOLA-Based F_0 Modification(Speech and Hearing)
- Fusion-Based Age-Group Classification Method Using Multiple Two-Dimensional Feature Extraction Algorithms(Pattern Recognition)
- Ears of the Robot : Three Simultaneous Speech Segregation and Recognition Using Robot-Mounted Microphones(Speech and Hearing)
- Genetic Algorithm Based Optimization of Partly-Hidden Markov Model Structure Using Discriminative Criterion(Speech Recognition, Statistical Modeling for Speech Processing)
- Hybrid Voice Conversion of Unit Selection and Generation Using Prosody Dependent HMM(Speech and Hearing)
- Extension of Hidden Markov Models for Multiple Candidates and Its Application to Gesture Recognition(Image Recognition, Computer Vision)
- Speech Enhancement Using a Square Microphone Array in the Presence of Directional and Diffuse Noise
- Ultrastructural and Immunocytochemical Characterization of the Bone-Single Crystal Sapphire Implant Interface in the Rat Maxilla
- Conversational robots: An approach to conversation protocol issues that utilizes the paralinguistic information available in a robot-human setting
- Prevalence of drug-resistant opportunistic microorganisms in oral cavity after treatment for oral cancer