Missing Feature Theory Applied to Robust Speech Recognition over IP Network(<Special Section>Speech Dynamics by Ear, Eye, Mouth and Machine)
スポンサーリンク
概要
- 論文の詳細を見る
This paper addresses problems involved in performing speech recognition over mobile and IP networks. The main problem is speech data loss caused by packet loss in the network. We present two missing-feature-based approaches that recover lost regions of speech data. These approaches are based on the reconstruction of missing frames or on marginal distributions. For comparison, we also use a packing method, which skips lost data. We evaluate these approaches with packet loss models, i.e., random loss and Gilbert loss models. The results show that the marginal-distributed-based technique is most effective for a packet loss environment; the degradation of word accuracy is only 5% when the packet loss rate is 30% and only 3% when mean burst loss length is 24 frames in the case of DSR front-end. The simple data imputation method is also effective in the case of clean speech.
- 社団法人電子情報通信学会の論文
- 2004-05-01
著者
-
Kuroiwa S
Chiba University And National Institute Of Information And Communications Technology
-
Kuroiwa Shingo
Graduate School Of Advanced Integration Science Chiba University
-
Kuroiwa Shingo
Faculty Of Engineering The University Of Tokushima
-
Kuroiwa Shingo
University Of Tokushima
-
NAKAMURA Satoshi
ATR Spoken Language Translation Research Labs.
-
Shikano Kiyohiro
Chiba University And National Institute Of Information And Communications Technology
-
Endo Tokiko
School Of Medicine Nagoya University
-
Endo T
Atr Spoken Language Translation Research Laboratories
-
ENDO Toshiki
ATR Spoken Language Translation Research Laboratories
-
Nakamura S
National Institute Of Information And Communications Technology
-
Nakamura Satoshi
Atr Spoken Language Translation Res. Lab. Kyoto Jpn
-
Nakamura Satoshi
Atr Spoken Language Communication Res. Lab. Kyoto‐fu Jpn
-
Nakamura Satoshi
National Institute Of Information And Communications Technology
-
Kuroiwa S
University of Tokushima
-
Nakamura S
ATR Spoken Language Translation Research Laboratories
関連論文
- Fuzzy Cluster Analysis and its Evaluation Method(BIOMETRICS AND ITS APPLICATIONS)
- Photovoltaic Effect in Schottky Junction of Poly(3-alkylthiophene)/Al with Various Alkyl Chain Lengths and Regioregularities
- Photocarrier Transport in Regioregular Poly (3-octadecylthiophene) : Optical Propertles of Condensed Matter
- Combination Therapy with Vascular Endothelial Growth Factor Neutralizing Antibody and Mitomycin C on Human Gastric Cancer Xenograft
- Alkyl Chain Length Dependence of Field-Effect Mobilities in Regioregular Poly(3-Alkylthiophene)Films
- Dependencies of Field Effect Mobility on Regioregularity and Side Chain Length in Poly (Alkylthiophene) Films (Special Issue on Organic Molecular Electronics for the 21st Century)
- Regioregularity vs Regiorandomness : Effect on Photocarrier Transport in Poly(3-hexylthiophene)
- CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments
- Noise and Channel Distortion Robust ASR System for DARPA SPINE2 Task (Special Issue on Speech Information Processing)
- A Study on Acoustic Modeling of Pauses for Recognizing Noisy Conversational Speech (Special Issue on Speech Information Processing)
- The Cell Surface Glycoprotein of Haloarcula japonica TR-1
- Characterization of a Novel Human Tumor Necrosis Factor-α Mutant with Increased Cytotoxic Activity
- Breast Tumor Classification by Neural Networks Fed with Sequential-Dependence Factors to the Input Layer
- Quantitative analysis of pattern of gonial proliferation during sexual maturation in Japanese scallop Patinopecten yessoensis
- GnRH-PROMOTED SPERMATOGONIAL PROLIFERATION OF SCALLOP MEDIATES THROUGH STEROIDOGENESIS(Endocrinology,Abstracts of papers presented at the 76^ Annual Meeting of the Zoological Society of Japan)
- MOLECULAR CLONING OF A PUTATIVE SEROTONIN RECEPTOR EXPRESSED IN THE OVARY OF SCALLOP, PATINOPECTEN YESSOENSIS(Developmental Biology,Abstracts of papers presented at the 76^ Annual Meeting of the Zoological Society of Japan)
- REGULATION OF GONIAL MULTIPLICATION BY A GnRH-LIKE FACTOR IN THE CENTRAL NERVOUS SYSTEM OF THE PATINOPECTEN YESSOENSIS(Endocrinology)(Proceedings of the Seventy-Third Annual Meeting of the Zoological Society of Japan)
- AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- Missing Feature Theory Applied to Robust Speech Recognition over IP Network(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Comparison of a 10 V Josephson Junction Array System and a Conventional 10 V Measuring System by Measuring Zener Reference Standard
- 1-V Josephson-Junction-Array Voltage Standard and Development of 10-V Josephson Junction Array at ETL
- Use of the Josephoson Junction Array Voltage Standard in Industry
- CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments(Speech and Hearing)
- A Design for a Collaborative Steering System of Microphone Array and Video Camera Toward Multi-Lingual Tele-Conference (特集 インタラクション技術の革新と実用化)
- A design of adaptive beamformer based on average speech spectrum for noisy speech recognition
- A Microphone Array-Based 3-D N-Best Search Method for Recognizing Multiple Sound Sources
- 3D N-best 探索法に基づく複数音源の位置推定と音声認識の統合
- 複数話者の音声認識における音源方向経路間距離を用いた3-D N-best探索法の評価
- The present status, progress, and usage of speech databases in Japan
- Thermophilic Alkaline Xylanase from Newly Isolated Alkaliphilic and Thermophilic Bacillus sp. Strain TAR-1
- Degradation of Human Hair by a Thermostable Alkaline Protease from Alkaliphilic Bacillus sp. No.AH-101
- Molecular Cloning, Nucleotide Sequence, and Expression of the Structural Gene for Alkaline Serine Protease from Alkaliphilic Bacillus sp.221
- IMPROVING ACCURACY IN PARAMETER ESTIMATION IN AN EXTENDED KALMAN PARTICLE FILTERS FOR NOISY SPEECH RECOGNITION
- ATR Parallel Decoding Based Speech Recognition System Robust to Noise and Speaking Styles(Speech Recognition, Statistical Modeling for Speech Processing)
- Search computing based on Google API for QA system (自然言語処理)
- Search computing based on Google API for QA system (言語理解とコミュニケーション)
- Construction of Audio-Visual Speech Corpus Using Motion-Capture System and Corpus Based Facial Animation(Life-like Agent and its Communication)
- Multi-Lingual Multi-Function Multi-Media Intelligent System
- Nonparametric Speaker Recognition Method Using Earth Mover's Distance(Speaker Recognition, Statistical Modeling for Speech Processing)
- Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance
- Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance
- Speaker Recognition using a Non-parametric Speaker Model Representation and Earth Mover's Distance
- Charge-Independence-Breaking Interactions in sd-Shell Nuclei : Nuclear Physics
- Translation of Japanese Noun Compounds at Super-Function Based MT System
- Passive hybrid subtractive beamformer for near-field sound sources
- Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models
- An Acoustic Modeling Method Robustagainst Changes of Speaking Stylein Error Recovery
- A Hybrid HMM/BN Acoustic Model Utilizing Pentaphone-Context Dependency(Speech Recognition, Statistical Modeling for Speech Processing)
- Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework(Speech Recognition, Statistical Modeling for Speech Processing)
- A Hybrid HMM/BN Acoustic Model for Automatic Speech Recognition (Special Issue on Speech Information Processing)
- A Model of Mental State Transition Network
- A New Question Answering System for Chinese Restricted Domain(Language,Human Communication II)
- Effects of Phoneme Type and Frequency on Distributed Speaker Identification and Verification(Speech and Hearing)
- MIXTURE OF FACTOR ANALYZED HMM
- Iterative Estimation and Compensation of Signal Direction for Moving Sound Source by Mobile Microphone Array(Engineering Acoustics)
- TIME-VARYING NOISE COMPENSATION BY SEQUENTIAL MONTE CARLO METHOD
- Ambient Browser: Web Browser for Daily Use (日韓合同ワークショップ 1st Korea-Japan Joint Workshop on Ubiquitous Computing and Networking Systems (ubiCNS 2005))
- Burst Error Recovery for Huffman Coding(Algorithm Theory)
- Improving Parsing of 'BA' Sentences for Machine Translation
- Audio-Visual Speech Recognition Based on Optimized Product HMMs and GMM Based-MCE-GPD Stream Weight Estimation (Special Issue on Speech Information Processing)
- Situated Spoken Dialogue with Robots Using Active Learning