A MFCC-Based CELP Speech Coder for Server-Based Speech Recognition in Network Environments(Speech/Audio Processing,<Special Section>Multimedia and Mobile Signal Processing)
スポンサーリンク
概要
- 論文の詳細を見る
Existing standard speech coders can provide high quality speech communication. However, they tend to degrade the performance of automatic speech recognition (ASR) systems that use the reconstructed speech. The main cause of the degradation is in that the linear predictive coefficients (LPCs), which are typical spectral envelope parameters in speech coding, are optimized to speech quality rather than to the performance of speech recognition. In this paper, we propose a speech coder using mel-frequency cepstral coefficients (MFCCs) instead of LPCs to improve the performance of a server-based speech recognition system in network environments. To develop the proposed speech coder with a low-bit rate, we first explore the interframe correlation of MFCCs, which results in the predictive quantization of MFCC. Second, a safety-net scheme is proposed to make the MFCC-based speech coder robust to channel errors. As a result, we propose an 8.7kbps MFCC-based CELP coder. It is shown that the proposed speech coder has a comparable speech quality to 8kbps G.729 and the ASR system using the proposed speech coder gives the relative word error rate reduction by 6.8% as compared to the ASR system using G.729 on a large vocabulary task (AURORA4).
- 社団法人電子情報通信学会の論文
- 2007-03-01
著者
-
KIM Hong
Department of Physiology, College of Medicine, Kyung Hee University
-
Lee Gil
Department Of Information And Communications Gist:(present Office)sc Management Center Sumsung Elect
-
YOON Jae
Department of Information and Communications, Gwangju Institute of Science and Technology (GIST)
-
Lee Gil
Department Of Chemical And Biological Engineering Korea University
-
Yoon Jae
Gwangju Inst. Of Sci. And Technol. (gist) Gwangju Kor
-
Kim Hong
Department Of Chemistry Advanced Materials Chemistry Research Center Korea University
-
Kim Hong
Gwangju Inst. Of Sci. And Technol. (gist) Gwangju Kor
-
Yoon Jae
Department Of Information And Communications Gwangju Institute Of Science And Technology (gist)
-
Kim Hong
Department of Applied Sciences, Korea Maritime University, Busan 606-791, Korea
関連論文
- Maternal Alcohol Administration Suppresses Expression of Nitric Oxide Synthase in the Hippocampus of Offspring Rats
- Folium mori Increases Cell Proliferation and Neuropeptide Y Expression in Dentate Gyrus of Streptozotocin-Induced Diabetic Rats
- Effects of Paeonia radix on 5-Hydroxytryptamine Synthesis and Tryptophan Hydroxylase Expression in the Dorsal Raphe of Exercised Rats
- A dministration of Folium mori Extract Decreases Nitric Oxide Synthase Expression in the Hypothalamus of Streptozotocin-Induced Diabetic Rats
- A New Carbazole-based Conjugated Multibranched Molecule and Its Tetramer as Hole Transporting Materials
- Protective Effect of Gabapentin on N-Methyl-D-aspartate-Induced Excitotoxicity in Rat Hippocampal CA1 Neurons
- Effect of postnatal treadmill exercise on c-Fos expression in the hippocampus of rat pups born from the alcohol-intoxicated mothers
- Maternal swimming during pregnancy enhances short-term memory and neurogenesis in the hippocampus of rat pups
- Influence of prenatal noise and music on the spatial memory and neurogenesis in the hippocampus of developing rats
- A MFCC-Based CELP Speech Coder for Server-Based Speech Recognition in Network Environments(Speech/Audio Processing,Multimedia and Mobile Signal Processing)
- Seroprevalence to the Circumsporozoite Protein Peptide Antigen of Plasmodium vivax in Korean Children
- Direct Application of Avall PCR Restriction Fragment Length Polymorphism Analysis (Avall PRA) Targeting 644 bp Heat Shock Protein 65 (hsp65) Gene to Sputum Samples
- Overcoming Two Post-fertilization Genetic Barriers in Interspecific Hybridization between Capsicum annuum and C. baccatum for Introgression of Anthracnose Resistance
- Structural and Luminescence Characteristics of Post-Annealed ZnO Films on Si (111) in H_2O Ambient
- A Statistical Approach to Error Compensation in Spectral Quantization(Speech and Hearing)
- Polydiacetylene Supramolecules Embedded in PVA Film for Strip-type Chemosensors
- Activation of extracellular signal regulated kinase 1/2 in human dermal microvascular endothelial cells stimulated by antiendothelial cell antibodies in sera of patients with Behcet's disease
- Plasma-Enhanced Chemical Vapor Deposition Growth of Fluorinated Amorphous Carbon Thin Films Using C_4F_8 and Si_2H_6/He for Low-Dielectric-Constant Intermetallic-Layer Dielectrics
- Clinical usefulness of 18F-FDG PET-CT for patients with gallbladder cancer and cholangiocarcinoma
- Successful elimination of Ascaris lumbricoides from the gallbladder by conservative medical therapy
- HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis
- Identification of Heat Shock Protein 90-Associated 84-kDa Phosphoprotein
- Conjunctival Nevus-like Lesions Originating from a Sclerotomy Site After 23-Gauge Transconjunctival Sutureless Vitrectomy
- Harmonic Model Based Excitation Enhancement for Low-Bit-Rate Speech Coding(Speech and Hearing)
- Efficient Transformation of α,β-Enone to Substituted Furans via Phosphoniosilylation
- Orlicz norm estimates for Polsson maximal operators
- Use of high-performance liquid chromatographic and microbiological analyses for evaluating the presence or absence of active metabolites of the antifungal posaconazole in human plasma
- High-performance liquid chromatographic analysis of the anti-tumor agent SCH 66336 in cynomolgus monkey plasma and evaluation of its chiral inversion in animals
- High-performance liquid chromatographic determination and stability of 5-(3-methyltriazen-1-yl)-imidazo-4-carboximide, the biologically active product of the antitumor agent temozolomide, in human plasma
- Age-related changes in the microarchitecture of collagen fibrils in the articular disc of the rat temporomandibular joint
- Marie Unna Hypotrichosis in an Asian Family
- Basic Study on the Radio Frequency Characteristics of the Transmission Lines Employing Periodically Perforated Ground Metal on GaAs Monolithic Microwave Integrated Circuit and Their Equivalent Ciruits
- DILUTION CHARACTERISTICS OF THERMAL DIFFUSERS IN COASTAL REGIONS WITH STRONG CURRENTS
- Plasma-Enhanced Chemical Vapor Deposition Growth of Fluorinated Amorphous Carbon Thin Films Using C4F8 and Si2H6/He for Low-Dielectric-Constant Intermetallic-Layer Dielectrics
- Time Course of Vasospasm : Its Clinical Significance
- Nesidioblastosis in an Adult with Hyperinsulinemic Hypoglycemia
- A Case of Transient Myopia After Blunt Eye Trauma
- Valuing Health Risks from Air Pollution : A Review of the Literature and a Conceptual Model
- Compensation of Speech Coding Distortion for Wireless Speech Recognition(Speech and Hearing)
- A Study of Stacked Buffer Layers for the Epitaxial Growth of Zn0.58Mg0.42O Films on c-Sapphire by Pulsed Laser Deposition
- Structural and Luminescence Characteristics of Post-Annealed ZnO Films on Si (111) in H2O Ambient
- Highly Conductive Flexible Multi-Walled Carbon Nanotube Sheet Films for Transparent Touch Screen
- Prevalence of Plasmid-Mediated Quinolone Resistance Genes and Ciprofloxacin Resistance in Pediatric Bloodstream Isolates of Enterobacteriaceae over a 9-Year Period
- Effect of autologous platelet-rich plasma on persistent corneal epithelial defect after infectious keratitis
- Highly Conductive Flexible Multi-Walled Carbon Nanotube Sheet Films for Transparent Touch Screen (Special Issue : Active-Matrix Flatpanel Displays and Devices : TFT Technologies and FPD Materials)
- Effect of Austenitizing Temperature on Microstructure and Mechanical Properties of 12% Cr Steel.
- Identification of Heat Shock Protein 90-Associated 84-kDa Phosphoprotein.
- Laparoscopic major liver resection in Korea : a multicenter study