HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose a new mask estimation method for the computational auditory scene analysis (CASA) of speech using two microphones. The proposed method is based on a hidden Markov model (HMM) in order to incorporate an observation that the mask information should be correlated over contiguous analysis frames. In other words, HMM is used to estimate the mask information represented as the interaural time difference (ITD) and the interaural level difference (ILD) of two channel signals, and the estimated mask information is finally employed in the separation of desired speech from noisy speech. To show the effectiveness of the proposed mask estimation, we then compare the performance of the proposed method with that of a Gaussian kernel-based estimation method in terms of the performance of speech recognition. As a result, the proposed HMM-based mask estimation method provided an average word error rate reduction of 61.4% when compared with the Gaussian kernel-based mask estimation method.
- (社)電子情報通信学会の論文
- 2008-09-01
著者
-
Park Ji
Department of Herbology, College of Oriental Medicine, Daegu Haany University
-
KIM Hong
Department of Physiology, College of Medicine, Kyung Hee University
-
YOON Jae
Department of Information and Communications, Gwangju Institute of Science and Technology (GIST)
-
Park Ji
Department Of Information And Communications Gwangju Institute Of Science And Technology (gist)
-
Kim Hong
Department Of Chemistry Advanced Materials Chemistry Research Center Korea University
-
Kim Hong
Department Of Information And Communications Gwangju Institute Of Science And Technology (gist)
-
Park Ji
Department Of Biotechnology Pukyong National University
-
Yoon Jae
Department Of Information And Communications Gwangju Institute Of Science And Technology (gist)
-
PARK Ji
Department of Biomedical Engineering, College of Biomedical Science & Engineering, Inje University
-
Kim Hong
Department of Applied Sciences, Korea Maritime University, Busan 606-791, Korea
関連論文
- The Anti-osteoporotic Effect of Aqueous Extracts of Gastrodiae Rhizoma In Vitro and In Vivo
- Maternal Alcohol Administration Suppresses Expression of Nitric Oxide Synthase in the Hippocampus of Offspring Rats
- Folium mori Increases Cell Proliferation and Neuropeptide Y Expression in Dentate Gyrus of Streptozotocin-Induced Diabetic Rats
- Effects of Paeonia radix on 5-Hydroxytryptamine Synthesis and Tryptophan Hydroxylase Expression in the Dorsal Raphe of Exercised Rats
- A dministration of Folium mori Extract Decreases Nitric Oxide Synthase Expression in the Hypothalamus of Streptozotocin-Induced Diabetic Rats
- Whole Blood and Red Blood Cell Manganese Reflected Signal Intensities of T1-Weighted Magnetic Resonance Images better than Plasma Manganese in Liver Cirrhotics
- A New Carbazole-based Conjugated Multibranched Molecule and Its Tetramer as Hole Transporting Materials
- Protective Effect of Gabapentin on N-Methyl-D-aspartate-Induced Excitotoxicity in Rat Hippocampal CA1 Neurons
- Effect of postnatal treadmill exercise on c-Fos expression in the hippocampus of rat pups born from the alcohol-intoxicated mothers
- Maternal swimming during pregnancy enhances short-term memory and neurogenesis in the hippocampus of rat pups
- Influence of prenatal noise and music on the spatial memory and neurogenesis in the hippocampus of developing rats
- A MFCC-Based CELP Speech Coder for Server-Based Speech Recognition in Network Environments(Speech/Audio Processing,Multimedia and Mobile Signal Processing)
- 3-6.Microsatellite Instability and Loss of Bax Expression in Endometroid Endometrial Cancers(Session 4 Oncology 4)
- Seroprevalence to the Circumsporozoite Protein Peptide Antigen of Plasmodium vivax in Korean Children
- Silicon-containing Bilayer Resist Based on a Single Component Nonchemically Amplified Resist System
- Role of Helicobacter pylori infection in aberrant DNA methylation along multistep gastric carcinogenesis
- Direct Application of Avall PCR Restriction Fragment Length Polymorphism Analysis (Avall PRA) Targeting 644 bp Heat Shock Protein 65 (hsp65) Gene to Sputum Samples
- Overcoming Two Post-fertilization Genetic Barriers in Interspecific Hybridization between Capsicum annuum and C. baccatum for Introgression of Anthracnose Resistance
- Enantioseparation of racemic N-acylarylalkylamines on various amino alcohol derived π-acidic chiral stationary phases
- Structural and Luminescence Characteristics of Post-Annealed ZnO Films on Si (111) in H_2O Ambient
- Room Temperature Processable Organic-Inorganic Hybrid Photolithographic Materials Based on a Methoxysilane Cross-Linker
- A Statistical Approach to Error Compensation in Spectral Quantization(Speech and Hearing)
- Activation of extracellular signal regulated kinase 1/2 in human dermal microvascular endothelial cells stimulated by antiendothelial cell antibodies in sera of patients with Behcet's disease
- The flavonoid quercetin induces apoptosis and inhibits migration through a MAPK-dependent mechanism in osteoblasts
- Clinical usefulness of 18F-FDG PET-CT for patients with gallbladder cancer and cholangiocarcinoma
- Successful elimination of Ascaris lumbricoides from the gallbladder by conservative medical therapy
- HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis
- Identification of Heat Shock Protein 90-Associated 84-kDa Phosphoprotein
- Three-Dimensional Kinematic Analysis during Upslope Walking with Different Inclinations by Healthy Adults
- O-1-194 Cost Effectiveness of Laparoscopic Appendectomy in Korean Academic Hospital
- Conjunctival Nevus-like Lesions Originating from a Sclerotomy Site After 23-Gauge Transconjunctival Sutureless Vitrectomy
- Endometrial carcinoma in a patient having 45,X Turner syndrome with gonadal mosaicism
- Efficacy of array comparative genomic hybridization in a fetus with an inherited apparently balanced translocation : A case report
- Harmonic Model Based Excitation Enhancement for Low-Bit-Rate Speech Coding(Speech and Hearing)
- Efficient Transformation of α,β-Enone to Substituted Furans via Phosphoniosilylation
- Orlicz norm estimates for Polsson maximal operators
- Use of high-performance liquid chromatographic and microbiological analyses for evaluating the presence or absence of active metabolites of the antifungal posaconazole in human plasma
- High-performance liquid chromatographic analysis of the anti-tumor agent SCH 66336 in cynomolgus monkey plasma and evaluation of its chiral inversion in animals
- High-performance liquid chromatographic determination and stability of 5-(3-methyltriazen-1-yl)-imidazo-4-carboximide, the biologically active product of the antitumor agent temozolomide, in human plasma
- RAPD identification of genetic variation in seaweed Hizikia fusiformis(Fucales, Phaeophyta)
- Assay of omeprazole and omeprazole sulfone by semi-microcolumn liquid chromatography with mixed-function precolumn
- 野生型(AhR+/+)ならびにアリールハイドロカーボン受容体欠損(AhR-/-)マウス肝臓における2,3,7,8-Tetrachlorodibenzo-p-Dioxinによる遺伝子誘導プロファイル(毒性学)
- Age-related changes in the microarchitecture of collagen fibrils in the articular disc of the rat temporomandibular joint
- Marie Unna Hypotrichosis in an Asian Family
- Basic Study on the Radio Frequency Characteristics of the Transmission Lines Employing Periodically Perforated Ground Metal on GaAs Monolithic Microwave Integrated Circuit and Their Equivalent Ciruits
- The Temporal Change of Cortical Activation Induced by the Ongoing Effects of Transcranial Direct Current Stimulation
- Comparative Proteomic Analysis of Soybean Nodulation Using a Supernodulation Mutant, SS2-2
- DILUTION CHARACTERISTICS OF THERMAL DIFFUSERS IN COASTAL REGIONS WITH STRONG CURRENTS
- Time Course of Vasospasm : Its Clinical Significance
- Nesidioblastosis in an Adult with Hyperinsulinemic Hypoglycemia
- Segmental dilatation of the ileum presenting as a cystic lesion on prenatal ultrasonography in one twin
- A Case of Transient Myopia After Blunt Eye Trauma
- Valuing Health Risks from Air Pollution : A Review of the Literature and a Conceptual Model
- Cortical Activation Pattern according to Discrimination of One-Point and Two-Point Tactile Sensory Inputs : an fMRI Study
- Compensation of Speech Coding Distortion for Wireless Speech Recognition(Speech and Hearing)
- Study of a-Plane GaN Epitaxial Lateral Overgrowth Using Carbonized Photoresist Mask on r-Plane Sapphire
- Prognostic implications of microscopic involvement of surgical resection margin in patients with differentiated papillary thyroid cancer after high-dose radioactive iodine ablation
- A Study of Stacked Buffer Layers for the Epitaxial Growth of Zn0.58Mg0.42O Films on c-Sapphire by Pulsed Laser Deposition
- Structural and Luminescence Characteristics of Post-Annealed ZnO Films on Si (111) in H2O Ambient
- Prevalence of Plasmid-Mediated Quinolone Resistance Genes and Ciprofloxacin Resistance in Pediatric Bloodstream Isolates of Enterobacteriaceae over a 9-Year Period
- Effect of autologous platelet-rich plasma on persistent corneal epithelial defect after infectious keratitis
- Effect of Austenitizing Temperature on Microstructure and Mechanical Properties of 12% Cr Steel.
- Synthesis and Characterization of Y2O3:Eu Phosphor Derived by Solution-Combustion Method
- Identification of Heat Shock Protein 90-Associated 84-kDa Phosphoprotein.
- Laparoscopic major liver resection in Korea : a multicenter study