Applying Sparse KPCA for Feature Extraction in Speech Recognition(Feature Extraction and Acoustic Medelings, <Special Section>Corpus-Based Speech Technologies)
スポンサーリンク
概要
- 論文の詳細を見る
This paper presents an analysis of the applicability of Sparse Kernel Principal Component Analysis (SKPCA) for feature extraction in speech recognition, as well as, a proposed approach to make the SKPCA technique realizable for a large amount of training data, which is an usual context in speech recognition systems. Although the KPCA (Kernel Principal Component Analysis) has proved to be an efficient technique for being applied to speech recognition, it has the disadvantage of requiring training data reduction, when its amount is excessively large. This data reduction is important to avoid computational unfeasibility and/or an extremely high computational burden related to the feature representation step of the training and the test data evaluations. The standard approach to perform this data reduction is to randomly choose frames from the original data set, which does not necessarily provide a good statistical representation of the original data set. In order to solve this problem a likelihood related re-estimation procedure was applied to the KPCA framework, thus creating the SKPCA, which nevertheless is not realizable for large training databases. The proposed approach consists in clustering the training data and applying to these clusters a SKPCA like data reduction technique generating the reduced data clusters. These reduced data clusters are merged and reduced in a recursive procedure until just one cluster is obtained, making the SKPCA approach realizable for a large amount of training data. The experimental results show the efficiency of SKPCA technique with the proposed approach over the KPCA with the standard sparse solution using randomly chosen frames and the standard feature extraction techniques.
- 社団法人電子情報通信学会の論文
- 2005-03-01
著者
-
ZEN Heiga
Department of Computer Science and Engineering, Nagoya Institute of Technology
-
TOKUDA Keiichi
Department of Computer Science and Engineering, Nagoya Institute of Technology
-
Zen Heiga
Department Of Computer Science And Engineering Nagoya Institute Of Technology
-
Tokuda Keiichi
Department Of Computer Science And Engineering Nagoya Institute Of Technology
-
Lima Amaro
Department Of Computer Science And Engineering Nagoya Institute Of Technology
-
NANKAKU Yoshihiko
Department of Computer Science and Engineering, Nagoya Institute of Technology
-
KITAMURA Tadashi
Department of Computer Science and Engineering, Nagoya Institute of Technology
-
RESENDE Fernando
Department of Electronics and Computer Science
-
Kitamura Tadashi
Department Of Computer Science And Engineering Nagoya Institute Of Technology
-
Kitamura Tadashi
Department Of Cardiothoracic Surgery The University Of Tokyo
-
Nankaku Yoshihiko
Department Of Computer Science And Engineering Nagoya Institute Of Technology
-
Tokuda Keiichi
Department Of Computer Science Naogya Institute Of Technology
-
Zen Heiga
Department Of Computer Science Naogya Institute Of Technology
-
Kitamura Tadashi
Department Of Cardiothoracic Surgery Faculty Of Medicine University Of Tokyo
関連論文
- The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006
- Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005(Speech and Herring)
- Effects of Nicorandil on Cardiovascular Events in Patients With Coronary Artery Disease in The Japanese Coronary Artery Disease (JCAD) Study
- Gender Differences in Patients With Coronary Artery Disease in Japan : The Japanese Coronary Artery Disease Study (The JCAD Study)
- Beta-Blocker Prescription Among Japanese Cardiologists and Its Effect on Various Outcomes
- Relationship Between Renal Dysfunction and Severity of Coronary Artery Disease in Japanese Patients
- PJ-038 Cystatin C Predicts Severity of Coronary Artery Disease Even in Patients without Chronic Kidney Disease (CKD)(PJ007,Kidney/Renal Circulation/CKD 1 (H),Poster Session (Japanese),The 73rd Annual Scientific Meeting of The Japanese Circulation Society)
- Applying Sparse KPCA for Feature Extraction in Speech Recognition(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- On the Use of Kernel PCA for Feature Extraction in Speech Recognition(Speech and Hearing)
- The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006
- A Hidden Semi-Markov Model-Based Speech Synthesis System(Speech and Hearing)
- State Duration Modeling for HMM-Based Speech Synthesis(Speech and Hearing)
- A Training Method of Average Voice Model for HMM-Based Speech Synthesis(Digital Signal Processing)
- A Context Clustering Technique for Average Voice Models (Special Issue on Speech Information Processing)
- Speaker Adaptation of Pitch and Spectrum for HMM-Based Speech Synthesis
- Multi-Space Probability Distribution HMM(Special Issue on the 2000 IEICE Excellent Paper Award)
- Vector Quantization of Speech Spectral Parameters Using Statistics of Static and Dynamic Features
- Text-Independent Speaker Identification Using Gaussian Mixture Models Based on Multi-Space Probability Distribution (Special Issue on Biometric Person Authentication)
- Establishment of a method of anonymization of DNA samples in genetic research
- A Reordering Model Using a Source-Side Parse-Tree for Statistical Machine Translation
- Clinical Experience with Cryopreserved Allografts for Aortic Infection
- A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
- Plasma Cystatin C Concentration Reflects the Severity of Coronary Artery Disease in Patients Without Chronic Kidney Disease
- Mixture Density Models Based on Mel-Cepstral Representation of Gaussian Process(Digital Signal Processing)
- Pseudoaneurysm Developed after Aortic Root Homograft Implantation
- LMS-Based Algorithms with Multi-Band Decomposition of the Estimation Error Applied to System Identification (Special Section on Digital Signal Processing)
- Multi-Band Decomposition of the Linear Prediction Error Applied to Adaptive AR Spectral Estimation
- Adaptive AR Spectral Estimation Based on Wavelet Decomposition of the Linear Prediction Error
- A Covariance-Typing Technique for HMM-Based Speech Synthesis
- Characteristics of Multi-Layer Perceptron Models in Enhancing Degraded Speech
- Prevalence of Vitreous Hemorrhage Following Coronary Revascularization in Patients With Diabetic Retinopathy
- Parameter Sharing in Mixture of Factor Analyzers for Speaker Identification(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Deterministic Annealing EM Algorithm in Acoustic Modeling for Speaker and Speech Recognition(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Continuous Speech Recognition Based on General Factor Dependent Acoustic Models(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Bayesian Context Clustering Using Cross Validation for Speech Recognition
- Physical Model-Based Indirect Measurement of Blood Flow and Pressures for Pulsatile Circulatory Assist-In Vitro Study
- Reformulating the HMM as a Trajectory Model
- Reformulating the HMM as a Trajectory Model
- Reformulating the HMM as a Trajectory Model
- Intensively Lowering Both Low-Density Lipoprotein Cholesterol and Blood Pressure Does Not Reduce Cardiovascular Risk in Japanese Coronary Artery Disease Patients
- ファジィ推論とリアルタイムモデルの併用について
- Speech recognition based on statistical models including multiple phonetic decision trees
- A Modeling Support Tool for a Global Human Model on the Internet
- The Development of a Physiological Simulation System for the Human Circulatory System Coupling Macro and Micro Models
- Report of the Committee on the classification and diagnostic criteria of diabetes mellitus : The Committee of the Japan Diabetes Society on the diagnostic criteria of diabetes mellitus
- Diagnostic Simulation Tool for a Circulatory System Model Based on Interpretive Structural Modeling
- How can a robot have consciousness?
- Animal-like Behavior Design of Small Robots by the Model of Subjective World and Behavior
- An Integrated Simulation Tool for Modeling the Human Circulatory System(Bioengineering)
- Animal-like behavior design of small robots by consciousness-based architecture
- A Case of Implantation of ICD over 30 Years after CABG for Coronary Arterial Lesions Due to Kawasaki Disease
- Moderate Prosthesis-Patient Mismatch May Be Negligible in Elderly Patients Undergoing Conventional Aortic Valve Replacement for Aortic Stenosis
- A Bayesian Framework Using Multiple Model Structures for Speech Recognition
- Neutrophil Elastase Inhibitor Sivelestat Attenuates Perioperative Inflammatory Response in Pediatric Heart Surgery With Cardiopulmonary Bypass:A Prospective Randomized Study
- Speaker interpolation for HMM-based speech synthesis system