Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose a new speaker-class modeling and its adaptation method for the LVCSR system and evaluate the method on the Corpus of Spontaneous Japanese (CSJ). In this method, closer speakers are selected from training speakers and the acoustic models are trained by using their utterances for each evaluation speaker. One of the major issues of the speaker-class model is determining the selection range of speakers. In order to solve the problem, several models which have a variety of speaker range are prepared for each evaluation speaker in advance, and the most proper model is selected on a likelihood basis in the recognition step. In addition, we improved the recognition performance using unsupervised speaker adaptation with the speaker-class models. In the recognition experiments, a significant improvement could be obtained by using the proposed speaker adaptation based on speaker-class models compared with the conventional adaptation method.
- (社)電子情報通信学会の論文
- 2010-09-01
著者
-
ITO Takashi
Graduate School of Engineering, Tohoku University
-
Kato Masaharu
Graduate School Of Life Science Tohoku University
-
Kohda Masaki
Graduate School Of Science And Engineering Yamagata University
-
KOSAKA Tetsuo
Graduate School of Science and Engineering, Yamagata University
-
TAKEDA Yuui
Faculty of Engineering, Yamagata University
-
Kosaka Tetsuo
Graduate School Of Science And Engineering Yamagata University
-
Takeda Yuui
Faculty Of Engineering Yamagata University
-
Kato Masaharu
Graduate School Of Arts And Sciences The Univresity Of Tokyo
-
ITO Takashi
Graduate School of Science and Engineering, Yamagata University
関連論文
- The Evaluation of New Amorphous Hydrocarbon Film aCHx, for Copper Barrier Dielectric Film in Low-k Copper Metallization
- The Excess Light Energy that is neither Utilized in photosynthesis nor Dissipated by Photoprotective Mechanisms Determines the Rate of Photoinactivation Photosystem II
- The effect of head motion on the accuracy of sound localization
- High-throughput Fluorescence Labelling of Full-length cDNA Products Based on a Reconstituted Translation System
- ACT Domain Repeat Protein 7, ACR7, Interacts with a Chaperone HSP18.0-CII in Rice Nuclei
- Interaction of N-Acetylglutamate Kinase with a PII-Like Protein in Rice
- Lecture Speech Recognition Using Discrete-Mixture HMMs
- Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
- FE-based Analysis for the Microstructure Evolution in Hot Bar Rolling
- Histogram equalization for noise-robust speech recognition using discrete-mixture HMMs
- Robust Speech Recognition Using Discrete-Mixture HMMs(Speech and Hearing)
- Characteristics of Nano-Grating N-Channel MOSFETs for Improved Current Drivability(Semiconductor Materials and Devices)
- Lateral Recrystallized Si Thin Films with Large Tensile Strain for High Performance Thin Film Transistors
- Audio-visual like in auditory spatial discrimination
- Enlargement of Crystal-Grains in Thin Silicon Films Using Continuous-Wave Laser Irradiation
- Evaluation of New Amorphous Hydrocarbon Film for Copper Barrier Dielectric Film in Low-$k$ Copper Metallization
- Low-Temperature Recrystallization of Ferroelectric Lead Zirconate Titanate Thin Films on Glass Substrate Using Continuous-Wave Green Laser
- Analysis of Continuous-Wave Laser Lateral Crystallized Polycrystalline Silicon Thin Films with Large Tensile Strain
- Roughness Reduction in Polycrystalline Silicon Thin Films Formed by Continuous-Wave Laser Lateral Crystallization with Cap SiO2 Thin Films
- Enlargement of Crystal Grains in Thin Silicon Films by Continuous-Wave Laser Irradiation