Hybrid Voice Conversion of Unit Selection and Generation Using Prosody Dependent HMM(Speech and Hearing)
スポンサーリンク
概要
- 論文の詳細を見る
We propose a hybrid voice conversion method which employs a combination of techniques using HMM-based unit selection and spectrum generation. In the proposed method, the HMM-based unit selection selects the most likely unit for the required phoneme context from the target speaker's corpus when candidates of the target unit exist in the corpus. Unit selection is performed based on the sequence of the spectral probability distribution obtained from the adapted HMMs. On the other hand, when a target unit does not exist in a corpus, a target waveform is generated from the adapted HMM sequence by maximizing the spectral likelihood. The proposed method also employs the HMM in which the spectral probability distribution is adjusted to the target prosody using the weight defined by the prosodic probability of each distribution. To show the effectiveness of the proposed method, sound quality and speaker individuality tests were conducted. The results revealed that the proposed method could produce high-quality speech and individuality of the synthesized sound was more similar to the target speaker compared to conventional methods.
- 社団法人電子情報通信学会の論文
- 2006-11-01
著者
-
OKUBO Tadashi
Department of Biological Sciences, Faculty of Fisheries, Hokkaido University
-
Okubo Tadashi
Department Of Computer Science Waseda University
-
Kobayashi Tetsunori
Department Of Computer Science Waseda University
-
MOCHIZUKI Ryo
Department of Computer Science, Waseda University
-
Mochizuki Ryo
Department Of Computer Science Waseda University:av Core Technology Development Center Matsushita El
-
Mochizuki Ryo
Department Of Computer Science Waseda University
関連論文
- Ultrastructural changes in gill chloride cells during smoltification in wild and hatchery-reared masu salmon Oncorhynchus masou
- Adult Bone Marrow Cells with Transdifferentiation Potential for Cadiomyocytes Possess the Early Vascular Progenitor Properties and Activatable Cardiomyocyte Differentiation Systems
- Identification of a single cell-derived bone marrow cell line with transdifferentiation potential for cardiomyocytes
- Genome Structure and Differential Expression of Two Isoforms of a Novel PDZ-Containing Myosin (MysPDZ) (Myo18A)
- Stromal Cells Provide Signals Different from Cytokines for STAT5 Activation in Hematopoietic Cells
- Effects of Human Granulocyte-Macrophage Colony Stimulating Factor(hGM-CSF)on Lymphoid and Myeloid Differentiation of Sorted Hematopoietic Stem Cells from hGM-CSF Receptor Gene Transgenic Mice
- Stroma - dependent Maintenance of Cytokine Responsive Hematopoietic Progenitor Cells Derived from Long - term Bone Marrow Culture
- Ears of the Robot : Direction of Arrival Estimation Based on Pattern Recognition Using Robot-Mounted Microphones
- JS-I-5 LIGATION OF PATENT DUCTUS ARTERIOSUS IN PREMATURE INFANTS : REPORT OF 36 CASES
- S-II-5. A Quantitative Stufy on CSF Dynamics of the"Normal Pressure Hydrocephalus" : Radioisotope Ventricular Clearance Using a Gamma Camera
- B-34. Radioisotope Ventricular Clearance on the C.S.F. Dynamics : A Study on Microcephaly and Hydrocephalus
- Influence of Lombard Effect : Accuracy Analysis of Simulation-Based Assessments of Noisy Speech Recognition Systems for Various Recognition Conditions
- A Low-Band Spectrum Envelope Reconstruction Method for PSOLA-Based F_0 Modification(Speech and Hearing)
- Fusion-Based Age-Group Classification Method Using Multiple Two-Dimensional Feature Extraction Algorithms(Pattern Recognition)
- Ears of the Robot : Three Simultaneous Speech Segregation and Recognition Using Robot-Mounted Microphones(Speech and Hearing)
- Hybrid Voice Conversion of Unit Selection and Generation Using Prosody Dependent HMM(Speech and Hearing)
- Speech Enhancement Using a Square Microphone Array in the Presence of Directional and Diffuse Noise
- Infantile Pulmonary Alveolar Proteinosis with Interstitial Pneumonia : Bilateral Simultaneous Lung Lavage Utilizing Extracorporeal Membrane Oxygenation and Steroid Therapy
- Single-Stage Transmedial Approach to a Stanford Type B Dissection in a Patient with Marfan's Syndrome
- Conversational robots: An approach to conversation protocol issues that utilizes the paralinguistic information available in a robot-human setting