A voice conversion based on phoneme segment mapping
スポンサーリンク
概要
- 論文の詳細を見る
Voice conversion is a technique to change speaker individuality;i. e. , speech uttered by a speaker is changed to sound as if another speaker had uttered it. In this paper, we propose a voice conversion algorithm that uses speech segments as conversion units. Input speech is decomposed into speech segments by a speech recognition module and the segments are replaced by speech segments uttered by another speaker. This algorithm makes it possible to convert not only the static characteristics but also speaker individuality contained in phoneme segments. The proposed voice conversion algorithm was performed between two male speakers. Spectrum distortion between target speech and the converted speech was reduced to one-third the natural spectrum difference between the two speakers. A litening experiment showed that, in terms of speaker identification accuracy, the speech converted by segment-sized units gave a score 20% higher than the speech converted frame-by-frame.
- 社団法人日本音響学会の論文
著者
-
Sagayama S
Atr Interpreting Telephony Research Lab.
-
Abe Masanobu
ATR Interpreting Telephony Research Laboratories
-
Sagayama Shigeki
ATR Interpreting Telephony Research Laboratories
関連論文
- A voice conversion based on phoneme segment mapping
- Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling (Special Issue on Natural Language Processing and Understanding)
- LR Parsing with a Category Reachability Test Applied to Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- A pairwise discriminant approach using artificial neural networks for continuous speech recognition
- Speaker Weighted Training of HMM Using Multiple Reference Speakers
- Speaker Adaptation Based on Vector Field Smoothing
- Minimum error classification training of HMMs : Implementation details and experimental results
- Isolated Word Recognition Using Pitch Pattern Information
- Three Different LR Parsing Algorithms for Phoneme-Context-Dependent HMM-Based Continuous Speech Recognition (Special Issue on Speech and Discourse Processing in Dialogue Systems)