Phonetically Balanced Text Corpus Design Using a Similarity Measure for a Stereo Super-Wideband Speech Database
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose a text corpus design method for a Korean stereo super-wideband speech database. Since a small-sized text corpus for speech coding is generally required for speech coding, the corpus should be designed to comply with the pronunciation behavior of natural conversation in order to ensure efficient speech quality tests. To this end, the proposed design method utilizes a similarity measure between the phoneme distribution occurring from natural conversation and that from the designed text corpus. In order to achieve this goal, we first collect and refine text data from textbooks and websites. Next, a corpus is designed from the refined text data based on the similarity measure to compare phoneme distributions. We then construct a Korean stereo super-wideband speech (K-SW) database using the designed text corpus, where the recording environment is set to meet the conditions defined by ITU-T. Finally, the subjective quality of the K-SW database is evaluated using an ITU-T super-wideband codec in order to demonstrate that the K-SW database is useful for developing and evaluating super-wideband codecs.
- 2011-07-01
著者
-
OH Yoo
School of Information and Communications, Gwangju Institute of Science and Technology (GIST)
-
KIM Hong
School of Information and Communications, Gwangju Institute of Science and Technology (GIST)
-
Bae Hyun
Etri
-
Kim Hong
Gwangju Inst. Sci. And Technol. Gwangju Kor
-
Kim Hong
Dept. Of Information And Communications Gwangju Institute Of Science And Technology
-
LEE Mi
ETRI
-
Kim Hong
School Of Electronics And Information Engineering Cheongju University
-
Oh Yoo
School Of Information And Communications Gwangju Institute Of Science And Technology (gist)
-
Kim Yong
School Of Information And Communications Gwangju Institute Of Science And Technology (gist)
-
Oh Yoo
School Of Information And Communications Gwangju Inst. Of Sci. And Technol. (gist)
-
Lee Mi
It Convergence Technology Research Laboratory Electronics And Telecommunications Research Institute
-
Bae Hyun
It Convergence Technology Research Laboratory Electronics And Telecommunications Research Institute
-
KIM Mina
School of Information and Communications, Gwangju Institute of Science and Technology (GIST)
-
Kim Mina
School Of Information And Communications Gwangju Institute Of Science And Technology (gist)
-
Kim Hong
School of Electronic and Information Engineering, Cheongju University, 36 Naedok-Dong Sangdang-Gu, Cheongju, Chungbuk 360-764, Korea
関連論文
- Correlation between organic chemical reaction and chemical shift in carbon-doped silicon oxide film (Electron devices: 第15回先端半導体デバイスの基礎と応用に関するアジア・太平洋ワークショップ(AWAD2007))
- Correlation between organic chemical reaction and chemical shift in carbon-doped silicon oxide film (Silicon devices and materials: 第15回先端半導体デバイスの基礎と応用に関するアジア・太平洋ワークショップ(AWAD2007))
- A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition
- A Statistical Approach to Error Compensation in Spectral Quantization(Speech and Hearing)
- Bandwidth-Scalable Stereo Audio Coding Based on a Layered Structure
- The Loss Kinetics of Substitutionsl Carbon in Si_C_x Regrown by Solid Phase Epitaxy
- A205 NUMERICAL STUDY ON TRIBRACHIAL FLAME PROPAGATION IN A 2-D MIXING LAYER(Laminar flame-1)
- A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition
- Phonetically Balanced Text Corpus Design Using a Similarity Measure for a Stereo Super-Wideband Speech Database
- Correlation of Grain Size of Pentacene-Deposited Surface and Carbon Content Analyzed by X-ray Photoelectron Spectroscopy
- The Loss Kinetics of Substitutional Carbon in Si1-xCx Regrown by Solid Phase Epitaxy