Formant frequency estimation of high-pitched speech by homomorphic prediction
スポンサーリンク
概要
- 論文の詳細を見る
The conventional model of the linear prediction analysis suffers from difficulties in estimating vocal tract characteristics of high-pitched speakers. This is because the autocorrelation function used by the autocorrelation method of linear prediction for estimating autoregressive coefficients is actually an "aliased" version of that of the vocal tract impulse response. This "aliasing" occurs due to the periodic nature of voiced speech. Generally it is accepted that homomorphic filtering can be used to obtain an estimate of vocal tract impulse response which is free from periodicity. Thus linear prediction of the resulting vocal tract impulse response (referred to as homomorphic prediction) is expected to be free from variations of fundamental frequencies. To our knowledge any experimental study, however, has not yet appeared on the suitability of this method for analyzing high-pitched speech. This paper presents a detail study on the prospects of homomorphic prediction as a formant tracking tool especially for high-pitched speech where linear prediction fails to obtain accurate estimation. The formant frequencies estimated using the proposed method are found to be accurate by more than an order of magnitude compared to the conventional procedure. The accuracy of formant estimation is verified on synthetic vowels for a wide range of pitch periods covering typical male and high-pitched female speakers. The validity of the proposed method is also examined by inspecting the spectral envelopes of natural speech spoken by high-pitched female speakers. We noticed that almost all the previous methods dealing with this limitation of linear prediction are based on the covariance technique where the obtained AR filter can be unstable. The solutions obtained by the current method are guaranteed to be stable which makes it superior for many speech analysis applications.
- 社団法人日本音響学会の論文
著者
-
Rahman Shahidur
Department Of Crop Botany Bangladesh Agricultural University
-
Rahman Shahidur
Department Of Information And Computer Sciences Saitama University
-
SHIMAMURA Tetsuya
Department of Information and Computer Sciences, Saitama University
-
Shimamura Tetsuya
Department Of Information And Computer Sciences Saitama University
関連論文
- Pretreatment with a Low Concentration of Methyl Viologen Decreases the Effects of Salt Stress on Chloroplast Ultrastructure in Rice Leaves (Oryza sativa L.)(Cell Biology)
- Spectrum Estimation by Noise-Compensated Data Extrapolation(Digital Signal Processing)
- Coefficients--Delay Simultaneous Adaptation Scheme for Linear Equalization of Nonminimum Phase Channels(Digital Signal Processing)
- Equalizer-Aided Time Delay Tracking Based on L_1-Normed Finite Differences(Digital Signal Processing)
- A New Method of Noise Variance Estimation from Low-Order Yule-Walker Equations (Digital Signal Processing)
- Identification of ARMA Speech Models Using an Effective Representation of Voice Source(Speech and Hearing)
- Formant frequency estimation of high-pitched speech by homomorphic prediction
- Noise Estimation Using High Frequency Regions for Spectral Subtraction