母音・半母音・有声破裂音の知覚におけるホルマント遷移の変化量と変化速度との間の相補性およびその識別機構 (<特集>聴覚)
スポンサーリンク
概要
- 論文の詳細を見る
Motion of Formant Frequency is often as essential cue for distinguishing phonemes in speech sounds. The purpose of this research is to examine the perceptual effect of change in amount and rate of formant frequency change and a signal processing in auditory system. Synthetic word-like sounds whose first formant frequency was changed systematically as shown in Fig. 8 and eq. (3. 1), whereas the other formants were kept constant, were used as the experimental material. Analysis of 7 listeners' responses in the hearing test of the word-like sounds shows the following findings : 1. These sounds are listened as one of /aba/, /awa/, /aua/, and /aaa/. 1. The perception of a stop consonant is influenced by the rate of frequency change and is relatively insensitive to change in the length of the central constant interval. The perception of semi-vowel and vowel, on the contrary, is influenced largely by the length of the central constant interval as well as the rate of frequency change. 3. If the rate of the upward transition is different from that of downward, a perceptual rate of formant motion is to be expected at some value between the upward and the downward rates. The upward transition has larger influence in the perception of stop consonant than the downward transition. 4. An increase in the rate of first formant frequency change reduces the amount of frequency change required to switch the identification to a particular consonant. With regard to this there is a mutually complementary effect between the amount and the rate of formant change. 5. The close examination for phoneme boundaries between /b/, /w/, /u/, and /a/ in these speech like sounds shows that the loci of phoneme boundaries obtained from listening tests can be represented by some hyperbolic curve in (?F, ?F) plane as follows ; (?F - ?F_θ) = C / (?F - ?F_θ) . . . . . . (4. 1) where ?F is the amount of transition of first formant frequency, ?F is the rate of transition of first formant frequency, ?F_θ, ?F_θ and C are constants depending upon which phonemes concern with. The fact that phoneme boundaris are given by eq. (4. 1) suggests that phonemes can be discriminated by extracting ?F, ?F and making a decision of whether the inquality : {(?F - ?F_θ) (?F - ?F_θ) - C } > 0 . . . . . . (6. 3) is satisfied. An operation for the decision must be also described in terms of "processed" signals in auditory domain something like a physiological or psychological quantity, and is supposed to be simpler in such a domain than in acoustic domain. A processor whose operation is defined by : f(t) = F(t) + 1 / 2R_M Σ^^R_M_ _R = 1 {K_1 exp (-T_c ・R) (F(t) - F(t - R)) + K_2 exp (-T_C・R) (F(t) -F(t - R))} . . . . . . (6. 5) has been postulated as a conversion for speech perception from a signal F(t) as a function of time t in acoustic domain to a signal f(t) in auditory domain. Effects of change in time span R_M, time constant T_C, and rate of transition of formant frequency in the input signal were examined by computer simulation. When the parameters have such values as : R_M = 8 (≒ 80 msec), T_C = 0. 15~0. 08 (≒ 100 mces), and K_1 = K_2 = 2, the phoneme boundary between /w/ and /b/ takes quite square shape in (?f, ?f - plane. From those findings, it is concluded that phoneme discrimination can be made for the processed signals by two simple threshold logics, one of which is for ?f and the other for ?f, instead of calculating rather complex ineq. (6. 3), and that the processor has short time memory of about 2R_M ( ≒ 160 msec), delayed response of about R_M (≒ 80 msec), and time constant T_C of about 100 msec.
- 社団法人日本音響学会の論文
- 1974-03-01
著者
関連論文
- 声道の3次元的形状の音響的影響の有限要素法による解析
- 声道の音響管モデルにおける鼻腔と口腔の結合の強さに関する検討
- 声道の音響管モデルの3次元有限要素法によるシミュレーション : 曲がり, 分岐, 及び断面形状の効果
- 第13回国際音響学会議
- 潜水時の高圧気体環境におけるコンデンサマイクロホンの特性
- 高圧ヘリウム空気中のグラスウール吸音材の音響特性
- 鼻腔・副鼻腔のMRI観測とその変形による鼻道音響伝達特性の検討
- 実体型鼻腔音響管の有限要素法を用いた解析
- 鼻腔の音響伝達特性のシミュレーションモデルによる検討
- MRIデータに基づく三次元実体型鼻腔モデルの有限要素法による解析
- MRIデータに基づく三次元実体型鼻腔モデルの有限要素法による解析
- 鼻腔・副鼻腔の3次元音響管モデルによるシミュレーション
- 潜水シミュレータ内の深度による背景騒音
- 潜水浮上時におけるダイバー血管中の気泡発生のパソコンによる検出システム(BUBSIPP)
- FEMモデルと実形モデルとの比較による梨状窩の音響特性の検討
- 第2回西太平洋地区音響学会議
- 線形予測分析法を用いた潜水浮上時における血液中の気泡検出
- 年令, 性別による日本語5母音のピッチ周波数とホルマント周波数の変化
- 状態ベクトルと自己相関係数による音声の記述と識別
- 音響管内にある分岐の開口端補正とその形状との関連について
- 3マイクロホンを用いた吸音材の音響特性の測定
- 230m有人潜水シミュレーション実験における聴力測定
- 高圧気体中のグラスウール吸音材の吸音特性
- 高圧気体環境における吸音材の吸音率について
- ヘリウム音声データベースの応用 : 音素の継続時間とバズバーについて
- 1チップDSPによる実時間ヘリウム音声修復装置の設計
- 破裂子音における口腔内圧及び放射音の測定とシミュレーション
- 声道内音圧及び頬の振動加速度による頬インピーダンスの測定
- DSP を用いた線形予測分析法による血液中の自動気泡検知(ABD)システム
- 第l5回国際音響学会議(ICA)
- ヘリウム音声
- 最近の高度難聴者用聴覚補助装置
- 母音・半母音・有声破裂音の知覚におけるホルマント遷移の変化量と変化速度との間の相補性およびその識別機構 (<特集>聴覚)