Human language identification with reduced segmental information
スポンサーリンク
概要
- 論文の詳細を見る
We conducted human language identification experiments using signals with reduced segmental information with Japanese and bilingual subjects. American English and Japanese excerpts from the OGI Multi-Language Telephone Speech Corpus were processed by spectral-envelope removal (SER), vowel extraction from SER (VES) and temporal-envelope modulation (TEM). The processed excerpts of speech were provided as stimuli for perceptual experiments. We calculated D indices from the subjects' responses, ranging from -2 to +2 where positive/negative values indicate correct/incorrect responses, respectively. With the SER signal, where the spectral-envelope is eliminated, humans could still identify the languages fairly successfully. The overall D index of Japanese subjects for this signal was + 1.17. With the VES signal, which retains only vowel sections of the SER signal, the D index was lower (+0.35). With the TEM signal, composed of white-noise-driven intensity envelopes from several frequency bands, the D index rose from +0.29 to +1.69 corresponding to the increasing number of bands from 1 to 4. Results varied depending on the stimulus language. Japanese and bilingual subjects scored differently from each other. These results indicate that humans can identify languages using signals with drastically reduced segmental information. The results also suggest variation due to the phonetic typologies of languages and subjects' knowledge.
- 社団法人日本音響学会の論文
著者
-
Arai Takayuki
Department Of Chemistry Faculty Of Engineering Gunma University
-
Komatsu Masahiko
Department Of Electrical And Electronics Engineering Sophia University:department Of Linguistics Uni
-
MORI Kazuya
Department of Immunology and Microbiology, Meiji University of Oriental Medicine
-
Murahara Yuji
Department Of Electrical And Electronics Engineering Sophia University
-
Aoyagi Makiko
Center for the Teaching of Foreign Languages, Sophia University
-
Mori Kazuya
Department Of Electrical And Electronics Engineering Sophia University
-
Mori Kazuya
Department Of Chemical Engineering Graduate School Of Engineering Hiroshima University
-
Aoyagi Makiko
Center For The Teaching Of Foreign Languages Sophia University
関連論文
- Perception of speaker identity and its relation to the phonological features (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Inverse correlation of intelligibility of speech in reverberation with the amount of overlap-masking(ACOUSTICAL LETTER)
- Decreasing speaking-rate with steady-state suppression to improve speech intelligibility in reverberant environments
- The Effects of Speech-Rate Slowing for Improving Speech Intelligibility in Reverberant Environments (国際ワークショップ Frontiers in Speech and Hearing Research)
- Suppressing steady-state portions of speech for improving intelligibility in various reverberant environments
- Effects of suppressing steady-state portions of speech on intelligibility in reverberant environments
- Synthesis and Properties of Fluorine-Containing Poly(arylenemethylene)s as New Heat Resistant Denatured Phenolic Resins
- Synthesis and Brain Regional Distribution of [^C]NPS 1506 in Mice and Rat: an N-Methyl-D-aspartate (NMDA) Receptor Antagonist(Medicinal Chemistry)
- Speech Processing for Hearing-Impaired Listeners Considering Threshold Elevation in the Critical Band with an Expanded Auditory Filter (国際ワークショップ Frontiers in Speech and Hearing Research)
- Improving Speech Intelligibility for Elderly Listeners by Steady-State Suppression (国際ワークショップ Frontiers in Speech and Hearing Research)
- Perception of long vowels in Japanese by Children
- Inactivation of Rat Cytochrome P450 2D Enzyme by a Further Metabolite of 4-Hydroxypropranolol, the Major and Active Metabolite of Propranolol
- Gel-type tongue for a physical model of the human vocal tract as an educational tool in acoustics of speech production
- Effects of linguistic contents on perceptual speaker identification : Comparison of familiar and unknown speaker identifications
- Comparison of consonant identification improvements by steady-state suppression via a loudspeaker system between with and without natural sounds from a talker in reverberation(Commemoration of the Japan-China Joint Conference on Acoustics 2
- The effect of pre-processing approach for improving speech intelligibility in a hall : Comparison between diotic and dichotic listening conditions
- Idiosyncrasy of nasal sounds in human speaker identification and their acoustic properties
- Visualization of Brain Activities of Single-Trial and Averaged Multiple-Trials MEG Data(Neuro, Fuzzy, GA)(Nonlinear Theory and its Applications)
- Digital pattern playback : Converting spectrograms to sound for educational purposes(introduction to the amazing world of sounds with demonstrations)
- Implementation of Steady-State Suppression Using a Digital Signal Processor for Real-Time Processing--Evaluation of the Processing in an Actual Hall (国際ワークショップ Frontiers in Speech and Hearing Research)
- Steady-state suppression for improving syllable identification in reverberant environments : A case study in an elderly person
- Critical-band based frequency compression for digital hearing aids
- Effect of the Carbamoyl Group Attached to an Axial Ligand Portion of a Novel Bleomycin Model on a Dioxygen Activating Reaction
- Distal Effect of Amide and Amino Groups on the Oxygen Activation Ability and Rate of the Redox Reaction of Simplified Analogs of Bleomycin
- Demonstrations for education in acoustics in Japan(introduction to the amazing world of sounds with demonstrations)
- The perivascular space as a path of hematopoietic progenitor cells and mature T cells between the blood circulation and the thymic parenchyma
- Comparing the characteristics of the plate and cylinder type vocal tract models
- Speech perception experiment using binaural integration of phonemic and prosodic information
- Analysis of spontaneous Japanese in a multi-language telephone-speech corpus
- Modulation cepstrum discriminating between speech and environmental noise
- Education system in acoustics of speech production using physical models of the human vocal tract(Applied Systems)
- Preparation of Thermosensitive Microgel Adsorbent for Quick Adsorption of Heavy Metal Ions by a Temperature Change
- Padding zero into steady-state portions of speech as a preprocess for improving intelligibility in reverberant environments
- Processing of consonant clusters by Japanese native speakers: Influence of English learning backgrounds
- Masking speech with its time-reversed signal
- Human language identification with reduced segmental information
- Effects of stimulus contents and speaker familiarity on perceptual speaker identification
- Sliding three-tube model as a simple educational tool for vowel production(introduction to the amazing world of sounds with demonstrations)
- Lung model and head-shaped model with visible vocal tract as educational tools in acoustics
- Moxibustion activates host defense against herpes simplex virus type I through augmentation of cytokine production
- What Is Rhythm? Can We Capture Syllable Shapes From Intensity Contours? (国際ワークショップ Frontiers in Speech and Hearing Research)
- Automatic Language Identification Using Sequential Information of Phonemes
- One-pot Synthesis of Permethylated α-CD-based Rotaxanes Having Alkylene Chain Axles and Their Structural Characteristics
- B2-3. Production variation of English schwa and Japanese listeners' perceptual assimilation pattern of English schwa(Summaries of Talks at the 26^ General Meeting)
- Identification of English voiceless fricatives in multispeaker babble noise by native Japanese and English listeners: Influence of English proficiency