Phrase Recognition in Conversational Speech Using Prosodic and Phonemic Information (Special Issue on Speech and Discourse Processing in Dialogue Systems)
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, a new scheme for phrase recognition in conversational speech is proposed, in which prosodic and phonemic information processing are usefully combined. This approach is employed both to produce candidates of phrase boundaries and to discriminate phonemes. The fundamental frequency patterns of continuous utterances are statistically analyzed and the likelihood of the occurrence of a phrase boundary is calculated for every frame. At the same time, the likelihood of phonemic characteristics of each frame can be obtained using a hierarchical clustering method. These two scores, along with lexical and grammatical constraints, can be effectively utilized to develop a possible word sequences or a word lattices which correspond to the continuous speech utterances. Our preliminary experjment shows the feasibility of applying prosody for continuous speech recognition especially for conversational style utterances.
- 社団法人電子情報通信学会の論文
- 1993-01-25
著者
-
Shirai Katsuhiko
School Of Science And Engineering Waseda University
-
Okawa Shigeki
School Of Science And Engineering Waseda University
-
Kobayashi Tetsunori
School of Science and Engineering, Waseda University
-
Kobayashi T
Waseda Univ. Tokyo
-
Kobayashi Tetsunori
School Of Science And Engineering Waseda University
-
Endo Takashi
School of Science and Engineering, Waseda University
-
Endo Takashi
School Of Integrated Design Engineering Keio University
-
Endo Takashi
School Of Science And Engineering Waseda University
関連論文
- A 110-MHz/1-Mb Synchronous TagRAM (Special Section on the 1993 VLSI Circuits Symposium (Joint Issue with the IEEE Journal of Solid-State Circuits, Vol.29, No.4 April 1994))
- A Current-Controlled Latch Sense Amplifier and a Static Power-Saving Input Buffer for Low-Power Architecture
- Adaptive Transmit Permission Probability Control in CDMA Cellular Packet Communications with Site Diversity
- Service Fairness in CDMA Cellular Packet Systems with Site Diversity Reception (Special Issue on Multimedia Mobile Communication Systems)
- Development of a Lip-Sync Algorithm Based on an Audio-Visual Corpus
- An Efficient Lip-Reading Method Robust to Illumination Variations
- Design and Creation of Speech and Text Corpora of Dialogue (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Airborne Dual-Frequency Polarimetric and Interferometric SAR(Special Issue on Advances in Radar Systems)
- Non-Gaussianity of ocean images by SAR(Synthetic Aperture Radar,ICSANE 2010 (International Conference on Space, Aeronautical and Navigational Electronics))
- Dependency of Backscattering from Ocean Surface on Wind Direction by using Pi-SAR : Low wind speed case(WSANE 2009 (Workshop for Space, Aeronautical and Navigational Electronics))
- Dependency of Backscattering from Ocean Surface on Ocean Winds Observed by an Airborne SAR(WSANE 2008 (Workshop for Space, Aeronautical and Navigational Electronics))
- A Flexible Search Managing Circuitry for High-Density Dynamic CAMs (Speial Section on High Speed and High Density Multi Functional LSI Memories)
- An Experimental SAR Estimation of Human Head Exposure to UHF Near Fields Using Dry-Phantom Models and a Thermograph (Special Issue on Biological Effects of Electromagnetic Fields)
- A Bitline Control Circuit Scheme and Redundancy Technique for High-Density Dynamic Content Addressable Memories (Special Issue on LSI Memories)
- Phrase Recognition in Conversational Speech Using Prosodic and Phonemic Information (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- Conversation Robot Participating in Group Conversation(Special Issue on the 2001 IEICE Excellent Paper Award)
- Extension of Hidden Markov Models for Multiple Candidates and Its Application to Gesture Recognition(Image Recognition, Computer Vision)
- Recognizing Reverberant Speech Based on Amplitude and Frequency Modulation
- Sounds of Speech Based Spoken Document Categorization : A Subword Representation Method(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Study of vibration-assisted micro-EDM-The effect of vibration on machining time and stability of discharge
- Extraction of Human Face and Transformable Region by Facial Expression Based on Extended Labeled Graph Matching