A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
スポンサーリンク
概要
- 論文の詳細を見る
Recently, some speech recognition methods using fusion of visual and auditory information have been researched. In this paper, a study on the mouth shape image suitable for fusion of visual and auditory information has been described. Features of mouth shape which are extracted from gray level image and binary image are adopted, and speech recognition using linear combination method has been performed. From results of speech recognition, the studies on the mouth shape features which are effective in fusion of visual and auditory information have been performed. And the effectiveness of using two kinds of mouth shape features also has been confirmed.
- 社団法人電子情報通信学会の論文
- 1995-11-25
著者
-
Hayashi Y
College Of Engineering University Of Osaka Prefecture
-
OGIHARA Akio
College of Engineering, University of Osaka Prefecture
-
TAKAMATSU Shinobu
College of Engineering, University of Osaka Prefecture
-
Shintani Akira
The College Of Engineering Osaka Prefecture University
-
Hayashi Yasuhisa
College Of Engineering University Of Osaka Prefecture
-
Ogihara A
Osaka Prefecture Univ. Sakai‐shi Jpn
-
Ogihara Akio
College Of Engineering Osaka Prefecture University
-
Doi Nobuhiro
Graduate School Of Information Production And Systems Waseda University
-
Doi Naoshi
College of Engineering, University of Osaka Prefecture
-
Shintani Akira
College of Engineering, University of Osaka Prefecture
-
Doi Naoshi
Graduate School Of Information Production And Systems Waseda University
-
Hayashi Yasuhisa
The College Of Engineering University Of Osaka Prefecture
-
Takamatsu Shinobu
The College Of Engineering Osaka Prefecture University
-
Takamatsu Shinobu
College Of Engineering University Of Osaka Prefecture
関連論文
- An Analysis on Minimum Searching Principle of Chaotic Neural Network (Special Section of Selected Papers from the 8th Karuizawa Workshop on Circuits and Systems)
- Bit Length Optimization of Fractional Part on Floating to Fixed Point Conversion for High-Level Synthesis(Logic and High Synthesis)(VLSI Design and CAD Algorithms)
- A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
- Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information : Special Section of Letters Selected from 1994 IEICE Spring Conference
- Speech Recognition of lsolated Digits Using Simultaneous Generative Histogram (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
- A Continuous Speech Recognition Algorithm Utilizing Island-Driven A^* Search (Special Section of Letters Selected from the 1993 IEICE Spring Conference)
- Bit-Length Optimization Method for High-Level Synthesis Based on Non-linear Programming Technique(System Level Design,VLSI Design and CAD Algorithms)
- Asymmetric Neural Network and Its Application to Knapsack Problem
- An Improvement of the Pseudoinverse Rule with Diagonal Elements (Special Section of Papers Selected from JTC-CSCC'93)
- An Autocorrelation Associative Neural Network with Self-Feedbacks (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
- A Neural Network with a Function of Inhibiting Subtours on TSP (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
- Associative Neural Network Models Based on a Measure of Manhattan Length (Special Section on the 5th Karuizawa Workshop on Circuits and Systems)
- Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Frame Color Image (Special Section of Letters Selected from the 1996 IEICE General Conference)
- An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information (Special Section of Papers Selected from JTC-CSCC'95)
- Binary Neural Network with Negative Self-Feedback and Its Application to N-Queens Problem (Special Issue on Neurocomputing)
- A Theoretical Analysis of Neural Networks with Nonzero Diagonal Elements (Special Section on the 5th Karuizawa Workshop on Circuits and Systems)
- An Extraction Method of Lip Shape for Independent Speaker
- An Isolated Word Speech Recognition Based on Fusion of Visual and Auditory Information Using 30-frame/s and 24-bit Color Image (Special Section on Digital Signal Processing)
- A Correcting Method for Pitch Extraction Using Neural Networks (Special Section of Papers Selected from JTC-CSCC'93)