Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Frame Color Image (Special Section of Letters Selected from the 1996 IEICE General Conference)
スポンサーリンク
概要
- 論文の詳細を見る
We propose a method to fuse auditory information and visual information for accurate speech recognition. This method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. In addition, we use full-frame color image as visual information in order to improve the accuracy of the proposed speech recognition system. We have performed experiments comparing the proposed method with the method using either auditory in-formation or visual information, and confirmed the validity of the proposed method.
- 社団法人電子情報通信学会の論文
- 1996-11-25
著者
-
Shintani Akira
The College Of Engineering Osaka Prefecture University
-
Ogihara A
Osaka Prefecture Univ. Sakai‐shi Jpn
-
Ogihara Akio
the College of Engineering, University of Osaka Prefecture
-
IGAWA Satoru
the College of Engineering, Osaka Prefecture University
-
TAKAMATSU Shinobu
the College of Engineering, Osaka Prefecture University
-
Takamatsu Shinobu
The College Of Engineering Osaka Prefecture University
-
Igawa Satoru
The College Of Engineering Osaka Prefecture University
-
Ogihara Akio
the College of Engineering, Osaka Prefecture University
関連論文
- An Analysis on Minimum Searching Principle of Chaotic Neural Network (Special Section of Selected Papers from the 8th Karuizawa Workshop on Circuits and Systems)
- A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
- Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information : Special Section of Letters Selected from 1994 IEICE Spring Conference
- Speech Recognition of lsolated Digits Using Simultaneous Generative Histogram (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
- A Continuous Speech Recognition Algorithm Utilizing Island-Driven A^* Search (Special Section of Letters Selected from the 1993 IEICE Spring Conference)
- Asymmetric Neural Network and Its Application to Knapsack Problem
- An Improvement of the Pseudoinverse Rule with Diagonal Elements (Special Section of Papers Selected from JTC-CSCC'93)
- An Autocorrelation Associative Neural Network with Self-Feedbacks (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
- A Neural Network with a Function of Inhibiting Subtours on TSP (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
- Associative Neural Network Models Based on a Measure of Manhattan Length (Special Section on the 5th Karuizawa Workshop on Circuits and Systems)
- Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Frame Color Image (Special Section of Letters Selected from the 1996 IEICE General Conference)
- An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information (Special Section of Papers Selected from JTC-CSCC'95)
- Binary Neural Network with Negative Self-Feedback and Its Application to N-Queens Problem (Special Issue on Neurocomputing)
- A Theoretical Analysis of Neural Networks with Nonzero Diagonal Elements (Special Section on the 5th Karuizawa Workshop on Circuits and Systems)