A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information

概要

論文の詳細を見る
Recently, some speech recognition methods using fusion of visual and auditory information have been researched. In this paper, a study on the mouth shape image suitable for fusion of visual and auditory information has been described. Features of mouth shape which are extracted from gray level image and binary image are adopted, and speech recognition using linear combination method has been performed. From results of speech recognition, the studies on the mouth shape features which are effective in fusion of visual and auditory information have been performed. And the effectiveness of using two kinds of mouth shape features also has been confirmed.
社団法人電子情報通信学会の論文
1995-11-25

著者

Hayashi Y
College Of Engineering University Of Osaka Prefecture
OGIHARA Akio
College of Engineering, University of Osaka Prefecture
TAKAMATSU Shinobu
College of Engineering, University of Osaka Prefecture
Shintani Akira
The College Of Engineering Osaka Prefecture University
Hayashi Yasuhisa
College Of Engineering University Of Osaka Prefecture
Ogihara A
Osaka Prefecture Univ. Sakai‐shi Jpn
Ogihara Akio
College Of Engineering Osaka Prefecture University
Doi Nobuhiro
Graduate School Of Information Production And Systems Waseda University
Doi Naoshi
College of Engineering, University of Osaka Prefecture
Shintani Akira
College of Engineering, University of Osaka Prefecture
Doi Naoshi
Graduate School Of Information Production And Systems Waseda University
Hayashi Yasuhisa
The College Of Engineering University Of Osaka Prefecture
Takamatsu Shinobu
The College Of Engineering Osaka Prefecture University
Takamatsu Shinobu
College Of Engineering University Of Osaka Prefecture

関連論文

An Analysis on Minimum Searching Principle of Chaotic Neural Network (Special Section of Selected Papers from the 8th Karuizawa Workshop on Circuits and Systems)
Bit Length Optimization of Fractional Part on Floating to Fixed Point Conversion for High-Level Synthesis(Logic and High Synthesis)(VLSI Design and CAD Algorithms)
A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information : Special Section of Letters Selected from 1994 IEICE Spring Conference
Speech Recognition of lsolated Digits Using Simultaneous Generative Histogram (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
A Continuous Speech Recognition Algorithm Utilizing Island-Driven A^* Search (Special Section of Letters Selected from the 1993 IEICE Spring Conference)
Bit-Length Optimization Method for High-Level Synthesis Based on Non-linear Programming Technique(System Level Design,VLSI Design and CAD Algorithms)
Asymmetric Neural Network and Its Application to Knapsack Problem
An Improvement of the Pseudoinverse Rule with Diagonal Elements (Special Section of Papers Selected from JTC-CSCC'93)
An Autocorrelation Associative Neural Network with Self-Feedbacks (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
A Neural Network with a Function of Inhibiting Subtours on TSP (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
Associative Neural Network Models Based on a Measure of Manhattan Length (Special Section on the 5th Karuizawa Workshop on Circuits and Systems)
Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Frame Color Image (Special Section of Letters Selected from the 1996 IEICE General Conference)
An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information (Special Section of Papers Selected from JTC-CSCC'95)
Binary Neural Network with Negative Self-Feedback and Its Application to N-Queens Problem (Special Issue on Neurocomputing)
A Theoretical Analysis of Neural Networks with Nonzero Diagonal Elements (Special Section on the 5th Karuizawa Workshop on Circuits and Systems)
An Extraction Method of Lip Shape for Independent Speaker
An Isolated Word Speech Recognition Based on Fusion of Visual and Auditory Information Using 30-frame/s and 24-bit Color Image (Special Section on Digital Signal Processing)
A Correcting Method for Pitch Extraction Using Neural Networks (Special Section of Papers Selected from JTC-CSCC'93)

A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information

スポンサーリンク

概要

著者

関連論文

スポンサーリンク