An Efficient Lip-Reading Method Robust to Illumination Variations
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, for real-time automatic image transform based lip-reading under illumination variations, an efficient (smaller feature data size) and robust (better recognition under different lighting conditions) method is proposed. Image transform based approach obtains a compressed representation of image pixel values of speaker's mouth and is reported to show superior lip-reading performance. However, this approach inevitably produces large feature vectors relevant to lip information to require much computation time for lip-reading even when principal component analysis (PCA) is applied. To reduce the necessary dimension of feature vectors, the proposed method folded the lip image based on its symmetry in a frame image. This method also compensates the unbalanced illumination between the left and the right lip areas. Additionally, to filter out the inter-frame time-domain spectral distortion of each pixel contaminated by illumination noise, our method adapted the hi-pass filtering on the variations of pixel values between consecutive frames. In the experimental results performed on database recorded at various lighting conditions, the proposed lip-folding or/and inter-frame filtering reduced much the necessary number of feature data, principal components in this work, and showed superior recognition rate compared to the conventional method.
- 社団法人電子情報通信学会の論文
- 2002-09-01
著者
-
Shirai Katsuhiko
School Of Science And Engineering Waseda University
-
Shirai Katsuhiko
Department Of Information And Computer Science Waseda University
-
Lee J
Gyeongin National Univ. Education Inchon Kor
-
Kim Jinyoung
Department Of Electronics Computer And Information Eng. And Rrc Hecs Chon-nam National University
-
KIM Jinyoung
The Department of Electronics, Computer and Information Eng. and RRC HECS, Chonnam National Universi
-
LEE Joohun
The Department of Internet Broadcasting, Dong-Ah College
関連論文
- A Synchronous Completion Prediction Adder (SCPA)
- Development of a Lip-Sync Algorithm Based on an Audio-Visual Corpus
- An Efficient Lip-Reading Method Robust to Illumination Variations
- Phrase Recognition in Conversational Speech Using Prosodic and Phonemic Information (Special Issue on Speech and Discourse Processing in Dialogue Systems)
- A Robust Recursive Least Square Algorithm against Impulsive Noise(Digital Signal Processing)
- Recognizing Reverberant Speech Based on Amplitude and Frequency Modulation
- Sounds of Speech Based Spoken Document Categorization : A Subword Representation Method(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Extraction of Human Face and Transformable Region by Facial Expression Based on Extended Labeled Graph Matching
- Linguistic Intelligent CAI System Using Speech Data-Base
- ANALYSIS OF PATH FLOW CHANGES CAUSED BY TRAFFIC INFORMATION PROVISION USING DYNAMIC PATH FLOW ESTIMATION