Appearance feature extraction versus image transform-based approach for visual speech recognition
スポンサーリンク
概要
- 論文の詳細を見る
In this paper we propose a new appearance based system which consists of two stages: visual speech feature extraction and classification, followed by recognition of the extracted feature, thereby the result is a complete lip-reading system. This lip-reading system employs our Hyper Column Model (HCM) approach to extract and classify the visual features and uses the Hidden Markov Model (HMM) for recognition. This paper addresses mainly the first stage; i.e. feature extraction and classification. We investigate the HCM performance to achieve feature extraction and classification and then compare the performance when replacing HCM with Fast Discrete Cosine Transform (FDCT). Unlike FDCT, HCM could extract the entire features without any loss. Also the experiments have shown that HCM is generally better than FDCT and provides a good distribution of the phonemes in the feature space for recognition purposes. For fair comparison, two databases are exploited with three different sets of resolution for each database. One of these two databases is designed to include shifted and scaled objects. Experiments reveal that HCM is capable of recovering and dealing with such image restrictions whereas the effectiveness of FDCT drops drastically especially for new subjects.
論文 | ランダム
- ジュニアクラブの運営の現状 (平成15年度日本体操競技・器械運動学会発表要旨) -- (シンポジウム2 日本のジュニア体操競技の現状と将来)
- 社会人体操競技選手の練習環境 (プロジェクト研究 体操競技に関する基本調査プロジェクト(第6報)わが国の社会人体操競技に関する基本調査)
- ジュニアにおける競技方法の分析と問題点 (プロジェクト研究 体操競技に関する基本調査(5)わが国のジュニア体操競技に関する基本調査)
- ジュニアクラブの練習環境の実態と問題点 (プロジェクト研究 体操競技に関する基本調査(5)わが国のジュニア体操競技に関する基本調査)
- ジェルソミーナとカビリア