Segmentation of the Speaker's Face Region with Audiovisual Correlation
スポンサーリンク
概要
- 論文の詳細を見る
The ability to find the speakers face region in a video is useful for various applications. In this work, we develop a novel technique to find this region within different time windows, which is robust against the changes of view, scale, and background. The main thrust of our technique is to integrate audiovisual correlation analysis into a video segmentation framework. We analyze the audiovisual correlation locally by computing quadratic mutual information between our audiovisual features. The computation of quadratic mutual information is based on the probability density functions estimated by kernel density estimation with adaptive kernel bandwidth. The results of this audiovisual correlation analysis are incorporated into graph cut-based video segmentation to resolve a globally optimum extraction of the speakers face region. The setting of any heuristic threshold in this segmentation is avoided by learning the correlation distributions of speaker and background by expectation maximization. Experimental results demonstrate that our method can detect the speakers face region accurately and robustly for different views, scales, and backgrounds.
- (社)電子情報通信学会の論文
- 2010-07-01
著者
-
Sato Yoichi
Institute of Industrial Science, The University of Tokyo
-
Liu Yuyu
Institute Of Industrial Science The University Of Tokyo
-
Sato Yoichi
Institute Of Industrial Science The University Of Tokyo
-
Sato Yoichi
Institute For Molecular Science
関連論文
- Reflectance Estimation under Complex Illumination
- Reconstruction an object's shape from its appearance manifold under moving light (コンピュータビジョンとイメージメディア)
- Laser Emission under ^4F_ and ^4F_ Pumping in Nd : LSB Micro-Laser
- Segmentation of the Speaker's Face Region with Audiovisual Correlation
- On the in situ estimation of surface acoustic impedance in interiors of arbitrary shape by acoustical inverse methods
- An MDL Approach to Learning Activity Grammars(Gestures)
- 環境への自動適応を伴うアピアランスベース頭部姿勢推定(テーマセッション,コンピュータビジョンとパターン認識のための機械学習)
- 環境への自動適応を伴うアピアランスベース頭部姿勢推定(テーマセッション,コンピュータビジョンとパターン認識のための機械学習)
- Driving Feature Extraction from High and Low Skilled Drivers in Curve Sections Based on Machine Learning
- Optimal Sampling for Efficient BRDF Acquisition
- Optimal Sampling for Efficient BRDF Acquisition