Modeling Interactions between Low-Level and High-Level Features for Human Action Recognition
スポンサーリンク
概要
- 論文の詳細を見る
Recognizing human action in complex scenes is a challenging problem in computer vision. Some action-unrelated concepts, such as camera position features, could significantly affect the appearance of local spatio-temporal features, and therefore the performance of low-level features based methods degrades. In this letter, we define the action-unrelated concept: the position of camera as high-level features. We observe that they can serve as a prior to local spatio-temporal features for human action recognition. We encode this prior by modeling interactions between spatio-temporal features and camera position features. We infer camera position features from local spatio-temporal features via these interactions. The parameters of this model are estimated by a new max-margin algorithm. We evaluate the proposed method on KTH, IXMAS and Youtube actions datasets. Experimental results show the effectiveness of the proposed method.
著者
-
SHAO Yunxue
State Key Laboratory of Management and Control for Complex Systems Institute of Automation, Chinese Academy of Sciences
-
WANG Chunheng
State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences
-
XIAO Baihua
State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences
-
ZHOU Wen
State Key Laboratory of Management and Control for Complex Systems Institute of Automation, Chinese Academy of Sciences
-
ZHANG Zhong
State Key Laboratory of Management and Control for Complex Systems Institute of Automation, Chinese Academy of Sciences
-
XIAO Baihua
State Key Laboratory of Management and Control for Complex Systems Institute of Automation, Chinese Academy of Sciences
-
WANG Chunheng
State Key Laboratory of Management and Control for Complex Systems Institute of Automation, Chinese Academy of Sciences
関連論文
- Modeling Interactions between Low-Level and High-Level Features for Human Action Recognition
- Nonlinear Metric Learning with Deep Independent Subspace Analysis Network for Face Verification