Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Learning spatial interest regions from videos to inform action recognition in still images

: Eweiwi, A.; Cheema, M.S.; Bauckhage, C.

Volltext (PDF; )

Seidl, T.:
16th LWA Workshops: KDML, IR and FGWM 2014. Proceedings : Aachen, Germany, September 8-10, 2014
Aachen, 2014 (CEUR Workshop Proceedings 1226)
Conference "Learning, Knowledge, Adaptation" (LWA) <16, 2014, Aachen>
Workshop "Knowledge Discovery, Data Mining and Machine Learning" (KDML) <2014, Aachen>
Konferenzbeitrag, Elektronische Publikation
Fraunhofer IAIS ()

Common approaches to human action recognition from images rely on local descriptors for classification. Typically, these descriptors are computed in the vicinity of key points which either result from running a key point detector or from dense or random sampling of pixel coordinates. Such key points are not a-priori related to human activities and thus of limited information with regard to action recognition. In this paper, we propose to identify action-specific key points in images using information available from videos. Our approach does not require manual segmentation or templates but applies non-negative matrix factorization to optical flow fields extracted from videos. The resulting basisflows are found to to be indicative of action specific image regions and therefore allow for an informed sampling of key points. We also present a generative model that allows for characterizing joint distributions of regions of interest and a human actions. In practical experiments, we determine correspondences between regions of interest that were automatically learned from videos and manually annotated locations of human body parts available from independent benchmark image data sets. We observe high correlations between learned interest regions and body parts most relevant for different actions.