Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Hybrid person detection and tracking in H.264/AVC video streams

 
: Wojaczek, Philipp; Laumer, M.; Amon, P.; Hutter, A.; Kaup, A.

Braz, José (Ed.); Battiato, Sebastiano (Ed.); Imai, Francisco (Ed.) ; Institute for Systems and Technologies of Information, Control and Communication -INSTICC-, Setubal:
10th International Conference on Computer Vision Theory and Applications, VISAPP 2015. Proceedings. Vol.I : Berlin, Germany, 11 - 14 March 2015; Part of VISIGRAPP, the 10th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
SciTePress, 2015
ISBN: 978-989-758-089-5
S.478-485
International Conference on Computer Vision Theory and Applications (VISAPP) <10, 2015, Berlin>
International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP) <10, 2015, Berlin>
Englisch
Konferenzbeitrag
Fraunhofer FHR ()

Abstract
In this paper we present a new hybrid framework for detecting and tracking persons in surveillance video streams compressed according to the H.264/AVC video coding standard. The framework consists of three stages and operates in both the compressed and the pixel domain of the video. The combination of compressed and pixel domain represents the hybrid character. Its main objective is to significantly reduce the amount of computation required, in particular for frames and image regions with few people present. In its first stage the proposed framework evaluates the header information for each compressed frame in the video sequence, namely the macroblock type information. This results in a coarse binary mask segmenting the frame into foreground and background. Only the foreground regions are processed further in the second stage that searches for persons in the image pixel domain by applying a person detector based on the Implicit Shape Model. The third stage segments each detected person further with a newly developed method that fuses information from the first two stages. This helps obtaining a finer segmentation for calculating a color histogram suitable for tracking the person using the mean shift algorithm. The proposed framework was experimentally evaluated on a publicly available test set. The results demonstrate that the proposed framework reliably separates frames with and without persons such that the computational load is significantly reduced while the detection performance is kept.

: http://publica.fraunhofer.de/dokumente/N-410696.html