Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Deep learning based multi-category object detection in aerial images

: Sommer, L.W.; Schuchert, Tobias; Beyerer, Jürgen

Volltext urn:nbn:de:0011-n-4614082 (16 MByte PDF)
MD5 Fingerprint: feae3e8dd47f9210b3c77dc950935a4c
Copyright Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.
Erstellt am: 27.3.2018

Sadjadi, Firooz A. (Ed.) ; Society of Photo-Optical Instrumentation Engineers -SPIE-, Bellingham/Wash.:
Automatic Target Recognition XXVII : 10-11 April 2017, Anaheim, California, United States
Bellingham, WA: SPIE, 2017 (Proceedings of SPIE 10202)
ISBN: 978-1-5106-0905-1
ISBN: 978-1-5106-0906-8
Paper 1020209, 8 S.
Conference "Automatic Target Recognition" <27, 2017, Anaheim/Calif.>
Konferenzbeitrag, Elektronische Publikation
Fraunhofer IOSB ()
deep learning; multi-category object detection; aerial imagery

Multi-category object detection in aerial images is an important task for many applications such as surveillance, tracking or search and rescue tasks. In recent years, deep learning approaches using features extracted by convolutional neural networks (CNN) significantly improved the detection accuracy on detection benchmark datasets compared to traditional approaches based on hand-crafted features as used for object detection in aerial images. However, these approaches are not transferable one to one on aerial images as the used network architectures have an insufficient resolution of feature maps for handling small instances. This consequently results in poor localization accuracy or missed detections as the network architectures are explored and optimized for datasets that considerably differ from aerial images in particular in object size and image fraction occupied by an object. In this work, we propose a deep neural network derived from the Faster R-CNN approach for multi- category object detection in aerial images. We show how the detection accuracy can be improved by replacing the network architecture by an architecture especially designed for handling small object sizes. Furthermore, we investigate the impact of different parameters of the detection framework on the detection accuracy for small objects. Finally, we demonstrate the suitability of our network for object detection in aerial images by comparing our network to traditional baseline approaches and deep learning based approaches on the publicly available DLR 3K Munich Vehicle Aerial Image Dataset that comprises multiple object classes such as car, van, truck, bus and camper.