Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

An approach to improve detection in scenes with varying object densities in remote sensing

 
: Michel, Andreas; Mispelhorn, Jonas; Schenkel, Fabian; Gross, Wolfgang; Middelmann, Wolfgang

:

Bruzzone, L. ; Society of Photo-Optical Instrumentation Engineers -SPIE-, Bellingham/Wash.:
Image and Signal Processing for Remote Sensing XXVI : 21-25 September 2020, Online Only, United Kingdom
Bellingham, WA: SPIE, 2020 (Proceedings of SPIE 11533)
ISBN: 978-1-5106-3879-2
ISBN: 978-1-5106-3880-8
Paper 115330I, 8 S.
Conference "Image and Signal Processing for Remote Sensing" <26, 2020, Online>
Englisch
Konferenzbeitrag
Fraunhofer IOSB ()
Density Region Proposal; Crowd Counting; object detection

Abstract
In the last decades, the amount of data obtained from electro-optical sensor systems has been steadily increasing in remote sensing (RS). Manual analysis of remote sensing images is a time-consuming task. Therefore, machine learning methods for detection and classification have become an appealing field of RS. In particular, the family of region convolutional neural networks (R-CNN) shows considerable success in different RS tasks. Advanced RCNN methods are multistage approaches, where first objects are detected and secondly classified with an optional segmentation step. However, the detection performance of advanced R-CNN algorithms suffers in areas with noticeably varying object densities and scales. Advanced R-CNN architectures usually consist of a detector stage and multiple heads. In the detector stage, regions of interest (ROI) are proposed and filtered by a non-maximum suppression (NMS) layer. In an area with a high density of objects, a strictly adjusted NMS may lead to missed detections. In contrast, a low threshold value for NMS can cause multiple overlapping detections for large objects. To address this challenge, we present our approach improving the results of object detector methods in scenes with varying densities of objects. Therefore, we add an encoder-decoder based density estimation network to our detector network to obtain the location of high-density areas. For these locations, additional fine detection of objects is performed. In order to exhibit the effectiveness of our approach, we evaluate our method on common crowd counting and object detection datasets.

: http://publica.fraunhofer.de/dokumente/N-618707.html