Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Residual vs. inception vs. classical networks for low-resolution face recognition

: Herrmann, C.; Willersinn, Dieter; Beyerer, Jürgen

Postprint urn:nbn:de:0011-n-4460088 (1.1 MByte PDF)
MD5 Fingerprint: d0f9f2e451138ffba8de280861aaa2f3
Created on: 7.6.2017

Sharma, Puneet (Ed.):
Image Analysis : 20th Scandinavian Conference, SCIA 2017 : Tromsø, Norway, June 12-14, 2017, Proceedings, Part I
Cham: Springer International Publishing, 2017 (Lecture Notes in Computer Science 10269)
ISBN: 978-3-319-59125-4
ISBN: 3-319-59125-8
Scandinavian Conference on Image Analysis (SCIA) <20, 2017, Tromsø/Norway>
Conference Paper
Fraunhofer IOSB ()

When analyzing surveillance footage, low-resolution face recognition is still a challenging task. While high-resolution face recognition experienced impressive improvements by Convolutional Neural Network (CNN) approaches, the benefit to low-resolution face recognition remains unclear as only few work has been done in this area. This paper adapts three popular high-resolution CNN designs to the low-resolution (LR) domain to find the most suitable architecture. Namely, the classical AlexNet/VGG architecture, Google’s inception architecture and Microsoft’s residual architecture are considered. While the inception and residual concept have been proven to be useful for very deep networks, it is shown in our case that shallower networks than for high-resolution recognition are sufficient. This leads to an advantage of the classical network architecture. Final evaluation on a downscaled version of the public YouTube Faces Database indicates a comparable performance to the high-resolution domain. Results with faces extracted from the SoBiS surveillance dataset indicate a superior performance of the trained networks in the LR domain.