Identifying and Generating Edge Cases

Bunzel, Niklas; Göller, Nicolas; Frick, Raphael

doi:10.1145/3665451.3665529

2024

Conference Paper

Abstract

One of the main issues for deploying neural networks in fully autonomous applications, such as self-driving cars, is rarely occurring edge cases. These edge cases are underrepresented or non-existent in both the training and test sets. We implemented an automatic and a semi-automatic pipeline for identifying and generating underrepresented edge cases without requiring any specific domain knowledge or prior information by utilizing diffusion models. By enriching the data set with the generated samples we can train a more robust classifier. With our automatic approach, the accuracy of the classifiers increases by up to 20.84% on the edge case data of the Oxford-IIIT Pet data set (OPD), while achieving improvements of up to 54.16% for individual classes, and decreasing the standard deviation by up to 2.07%. Even on the entire OPD, the accuracy of the classifiers improves slightly. With our semi-automatic pipeline, we achieve improvements of up to 12.87% on a subset of manually generated edge cases, with individual classes gaining up to 37%. Our automated pipeline also achieves up to 8.52% improvement on the edge case data of the CIFAR-100 dataset.

Author(s)

Bunzel, Niklas

Fraunhofer-Institut für Sichere Informationstechnologie SIT

Göller, Nicolas

Frick, Raphael

Fraunhofer-Institut für Sichere Informationstechnologie SIT

Mainwork

Proceedings of the 2nd ACM Workshop on Secure and Trustworthy Deep Learning Systems. Part of: Asia CCS 24

Conference

Workshop on Secure and Trustworthy Deep Learning Systems 2024

Options

Identifying and Generating Edge Cases