English
Deutsch

Log In

Password Login

or

Research Outputs
Projects
Researchers
Institutes
Statistics

Fraunhofer-Gesellschaft

Home
Fraunhofer-Gesellschaft
Person
Schwaiger, Adrian

Options

M.Sc.

Schwaiger, Adrian

0000-0001-7692-104X

Profile

Publications

Advisor

Metrics

1 results

Filters

Roscher, Karsten

Henne, Maximilian 1

Schwaiger, Adrian 1

Search author name

Submit

Fraunhofer-Institut für Kognitive Systeme IKS 1

search.filters.filter.organization.label

Submit

artificial intelligence 1

computer vision 1

deep learning 1

Search subject

Submit

conference paper 1

Search type

Submit

Settings

Sort By

Results per page

Now showing 1 - 1 of 1

Benchmarking Uncertainty Estimation Methods for Deep Learning with Safety-Related Metrics

( 2020)
Henne, Maximilian
;
Schwaiger, Adrian
;
Roscher, Karsten
;
Weiß, Gereon

Deep neural networks generally perform very well on giving accurate predictions, but they often lack in recognizing when these predictions may be wrong. This absence of awareness regarding the reliability of given outputs is a big obstacle in deploying such models in safety-critical applications. There are certain approaches that try to address this problem by designing the models to give more reliable values for their uncertainty. However, even though the performance of these models are compared to each other in various ways, there is no thorough evaluation comparing them in a safety-critical context using metrics that are designed to describe trade-offs between performance and safe system behavior. In this paper we attempt to fill this gap by evaluating and comparing several state-of-the-art methods for estimating uncertainty for image classifcation with respect to safety-related requirements and metrics that are suitable to describe the models performance in safety-critical domains. We show the relationship of remaining error for predictions with high confidence and its impact on the performance for three common datasets. In particular, Deep Ensembles and Learned Confidence show high potential to significantly reduce the remaining error with only moderate performance penalties.

Cookie settings
Imprint
Privacy policy
Api
Send Feedback

© 2022