• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Can you trust your Agent? The Effect of Out-of-Distribution Detection on the Safety of Reinforcement Learning Systems
 
  • Details
  • Full
Options
2024
Conference Paper
Title

Can you trust your Agent? The Effect of Out-of-Distribution Detection on the Safety of Reinforcement Learning Systems

Abstract
Deep Reinforcement Learning (RL) has the potential to revolutionize the automation of complex sequential decision-making problems. Although it has been successfully applied to a wide range of tasks, deployment to real-world settings remains challenging and is often limited. One of the main reasons for this is the lack of safety guarantees for conventional RL algorithms, especially in situations that substantially differ from the learning environment. In such situations, state-of-the-art systems will fail silently, producing action sequences without signalizing any uncertainty regarding the current input. Recent works have suggested Out-of-Distribution (OOD) detection as an additional reliability measure when deploying RL in the real world. How these mechanisms benefit the safety of the entire system, however, is not yet fully understood. In this work, we study how OOD detection contributes to the safety of RL systems by describing the challenges involved with detecting unknown situations. We derive several definitions for unknown events and explore potential avenues for a successful safety argumentation, building on recent work for safety assurance of Machine Learning components. In a series of experiments, we compare different OOD detectors and show how difficult it is to distinguish harmless from potentially unsafe OOD events in practice, and how standard evaluation schemes can lead to deceptive conclusions, depending on which definition of unknown is applied.
Author(s)
Haider, Tom  
Fraunhofer-Institut für Kognitive Systeme IKS  
Roscher, Karsten  
Fraunhofer-Institut für Kognitive Systeme IKS  
Herd, Benjamin  orcid-logo
Fraunhofer-Institut für Kognitive Systeme IKS  
Schmoeller da Roza, Felippe
Fraunhofer-Institut für Kognitive Systeme IKS  
Burton, Simon  
Fraunhofer-Institut für Kognitive Systeme IKS  
Mainwork
SAC 2024, 39th ACM/SIGAPP Symposium on Applied Computing  
Project(s)
IKS-Ausbauprojekt  
Funder
Bayerisches Staatsministerium für Wirtschaft, Landesentwicklung und Energie  
Conference
Symposium on Applied Computing 2024  
Open Access
File(s)
Download (2.11 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.1145/3605098.3635931
10.24406/h-469635
Additional link
Full text
Language
English
Fraunhofer-Institut für Kognitive Systeme IKS  
Fraunhofer Group
Fraunhofer-Verbund IUK-Technologie  
Keyword(s)
  • reinforcement learning

  • RL

  • out of distribution

  • OOD

  • ood detection

  • artificial intelligence

  • AI

  • safety

  • AI safety

  • safe AI

  • sequential decision making

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024