• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Coverage of LLM Trustworthiness Metrics in the Current Tool Landscape
 
  • Details
  • Full
Options
December 16, 2025
Conference Paper
Title

Coverage of LLM Trustworthiness Metrics in the Current Tool Landscape

Abstract
The increasing prevalence of AI systems that are build with Large Language Model (LLM) components raises the requirement for a dedicated tool stack that allows to monitor such systems, covering training, development and inference environments. Beside technical performance metrics like latency and throughput, regulations like the EU AI Act require the monitoring of trustworthiness related metrics like fairness and transparency during operation. In this paper, we describe the results of an investigation we conducted to gain an overview of the current landscape of LLM trustworthiness metrics and their coverage in monitoring tools. Based on an in-depth analysis of available catalogs and additional research, we identified 43 metrics and 23 tools. Furthermore, we highlight existing gaps and potential areas for further research. The results support practitioners and researchers in making informed decisions about the most appropriate tech stack for their AI systems.
Author(s)
Helmer, Lennard  orcid-logo
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Stein, Benny Jörg  orcid-logo
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Ufer, Tim
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Fernandes, Elanton
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Abdelwahab, Hammam
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Pareek, Abhinav
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Woll, Joshua
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Mainwork
Proceedings of TRUST-AI 2025 - The European Workshop on Trustworthy AI  
Project(s)
Aufbau eines Gaia-X Knotens für große KI-Sprachmodelle und innovative Sprachapplikations-Services; Teilvorhaben: Entwicklung von Sprachmodellen, Interoperabilitäts- und Nutzungskonzepten  
Funder
Bundesministerium für Wirtschaft und Energie  
Conference
European Workshop on Trustworthy AI 2025  
European Conference on Artificial Intelligence 2025  
Open Access
File(s)
Download (1.02 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.24406/publica-6891
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • Large Language Models

  • Artificial Intelligence

  • Generative AI

  • Trustworthy AI

  • Responsible AI,

  • MLOps

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024