PARMA: a Platform Architecture to enable Automated, Reproducible, and Multi-party Assessments of AI Trustworthiness

Pintz, Maximilian Alexander; Becker, Daniel; Mock, Michael

doi:10.1145/3643691.3648585

2024

Conference Paper

Abstract

As AI applications are emerging in diverse fields - e.g., industry, healthcare or finance - weaknesses and failures of such applications might bare unacceptable risks which need to be rigorously assessed, quantified and, if necessary, mitigated. One crucial component of an effective AI trustworthiness assessment and risk management are systematic evaluations of the AI application based on properly chosen and executed tests. In addition to the known requirements of providing facilities for automated and reproducible tests, an assessment platform for Trustworthy AI must support the integration of different AI models and data sets, must be extensible for AI risk specific metrics and test tools, and should facilitate collaboration between model providers, assessment tool developers and auditors. In this paper, we develop an architecture of a platform for automated, reproducible and collaborative assessments of AI applications, based on an in-depth requirements analysis that maps use cases and collaboration scenarios to technical requirements.

Author(s)

Pintz, Maximilian Alexander

Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Becker, Daniel

Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Mock, Michael

Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Mainwork

IEEE/ACM 2nd International Workshop on Responsible AI Engineering, RAIE 2024. Proceedings

Conference

International Workshop on Responsible AI Engineering 2024

Options

PARMA: a Platform Architecture to enable Automated, Reproducible, and Multi-party Assessments of AI Trustworthiness