• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. GG-BBQ: German Gender Bias Benchmark for Question Answering
 
  • Details
  • Full
Options
August 2025
Conference Paper
Title

GG-BBQ: German Gender Bias Benchmark for Question Answering

Abstract
Within the context of Natural Language Processing (NLP), fairness evaluation is often associated with the assessment of bias and reduction of associated harm. In this regard, the evaluation is usually carried out by using a benchmark dataset, for a task such as Question Answering, created for the measurement of bias in the model’s predictions along various dimensions, including gender identity. In our work, we evaluate gender bias in German Large Language Models (LLMs) using the Bias Benchmark for Question Answering by Parrish et al. (2022) as a reference. Specifically, the templates in the gender identity subset of this English dataset were machine translated into German. The errors in the machine translated templates were then manually reviewed and corrected with the help of a language expert. We find that manual revision of the translation is crucial when creating datasets for gender bias evaluation because of the limitations of machine translation from English to a language such as German with grammatical gender. Our final dataset is comprised of two subsets: Subset-I, which consists of group terms related to gender identity, and Subset-II, where group terms are replaced with proper names. We evaluate several LLMs used for German NLP on this newly created dataset and report the accuracy and bias scores. The results show that all models exhibit bias, both along and against existing social stereotypes.
Author(s)
Satheesh, Shalaka  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Klug, Katrin  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Beckh, Katharina  orcid-logo
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Allende-Cid, Héctor  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Houben, Sebastian
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Hassan, Teena
Hochschule Bonn-Rhein-Sieg
Mainwork
GeBNLP 2025, 6th Workshop on Gender Bias in Natural Language Processing. Proceedings  
Project(s)
The Lamarr Institute for Machine Learning and Artificial Intelligence  
Funder
Bundesministerium für Bildung und Forschung -BMBF-
Conference
Workshop on Gender Bias in Natural Language Processing 2025  
Association for Computational Linguistics (ACL Annual Meeting) 2025  
Open Access
File(s)
Download (280.34 KB)
Rights
Use according to copyright law
DOI
10.18653/v1/2025.gebnlp-1.14
10.24406/publica-5793
Additional link
Full text
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • Gender Bias

  • German Large Language Models

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024