• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. Youth language and emerging slurs: tackling bias in BERT-based hate speech detection
 
  • Details
  • Full
Options
March 12, 2025
Journal Article
Title

Youth language and emerging slurs: tackling bias in BERT-based hate speech detection

Abstract
With the increasing presence of adolescents and children online, it is crucial to evaluate algorithms designed to protect them from physical and mental harm. This study measures the bias introduced by emerging slurs found in youth language on existing BERT-based hate speech detection models. The research establishes a novel framework to identify language bias within trained networks, introducing a technique to detect emerging hate phrases and evaluate the unintended bias associated with them. As a result, three bias test sets are constructed: one for emerging hate speech terms, another for established hate terms, and one to test for overfitting. Based on these test sets, three scientific and one commercial hate speech detection models are assessed and compared. For comprehensive evaluation, the research introduces a novel Youth Language Bias Score. Finally, the study applies fine-tuning as a mitigation strategy for youth language bias, rigorously testing and evaluating the newly trained classifier. To summarize, the research introduces a novel framework for bias detection, highlights the influence of adolescent language on classifier performance in hate speech classification, and presents the first-ever hate speech classifier specifically trained for online youth language. This study focuses only on slurs in hateful speech, offering a foundational perspective for the field.
Author(s)
Fillies, Jan
Paschke, Adrian  
Freie Universität Berlin  
Journal
AI and ethics  
Open Access
File(s)
Download (1.19 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.1007/s43681-025-00701-z
10.24406/publica-7161
Additional link
Full text
Language
English
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Keyword(s)
  • Bias

  • Hate speech detection

  • NLP

  • Youth language

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024