• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Adept: AI-Generated Text Detection Based on Phrasal Category N-Grams
 
  • Details
  • Full
Options
2025
Conference Paper
Title

Adept: AI-Generated Text Detection Based on Phrasal Category N-Grams

Abstract
With the advent of large language models (LLMs), the generation of artificial text has become remarkably accessible and is increasingly integrated into everyday applications. As the use of LLMs to produce content becomes more widespread, the ability to distinguish between AI-generated and human-written texts has grown in importance. This year’s PAN competition focuses on this specific challenge: Based on a text, participants must determine whether it was written by a human or generated by an AI system (more specifically, an LLM). We propose a classification approach called Adept, which explicitly leverages constituent trees to model the grammatical structure of texts. For each sentence, we generate a constituent tree and represent the entire text by aggregating the distribution of syntactic n-grams, defined as paths of a fixed length within these trees. Using these structural representations, we train a multilayer perceptron (MLP) to classify authorship. Adept achieves a mean score of 0.843 on the test dataset, evaluated by the organizers of the competition. This ranks us on rank 16 out of 24 with a score difference of 0.056 to the first place and 0.036 to the third place.
Author(s)
Völpel, Felix
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Halvani, Oren  
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Mainwork
Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025)  
Conference
Conference and Labs of the Evaluation Forum 2025  
Open Access
File(s)
Download (2.05 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.24406/publica-5953
Language
English
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Keyword(s)
  • AI-generated Text Detection

  • Constituent Tree n-grams

  • PAN 2025

  • Subtask 1: Voight-Kampff AI Detection Sensitivity

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024