• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. LOG-AID: Logit-Based Statistical Features for AI Text Detection
 
  • Details
  • Full
Options
2025
Conference Paper
Title

LOG-AID: Logit-Based Statistical Features for AI Text Detection

Title Supplement
Notebook for PAN at CLEF 2025
Abstract
This submission addresses Subtask 1 of the Voight-Kampff Generative AI Detection task, which is part of the PAN 2025 lab. The goal of the subtask is to distinguish AI-generated texts from human-written ones, even when the machine-generated texts have been intentionally obfuscated to appear more human-like. As Large Language Models (LLMs) continue to improve in fluency and coherence, this distinction becomes increasingly difficult and requires robust detection strategies. This submission introduces a zero-shot method based on token-level statistics, which are extracted from two pre-trained LLMs: a base model and an instruction-tuned model. This method LOG-AID computes five core features: mean surprisal under each model, Jensen-Shannon divergence between their predictive distributions, average entropy difference, the mean entropy of the base model and the average logarithmic rank of the ground-truth tokens. These features are combined into a fixed-size vector and classified using a logistic regression model. On the official test set, the proposed system achieved a mean score of 0.827 across five metrics, surpassing strong baselines such as Binoculars (0.818) and PPMd Compression (0.758). In particular, the combination of uncertainty-based measures (surprisal, entropy) and rank-based features (log-rank) enhances discriminative power. This contribution offers a simple, interpretable and self-contained classification approach that does not require any fine-tuning. The method relies solely on internal probability structures of pre-trained models and may serve as a lightweight baseline for future work in AI text detection.
Author(s)
Titze, Sophie
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Halvani, Oren  
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Mainwork
Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025)  
Conference
Conference and Labs of the Evaluation Forum 2025  
Open Access
File(s)
Download (1.1 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.24406/publica-5954
Language
English
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Keyword(s)
  • AI Text detection

  • GenAI Detection

  • LLM Text Detection

  • PAN 2025

  • Voight-Kampff Generative AI Detection

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024