• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Anderes
  4. Designing Usable Interfaces for Human Evaluation of LLM-Generated Texts: UX Challenges and Solutions
 
  • Details
  • Full
Options
2025
Paper (Preprint, Research Paper, Review Paper, White Paper, etc.)
Title

Designing Usable Interfaces for Human Evaluation of LLM-Generated Texts: UX Challenges and Solutions

Abstract
Human evaluations remain important for assessing large language models (LLMs) due to the limitations of automated metrics. However, flawed methodologies and poor user interface (UI) design can compromise the validity and reliability of such evaluations. This pre-registered study investigates usability challenges and proposes solutions for UI design in evaluating LLM-generated texts. By comparing common evaluation methods such as Direct Quality Estimation, AB-Testing, Agreement with Quality Criterion, and Best-Worst Scaling, insights were gained into user experience challenges, including inefficient information transfer and poor visibility of evaluation materials. Iterative redesigns improved discoverability, accessibility, and user interaction through modifications of page layout and content presentation. Testing these enhancements revealed increased clarity and usability, with higher response rates and more consistent ratings. This work highlights the importance of UI design in enabling reliable and meaningful human evaluation, providing actionable recommendations to enhance the integrity and usability of NLP evaluation frameworks.
Author(s)
Mertsiotaki, Androniki
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Hofmann, Stephanie
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keck, Sarah
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Kratsch, Emily
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Daum, Alexander
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Popp, Birgit
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Conference
International Joint Conference on Artificial Intelligence 2025  
Open Access
DOI
10.24406/publica-4219
File(s)
Designing_usable_interfaces_for_human_evaluation_of_LLM_Generated_Texts.pdf (920.77 KB)
Rights
CC BY-NC-ND 4.0: Creative Commons Attribution-NonCommercial-NoDerivatives
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Evaluation

  • LLM

  • Large Language Models

  • NLP

  • Design

  • Usability

  • User Interface

  • Natural Language Processing

  • Human

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024