• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Towards Integrating ChatGPT into Satellite Image Annotation Workflows. A Comparison of Label Quality and Costs of Human and Automated Annotators
 
  • Details
  • Full
Options
2025
Journal Article
Title

Towards Integrating ChatGPT into Satellite Image Annotation Workflows. A Comparison of Label Quality and Costs of Human and Automated Annotators

Abstract
High-quality annotations are a critical success factor for machine learning (ML) applications. To achieve this, we have traditionally relied on human annotators, navigating the challenges of limited budgets and the varying task-specific expertise, costs, and availability. Since the emergence of Large Language Models (LLMs), their popularity for generating automated annotations has grown, extending possibilities and complexity of designing an efficient annotation strategy. Increasingly, computer vision capabilities have been integrated into general-purpose LLMs like ChatGPT. This raises the question of how effectively LLMs can be used in satellite image annotation tasks and how they compare to traditional annotator types. This study presents a comprehensive investigation and comparison of various human and automated annotators for image classification. We evaluate the feasibility and economic competitiveness of using the ChatGPT4-V model for a complex land usage annotation task and compare it with alternative human annotators. A set of satellite images is annotated by a domain expert and 15 additional human and automated annotators, differing in expertise and costs. Our analyses examine the annotation quality loss between the expert and other annotators. This comparison is conducted through (1) descriptive analyses, (2) fitting linear probability models, and (3) comparing F1-scores. Ultimately, we simulate annotation strategies where samples are split according to an automatically assigned certainty score. Routing low-certainty images to human annotators can cut total annotation costs by over 50% with minimal impact on label quality. We discuss implications regarding the economic competitiveness of annotation strategies, prompt engineering and the task-specificity of expertise.
Author(s)
Beck, Jacob
Ludwig-Maximilians-Universität München
Kemeter, Lukas Malte
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Dürrbeck, Konrad
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Abdalla, Mohamed Hesham Ibrahim
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Kreuter, Frauke
Ludwig-Maximilians-Universität München
Journal
IEEE journal of selected topics in applied earth observations and remote sensing  
Open Access
DOI
10.1109/JSTARS.2025.3528192
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Automated Annotations

  • ChatGPT

  • Label Quality

  • LLMs

  • Satellite Image Annotation

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024