• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Open-Vocabulary Object Detectors: Robustness Challenges Under Distribution Shifts
 
  • Details
  • Full
Options
2025
Conference Paper
Title

Open-Vocabulary Object Detectors: Robustness Challenges Under Distribution Shifts

Abstract
The challenge of Out-Of-Distribution (OOD) robustness remains a critical hurdle towards deploying deep vision models. Vision-Language Models (VLMs) have recently achieved groundbreaking results. VLM-based open-vocabulary object detection extends the capabilities of traditional object detection frameworks, enabling the recognition and classification of objects beyond predefined categories. Investigating OOD robustness in recent open-vocabulary object detection is essential to increase the trustworthiness of these models. This study presents a comprehensive robustness evaluation of the zero-shot capabilities of three recent open-vocabulary (OV) foundation object detection models: OWL-ViT, YOLO World, and Grounding DINO. Experiments carried out on the robustness benchmarks COCO-O, COCO-DC, and COCO-C encompassing distribution shifts due to information loss, corruption, adversarial attacks, and geometrical deformation, highlighting the challenges of the model’s robustness to foster the research in this field. Project webpage: https://prakashchhipa.github.io/projects/ovod_robustness.
Author(s)
Chhipa, Prakash Chandra
Luleå University of Technology
De, Kanjar
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Chippa, Meenakshi Subhash
Luleå University of Technology
Saini, Rajkumar
Luleå University of Technology
Liwicki, Marcus
Luleå University of Technology
Mainwork
Computer Vision - ECCV2024 Workshops. Proceedings. Part XVIII  
Conference
European Conference on Computer Vision 2024  
DOI
10.1007/978-3-031-91672-4_5
Language
English
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Keyword(s)
  • distribution shift

  • foundation model

  • open-vocabulary object detection

  • robustness

  • zero-shot

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024