• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Anderes
  4. Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying
 
  • Details
  • Full
Options
2024
Paper (Preprint, Research Paper, Review Paper, White Paper, etc.)
Title

Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying

Title Supplement
Published on arXiv
Abstract
Deep Neural Networks (DNNs) require large amounts of annotated training data for a good performance. Often this data is generated using manual labeling (error-prone and time-consuming) or rendering (requiring geometry and material information). Both approaches make it difficult or uneconomic to apply them to many small-scale applications. A fast and straightforward approach of acquiring the necessary training data would allow the adoption of deep learning to even the smallest of applications. Chroma keying is the process of replacing a color (usually blue or green) with another background. Instead of chroma keying, we propose luminance keying for fast and straightforward training image acquisition. We deploy a black screen with high light absorption (99.99%) to record roughly 1-minute long videos of our target objects, circumventing typical problems of chroma keying, such as color bleeding or color overlap between background color and object color. Next we automatically mask our objects using simple brightness thresholding, saving the need for manual annotation. Finally, we automatically place the objects on random backgrounds and train a 2D object detector. We do extensive evaluation of the performance on the widely-used YCB-V object set and compare favourably to other conventional techniques such as rendering, without needing 3D meshes, materials or any other information of our target objects and in a fraction of the time needed for other approaches. Our work demonstrates highly accurate training data acquisition allowing to start training state-of-the-art networks within minutes.
Author(s)
Pöllabauer, Thomas  orcid-logo
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Knauthe, Volker
TU Darmstadt, Fachgebiet Graphisch-Interaktive Systeme  
Boller, Andre
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Kuijper, Arjan  orcid-logo
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Fellner, Dieter
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Conference
International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision 2024  
DOI
10.48550/arXiv.2405.07653
10.24406/publica-3204
File(s)
2405.07653v1.pdf (4.14 MB)
Rights
Under Copyright
Language
English
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Keyword(s)
  • Branche: Automotive Industry

  • Branche: Cultural und Creative Economy

  • Research Line: Computer graphics (CG)

  • Research Line: Computer vision (CV)

  • Research Line: Modeling (MOD)

  • Research Line: Machine learning (ML)

  • LTA: Machine intelligence, algorithms, and data structures (incl. semantics)

  • LTA: Generation, capture, processing, and output of images and 3D models

  • Computer vision

  • Machine learning

  • Pattern recognition

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024