• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Active Data Collection and Management for Real-World Continual Learning via Pretrained Oracle
 
  • Details
  • Full
Options
2024
Conference Paper
Title

Active Data Collection and Management for Real-World Continual Learning via Pretrained Oracle

Abstract
Incremental Learning (IL) deals with learning from continuous streams of data while minimising catastrophic forgetting. This field of Machine Learning (ML) research has introduced several novel approaches and methodologies for varying configurations. However, academic Continual Learning setups generally work with well-curated datasets under predefined conditions, which do not hold for practical applications. In real-world scenarios, the problem of ML starts with data collection and curation. Depending on the application, different challenges are posed w.r.t. data management, such as similar objects, unbalanced data containing sparse samples, visual artefacts, digitisation, and camera setup. This becomes an incrementally compounding issue in Continual Learning projects with data drift and varying conditions. We propose Active Data Collection and Management (ADCM), a straight-forward and effective general framework for data collection, coreset/exemplar selection, and analysis. A pretrained Oracle model provides ground truth distribution for the other model that learns incrementally. We couple ADCM with traditional ML/IL setups and demonstrate its suitability for real-world tasks, such as fine-grained classification and anomaly detection. A baseline implementation of ADCM for Class-IL matches state-of-the-art exemplar selection strategies, providing an improvement in average incremental accuracy of 1.5% with Dynamically Expandable Representation (DER) and 4.1% with PODNet against Herding, and 0.8% on old class data against Reinforced Memory Management (RMM); and shows improved performance for general coreset selection. Our code is available at: https://github.com/Vivek9Chavan/ADCM
Author(s)
Chavan, Vivek
Fraunhofer-Institut für Produktionsanlagen und Konstruktionstechnik IPK  
Koch, Paul
Fraunhofer-Institut für Produktionsanlagen und Konstruktionstechnik IPK  
Schlüter, Marian
Fraunhofer-Institut für Produktionsanlagen und Konstruktionstechnik IPK  
Briese, Clemens
Fraunhofer-Institut für Produktionsanlagen und Konstruktionstechnik IPK  
Krüger, Jörg  
Fraunhofer-Institut für Produktionsanlagen und Konstruktionstechnik IPK  
Mainwork
IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
Funder
Bundesministerium für Bildung und Forschung  
Conference
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
DOI
10.1109/CVPRW63382.2024.00412
Language
English
Fraunhofer-Institut für Produktionsanlagen und Konstruktionstechnik IPK  
Keyword(s)
  • Continual Learning

  • Data Collection

  • Data Management

  • Exemplar Selection

  • Incremental Learning

  • Real-world Scenarios

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024