• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. DiffProb: Data Pruning for Face Recognition
 
  • Details
  • Full
Options
2025
Conference Paper
Title

DiffProb: Data Pruning for Face Recognition

Abstract
Face recognition models have made substantial progress due to advances in deep learning and the availability of large-scale datasets. However, reliance on massive annotated datasets introduces challenges related to training computational cost and data storage, as well as potential privacy concerns regarding managing large face datasets. This paper presents DiffProb, the first data pruning approach for the application of face recognition. DiffProb assesses the prediction probabilities of training samples within each identity and prunes the ones with identical or close prediction probability values, as they are likely reinforcing the same decision boundaries, and thus contribute minimally with new information. We further enhance this process with an auxiliary cleaning mechanism to eliminate mislabeled and label-flipped samples, boosting data quality with minimal loss. Extensive experiments on CASIA-WebFace with different pruning ratios and multiple benchmarks, including LFW, CFP-FP, and IJB-C, demonstrate that DiffProb can prune up to 50% of the dataset while maintaining or even, in some settings, improving the verification accuracies. Additionally, we demonstrate DiffProb’s robustness across different architectures and loss functions. Our method significantly reduces training cost and data volume, enabling efficient face recognition training and reducing the reliance on massive datasets and their demanding management. The code, pretrained models, and pruned datasets are publicly released: https://github.com/EduardaCaldeira/DiffProb.
Author(s)
Loureiro Caldeira, Maria Eduarda
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Kolf, Jan Niklas  
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Damer, Naser  
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Boutros, Fadi  orcid-logo
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Mainwork
IEEE 19th International Conference on Automatic Face and Gesture Recognition, FG 2025  
Project(s)
Next Generation Biometric Systems  
Next Generation Biometric Systems  
Funder
Bundesministerium für Bildung und Forschung -BMBF-  
Hessisches Ministerium für Wissenschaft und Kunst -HMWK-  
Conference
International Conference on Automatic Face and Gesture Recognition 2025  
DOI
10.1109/FG61629.2025.11099256
Language
English
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Keyword(s)
  • Branche: Information Technology

  • Research Line: Computer vision (CV)

  • Research Line: Human computer interaction (HCI)

  • Research Line: Machine learning (ML)

  • LTA: Machine intelligence, algorithms, and data structures (incl. semantics)

  • LTA: Generation, capture, processing, and output of images and 3D models

  • Deep learning

  • Machine learning

  • Face recognition

  • Biometrics

  • ATHENE

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024