• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. ID-Booth: Identity-consistent Face Generation with Diffusion Models
 
  • Details
  • Full
Options
2025
Conference Paper
Title

ID-Booth: Identity-consistent Face Generation with Diffusion Models

Abstract
Recent advances in generative modeling have enabled the generation of high-quality synthetic data that is applicable in a variety of domains, including face recognition. Here, state-of-the-art generative models typically rely on conditioning and fine-tuning of powerful pretrained diffusion models to facilitate the synthesis of realistic images of a desired identity. Yet, these models often do not consider the identity of subjects during training, leading to poor consistency between generated and intended identities. In contrast, methods that employ identity-based training objectives tend to overfit on various aspects of the identity, and in turn, lower the diversity of images that can be generated. To address these issues, we present in this paper a novel generative diffusion-based framework, called ID-Booth. ID-Booth consists of a denoising network responsible for data generation, a variational auto-encoder for mapping images to and from a lower-dimensional latent space and a text encoder that allows for prompt-based control over the generation procedure. The framework utilizes a novel triplet identity training objective and enables identity-consistent image generation while retaining the synthesis capabilities of pretrained diffusion models. Experiments with a state-of-the-art latent diffusion model and diverse prompts reveal that our method facilitates better intra-identity consistency and inter-identity separability than competing methods, while achieving higher image diversity. In turn, the produced data allows for effective augmentation of small-scale datasets and training of betterperforming recognition models in a privacy-preserving manner. The source code for the ID-Booth framework is publicly available at https://github.com/dariant/ID-Booth.
Author(s)
Tomašević, Darian
University of Ljubljana  
Boutros, Fadi  orcid-logo
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Lin, Chenhao
Xi'an Jiaotong University
Damer, Naser  
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Štruc, Vitomir
University of Ljubljana  
Peer, Peter
University of Ljubljana  
Mainwork
IEEE 19th International Conference on Automatic Face and Gesture Recognition, FG 2025  
Project(s)
Next Generation Biometric Systems  
Next Generation Biometric Systems  
Funder
Bundesministerium für Bildung und Forschung -BMBF-  
Hessisches Ministerium für Wissenschaft und Kunst -HMWK-  
Conference
International Conference on Automatic Face and Gesture Recognition 2025  
DOI
10.1109/FG61629.2025.11099217
Language
English
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Keyword(s)
  • Branche: Information Technology

  • Research Line: Computer vision (CV)

  • Research Line: Human computer interaction (HCI)

  • Research Line: Machine learning (ML)

  • LTA: Machine intelligence, algorithms, and data structures (incl. semantics)

  • LTA: Generation, capture, processing, and output of images and 3D models

  • Machine learning

  • Deep learning

  • Face recognition

  • ATHENE

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024