• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation
 
  • Details
  • Full
Options
2025
Conference Paper
Title

STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation

Abstract
In layout-to-image (L2I) synthesis, controlled complex scenes are generated from coarse information like bounding boxes. Such a task is exciting to many downstream applications because the input layouts offer strong guidance to the generation process while remaining easily reconfigurable by humans. In this paper, we proposed STyled LAYout Diffusion (STAY Diffusion), a diffusion-based model that produces photo-realistic images and provides fine-grained control of stylized objects in scenes. Our approach learns a global condition for each layout, and a self-supervised semantic map for weight modulation using a novel Edge-Aware Normalization (EA Norm). A new Styled-Mask Attention (SM Attention) is also introduced to cross-condition the global condition and image feature for capturing the objects' relationships. These measures provide consistent guidance through the model, enabling more accurate and controllable image generation. Extensive benchmarking demonstrates that our STAY Diffusion presents high-quality images while surpassing previous state-of-the-art methods in generation diversity, accuracy, and controllability.
Author(s)
Wang, Ruyu
Bosch Center for Artificial Intelligence
Hou, Xuefeng
Bosch Center for Artificial Intelligence
Schmedding, Sabrina
Bosch Center for Artificial Intelligence
Huber, Marco  
Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA  
Mainwork
IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025. Proceedings  
Conference
Winter Conference on Applications of Computer Vision 2025  
DOI
10.1109/WACV61041.2025.00379
Language
English
Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024