• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Text2Autochrome: Text Guided Autochrome Synthesis Using Generative Models
 
  • Details
  • Full
Options
2025
Conference Paper
Title

Text2Autochrome: Text Guided Autochrome Synthesis Using Generative Models

Abstract
Autochrome is an early color photography technique that is highly sensitive and prone to deterioration, limiting their public display. A limited collection of digitized autochromes exists, often with defects due to their fragile nature. We applied generative AI methods, specifically Low-Rank Adaptation (LoRA), to fine-tune diffusion models, enabling efficient use of computational resources. Our curated dataset of vintage digitized autochromes showcased various styles and served as the basis for training the LoRA model, resulting in the generation of digitized autochromes that preserved the original color filter effects and characteristic granularity. By leveraging generative AI, we can utilize the multi-modal capabilities of the model, allowing each user to generate images through concept-based prompts. This approach empowers users to creatively interact with the model, producing personalized images while maintaining the historical color fidelity and structure of autochromes. This capability also enables us to generate defect-free autochromes, which can be utilized for synthetic training in autochrome restoration efforts. We evaluated our approach using the CLIPScore metric for quantitative similarity and conducted a user study for qualitative feedback on the generated images. Our results show that the fine-tuned LoRA model effectively captures the essence of autochromes, producing visually appealing images that respect the historical aesthetic. Considering the potential for misinterpretation and ethical concerns surrounding text-to-image methods using deep learning with historical photographs, we are committed to enhancing transparency by releasing our model weights and training datasets, thereby empowering the community to better understand, evaluate, and address these important issues. Further we release an interactive demo together with the fine-tuned weights available via huggingface.
Author(s)
Kühn, Julius
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Sinha, Saptarshi Neil
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Nguyen, Duc Anh
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Horst, Robin
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Kuijper, Arjan  orcid-logo
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Fellner, Dieter
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Mainwork
DH 2025 - Digital Heritage International Congress  
Conference
Digital Heritage International Congress 2025  
Open Access
File(s)
Download (20.24 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.2312/dh.20253061
10.24406/publica-5520
Language
English
Fraunhofer-Institut für Graphische Datenverarbeitung IGD  
Keyword(s)
  • Branche: Cultural and Creative Economy

  • Research Line: Computer vision (CV)

  • Research Line: Machine learning (ML)

  • LTA: Generation, capture, processing, and output of images and 3D models

  • Synthetic data

  • Generative AI

  • Image restoration

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024