• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Privacy Risk Assessment for Synthetic Longitudinal Health Data
 
  • Details
  • Full
Options
August 30, 2024
Conference Paper
Title

Privacy Risk Assessment for Synthetic Longitudinal Health Data

Abstract
Introduction:
A modern approach to ensuring privacy when sharing datasets is the use of synthetic data generation methods, which often claim to outperform classic anonymization techniques in the trade-off between data utility and privacy. Recently, it was demonstrated that various deep learning-based approaches are able to generate useful synthesized datasets, often based on domain-specific analyses. However, evaluating the privacy implications of releasing synthetic data remains a challenging problem, especially when the goal is to conform with data protection guidelines.
Methods:
Therefore, the recent privacy risk quantification framework Anonymeter has been built for evaluating multiple possible vulnerabilities, which are specifically based on privacy risks that are considered by the European Data Protection Board, i.e. singling out, linkability, and attribute inference. This framework was applied to a synthetic data generation study from the epidemiological domain, where the synthesization replicates time and age trends previously found in data collected during the DONALD cohort study (1312 participants, 16 time points). The conducted privacy analyses are presented, which place a focus on the vulnerability of outliers.
Results:
The resulting privacy scores are discussed, which vary greatly between the different types of attacks.
Conclusion:
Challenges encountered during their implementation and during the interpretation of their results are highlighted, and it is concluded that privacy risk assessment for synthetic data remains an open problem.
Author(s)
Schneider, Julian
Walter, Marvin
Otte, Karen
Meurers, Thierry
Perrar, Ines
Nöthlings, Ute
Adams, Tim  
Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI  
Fröhlich, Holger  
Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI  
Prasser, Fabian
Fluck, Juliane
Kühnel, Lisa
Mainwork
German Medical Data Sciences 2024. Health - Thinking, Researching and Acting Together  
Conference
German Association of Medical Informatics, Biometry, and Epidemiology (GMDS Annual Meeting) 2024  
DOI
10.3233/SHTI240867
Language
English
Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI  
Keyword(s)
  • Data sharing

  • Epidemiological study

  • Privacy risk assessment

  • Synthetic data

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024