• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. dsMTL: a computational framework for privacy-preserving, distributed multi-task machine learning
 
  • Details
  • Full
Options
November 1, 2022
Journal Article
Title

dsMTL: a computational framework for privacy-preserving, distributed multi-task machine learning

Abstract
Motivation:
In multi-cohort machine learning studies, it is critical to differentiate between effects that are reproducible across cohorts and those that are cohort-specific. Multi-task learning (MTL) is a machine learning approach that facilitates this differentiation through the simultaneous learning of prediction tasks across cohorts. Since multi-cohort data can often not be combined into a single storage solution, there would be the substantial utility of an MTL application for geographically distributed data sources.
Results:
Here, we describe the development of ‘dsMTL’, a computational framework for privacy-preserving, distributed multi-task machine learning that includes three supervised and one unsupervised algorithms. First, we derive the theoretical properties of these methods and the relevant machine learning workflows to ensure the validity of the software implementation. Second, we implement dsMTL as a library for the R programming language, building on the DataSHIELD platform that supports the federated analysis of sensitive individual-level data. Third, we demonstrate the applicability of dsMTL for comorbidity modeling in distributed data. We show that comorbidity modeling using dsMTL outperformed conventional, federated machine learning, as well as the aggregation of multiple models built on the distributed datasets individually. The application of dsMTL was computationally efficient and highly scalable when applied to moderate-size (n < 500), real expression data given the actual network latency.
Availability and implementation:
dsMTL is freely available at https://github.com/transbioZI/dsMTLBase (server-side package) and https://github.com/transbioZI/dsMTLClient (client-side package).
Supplementary information:
Supplementary data are available at Bioinformatics online.
Author(s)
Cao, Han
Zhang, Youcheng
Baumbach, Jan
Burton, Paul R.
Dwyer, Dominic
Koutsouleris, Nikolaos
Matschinske, Julian
Marcon, Yannick
Rajan, Sivanesan
Rieg, Thilo
Ryser-Welch, Patricia
Späth, Julian
Herrmann, Carl
Schwarz, Emanuel
Person Involved
Hofmann-Apitius, Martin  
Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI  
Tom Kodamullil, Alpha
Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI  
Journal
Bioinformatics  
Project(s)
IMPLEMENT - Verbesserung der Personalisierten Medizin in der Psychiatrie durch Techniken des Maschinellen Lernens - Identifikation von Subgruppen der Schizophrenie anhand von Multi-OMICS-Daten  
Funder
Bundesministerium für Bildung und Forschung -BMBF-  
Open Access
DOI
10.1093/bioinformatics/btac616
Language
English
Fraunhofer-Institut für Algorithmen und Wissenschaftliches Rechnen SCAI  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024