Swarm Learning for decentralized and confidential clinical machine learning

Stefanie Warnat-Herresthal
Hartmut Schultze
Krishnaprasad Lingadahalli Shastry
Sathyanarayanan Manamohan
Saikat Mukherjee
Vishesh Garg
Ravi Sarveswara
Kristian Händler
Peter Pickkers
N. Ahmad Aziz
Sofia Ktena
Florian Tran
Michael Bitzer
Stephan Ossowski
Nicolas Casadei
Christian Herr
Daniel Petersheim
Uta Behrends
Fabian Kern
Tobias Fehlmann
Philipp Schommers
Clara Lehmann
Max Augustin
Jan Rybniker
Janine Altmüller
Neha Mishra
Joana P. Bernardes
Benjamin Krämer
Lorenzo Bonaguro
Jonas Schulte-Schrepping
Elena De Domenico
Christian Siever
Michael Kraut
Milind Desai
Bruno Monnet
Maria Saridaki
Charles Martin Siegel
Anna Drews
Melanie Nuesch-Germano
Heidi Theis
Jan Heyckendorf
Stefan Schreiber
Sarah Kim-Hellmuth
COVID-19 Aachen Study (COVAS)
Paul Balfanz
Thomas Eggermann
Peter Boor
Ralf Hausmann
Hannah Kuhn
Susanne Isfort
Julia Carolin Stingl
Günther Schmalzing
Christiane K. Kuhl
Rainer Röhrig
Gernot Marx
Stefan Uhlig
Edgar Dahl
Dirk Müller-Wieland
Michael Dreher
Nikolaus Marx
Jacob Nattermann
Dirk Skowasch
Ingo Kurth
Andreas Keller
Robert Bals
Peter Nürnberg
Olaf Rieß
Philip Rosenstiel
Mihai G. Netea
Fabian Theis
Sach Mukherjee
Michael Backes
Anna C. Aschenbrenner
Thomas Ulas
Deutsche COVID-19 Omics Initiative (DeCOI)
Angel Angelov
Alexander Bartholomäus
Anke Becker
Daniela Bezdan
Conny Blumert
Ezio Bonifacio
Peer Bork
Bunk Boyke
Helmut Blum
Thomas Clavel
Maria Colome-Tatche
Markus Cornberg
Inti Alberto De La Rosa Velázquez
Andreas Diefenbach
Alexander Dilthey
Nicole Fischer
Konrad Förstner
Sören Franzenburg
Julia-Stefanie Frick
Gisela Gabernet
Julien Gagneur
Tina Ganzenmueller
Marie Gauder
Janina Geißert
Alexander Goesmann
Siri Göpel
Adam Grundhoff
Hajo Grundmann
Torsten Hain
Frank Hanses
Ute Hehr
André Heimbach
Marius Hoeper
Friedemann Horn
Daniel Hübschmann
Michael Hummel
Thomas Iftner
Angelika Iftner
Thomas Illig
Stefan Janssen
Jörn Kalinowski
René Kallies
Birte Kehr
Oliver T. Keppler
Christoph Klein
Michael Knop
Oliver Kohlbacher
Karl Köhrer
Jan Korbel
Peter G. Kremsner
Denise Kühnert
Markus Landthaler
Yang Li
Kerstin U. Ludwig
Oliwia Makarewicz
Manja Marz
Alice C. McHardy
Christian Mertes
Maximilian Münchhoff
Sven Nahnsen
Markus Nöthen
Francine Ntoumi
Jörg Overmann
Silke Peter
Klaus Pfeffer
Isabell Pink
Anna R. Poetsch
Ulrike Protzer
Alfred Pühler
Nikolaus Rajewsky
Markus Ralser
Kristin Reiche
Stephan Ripke
Ulisses Nunes da Rocha
Antoine-Emmanuel Saliba
Leif Erik Sander
Birgit Sawitzki
Simone Scheithauer
Philipp Schiffer
Jonathan Schmid-Burgk
Wulf Schneider
Eva-Christina Schulte
Alexander Sczyrba
Mariam L. Sharaf
Yogesh Singh
Michael Sonnabend
Oliver Stegle
Jens Stoye
Janne Vehreschild
Thirumalaisamy P. Velavan
Jörg Vogel
Sonja Volland
Max von Kleist
Andreas Walker
Jörn Walter
Dagmar Wieczorek
Sylke Winkler
John Ziebuhr
Monique M. B. Breteler
Evangelos J. Giamarellos-Bourboulis
Matthijs Kox
Matthias Becker
Sorin Cheran
Michael S. Woodacre
Eng Lim Goh
Joachim L. Schultze

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (ScreenIT)

Abstract

Fast and reliable detection of patients with severe and heterogeneous illnesses is a major goal of precision medicine ^1,2 . Patients with leukaemia can be identified using machine learning on the basis of their blood transcriptomes ³ . However, there is an increasing divide between what is technically possible and what is allowed, because of privacy legislation ^4,5 . Here, to facilitate the integration of any medical data from any data owner worldwide without violating privacy laws, we introduce Swarm Learning—a decentralized machine-learning approach that unites edge computing, blockchain-based peer-to-peer networking and coordination while maintaining confidentiality without the need for a central coordinator, thereby going beyond federated learning. To illustrate the feasibility of using Swarm Learning to develop disease classifiers using distributed data, we chose four use cases of heterogeneous diseases (COVID-19, tuberculosis, leukaemia and lung pathologies). With more than 16,400 blood transcriptomes derived from 127 clinical studies with non-uniform distributions of cases and controls and substantial study biases, as well as more than 95,000 chest X-ray images, we show that Swarm Learning classifiers outperform those developed at individual sites. In addition, Swarm Learning completely fulfils local confidentiality regulations by design. We believe that this approach will notably accelerate the introduction of precision medicine.

Version published to 10.1038/s41586-021-03583-3
May 26, 2021
ScreenIT
Jul 2, 2020
SciScore for 10.1101/2020.06.25.171009: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
Institutional Review Board Statement not detected.
Randomization not detected.
Blinding not detected.
Power Analysis not detected.
Sex as a biological variable not detected.
Table 2: Resources
No key resources detected.
Results from OddPub: Thank you for sharing your data.
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:
- Than…
SciScore for 10.1101/2020.06.25.171009: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
Institutional Review Board Statement not detected.
Randomization not detected.
Blinding not detected.
Power Analysis not detected.
Sex as a biological variable not detected.
Table 2: Resources
No key resources detected.
Results from OddPub: Thank you for sharing your data.
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:
Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.
About SciScore
SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.
Read the original source
Version published to 10.1101/2020.06.25.171009 on bioRxiv
Jun 26, 2020

Machine Learning for Privacy Threat Classification: A Systematic Review

This article has 4 authors:
1. L.D.C.S. Subhashini
2. Yuefeng Li
3. Xiaohui Tao
4. Jianming Yong
This article has no evaluationsLatest version Jul 2, 2025
A Systematic Review of Machine Learning in Credit Card Fraud Detection

This article has 3 authors:
1. Fatemeh Moradi
2. Mehran Tarif Hokmabadi
3. MohammadHossein Homaei
This article has no evaluationsLatest version Jul 14, 2025
The Adaptive Ensemble Learning-Based Intrusion Detection System for Enhanced Cybersecurity in Networked Environments

This article has 2 authors:
1. Kuldeep Kumar
2. Namrta Tanwar
This article has no evaluationsLatest version Aug 19, 2025

Institutional Review Board Statement	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

Related articles

Machine Learning for Privacy Threat Classification: A Systematic Review

A Systematic Review of Machine Learning in Credit Card Fraud Detection

The Adaptive Ensemble Learning-Based Intrusion Detection System for Enhanced Cybersecurity in Networked Environments