The FeatureCloud AI Store for Federated Learning in Biomedicine and Beyond
Authors:
Julian Matschinske,
Julian Späth,
Reza Nasirigerdeh,
Reihaneh Torkzadehmahani,
Anne Hartebrodt,
Balázs Orbán,
Sándor Fejér,
Olga Zolotareva,
Mohammad Bakhtiari,
Béla Bihari,
Marcus Bloice,
Nina C Donner,
Walid Fdhila,
Tobias Frisch,
Anne-Christin Hauschild,
Dominik Heider,
Andreas Holzinger,
Walter Hötzendorfer,
Jan Hospes,
Tim Kacprowski,
Markus Kastelitz,
Markus List,
Rudolf Mayer,
Mónika Moga,
Heimo Müller
, et al. (7 additional authors not shown)
Abstract:
Machine Learning (ML) and Artificial Intelligence (AI) have shown promising results in many areas and are driven by the increasing amount of available data. However, this data is often distributed across different institutions and cannot be shared due to privacy concerns. Privacy-preserving methods, such as Federated Learning (FL), allow for training ML models without sharing sensitive data, but t…
▽ More
Machine Learning (ML) and Artificial Intelligence (AI) have shown promising results in many areas and are driven by the increasing amount of available data. However, this data is often distributed across different institutions and cannot be shared due to privacy concerns. Privacy-preserving methods, such as Federated Learning (FL), allow for training ML models without sharing sensitive data, but their implementation is time-consuming and requires advanced programming skills. Here, we present the FeatureCloud AI Store for FL as an all-in-one platform for biomedical research and other applications. It removes large parts of this complexity for developers and end-users by providing an extensible AI Store with a collection of ready-to-use apps. We show that the federated apps produce similar results to centralized ML, scale well for a typical number of collaborators and can be combined with Secure Multiparty Computation (SMPC), thereby making FL algorithms safely and easily applicable in biomedical and clinical environments.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
Privacy-preserving Artificial Intelligence Techniques in Biomedicine
Authors:
Reihaneh Torkzadehmahani,
Reza Nasirigerdeh,
David B. Blumenthal,
Tim Kacprowski,
Markus List,
Julian Matschinske,
Julian Späth,
Nina Kerstin Wenke,
Béla Bihari,
Tobias Frisch,
Anne Hartebrodt,
Anne-Christin Hausschild,
Dominik Heider,
Andreas Holzinger,
Walter Hötzendorfer,
Markus Kastelitz,
Rudolf Mayer,
Cristian Nogales,
Anastasia Pustozerova,
Richard Röttger,
Harald H. H. W. Schmidt,
Ameli Schwalber,
Christof Tschohl,
Andrea Wohner,
Jan Baumbach
Abstract:
Artificial intelligence (AI) has been successfully applied in numerous scientific domains. In biomedicine, AI has already shown tremendous potential, e.g. in the interpretation of next-generation sequencing data and in the design of clinical decision support systems. However, training an AI model on sensitive data raises concerns about the privacy of individual participants. For example, summary s…
▽ More
Artificial intelligence (AI) has been successfully applied in numerous scientific domains. In biomedicine, AI has already shown tremendous potential, e.g. in the interpretation of next-generation sequencing data and in the design of clinical decision support systems. However, training an AI model on sensitive data raises concerns about the privacy of individual participants. For example, summary statistics of a genome-wide association study can be used to determine the presence or absence of an individual in a given dataset. This considerable privacy risk has led to restrictions in accessing genomic and other biomedical data, which is detrimental for collaborative research and impedes scientific progress. Hence, there has been a substantial effort to develop AI methods that can learn from sensitive data while protecting individuals' privacy. This paper provides a structured overview of recent advances in privacy-preserving AI techniques in biomedicine. It places the most important state-of-the-art approaches within a unified taxonomy and discusses their strengths, limitations, and open problems. As the most promising direction, we suggest combining federated machine learning as a more scalable approach with other additional privacy preserving techniques. This would allow to merge the advantages to provide privacy guarantees in a distributed way for biomedical applications. Nonetheless, more research is necessary as hybrid approaches pose new challenges such as additional network or computation overhead.
△ Less
Submitted 6 November, 2020; v1 submitted 22 July, 2020;
originally announced July 2020.