Abstract
PURPOSE
Image analysis is one of the most promising applications of artificial intelligence (AI) in health care, potentially improving prediction, diagnosis, and treatment of diseases. Although scientific advances in this area critically depend on the accessibility of large-volume and high-quality data, sharing data between institutions faces various ethical and legal constraints as well as organizational and technical obstacles.
METHODS
The Joint Imaging Platform (JIP) of the German Cancer Consortium (DKTK) addresses these issues by providing federated data analysis technology in a secure and compliant way. Using the JIP, medical image data remain in the originator institutions, but analysis and AI algorithms are shared and jointly used. Common standards and interfaces to local systems ensure permanent data sovereignty of participating institutions.
RESULTS
The JIP is established in the radiology and nuclear medicine departments of 10 university hospitals in Germany (DKTK partner sites). In multiple complementary use cases, we show that the platform fulfills all relevant requirements to serve as a foundation for multicenter medical imaging trials and research on large cohorts, including the harmonization and integration of data, interactive analysis, automatic analysis, federated machine learning, and extensibility and maintenance processes, which are elementary for the sustainability of such a platform.
CONCLUSION
The results demonstrate the feasibility of using the JIP as a federated data analytics platform in heterogeneous clinical information technology and software landscapes, solving an important bottleneck for the application of AI to large-scale clinical imaging data.
INTRODUCTION
Medical imaging plays an essential role in nearly all aspects of high-quality cancer care, from preventive measures, including screening and early detection, through diagnosis, treatment planning and monitoring, and follow-up. Most patients with cancer undergo repeated imaging during the course of their treatment. In combination with clinical and laboratory data, quantitative imaging biomarkers are of fundamental importance to improve standardized therapy monitoring in clinical multicenter studies.1-7
CONTEXT
Key Objective
To create a digital infrastructure for federated artificial intelligence (AI)–based medical image analysis with the goal of facilitating and enabling multicenter trials between the partner sites of the German Cancer Consortium and beyond.
Knowledge Generated
The decentralized local execution of data analyses can solve many obstacles of cross-site collaboration. This work tackles organizational, legal, and technical challenges of distributed data analysis and shows its value in several use cases and studies.
Relevance
Translating new developments into clinical practice is the ultimate goal of medical imaging research, and using these new technologies will yield enormous benefits for patients. The open-source Joint Imaging Platform presented in this work is realizing this step from the research laboratory into a real multicenter clinical study setting, supporting and enabling the translation of cutting-edge AI-based technologies into clinical practice.
Medical images are more than pictures; they are data characterizing the patient.8 As such, they are rightly subject to strict data protection, as well as ethical and moral requirements for scientific secondary use, which, in turn, can impede the exchange of biomedical imaging data across clinical sites. Current research strongly indicates that data anonymization is not only difficult to perform but also generally ineffective in practice.9-11 De-identification or anonymization methods considered safe today might potentially fail in the future.9 Data ownership, insufficient personal incentives for data collectors to share their data, and remaining technical challenges present further hurdles to data sharing in the medical research domain.12 In addition, the clinical landscape is composed of heterogeneous information technology (IT) systems as well as different scanners and acquisition parameters, making joint projects and data sharing cumbersome but also necessary for generalizable image-based biomarkers and artificial intelligence (AI)–based image analysis.
Recent improvements in machine learning (ML) have enabled algorithms that can achieve results for specialized tasks equivalent to those of physicians and that are able to support human experts in improving their performance and efficiency.4,13-27 These accomplishments elevated medical imaging to one of the most promising fields for practical application of ML in health care, aiming at better prediction, diagnosis, and treatment of diseases.13 However, to enable a broad application of these techniques, a number of challenges still need to be overcome. The common denominator of all previous success stories is an extensive investment in collecting and curating a substantial amount of multicentric imaging data, which is critical to establish the required robustness of ML models. Thus, the obstacles for data sharing represent a bottleneck for medical research in general and for cancer research in particular.
To overcome this bottleneck, several projects and initiatives are working on facilitating, accelerating, and promoting collaboration in larger scientific networks. Existing platforms such as KETOS,28 which is based on DataSHIELD29,30 and targeted to perform statistical analysis on textual clinical information, were among the first to adopt the concept of federated on-site execution of algorithms to enable decentralized analysis of clinical data. The field of bioinformatics has brought up widely used platforms for standardized data analysis. Among these, Galaxy31 is the most prominent solution for platform-based genome analyses. The Personal Health Train,32 on the other hand, envisions federated scenarios that include the continuous federated training of ML models. In the area of medical imaging, Sharma et al33 presented the Platform for Imaging in Precision Medicine (PRISM). PRISM focuses on the curation, management, and exploration of radiologic, pathologic, and clinical data collections, such as The Cancer Imaging Archive,34 not on the decentralized processing of imaging data and the realization of multicenter trials.
To fill this gap, the strategic initiative Joint Imaging Platform (JIP) was established by the German Cancer Consortium (DKTK).35 The DKTK is a long-term initiative by the German Federal Ministry of Education and Research connecting more than 20 academic research institutes and university hospitals with the German Cancer Research Center to foster multicenter clinical trials for improved cancer diagnosis and treatment (Appendix Table A1). The consortium provides a unique opportunity for collecting large-scale high-quality imaging data from several institutions. The JIP is designed to facilitate collaborative imaging projects across institutions by addressing the typical technical, organizational, and legal challenges associated with the sharing of imaging data, acquisition parameters, analysis algorithms, or processing results. By enabling training, evaluation, and application of algorithms in large-scale federated clinical settings, the platform builds a solid and extensible foundation for federated learning scenarios. Leveraging open-source technologies, the JIP has the potential to serve as a promoter of prospective cross-center radiologic studies at unprecedented cohort sizes, not only within the DKTK but also beyond.
METHODS
The JIP is designed as a federated data analysis and processing system (ie, for delivering methods and tools to the image data instead of collecting the data for processing and analysis). Strict on-site data processing mitigates common problems with data protection regulations because no personal data have to leave the clinic.
Platform Requirements
We conducted a requirement analysis at each DKTK site to gather information regarding their specific demands and expectations. The collected responses revealed a quite heterogeneous landscape considering patient count, modalities, and IT systems. Furthermore, the analysis revealed two main requirements for the JIP. First, enabling and supporting multicenter imaging studies was of utmost importance, including access to larger case numbers for retrospective data analyses and the facilitation of collaborations. Second, an improved integration of data processing, annotation, and sharing tools into the clinical environment was of interest, particularly ML and federated data analytics. These results were translated into the following individual aspects that should be realized in the platform.
Integrability.
The fundamental principle of the JIP is based on the local execution of analysis methods as an extension of the existing clinical infrastructure. As a result, the platform should seamlessly integrate with existing local clinical systems, and the interaction of physicians with the JIP should be as close as possible to the established clinical workflows and tools.
Data accessibility.
To achieve high compatibility with existing hospital systems, the widely established standard for Digital Imaging and Communications in Medicine (DICOM) should be used whenever possible. It should also be possible to view stored images and segmentations within the platform. Results of computations (eg, segmentations or parametric images) should be stored in DICOM to ensure high compatibility.
Algorithmic accessibility.
The platform should facilitate the sharing and distribution of algorithmic developments between research groups across different sites. This requires a versatile and efficient integration path for in-house developments into the platform, supporting arbitrary processing steps using different programming languages and input and output formats.
Data sovereignty.
Although the platform should enable joint projects within DKTK, it must put mechanisms into place that ensure full control over each site’s local data.
Data exploration.
Because algorithms are usually designed for specific types of input data, image properties such as modality, protocol, examined body parts, or patient characteristics are essential for selecting suitable training and test cases. The platform should enable users to filter appropriate data easily from existing image collections. The filters that identify a specific cohort should be shareable with partners.
Scalability.
As a service-oriented application, the platform should ensure a high level of scalability. For example, an increasing demand should be responded by spawning more instances of a service that is under high load.
Maintenance.
To ensure long-term sustainability of the platform, all platform instances in the consortium need to be kept up to date. Thus, to meet new developments and the continuous change in requirements, the platform must be easy to maintain, update, and expand. The installation and maintenance of each individual instance must be possible by nonspecialized technical staff of each site.
Platform Architecture
In response to these requirements, we designed a system that is structured into five building blocks (Fig 1). JIP SYSTEM realizes the technical basis and acts like an operating system of the platform. It takes care of provision, monitoring, and communication of services within the platform. JIP BASE consists of the more task-related components (eg, for the main user interface and authentication). JIP STORE contains components for data handling, management, and storage; JIP META contains components for metadata management, subject and image search, and selection; and JIP FLOW contains components for the controlled execution of processing sequences. Each of the functional units was realized based on open-source technologies. We have designed the system to be located within the protected hospital IT infrastructure. This allows for processing of the entire available patient data and facilitates integration into local procedures. The Data Supplement provides more detailed information about the individual components.
Because methods, technologies, and requirements in research are constantly evolving and the JIP is designed as an open platform for the community, extensibility of the platform for additional tools to explore, examine, or analyze medical data is crucial. Open interfaces, which are provided within the platform, allow a high degree of flexibility. This even extends to services that were originally not developed for use within a Web environment; the JIP offers Virtual Network Computing as an interface able to stream complete desktop applications to a Web browser. Some of the already integrated extensions (Fig 2D) are described in more detail in the Data Supplement. The seamless interaction of the JIP with other platform initiatives is also detailed in the Data Supplement.
RESULTS
In this section, exemplary and complementary use cases that are realizable within the JIP and that cover all aspects of the previously defined platform requirements are described. The successful implementation of the JIP and its capabilities are further demonstrated in an overview of the current clinical and technical site involvements.
Use Case 1: Data Harmonization and Integration
To enable comparability of study results, mutually agreeable scanner configurations or guidelines that achieve a more standardized imaging of certain patient groups are desirable in multicenter studies. The JIP supports such harmonization of data by enabling the development of algorithms that robustly handle multicenter data, including all the protocol and quality variations.
Within the DKTK consortium, the imaging protocols for diffusion-weighted magnetic resonance imaging of the brain and the prostate were standardized and validated using the JIP. The effect of the harmonization on the quantitative apparent diffusion coefficient measurements was demonstrated in comparison with canonical in-house sequences before and after standardization, exemplarily shown for prostate measurements (Fig 2A). Levene’s test revealed that there were significant differences between the variances before and after harmonization at 3 T (peripheral zone [PZ], P = .004; cyst, P = .00005) but not at 1.5 T (PZ, P = .072; cyst, P = .076).
Use Case 2: Automatic Large-Scale Radiomics Analysis
In this scenario, a large number of images are processed using fully automatic image quantification algorithms, more specifically a shape model–based organ segmentation covering segmentations for kidneys, liver, and spleen in abdominal computed tomography (CT) scans36 followed by a radiomics analysis of the resulting objects.37
Complex workflows are realized in the JIP by concatenating individual processing steps and pipelines. The generated segmentations are pushed into JIP STORE and trigger the subsequent radiomics workflow, which automatically extracts the radiomics features from the provided organ masks. All metadata of the generated DICOM-SEGs are automatically extracted and pushed into JIP META. The combination of Kubernetes and Apache Airflow allows automatic parallel execution of each pipeline instance across the configured computing cluster, reducing computation time through transparent parallelization. Figure 2B illustrates the fully automated processing workflow, which is formally defined as a directed acyclic graph with Docker containers as nodes.
Use Case 3: Interactive Analysis
This scenario demonstrates that interactive desktop applications can be shipped via containers and integrated into otherwise automatic JIP processing workflows. The desktop application qPSMA37a for the interactive quantification of whole-body tumor volume of patients with prostate cancer using prostate-specific membrane antigen–positron emission tomography/CT images, developed by the Department of Nuclear Medicine at the Technical University Munich, was integrated into such a semiautomatic workflow. In this specific workflow, time-consuming preprocessing steps are automatically performed before the manual annotation step is triggered (Fig 2C). After the expert’s manual interaction, the automatic processing pipeline continues. No software needs to be installed on the annotators’ workstations, and multiple instances can be started simultaneously, also within a workflow, allowing parallel and more complex annotation workflows that might involve multiple users.
Use Case 4: Federated Data Analysis
This use case focuses on the cross-site distribution and application of analysis tools (Fig 3). As shown in the previous use cases, once a tool is packaged in a container, it can be executed on the platform independently of its grade of automation or interactivity level, building the foundation for a federated data analysis. The Helmholtz incubator project Trustworthy Federated Data Analysis,38 based on the JIP technology, is investigating trustworthy and regulatory compliant federated computing concepts. A hurdle in federated data analysis is the heterogeneity of data and metadata across sites, even when using standards such as DICOM. To approach this challenge, there is ongoing work to semiautomatically create mappings between different local conventions to a standardized metadata format. As a proof of concept, the development of this project will be validated in a federated radiation therapy study, aiming to reproduce results that were already generated and published by traditional means (ie, by pooling the data from all sites in the context of the DKTK Radiation Oncology Group study).39
Site User Engagement and Projects
Recently, developers at local sites have started to become involved and to migrate their processing workflows into the JIP. Measures to further encourage community involvement include a detailed developer guide and documentation,40 together with an open-source release of the codebase.41 The first JIP tech workshop has recently taken place, and further events such as hackathons and workshops are planned.
Since the initial release, several projects have started to or plan to use the platform. A subset of projects is listed in Table 1. The spectrum of applications reaches from simple data management to complex application of modern ML algorithms. For example, the surgical ARMANI trial will use the platform to investigate imaging associated with different resection strategies of liver metastases. In the course of the project outlined in case 3, the lesion load in prostate cancer will be examined.
TABLE 1.
DISCUSSION
The JIP provides a unified infrastructure across radiology and nuclear medicine departments of 10 university hospitals in Germany. The platform leverages state-of-the-art industry standards for cloud computing while adhering to on-premise hosting and execution. The technology stack is typical in modern cloud systems and can also easily be deployed using one of the leading commercial providers of cloud services.
The JIP offers standardized image processing by leveraging successfully established mechanisms from other fields. For example, we implemented a strictly browser-based interaction as in Galaxy31 and decentralized execution of algorithms as it was suggested by KETOS for textual data,28 while enabling the potential of container-based federated learning as suggested by the Personal Health Train.32
The decentralized approach enables compliance with data protection rules according to the European General Data Protection Regulation (GDPR) and the Health Insurance Portability and Accountability Act (HIPAA). We provided support in the handling of data protection questions to the sites by distributing technical and organizational measures as required by the recently introduced GDPR. The decentralized data storage facilitates additional GDPR requirements such as consent management (including the handling of consent withdrawal), data transparency, and the right to rectification and erasure. Depending on the study protocol, data pseudonymization and anonymization can be designed directly into the processing workflow, facilitating the duties of the data controller as required by the GDPR. These principles naturally extend to the requirements of the HIPAA, such as central processing steps for personal health information management.
Because of regional differences, site-specific data protection concepts often represent specific challenges. For example, some clinical IT networks do not grant permanent external communication. This complicates scenarios like methods exchange or federated learning by adding manual steps and preventing automated updates. In addition, the server must be installed and maintained independently at each site. This significantly increases the need for reliability and automation of maintenance routines. We have addressed this issue on several levels, from the basic system architecture up to documentation and support measures. The first release cycles have shown that our measures were successful.
The integration of electronic health records (EHRs) will be evaluated in upcoming versions. The contained information might serve as an important pretest probability for certain medical conditions and should be available as input for ML algorithms. Additional nonimaging parameters can, for instance, be taken into account by linking to the DKTK CCP-IT.42,43 Including radiologic reports from the radiologic information system presents an additional data source and allows for an even more powerful stratification of patient subgroups.
In the context of electronic case report forms and EHRs, the important aspect of data quality has been investigated extensively.44-47 With a similar intention but with a focus on imaging, future work on the JIP will include the development of automatic AI-based data quality assessment methods based on image metadata as well as the actual image data itself.
For a clinical application of AI-based methodologies, the trustworthiness of such techniques is also a key challenge. Trust can only be generated by a deep understanding of an algorithm's decision making, which can be promoted by new techniques of explainable AI, where the JIP as a research platform located in multiple university hospitals is the ideal tool to develop and test such approaches.27
To conclude, we have established a flexible decentralized analysis platform for medical images that respects data sovereignty and protects privacy by sharing algorithms instead of data. We observed that the availability of the JIP led to an unprecedented level of communication and collaboration within the radiologic and nuclear medicine research community of the DKTK. An increasing number of clinical studies have committed to use the JIP (Table 1). In addition, several requests for further extensions have been made (eg, supporting histopathology data).
Because DKTK is not the only collaborative network that is in need of a research imaging platform, the core implementation of the JIP will be available as an open-source software project named Kaapana.48 By providing the platform and source code, we hope to mitigate the compatibility gap between systems in the heterogeneous clinical IT landscapes and lay the foundation for unprecedented research opportunities in data-driven medicine.
Appendix
TABLE A1.
EQUAL CONTRIBUTION
M.N., J.K., J.M., K.K., H-P.S., and K.M-H. contributed equally to this work.
PRIOR PRESENTATION
Presented in part at the 101st Deutscher Röntgenkongress Presentation at the 1st German Cancer Research Congress, Heidelberg, Germany, February 4-5, 2019.
SUPPORT
Supported by the German Cancer Consortium.
AUTHOR CONTRIBUTIONS
Conception and design: Jonas Scherer, Marco Nolden, Jens Kleesiek, Jasmin Metzger, Klaus Kades, Verena Schneider, Oliver Sedlaczek, Andreas M. Bucher, Thomas J. Vogl, Jens-Peter Kühn, Gerald Antoch, Konstantin Nikolaou, Wolfgang G. Kunz, Balthasar Schachtner, Jens Ricke, Alexander Radbruch, Lale Umutlu, Michael Forsting, Robert Seifert, Ken Herrmann, Philipp Mayer, Hans-Ulrich Kauczor, Winfried Brenner, Roman Kloeckner, Mathias Schreckenberger, Rickmer Braren, Georgios Kaissis, Marcus Makowski, Matthias Eiber, Andrei Gafita, Wolfgang A. Weber, Michael Bock, Fabian Bamberg, Jürgen Hennig, Tristan Kuder, Peter Neher, Ralf Floca, Heinz-Peter Schlemmer, Klaus Maier-Hein
Administrative support: Frank Grünwald, Jens-Peter Kühn, Christian la Fougère, Rickmer Braren, Jakob Neubauer, Heinz-Peter Schlemmer
Provision of study materials or patients: Jens-Peter Kühn, Lars Schimmöller, Christian la Fougère, Peter Bartenstein, Bernd Hamm, Rickmer Braren, Jürgen Hennig, Uwe Haberkorn, Heinz-Peter Schlemmer
Collection and assembly of data: Marco Nolden, Jens Kleesiek, Klaus Kades, Michael Bach, Oliver Sedlaczek, Andreas M. Bucher, Thomas J. Vogl, Frank Grünwald, Jens-Peter Kühn, Ralf-Thorsten Hoffmann, Oliver Bethge, Hans-Wilhelm Müller, Andreas Daul, Christian la Fougère, Wolfgang G. Kunz, Michael Ingrisch, Peter Bartenstein, Felix Nensa, Alexander Radbruch, Michael Forsting, Tobias Penzkofer, Bernd Hamm, Christoph Düber, Rickmer Braren, Rupert Trager, Jakob Neubauer, Jürgen Hennig, Philipp Tobias Meyer, Juri Ruf, Uwe Haberkorn, Klaus Maier-Hein
Data analysis and interpretation: Marco Nolden, Jens Kleesiek, Jasmin Metzger, Klaus Kades, Michael Bach, Andreas M. Bucher, Jens-Peter Kühn, Jörg Kotzerke, Lars Schimmöller, Christian la Fougère, Wolfgang G. Kunz, Alexander Radbruch, Lale Umutlu, Michael Forsting, Ken Herrmann, Tobias Penzkofer, Bernd Hamm, Roman Kloeckner, Rickmer Braren, Georgios Kaissis, Marco Reisert, Stefan O. Schoenberg, Peter Neher, Klaus Maier-Hein
Manuscript writing: All authors
Final approval of manuscript: All authors
Accountable for all aspects of the work: All authors
AUTHORS' DISCLOSURES OF POTENTIAL CONFLICTS OF INTEREST
The following represents disclosure information provided by authors of this manuscript. All relationships are considered compensated unless otherwise noted. Relationships are self-held unless noted. I = Immediate Family Member, Inst = My Institution. Relationships may not relate to the subject matter of this manuscript. For more information about ASCO's conflict of interest policy, please refer to www.asco.org/rwc or ascopubs.org/cci/author-center.
Open Payments is a public database containing information reported by companies about payments made to US-licensed physicians (Open Payments).
Andreas M. Bucher
Honoraria: Bayer, Guebert
Travel, Accommodations, Expenses: Bayer, Guebert
Frank Grünwald
Honoraria: Henning
Konstantin Nikolaou
Honoraria: Siemens Healthineers, Bayer Schering Pharma
Consulting or Advisory Role: Siemens Healthineers
Speakers' Bureau: Siemens Healthineers, Siemens Healthineers (Inst)
Research Funding: Bayer Schering Pharma (Inst)
Travel, Accommodations, Expenses: Siemens Healthineers, Bayer Schering Pharma
Michael Ingrisch
Stock and Other Ownership Interests: Siemens Healthineers
Jens Ricke
Honoraria: Sirtex Medical
Consulting or Advisory Role: Ipsen
Research Funding: Sirtex Medical (Inst)
Travel, Accommodations, Expenses: BTG
Alexander Radbruch
Honoraria: Bayer (Inst), Guerbet (Inst), Bayer, Guerbet, Novartis
Consulting or Advisory Role: Guerbet, Bayer
Speakers' Bureau: Bayer, Guerbet
Research Funding: Bayer (Inst), Guerbet (Inst)
Travel, Accommodations, Expenses: Bayer, Guerbet
Lale Umutlu
Consulting or Advisory Role: Siemens Healthineers
Speakers' Bureau: Siemens Healthineers, Bayer
Research Funding: Siemens Healthineers (Inst)
Travel, Accommodations, Expenses: Bayer, Siemens Healthineers
Other Relationship: Vara
Uncompensated Relationships: Vara
Ken Herrmann
Leadership: Sofie Biosciences
Stock and Other Ownership Interests: Sofie Biosciences
Consulting or Advisory Role: Novartis, Bain Capital, Bayer, Adacap, Amgen, BTG, Ipsen, ITG, ROTOP (Inst), Siemens Healthinee, GE Healthcare
Hans-Ulrich Kauczor
Speakers' Bureau: Philips Healthcare, Boehringer Ingelheim, MSD, AstraZeneca
Research Funding: Siemens Healthineers (Inst), Philips Healthcare (Inst), Bayer Schering Pharma (Inst)
Tobias Penzkofer
Research Funding: AGO (Inst), Aprea AB (Inst), ARCAGY-GINECO (Inst), Astellas Pharma Global (APGD) (Inst), AstraZeneca (Inst), Clovis Oncology (Inst), Dohme (Inst), Holaira (Inst), Incyte (Inst), Karyopharm Therapeutics (Inst), Lion Biotechnologies (Inst), MedImmune (Inst), Merck Sharp (Inst), Millennium (Inst), Morphotec (Inst), NovoCure (Inst), PharmaMar S.A. and PharmaMar USA (Inst), Roche (Inst), Siemens Healthineers (Inst), Tesaro (Inst)
Bernd Hamm
Honoraria: Bayer Schering Pharma
Consulting or Advisory Role: Canon, InnoRa GMBH
Research Funding: Abbot/AbbVie (I), Bracco Diagnostics (I), Guerbet (I), Novartis (I), GE Healthcare (I), Siemens (I)
Other Relationship: Board member of professional societies
Roman Kloeckner
Consulting or Advisory Role: Boston Scientific, Bristol Myers Squibb, Guerbet, Roche
Speakers' Bureau: BTG, Bristol Myers Squibb, Guerbet, Siemens Healthineers
Mathias Schreckenberger
Honoraria: Takeda
Matthias Eiber
Consulting or Advisory Role: Blue Earth Diagnostics, ABX Advanced Biochemical Compounds
Research Funding: Siemens, Blue Earth Diagnostics, ABX Advanced Biochemical Compounds
Patents, Royalties, Other Intellectual Property: Patent application for rhPSMA
Travel, Accommodations, Expenses: Bayer Schering Pharma
Wolfgang A. Weber
Consulting or Advisory Role: ITG, Blue Earth Diagnostics (Inst), Pentixapharm (Inst)
Research Funding: Blue Earth Diagnostics (Inst), ITG (Inst)
Patents, Royalties, Other Intellectual Property: Patent for PARP imaging agent
Travel, Accommodations, Expenses: Blue Earth Diagnostics, Pentixapharm
Michael Bock
Research Funding: Siemens Healthineers
Fabian Bamberg
Speakers' Bureau: Siemens Healthineers, Bracco Diagnostics, Bayer Health
Research Funding: Siemens Healthineers (Inst), Bayer Health (Inst)
Philipp Tobias Meyer
Consulting or Advisory Role: GE Healthcare, OPASCA
Juri Ruf
Employment: Gilead Sciences (I)
Uwe Haberkorn
Patents, Royalties, Other Intellectual Property: Patent for FAP inhibitors for nuclear medicine imaging and therapy (Inst)
Stefan O. Schoenberg
Other Relationship: Siemens Healthineers (Inst)
Tristan Kuder
Patents, Royalties, Other Intellectual Property: Patents for magnetic resonance diffusion-weighted imaging phantoms.
Uncompensated Relationships: HQ Imaging GmbH
Heinz-Peter Schlemmer
Honoraria: Bayer/Vital, Siemens Healthineers, Bracco Diagnostics, Curagita
Consulting or Advisory Role: Siemens Healthineers, Bracco Diagnostics
Research Funding: Siemens Healthineers (Inst), Profound (Inst)
Travel, Accommodations, Expenses: Siemens Healthineers, Bayer/Vital, Curagita, Bracco Diagnostics
Klaus Maier-Hein
Research Funding: Siemens Healthineers
No other potential conflicts of interest were reported.
REFERENCES
- 1.Savadjiev P, Chong J, Dohan A, et al. Image-based biomarkers for solid tumor quantification. Eur Radiol. 2019;29:5431–5440. doi: 10.1007/s00330-019-06169-w. [DOI] [PubMed] [Google Scholar]
- 2.Parmar C, Grossmann P, Bussink J, et al. Machine learning methods for quantitative radiomic biomarkers. Sci Rep. 2015;5:13087. doi: 10.1038/srep13087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Kurtz DM, Esfahani MS, Scherer F, et al. Dynamic risk profiling using serial tumor biomarkers for personalized outcome prediction. Cell. 2019;178:699–713.e19. doi: 10.1016/j.cell.2019.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kickingereder P, Isensee F, Tursunova I, et al. Automated quantitative tumour response assessment of MRI in neuro-oncology with artificial neural networks: A multicentre, retrospective study. Lancet Oncol. 2019;20:728–740. doi: 10.1016/S1470-2045(19)30098-1. [DOI] [PubMed] [Google Scholar]
- 5.Aerts HJWL. The potential of radiomic-based phenotyping in precision medicine: A review. JAMA Oncol. 2016;2:1636–1642. doi: 10.1001/jamaoncol.2016.2631. [DOI] [PubMed] [Google Scholar]
- 6.Amin S, Bathe OF. Response biomarkers: Re-envisioning the approach to tailoring drug therapy for cancer. BMC Cancer. 2016;16:850. doi: 10.1186/s12885-016-2886-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Harry VN, Semple SI, Parkin DE, et al. Use of new imaging techniques to predict tumour response to therapy. Lancet Oncol. 2010;11:92–102. doi: 10.1016/S1470-2045(09)70190-1. [DOI] [PubMed] [Google Scholar]
- 8.Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images are more than pictures, they are data. Radiology. 2016;278:563–577. doi: 10.1148/radiol.2015151169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Rocher L, Hendrickx JM, de Montjoye Y-A. Estimating the success of re-identifications in incomplete datasets using generative models. Nat Commun. 2019;10:3069. doi: 10.1038/s41467-019-10933-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Bartling S, Friesike S (eds): Challenges of open data in medical research, in Opening Science: The Evolving Guide on How the Internet is Changing Research, Collaboration and Scholarly Publishing. Cham, Switzerland, Springer International Publishing, 2014, pp 297-307. [Google Scholar]
- 11. Ravindra V, Grama A: De-anonymization attacks on neuroimaging datasets. http://arxiv.org/abs/1908.03260.
- 12.van Panhuis WG, Paul P, Emerson C, et al. A systematic review of barriers to data sharing in public health. BMC Public Health. 2014;14:1144. doi: 10.1186/1471-2458-14-1144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Topol EJ. High-performance medicine: The convergence of human and artificial intelligence. Nat Med. 2019;25:44–56. doi: 10.1038/s41591-018-0300-7. [DOI] [PubMed] [Google Scholar]
- 14. Rajpurkar P, Irvin J, Zhu K, et al: CheXNet: Radiologist-level pneumonia detection on chest x-rays with deep learning. http://arxiv.org/abs/1711.05225.
- 15. Gale W, Oakden-Rayner L, Carneiro G, et al: Detecting hip fractures with radiologist-level performance using deep neural networks. http://arxiv.org/abs/1711.06504.
- 16.Schelb P, Kohl S, Radtke JP, et al. Classification of cancer at prostate MRI: Deep learning versus clinical PI-RADS assessment. Radiology. 2019;293:607–617. doi: 10.1148/radiol.2019190938. [DOI] [PubMed] [Google Scholar]
- 17.Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542:115–118. doi: 10.1038/nature21056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kuo W, Hӓne C, Mukherjee P, et al. Expert-level detection of acute intracranial hemorrhage on head computed tomography using deep learning. Proc Natl Acad Sci USA. 2019;116:22737–22745. doi: 10.1073/pnas.1908021116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hekler A, Utikal JS, Enk AH, et al. Deep learning outperformed 11 pathologists in the classification of histopathological melanoma images. Eur J Cancer. 2019;118:91–96. doi: 10.1016/j.ejca.2019.06.012. [DOI] [PubMed] [Google Scholar]
- 20.Campanella G, Hanna MG, Geneslaw L, et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med. 2019;25:1301–1309. doi: 10.1038/s41591-019-0508-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ardila D, Kiraly AP, Bharadwaj S, et al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat Med. 2019;25:954–961. doi: 10.1038/s41591-019-0447-x. [DOI] [PubMed] [Google Scholar]
- 22. doi: 10.1016/j.cell.2018.03.040. Christiansen EM, Yang SJ, Ando DM, et al: In silico labeling: Predicting fluorescent labels in unlabeled images. Cell 173:792-803.e19, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tomašev N, Glorot X, Rae JW, et al. A clinically applicable approach to continuous prediction of future acute kidney injury. Nature. 2019;572:116–119. doi: 10.1038/s41586-019-1390-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kleesiek J, Morshuis JN, Isensee F, et al. Can virtual contrast enhancement in brain MRI replace gadolinium? A feasibility study. Invest Radiol. 2019;54:653–660. doi: 10.1097/RLI.0000000000000583. [DOI] [PubMed] [Google Scholar]
- 25.Poplin R, Varadarajan AV, Blumer K, et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat Biomed Eng. 2018;2:158–164. doi: 10.1038/s41551-018-0195-0. [DOI] [PubMed] [Google Scholar]
- 26.Lu MT, Ivanov A, Mayrhofer T, et al. Deep learning to assess long-term mortality from chest radiographs. JAMA Netw Open. 2019;2:e197416. doi: 10.1001/jamanetworkopen.2019.7416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Towards trustable machine learning. Nat Biomed Eng. 2018;2:709–710. doi: 10.1038/s41551-018-0315-x. [DOI] [PubMed] [Google Scholar]
- 28.Gruendner J, Schwachhofer T, Sippl P, et al. KETOS: Clinical decision support and machine learning as a service—A training and deployment platform based on Docker, OMOP-CDM, and FHIR Web Services. PLoS One. 2019;14:e0223010. doi: 10.1371/journal.pone.0223010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Gaye A, Marcon Y, Isaeva J, et al. DataSHIELD: Taking the analysis to the data, not the data to the analysis. Int J Epidemiol. 2014;43:1929–1944. doi: 10.1093/ije/dyu188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Wilson RC, Butters OW, Avraam D, et al. DataSHIELD: New directions and dimensions. Data Sci J. 2017;16:21. [Google Scholar]
- 31.Goecks J, Nekrutenko A, Taylor J. Galaxy: A comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010;11:R86. doi: 10.1186/gb-2010-11-8-r86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.van Soest J, Sun C, Mussmann O, et al. Using the personal health train for automated and privacy-preserving analytics on vertically partitioned data. Stud Health Technol Inform. 2018;247:581–585. [PubMed] [Google Scholar]
- 33.Sharma A, Tarbox L, Kurc T, et al. PRISM: A platform for imaging in precision medicine. JCO Clin Cancer Inform. 2020;4:491–499. doi: 10.1200/CCI.20.00001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Clark K, Vendt B, Smith K, et al. The Cancer Imaging Archive (TCIA): Maintaining and operating a public information repository. J Digit Imaging. 2013;26:1045–1057. doi: 10.1007/s10278-013-9622-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Joos S, Nettelbeck DM, Reil-Held A, et al. German Cancer Consortium (DKTK): A national consortium for translational cancer research. Mol Oncol. 2019;13:535–542. doi: 10.1002/1878-0261.12430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Norajitra T, Maier-Hein KH. 3D statistical shape models incorporating landmark-wise random regression forests for omni-directional landmark detection. IEEE Trans Med Imaging. 2017;36:155–168. doi: 10.1109/TMI.2016.2600502. [DOI] [PubMed] [Google Scholar]
- 37.Götz M, Nolden M, Maier-Hein K. MITK phenotyping: An open-source toolchain for image-based personalized medicine with radiomics. Radiother Oncol. 2019;131:108–111. doi: 10.1016/j.radonc.2018.11.021. [DOI] [PubMed] [Google Scholar]
- 37a.Gafita A, Bieth M, Krönke M, et al. qPSMA: Semiautomatic software for whole-body tumor burden assessment in prostate cancer using 68Ga-PSMA11 PET/CT. J Nucl Med Off Publ Soc Nucl Med. 2019;60(9):1277–1283. doi: 10.2967/jnumed.118.224055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Helmholtz Center for Information Security, German Cancer Research Center: Trustworthy Federated Data Analytics (TFDA). https://tfda.hmsp.center.
- 39.Lohaus F, Linge A, Tinhofer I, et al. HPV16 DNA status is a strong prognosticator of loco-regional control after postoperative radiochemotherapy of locally advanced oropharyngeal carcinoma: Results from a multicentre explorative study of the German Cancer Consortium Radiation Oncology Group (DKTK-ROG) Radiother Oncol. 2014;113:317–323. doi: 10.1016/j.radonc.2014.11.011. [DOI] [PubMed] [Google Scholar]
- 40. German Cancer Consortium: Joint Imaging Platform. https://jip.dktk.dkfz.de/jiphomepage/
- 41. Github: Kaapana repository. https://github.com/kaapana/kaapana.
- 42. Lablans M: Konzept der CCP-IT des DKTK. https://dktk.dkfz.de/application/files/1814/6235/8457/Konzept_CCP-IT.pdf.
- 43.Lablans M, Schmidt EE, Ückert F. An architecture for translational cancer research as exemplified by the German Cancer Consortium. JCO Clin Cancer Inform. 2018;2:1–8. doi: 10.1200/CCI.17.00062. [DOI] [PubMed] [Google Scholar]
- 44.Zaccaria GM, Ferrero S, Rosati S, et al. Applying data warehousing to a phase III clinical trial from the Fondazione Italiana Linfomi Ensures superior data quality and improved assessment of clinical outcomes. JCO Clin Cancer Inform. 2019;3:1–15. doi: 10.1200/CCI.19.00049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Weiskopf NG, Bakken S, Hripcsak G, et al. A data quality assessment guideline for electronic health record data reuse. EGEMS (Wash DC) 2017;5:14. doi: 10.5334/egems.218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Terry AL, Stewart M, Cejic S, et al. A basic model for assessing primary health care electronic medical record data quality. BMC Med Inform Decis Mak. 2019;19:30. doi: 10.1186/s12911-019-0740-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. doi: 10.13063/2327-9214.1244. Kahn MG, Callahan TJ, Barnard J, et al: A harmonized data quality assessment terminology and framework for the secondary use of electronic health record data. EGEMS (Wash DC) 4:1244, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Kaapana. https://kaapana.ai/
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
To achieve high compatibility with existing hospital systems, the widely established standard for Digital Imaging and Communications in Medicine (DICOM) should be used whenever possible. It should also be possible to view stored images and segmentations within the platform. Results of computations (eg, segmentations or parametric images) should be stored in DICOM to ensure high compatibility.