Abstract
Background
Real-world data (RWD) are increasingly used in health research and regulatory decision-making to assess the effectiveness, safety, and value of interventions in routine care. However, the heterogeneity of European health care systems, data capture methods, coding standards, and governance structures poses challenges for generating robust and reproducible real-world evidence. The European Health Data & Evidence Network (EHDEN) was established to address these challenges by building a large-scale federated data infrastructure that harmonizes RWD across Europe.
Objective
This study aims to describe the composition and characteristics of the databases harmonized within EHDEN as of September 2024. We seek to provide transparency regarding the types of RWD available and their potential to support collaborative research and regulatory use.
Methods
EHDEN recruited data partners through structured open calls. Selected data partners received funding and technical support to harmonize their data to the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM), with assistance from certified small-to-medium enterprises trained through the EHDEN Academy. Each data source underwent an extract-transform-load process and data quality assessment using the data quality dashboard. Metadata—including country, care setting, capture method, and population criteria—were compiled in the publicly accessible EHDEN Portal.
Results
As of September 1, 2024, the EHDEN Portal includes 210 harmonized data sources from 30 countries. The highest representation comes from Italy (13%), Great Britain (12.5%), and Spain (11.5%). The mean number of persons per data source is 2,147,161, with a median of 457,664 individuals. Regarding care setting, 46.7% (n=98) of data sources reflect data exclusively from secondary care, 42.4% (n=89) from mixed care settings (both primary and secondary), and 11% (n=23) from primary care only. In terms of population inclusion criteria, 55.7% (n=117) of data sources include individuals based on health care encounters, 32.9% (n=69) through disease-specific data collection, and 11.4% (n=24) via population-based sources. Data capture methods also vary, with electronic health records (EHRs) being the most common. A total of 74.7% (n=157) of data sources use EHRs, and more than half of those (n=85) rely on EHRs as their sole method of data collection. Laboratory data are used in 29.5% (n=62) of data sources, although only one relies exclusively on laboratory data. Most laboratory-based data sources combine this method with other forms of data capture.
Conclusions
EHDEN is the largest federated health data network in Europe, enabling standardized, General Data Protection Regulation–compliant analysis of RWD across diverse care settings and populations. This descriptive summary of the network’s data sources enhances transparency and supports broader efforts to scale federated research. These findings demonstrate EHDEN’s potential to enable collaborative studies and generate trusted evidence for public health and regulatory purposes.
Introduction
Real-world data (RWD) has become a cornerstone in health care research, especially in regulatory science, due to its ability to capture insights from diverse patient populations and clinical settings. Unlike data generated through traditional randomized controlled trials, which often have stringent inclusion criteria, RWD reflects the everyday health care experiences of a broader patient base [1-5]. This breadth offers a richer context for understanding drug safety and effectiveness, guiding postauthorization safety monitoring, informing risk-benefit evaluations, and supporting regulatory decisions [6]. Regulators, industry, and academics alike rely on real-world evidence (RWE) derived from RWD to answer critical questions about health care interventions in clinical care settings that are more representative of routine practice [7-9].
Europe’s health care landscape presents both challenges and opportunities for generating RWD [10]. Its diversity spans many different health systems, terminology systems, and data collection practices, with variability in health care delivery and data availability across countries. This heterogeneity complicates large-scale representative research but also offers a unique opportunity to study diverse populations [8,11-13,undefined,undefined]. However, capturing this potential requires overcoming technical, operational, and methodological barriers to ensure data harmonization and quality. A federated network is particularly well suited to Europe’s fragmented health care landscape, where legal, linguistic, and governance diversity necessitate a model that supports local control while enabling cross-border collaboration.
Federated data networks, like the European Health Data & Evidence Network (EHDEN), are well-suited for Europe’s decentralized data landscape [14-19]. In the context of EHDEN, a federated network refers to a collaboration of independently governed data sources that retain full control of their data locally, preserving the autonomy and governance policies of individual data holders. This approach allows for multidatabase studies across diverse populations without requiring centralized data access or query execution, ensuring that personal health information does not leave its original source. By design, this method complies with the General Data Protection Regulation, as it avoids centralizing or transferring personal data and supports the principles of data minimization, purpose limitation, and local control. It is important to note that privacy-preserving practices are also in place for studies conducted using the network. Data partners (DPs) only share aggregate results, typically high-level outputs such as hazard ratios, after applying a minimum cell size threshold (k-anonymity, commonly set to 5) to suppress potentially re-identifiable results.
EHDEN was established as an Innovative Medicines Initiative (IMI), now Innovative Health Initiative, public-private partnership in November 2018 to overcome the challenges and transform how health data is used in Europe [20,21]. The project built a federated data network that standardizes health data across participating sources, making data analysis more feasible and consistent. By harmonizing data and implementing quality assurance protocols, EHDEN enhances the usability and comparability of RWD across Europe. This paper provides an overview of the EHDEN network, examining its data harmonization efforts, quality control processes, and the range of data sources included in the network. Through this discussion, we aim to highlight the scope of RWD available across Europe and its potential for advancing health care research and regulatory decision-making.
Methods
Common Data Model
As the foundation for its network, EHDEN adopted the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) [22-24]. The OMOP CDM is widely recognized for its “structure + content” approach whereby the tables and fields (structure) as well as the vocabulary (content) are standardized, allowing for integration of data across multiple systems while maintaining data integrity. The model also supports a wide range of data types, including electronic health records (EHRs), claims data, and patient registries.
The OMOP CDM is maintained by the Observational Health Data Science and Informatics (OHDSI) community, an open science effort that aims to improve health by empowering a community to collaboratively generate the evidence that promotes better health decisions and better care [25]. The open-source nature of OHDSI allows for continuous community-driven improvements, making it adaptable to emerging health care needs [26-28].
Data Partner Calls
Any organization with access to a data source in Europe could apply to be included in the EHDEN network. In this context, a data source is defined as a distinct repository of health care-related data pertaining to a specific set of individuals. Except for the COVID-19 Rapid Collaboration Call, the 7 DP calls executed between September 2019 and October 2022 were aligned to similar timelines for DP identification, grant awarding, and initiation of data harmonization (Figure 1). In each call, candidate partner organizations with access to one or more electronic health care databases applied to the EHDEN Harmonization Fund for a grant to implement or enhance their database (Multimedia Appendix 1). DPs were selected based on 3 criteria: data impact (size, coverage, quality), network impact (track record, uniqueness within network), and readiness (willingness to participate, governance) (Multimedia Appendix 2), reviewed by a Data Source Prioritization Committee. Each application was reviewed and scored by 2 reviewers, and the top applicants per round were awarded a grant.
Figure 1. Timeline of data call 7 (2022‐2023) within the European Health Data & Evidence Network. The figure illustrates key milestones in the selection, contracting, and data harmonization process for data partners onboarded during this call. SME: small-to-medium enterprise.
Data Standardization
Once DPs were identified and grants awarded, each data source underwent standardization to the OMOP CDM. A crucial factor in EHDEN’s long-term sustainability and success was the recruitment and training of local small-to-medium enterprises (SMEs). These SMEs were brought on board through separate calls from the EHDEN consortium and certified via the EHDEN Academy education program concluded by an onsite or web-based training. SMEs played a pivotal role in supporting DPs throughout the extract, transform, and load (ETL) process by providing guidance and expertise. In total, 64 SMEs across 22 countries were certified by EHDEN to support DPs.
The ETL process followed by the DPs and supported by the SMEs was largely uniform, as outlined by Voss et al [24], and involved four key steps: (1) summarizing the native data, (2) creating the ETL specification, (3) mapping source vocabulary codes, and (4) implementing the ETL. This standardized approach ensured transparency in the followed procedure and adherence to the conventions in converting data sources to the CDM, while also allowing DPs to benefit from the SMEs’ specialized knowledge.
To promote semantic interoperability across heterogeneous health care systems, all source codes for diagnoses, medications, procedures, and measurements were mapped to standardized vocabularies (eg, SNOMED CT [Systematized Nomenclature of Medicine – Clinical Terms], RxNorm [Prescription Normalized Names], LOINC [Logical Observation Identifiers Names and Codes]) as required by the OMOP CDM. For example, the UK Biobank contributed data that included SNOMED CT-coded diagnoses from EHRs alongside custom-coded fields for self-reported conditions and blood pressure measurements. Mapping these nonstandard elements involved a combination of automated matching and manual curation followed by expert validation. This process facilitated consistent interpretation of clinical concepts across countries and enhanced the analytical interoperability of the network [29].
Payments were structured based on output; to receive full funding, DPs were required to meet 3 different milestones. The ETL specification document entitled DPs to 30%, ETL implementation and infrastructure released the next 40%, and the final 30% was received by the DP after final inspection of the harmonized data (Multimedia Appendix 3).
Data Quality
Each milestone was reviewed by an EHDEN consortium member who was part of the Milestone review committee. The ETL specification document required by milestone 1 was evaluated to ensure the mapping adhered to the OMOP CDM conventions and that the DPs or SMEs had a good understanding of the CDM and their own native data [23,30]. Milestone 2, the ETL implementation, had multiple review steps. The infrastructure was investigated to be sure the DPs were using a supported database platform [31]. The vocabulary mapping was evaluated to ensure most, if not all, source codes were included. The data quality dashboard (DQD) was developed by EHDEN Work Package 5 to provide a standard structure for quality assurance [32]. It was used by DPs throughout the ETL process to continually improve the standardized data sources [33]. As described by Voss et al [24], the median number of times a DP ran the DQD was 3 (IQR 2‐7). In general, conformance issues to the OMOP CDM were identified and addressed in initial runs, with more complex site-specific vocabulary mapping issues addressed in subsequent runs. DPs used the default failure thresholds in the first run. These were updated to reflect the nuances of each data source in later runs of the software. In milestone 3, final DQD results as well as the CDM Inspection Report were reviewed [34]. Once approved, the DP then entered their information into the EHDEN portal, an online platform open to the public designed to catalog metadata on each data source [35].
Analyses
An individual data source was considered one entry in the EHDEN portal. Data sources were categorized based on country, person count, the levels of care represented (primary, secondary, or mixed), why a person was included, and how data were captured. These categories were ascertained from the data source description and metadata provided to the EHDEN portal and verified with the DPs.
There are 3 reasons persons could be included in an EHDEN data source (person inclusion), as defined by population, where a person enters the data source because they live in a certain geographical location or because they are registered with a practice or insurer; encounter, where a person enters the data source upon a visit to a health care provider for any medical reason; or disease, where a person enters the data source when satisfying specific criteria (ie, a person has a specific medical condition). The most restrictive reason was chosen as the classification for each data source.
We identified which types of data capture methods each source contained; it could be one or more of the following: EHR, a bill or adjudicated claim record for health services rendered (claim), measurements taken and results recorded (laboratory), a set of required information collected about participants in a registry (case report form), patient-reported data (survey), documents analyzed by pulling structured data from unstructured data using a natural language processing algorithm, or death information from an official source or government entity (death certificate). If the data source did not provide this information, they were categorized as unknown.
Ethical Considerations
Patients or the public were not involved in the design, conduct, reporting, or dissemination plans of this research.
Results
As of September 1, 2024, there are 210 data sources in the EHDEN portal. Figure 2 shows all countries and the number of data sources available in each. The data sources span 30 countries, with the largest representation from Italy, Great Britain, and Spain, with 13%, 12.5%, and 11.5% of the total data sources in the network, respectively. The mean number of persons per data source is 2,147,161 and the median number of persons is 457,664.
Figure 2. Geographic distribution and characteristics of data sources included in the European Health Data & Evidence Network as of September 1, 2024. The map displays country-level data density based on the number of data sources relative to national population size. Overlaid symbols represent key metadata for each source, including total person count, care setting (primary, secondary, or mixed), data capture methods (eg, electronic health records, laboratory, or claims), and the reason for person inclusion (eg, encounter-based, disease-specific, or population-based).
Table 1 provides the complete list of data sources and their attributes. One row in the table equates to 1 data source. The first column lists the DP, which is the name of the institution or organization that is the custodian of the data source. The individual data sources are identified by an acronym, which is also how they are identified in the EHDEN portal. Country of origin is represented by the 2-digit country code. The number of persons, the person inclusion method, and care level are also provided. Each data capture category has its own column in the table. If a data source uses one of the capture methods, that box is filled with a check mark symbol in the table.
Table 1. Overview of 210 standardized real-world data sources in the European Health Data & Evidence Network as of September 1, 2024. The table includes the full list of data sources by country, total number of persons represented, person inclusion, care level, and data capture methods. These attributes provide essential context for understanding the scope and scale of data available for real-world evidence generation within the European Health Data & Evidence Network.
| Data partner | Data source acronym | Country | Person count | Person inclusion | Care level | EHRa | Claim | Lab | Case report form | Survey | NLPb | Death certificate | Unknown |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Centro Clínico Académico – Braga, Associação (2CA-Braga) | 2CA-Braga | Portugal | 10,70,217 | Encounter | Secondary | ✓ | |||||||
| INCLIVA | ABUCASIS | Spain | 40,14,819 | Encounter | Mixed | ✓ | ✓ | ✓ | |||||
| The wellbeing services county of Southwest Finland, VarHa | ACI | Finland | 7,65,000 | Encounter | Secondary | ✓ | ✓ | ||||||
| Innovative Medical Research SA | ADWH IMR | Greece | 6,00,000 | Encounter | Primary | ✓ | |||||||
| Fondazione Casa Sollievo della Sofferenza | aGMS | Italy | 2140 | Disease | Secondary | ✓ | |||||||
| Akrivia Health | AKRDB | Great Britain | 30,85,560 | Encounter | Secondary | ✓ | |||||||
| Amsterdam UMC | AmsterdamUMCdb | Netherlands | 20,109 | Encounter | Secondary | ✓ | ✓ | ||||||
| Stichting VUmc | AMYPAD PNHS | Netherlands | 3368 | Disease | Mixed | ✓ | |||||||
| AZIENDA OSPEDALIERO UNIVERSITARIA SAN LUIGI GONZAGA | AOU-SANLUIGI | Italy | 1,84,520 | Encounter | Secondary | ✓ | ✓ | ||||||
| University Hospital of Parma | AOUPR | Italy | 5,73,205 | Encounter | Secondary | ✓ | ✓ | ||||||
| Azienda Ospedaliera Universitaria Integrata Verona | AOVR | Italy | 5,12,000 | Encounter | Secondary | ✓ | ✓ | ✓ | |||||
| Assistance Publique - Hopitaux de Marseille | AP-HM | France | 27,92,497 | Encounter | Mixed | ✓ | |||||||
| APDP | APDP | Portugal | 2,42,000 | Encounter | Secondary | ✓ | |||||||
| Azienda Ospedaliero-Universitaria di Modena | APUM | Italy | 3272 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| Servei Català de la Salut | AQUAS - CatSalut CMBD | Spain | 68,81,752 | Encounter | Secondary | ✓ | |||||||
| FONDAZIONE TOSCANA GABRIELE MONASTERIO PER LA RICERCA MEDICA E DI SANITA PUBBLICA (FTGM) | ARCA | Italy | 4,64,194 | Encounter | Mixed | ✓ | ✓ | ||||||
| ASL Roma 1 | ASL Roma 1 | Italy | 11,98,036 | Encounter | Mixed | ✓ | |||||||
| Assuta medical centers | AssutaSurgical | Israel | 7,76,538 | Encounter | Secondary | ✓ | ✓ | ||||||
| ATS Bergamo | ATS-BG | Italy | 13,00,000 | Population | Mixed | ✓ | ✓ | ||||||
| Institute of Rheumatology | ATTRA | Czech Republic | 8006 | Disease | Mixed | ✓ | ✓ | ||||||
| Marco Massari (IRCSSE) | AUSL-RE | Italy | 43,564 | Disease | Mixed | ✓ | ✓ | ||||||
| Az Oostende | AZ Oostende | Belgium | 3,71,097 | Encounter | Secondary | ✓ | |||||||
| AZ Delta | AZDDB | Belgium | 9,90,559 | Encounter | Secondary | ✓ | ✓ | ✓ | ✓ | ||||
| VZW AZ Groeninge | AZG | Belgium | 18,571 | Encounter | Secondary | ✓ | ✓ | ||||||
| AZ Klina | AZK | Belgium | 5,06,770 | Encounter | Secondary | ✓ | |||||||
| AZ Maria Middelares | AZMM | Belgium | 95,341 | Encounter | Secondary | ✓ | |||||||
| Servicio Navarro de Salud Osasunbidea (SNS-O) | BARDENA | Spain | 19,72,272 | Encounter | Mixed | ✓ | |||||||
| Barts Health NHS Trust | Barts | Great Britain | 23,12,983 | Encounter | Secondary | ✓ | |||||||
| National Scientific Program “E-Health in Bulgaria” | BDR | Bulgaria | 5,01,065 | Disease | Mixed | ✓ | ✓ | ||||||
| Agencia Española de Medicamentos y Productos Sanitarios, AEMPS | BIFAP | Spain | ######## | Population | Primary | ✓ | |||||||
| Instituto Aragonés de Ciencias de la Salud (IACS) | BIGAN | Spain | 22,91,148 | Encounter | Mixed | ✓ | |||||||
| Instituto de Medicina Molecular | Biobank_iMM_Reuma | Portugal | 592 | Disease | Mixed | ✓ | ✓ | ||||||
| Bnai Zion Medical Research Foundation and Infrastructure Development Health Services | BZMC | Israel | 10,68,599 | Encounter | Secondary | ✓ | ✓ | ✓ | |||||
| Inspire-srl | CasertaDB | Italy | 13,02,318 | Population | Mixed | ✓ | ✓ | ||||||
| Connected Bradford | cBradford | Great Britain | 12,00,677 | Encounter | Mixed | ✓ | |||||||
| Clinical Center of Montenegro | CCME | Montenegro | 2,02,322 | Encounter | Secondary | ✓ | ✓ | ||||||
| Casa di Cura Privata del Policlinico (CCPP) | CCPP | Italy | 16,218 | Encounter | Mixed | ✓ | ✓ | ||||||
| ISMETT | cdm_ismett | Italy | 24,269 | Encounter | Secondary | ✓ | |||||||
| Bordeaux University Hospital | CDWbordeaux | France | 19,85,011 | Encounter | Secondary | ✓ | ✓ | ✓ | |||||
| Charité - Universitätsmedizin | CHA-CAN | Germany | 2,14,443 | Encounter | Secondary | ✓ | |||||||
| Charité - Universitätsmedizin | CHA-DIA | Germany | 60,138 | Encounter | Secondary | ✓ | ✓ | ✓ | |||||
| Charité - Universitätsmedizin | CHA-IBD | Germany | 2471 | Encounter | Secondary | ✓ | |||||||
| Institute of Social and Preventive Medicine, University of Bern | ChCR and SCCSS | Switzerland | 12,000 | Disease | Mixed | ✓ | ✓ | ||||||
| Clinical-hospital center Zvezdara | CHCZ | Serbia | 5,15,000 | Encounter | Secondary | ✓ | ✓ | ||||||
| Clinical Hospital Dubrava | CHDubrava–IN2 | Croatia | 3,11,754 | Encounter | Secondary | ✓ | |||||||
| Centro Hospitalar Universitário de Coimbra (CHUC) | CHUC Ophtalmology | Portugal | 31,507 | Encounter | Secondary | ✓ | |||||||
| Center Hospitalier Universitaire de Toulouse | CHUT | France | 30,59,340 | Encounter | Secondary | ✓ | |||||||
| Modena Oncology Center - Azienda Ospedaliera Modena | COMNet | Italy | 89,300 | Encounter | Secondary | ✓ | |||||||
| Clinical Practice Research Datalink (CPRD) | CPRD AURUM | Great Britain | ######## | Population | Primary | ✓ | |||||||
| Clinical Practice Research Datalink (CPRD) | CPRD HESAPC AURUM | Great Britain | ######## | Population | Mixed | ✓ | |||||||
| The Norwegian Cancer Registry | CRN | Norway | 11,56,806 | Disease | Secondary | ✓ | ✓ | ||||||
| Basilicata Cancer Registry | CROB | Italy | 54,265 | Disease | Primary | ✓ | |||||||
| Krebsregister Rheinland-Pfalz | CRRLP | Germany | 2,16,174 | Disease | Mixed | ✓ | |||||||
| CUF | CUF_CRC | Portugal | 1485 | Disease | Secondary | ✓ | |||||||
| DataLoch | DataLoch | Great Britain | 4,14,038 | Disease | Secondary | ✓ | ✓ | ✓ | |||||
| Center for Surgical Science (CSS) | DCCG | Denmark | 76,849 | Disease | Secondary | ✓ | |||||||
| Amsterdam UMC | DDW | Netherlands | 1834 | Disease | Secondary | ✓ | |||||||
| Stockholm CREAtinine Measurements Project | DH-SCREAM | Sweden | 30,85,764 | Encounter | Mixed | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| University of Southern Denmark | DHCR | Denmark | 99,930 | Population | Secondary | ||||||||
| DIGITAL HEALTH SOLUTIONS SA | DHS BIO | Greece | 21,02,509 | Encounter | Primary | ✓ | ✓ | ||||||
| German Cancer Society (DKG) | DKG EDIUM | Germany | 8680 | Disease | Mixed | ✓ | ✓ | ||||||
| German Cancer Society (DKG) | DKG PCO | Germany | 49,300 | Disease | Mixed | ✓ | ✓ | ||||||
| Hospital de Denia | DptoSalud-DENIA | Spain | 3,56,723 | Encounter | Mixed | ✓ | |||||||
| University of Ulm, ZIBMT | DPV | Germany | 6,38,031 | Disease | Mixed | ✓ | |||||||
| Research Institute - Hospital de la Santa Creu i Sant Pau | DW HSCSP | Spain | 13,15,128 | Encounter | Mixed | ✓ | ✓ | ||||||
| Primary Healthcare Center Zemun | DZ Zemun | Serbia | 3,55,000 | Population | Primary | ✓ | ✓ | ||||||
| EBMT: The European Society for Blood and Marrow Transplantation | EBMT | Netherlands | 8,99,425 | Disease | Secondary | ✓ | |||||||
| European Clinical Research Alliance on Infectious Diseases (ECRAID) | ECRAID-Base POS VAP | Netherlands | 563 | Disease | Secondary | ✓ | |||||||
| Center Hospitalier Universitaire de Montpellier | eDOL Entrepôt de DOnnées du Languedoc | France | 19,30,844 | Encounter | Secondary | ✓ | |||||||
| Fondazione IRCCS Policlinico San Matteo | ELISA | Italy | 4,37,482 | Encounter | Mixed | ✓ | ✓ | ✓ | |||||
| EGAS MONIZ HEALTH ALLIANCE | EMHA ULSEDV | Portugal | 563 | Encounter | Secondary | ✓ | |||||||
| EGAS MONIZ HEALTH ALLIANCE | EMHA ULSGE | Portugal | 728 | Encounter | Secondary | ✓ | |||||||
| EGAS MONIZ HEALTH ALLIANCE | EMHA ULSRA | Portugal | 5,14,000 | Encounter | Secondary | ✓ | |||||||
| European Rare Kidney Disease Registry (ERKReg) | ERKReg | Germany | 17,079 | Disease | Mixed | ✓ | |||||||
| University of Tartu | Estonian Biobank | Estonia | 2,02,102 | Population | Mixed | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| FIIBAP | FIIBAP-COVID19 | Spain | 3,38,303 | Encounter | Primary | ✓ | |||||||
| Fondazione IRCCS Istituto Neurologico Carlo Besta | FINCB - Dataset | Italy | 1,32,408 | Encounter | Mixed | ✓ | ✓ | ✓ | |||||
| Fondazione IRCCS Istituto Neurologico Carlo Besta FINCB | FINCB-COVID19 | Italy | 766 | Disease | Secondary | ✓ | |||||||
| FinRegistry (Institute of Molecular Medicine Finland (FIMM), University of Helsinki) | FinRegistry | Finland | 53,43,204 | Population | Mixed | ✓ | |||||||
| Queen Mary University of London | FLS | Great Britain | 27,000 | Disease | Mixed | ✓ | |||||||
| Fondazione Poliambulanza Istituto Ospedaliero | FPIO | Italy | 23,116 | Disease | Secondary | ✓ | ✓ | ||||||
| Geneva Cancer Registry | GCR | Switzerland | 1,48,929 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| Telavi Regional Hospital | GE Telavi | Georgia | 41,059 | Encounter | Secondary | ✓ | ✓ | ||||||
| GENERAL HOSPITAL OF KAVALA | GHK | Greece | 1,83,024 | Encounter | Secondary | ✓ | |||||||
| MS Forschungs- und Projektentwicklungs-GmbH | GMSR | Germany | 82,300 | Disease | Mixed | ✓ | ✓ | ||||||
| Grande Ospedale Metropolitano “Bianchi-Melacrino-Morelli” | GOM-RC | Italy | 1,99,645 | Population | Mixed | ✓ | ✓ | ||||||
| GOSH | GOSH DRE | Great Britain | 1,35,511 | Encounter | Mixed | ✓ | |||||||
| Fundacion de Investigacion Biomedica del Hospital Universitario 12 de Octubre | H12O | Spain | 28,09,436 | Encounter | Mixed | ✓ | ✓ | ✓ | |||||
| Hadassah OBGYN | HadassahOBGYN | Israel | 1,19,753 | Encounter | Secondary | ✓ | |||||||
| Hospital Distrital de Santarém (HDS) | HDS Oncology and Obesity EHR | Portugal | 5000 | Disease | Mixed | ✓ | |||||||
| Harvey Walsh Ltd | HES | Great Britain | ######## | Encounter | Secondary | ✓ | |||||||
| Health Informatics Center (HIC) | HIC | Great Britain | 12,53,625 | Encounter | Mixed | ✓ | ✓ | ✓ | |||||
| SIMG | HSD | Italy | 23,99,088 | Encounter | Primary | ✓ | |||||||
| Hospital Sant Joan de Déu | HSJD | Spain | 12,47,603 | Encounter | Secondary | ✓ | |||||||
| Fundación para la Investigación del Hospital Universitario La Fe de la Comunidad Valenciana (HULAFE) | HULAFE | Spain | 22,74,159 | Encounter | Mixed | ✓ | ✓ | ✓ | ✓ | ||||
| Hospital District of Helsinki and Uusimaa | HUS | Finland | 33,33,798 | Encounter | Secondary | ✓ | |||||||
| Virgen Macarena University Hospital | HUVM | Spain | 10,89,615 | Encounter | Mixed | ✓ | ✓ | ✓ | |||||
| DIAGNOSTIC & THERAPEUTIC CENTER OF ATHENS “HYGEIA” SINGLE MEMBER SOCIETE ANONYME | HYGEIA-EHDEN | Greece | 5,66,798 | Encounter | Mixed | ✓ | ✓ | ||||||
| Icometrix | Icometrix | Belgium | 4595 | Encounter | Mixed | ✓ | |||||||
| Lancashire and South Cumbria Integrated Care Board | IDRIL-1 | Great Britain | 14,99,205 | Encounter | Secondary | ✓ | |||||||
| Fundacio Institut d’Investigacions Mèdiques (FIMIM) | IMASIS | Spain | 18,00,000 | Encounter | Mixed | ✓ | ✓ | ✓ | ✓ | ||||
| Lille University Hospital | INCLUDE | France | 1,914.68 | Encounter | Secondary | ✓ | |||||||
| InGef - Institute for Applied Health Research Berlin GmbH | InGef RDB | Germany | 91,11,064 | Population | Mixed | ✓ | |||||||
| Institut Català d’Oncologia | Institut Català d’Oncologia | Spain | 4,06,877 | Disease | Mixed | ✓ | |||||||
| Fondazione Istituto Nazionale dei Tumori | INT | Italy | 7,21,861 | Disease | Mixed | ✓ | ✓ | ||||||
| NO GRANT | IPCI | Netherlands | 28,70,221 | Population | Primary | ✓ | ✓ | ✓ | |||||
| Consorci Corporació Sanitària Parc Taulí | IRIS | Spain | 12,86,363 | Encounter | Mixed | ✓ | |||||||
| Istanbul University | ITF | Turkey | 8,99,515 | Encounter | Mixed | ✓ | |||||||
| IUC Cerrahpaşa TIP Fakületesi | IU-CTF | Turkey | 5,84,043 | Encounter | Mixed | ✓ | |||||||
| E-MEDIT D.O.O. & Hospital Travnik | JU Travnik | Bosnia and Herzegovina | 52,479 | Encounter | Secondary | ✓ | ✓ | ||||||
| IN2 d.o.o. & Clinical Hospital Center Osijek | KBC Osijek | Croatia | 3,81,105 | Encounter | Mixed | ✓ | ✓ | ✓ | |||||
| IGEA d.o.o. & University Hospital Center Sestre milosrdnice | KBC SM | Croatia | 6,00,000 | Encounter | Primary | ✓ | ✓ | ✓ | |||||
| Hierarchia & University Hospital Center Zagreb | KBCZg | Croatia | 9,61,568 | Encounter | Mixed | ✓ | |||||||
| MEB KI | KI-MEB | Sweden | 13,00,000 | Population | Mixed | ✓ | |||||||
| Bács-Kiskun Megyei Kórház a Szegedi Tudományegyetem Általános Orvostudományi Kar Oktató Kórháza | Kkh_EMR | Hungary | 5,00,000 | Encounter | Secondary | ✓ | ✓ | ||||||
| The Directorate of Government Medical Centers at the Israeli Ministry Of Health | KMC-EHR | Israel | 65,85,681 | Encounter | Secondary | ✓ | ✓ | ✓ | |||||
| Lambeth DataNet | LDN | Great Britain | 13,50,835 | Encounter | Primary | ||||||||
| Hospital da Luz Learning Health | LH_MMO | Portugal | 7245 | Disease | Secondary | ✓ | ✓ | ✓ | ✓ | ||||
| Leeds Teaching Hospitals | LTHT | Great Britain | 19,25,447 | Encounter | Secondary | ✓ | ✓ | ||||||
| OAKS Consulting s.r.o. | LUCAS | Czech Republic | 8507 | Disease | Mixed | ✓ | |||||||
| MCS Grupa d.o.o. & Health Care Center of Primorje-Gorski Kotar County | M-DZPGZ | Croatia | 2,77,128 | Encounter | Primary | ✓ | ✓ | ||||||
| Azienda Ospedaliera SS Antonio e Biagio e Cesare Arrigo | MACADAM | Italy | 879 | Disease | Mixed | ✓ | |||||||
| Medaman | MHD | Belgium | 1,17,105 | Encounter | Secondary | ✓ | ✓ | ✓ | |||||
| Hanover Medical School | MHH | Germany | 22,52,576 | Encounter | Secondary | ✓ | |||||||
| CancerDataNet GmbH | MM_CDN | Germany | 7006 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| University MS Center | MSDC | Belgium | 872 | Disease | Mixed | ✓ | |||||||
| Medical University of Vienna | MUV | Austria | 33,948 | Encounter | Secondary | ✓ | |||||||
| Medical University of Vienna | MUV-H2O-BC | Austria | 3508 | Disease | Secondary | ✓ | |||||||
| Medical University of Vienna | MUV-H2O-DM | Austria | 170 | Disease | Secondary | ✓ | |||||||
| IKNL | NCR | Netherlands | 23,39,983 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| National Institute of Health Insurance Fund Management Hungary | NEAK | Hungary | 14,140,982 | Population | Mixed | ✓ | |||||||
| AO Card. G. Panico - Center for Neurodegenerative Diseases and Aging Brain | Neurage-DB | Italy | 552 | Disease | Secondary | ✓ | |||||||
| Queen Mary University of London | NHFD | Great Britain | 5,90,584 | Disease | Secondary | ✓ | |||||||
| King’s College London | NHIC RENAL GSTT | Great Britain | 2149 | Disease | Secondary | ✓ | ✓ | ||||||
| University of Oslo | NHR@UiO | Norway | 73,43,868 | Population | Mixed | ✓ | |||||||
| National Intensive Care Evaluation foundation | NICE | Netherlands | 10,72,259 | Encounter | Secondary | ✓ | ✓ | ||||||
| UK National Neonatal Research Database | NNRD | Great Britain | 11,80,103 | Encounter | Secondary | ✓ | |||||||
| Szabolcs-Szatmár-Bereg Megyei Kórházak és Egyetemi Oktatókórház | Nyir_EMR | Hungary | 9,67,000 | Encounter | Secondary | ✓ | ✓ | ||||||
| Bambino Gesù Children’s Hospital | OBG-POHD | Italy | 3901 | Disease | Secondary | ✓ | |||||||
| Onze-Lieve-Vrouwziekenhuis Aalst-Asse-Ninove | OLVZ_LUNG | Belgium | 364 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| GermanOncology | OncalizerReg | Germany | 4159 | Disease | Secondary | ✓ | |||||||
| Optimum Patient Care Limited | OPCRD | Great Britain | 25,953,068 | Encounter | Primary | ✓ | ✓ | ||||||
| Royal College of General Practitioners (RCGP) | ORCHID | Great Britain | 80,00,000 | Population | Primary | ✓ | |||||||
| Rioja Salud | PASCAL | Spain | 7,89,371 | Encounter | Mixed | ✓ | |||||||
| University of Turku (Prostate Cancer Registry of South West Finland) | PcaSF | Finland | 22,232 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| ASST Papa Giovanni XXIII | PG23 | Italy | 4,51,135 | Encounter | Mixed | ✓ | ✓ | ||||||
| Papageorgiou General Hospital | PGH | Greece | 14,12,857 | Encounter | Secondary | ✓ | |||||||
| STIZON | PHARMO | Netherlands | 44,41,048 | Encounter | Primary | ✓ | |||||||
| BCB Medical Ltd | Pirha BCB IBD | Finland | 4516 | Disease | Secondary | ✓ | ✓ | ✓ | |||||
| Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico | POLIMI | Italy | 14,70,942 | Encounter | Secondary | ✓ | ✓ | ||||||
| UZ Brussel | PRIMUZ | Belgium | 5594 | Encounter | Secondary | ✓ | ✓ | ||||||
| Fundació Institut d´Investigació Sanitària Illes Balears | PRISIB | Spain | 24,98,226 | Encounter | Mixed | ✓ | ✓ | ||||||
| IRCCS Policlinico San Donato | PSD | Italy | 4,85,174 | Encounter | Mixed | ✓ | ✓ | ||||||
| Finnish Clinical Biobank Tampere | PSHP Oncology | Finland | 1,14,697 | Disease | Secondary | ✓ | ✓ | ||||||
| Parc Sanitari Sant Joan de Déu | PSSJD | Spain | 6,59,817 | Encounter | Mixed | ✓ | |||||||
| Harm Slijper | PulseHandWrist | Netherlands | 49,903 | Disease | Secondary | ✓ | ✓ | ||||||
| Quironsalud | QuirónSalud | Spain | 2,98,839 | Encounter | Secondary | ✓ | |||||||
| Registo Portugues de Doentes Reumaticos | Reuma.pt | Portugal | 28,325 | Disease | Secondary | ✓ | |||||||
| Czech Myeloma Group | RMG | Czech Republic | 9802 | Disease | Mixed | ✓ | |||||||
| Registre National du Cancer du Luxembourg | RNC | Luxembourg | 8892 | Disease | Mixed | ✓ | |||||||
| Vaud Cancer Registry | RVT | Switzerland | 1,54,043 | Disease | Mixed | ✓ | |||||||
| LynxCare | RWEHub_Cardiology | Belgium | 18,296 | Encounter | Secondary | ✓ | |||||||
| LynxCare | RWEHubHF | Belgium | 26,500 | Disease | Secondary | ✓ | |||||||
| SAIL Databank | SAIL - ADDE | Great Britain | 7,93,300 | Population | Secondary | ✓ | |||||||
| SAIL Databank | SAIL - NCCH | Great Britain | 19,07,900 | Population | Primary | ✓ | |||||||
| SAIL Databank | SAIL - PATD | Great Britain | 11,05,100 | Disease | Primary | ✓ | |||||||
| SAIL Databank | SAIL - PEDW | Great Britain | 34,23,200 | Encounter | Secondary | ✓ | |||||||
| SAIL Databank | SAIL - WDSD | Great Britain | 55,69,400 | Population | Primary | ✓ | ✓ | ✓ | |||||
| SAT Health | SATHEALTH | Bulgaria | 12,451 | Encounter | Secondary | ✓ | |||||||
| Gothenburg University | SCIFI-PEARL | Sweden | 11,700,000 | Population | Mixed | ✓ | ✓ | ||||||
| Servicio Cántabro de Salud and IDIVAL | SCIVAL | Spain | 13,77,099 | Encounter | Mixed | ✓ | ✓ | ||||||
| HUG and SCQM | SCQM | Switzerland | 20,355 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| Consellería de Sanidade | SERGAS | Spain | 29,16,773 | Encounter | Mixed | ✓ | ✓ | ||||||
| University of Edinburgh | SESCD | Great Britain | 35,395 | Disease | Secondary | ✓ | ✓ | ||||||
| SIDIAP - The Information System for Research in Primary Care | SIDIAP | Spain | 78,87,308 | Encounter | Primary | ✓ | ✓ | ||||||
| King’s College London | SLSR | Great Britain | 6242 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| Health Data Hub | SNDS | France | 671,610c | Population | Mixed | ✓ | |||||||
| Bordeaux PharmacoEpi | SNDS (BPE) | France | 72,520c | Population | Mixed | ✓ | |||||||
| SWIBREG | SWIBREG | Sweden | 60,000 | Disease | Mixed | ✓ | ✓ | ✓ | |||||
| Pirkanmaa Hospital District | TaUH | Finland | 8,93,817 | Encounter | Secondary | ✓ | ✓ | ||||||
| CEGEDIM HEALTH DATA | THIN FRANCE | France | ######## | Encounter | Primary | ✓ | ✓ | ✓ | |||||
| CEGEDIM HEALTH DATA | THIN Romania | Romania | 10,48,994 | Encounter | Primary | ✓ | ✓ | ||||||
| CEGEDIM HEALTH DATA | THIN UK | Great Britain | ######## | Encounter | Primary | ✓ | ✓ | ✓ | |||||
| Finnish Institute of Health and Welfare | THL-AVOHILMO | Finland | 72,94,000 | Encounter | Primary | ✓ | |||||||
| Finnish Institute for Health and Welfare (THL) | THL-HILMO | Finland | 71,02,953 | Encounter | Mixed | ✓ | |||||||
| Trinity St James’s Cancer Institute | TSJCI BRE | Ireland | 1020 | Disease | Mixed | ✓ | |||||||
| Trinity St James’s Cancer Institute | TSJCI COL | Ireland | 624 | Disease | Mixed | ✓ | |||||||
| Trinity St James’s Cancer Institute | TSJCI GYN | Ireland | 922 | Disease | Mixed | ✓ | |||||||
| Trinity St James’s Cancer Institute | TSJCI HAN | Ireland | 1363 | Disease | Mixed | ✓ | |||||||
| Trinity St James’s Cancer Institute | TSJCI LNG | Ireland | 1920 | Disease | Mixed | ✓ | |||||||
| Trinity St James’s Cancer Institute | TSJCI SKN | Ireland | 663 | Disease | Mixed | ✓ | |||||||
| Trinity St James’s Cancer Institute | TSJCI UGI | Ireland | 1785 | Disease | Mixed | ✓ | |||||||
| Trinity St James’s Cancer Institute | TSJCI URO | Ireland | 2047 | Disease | Mixed | ✓ | |||||||
| Clinical center of Nis | UCCNis | Serbia | 3400 | Encounter | Secondary | ✓ | ✓ | ||||||
| Clinical Center of Serbia | UCCS | Serbia | 8,60,000 | Encounter | Secondary | ✓ | ✓ | ✓ | |||||
| University College London Hospitals | UCLH | Great Britain | 1,88,970 | Encounter | Secondary | ✓ | |||||||
| University College London (UCL) (UK Biobank) | UK Biobank | Great Britain | 5,02,504 | Population | Mixed | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| University Medicine Dresden | UKDresden | Germany | 6,24,697 | Encounter | Secondary | ✓ | ✓ | ||||||
| National Cancer Institute | ULR | Ukraine | 1112 | Disease | Mixed | ✓ | |||||||
| ULS AC Cardiovascular | ULS AC Cardiovascular | Portugal | 2180 | Encounter | Secondary | ✓ | |||||||
| ULSM | ULSM COVID | Portugal | 9750 | Disease | Secondary | ✓ | |||||||
| Unidade Local de Saúde de Matosinhos | ULSM RT-DB | Portugal | 6,79,804 | Encounter | Secondary | ✓ | |||||||
| University of Pécs | UP-HCDB | Hungary | 10,12,198 | Encounter | Secondary | ✓ | ✓ | ✓ | ✓ | ||||
| Semmelweis University | USN_EMR | Hungary | 20,75,672 | Encounter | Secondary | ✓ | |||||||
| University Hospital Antwerp | UZA_NLP_ONCO | Belgium | 4562 | Disease | Secondary | ✓ | |||||||
| Universitaire Ziekenhuizen KU Leuven | UZLDB | Belgium | 5,82,709 | Disease | Secondary | ✓ | ✓ | ✓ | |||||
| Vall d’Hebrón Hospital Campus | VH | Spain | 17,99,398 | Encounter | Secondary | ✓ | ✓ | ||||||
| FISABIO-HSRU | VID-CONSIGN | Spain | 19,64,588 | Population | Mixed | ✓ | ✓ | ✓ | |||||
| VieCuri Medisch Centrum | Viecuri | Netherlands | 4918 | Encounter | Secondary | ✓ | ✓ | ||||||
| Ziekenhuis Oost-Limburg | ZOL-EPDexport-DB | Belgium | 12,209 | Encounter | Secondary | ✓ | ✓ | ✓ |
EHR: electronic health record.
NLP: natural language processing.
This is a subset of the full data source.
Looking at care settings, 46.7% (98/210) of data sources represent data from the secondary setting only, while 42.4% (89/210) represent data from mixed settings (primary and secondary). A comparatively smaller set of 11.0% (23/210) represents data only from the primary care setting (Table 2). Looking at the ways in which persons are included in the data sources, 55.7% (n=117) do so through health care encounters, 32.9% (n=69) through disease-specific data collection, and 11.4% (n=24) through population-based sources.
Table 2. Stratification of data sources in the European Health Data & Evidence Network by method of person inclusion and care level. Person inclusion reflects the basis by which individuals are represented in the database: health care encounters, disease-specific inclusion, or population-based inclusion. Care settings indicate whether data were captured in primary care, secondary (hospital) care, or across both (mixed).
| Care level | Person inclusion | Values, n (%) | ||
|---|---|---|---|---|
| Disease | Encounter | Population | ||
| Mixed, n | 40 | 34 | 15 | 89 (42.4) |
| Primary, n | 2 | 14 | 7 | 23 (11) |
| Secondary, n | 27 | 69 | 2 | 98 (46.7) |
| Total, n (%) | 69 (32.9) | 117 (55.7) | 24 (11.4) | 210 (100) |
Figure 3 shows the number of data sources that receive information through each capture method and each combination of capture methods. EHR is the most common, with 74.7% (157/210) of data sources reporting at least 1 capture method as EHR. Over half of those data sources (85/210) report EHR as their only method for receiving data. Laboratory is the second most common way data sources capture information, as it is reported in 29.5% (62/210) of data sources. Unlike EHR, laboratory data is more likely to be coupled with another data capture method, as only one data source lists laboratory as the singular way they receive information.
Figure 3. The frequency and overlap of different data capture methods used across 210 standardized real-world data sources in the European Health Data & Evidence Network as of September 1, 2024. EHR: electronic health record; NLP: natural language processing.

Discussion
Principal Findings
The varied health care data across Europe, as demonstrated by the summary of 210 data sources in EHDEN from 30 countries, underscores the critical need to generate evidence from more than one data source to comprehensively represent the health care needs or experiences of the entire European population. Across the person inclusion and care levels represented in the network, the data sources are well distributed, emphasizing how health care systems, populations, and data capture methods can differ substantially. While 74.7% (157/210) of the data sources report EHR as at least one of their data capture methods, only 40.4% (85/210) report EHR as their only data capture method. The other 34.3% (72/210) report some combination of EHR, laboratory, case report form, claim, natural language processing, and death register data, showcasing the tremendous heterogeneity of data available in Europe.
Prior Initiatives
Prior initiatives like European Union–Adverse Drug Reactions (EU-ADR) and Innovative Medicines Initiative–European Medical Information Framework (IMI-EMIF) laid the groundwork for EHDEN, with learnings from those projects directly impacting this project [15,18,36,37]. EU-ADR demonstrated the feasibility of building a federated data network for large-scale drug safety monitoring in Europe using common data analysis files. IMI-EMIF made the first transition from using common input files like those in EU-ADR to the OMOP CDM, but it was not scalable due to the lack of funds and need for trained SMEs, both problems which EHDEN addressed.
Sustainability and Success of EHDEN
The sustainability of the EHDEN initiative has been achieved through a combination of mechanisms that foster shared leadership, collaboration, and long-term value creation. One key factor has been the stimulation and enablement of both national and European collaborations. The establishment of OHDSI National Nodes has provided a platform for DPs within individual countries to collaborate, share best practices, and enhance data quality [38]. These nodes facilitate national-level harmonization while ensuring compliance with local regulations and coding systems, thereby strengthening the network’s integrity. Beyond this, EHDEN’s adoption in multiple European projects has further expanded its influence, including its pivotal role in enabling large-scale initiatives such as the Data Analysis and Real World Interrogation Network (DARWIN EU). This has also influenced how the European Federation of Pharmaceutical Industries and Associations (EFPIA) is standardizing its data, demonstrating EHDEN’s impact across sectors.
EHDEN has also delivered economic value by creating local ecosystems that support SMEs and DPs. Through the Harmonization Fund, EHDEN has injected resources into the European health care data landscape, with the return on investment yielding a multiplier effect. By recruiting and training SMEs through the EHDEN Academy, the initiative has built local expertise to support DPs throughout the ETL process, ensuring decentralized and sustainable support for the network.
One of the goals of EHDEN has been to standardize health data, akin to utilities like electricity or the internet, essential and accessible to a rapidly growing number of stakeholders across Europe. Now that EHDEN has transitioned from a project under IHI to the nonprofit EHDEN Foundation, the focus has shifted to sustaining, expanding, and improving the network while leveraging the harmonized data for evidence generation. This next phase aims to generate meaningful RWE for research and regulatory purposes. A recent report by The European Commission on the future of European competitiveness highlights EHDEN’s foundational role in shaping the future of the European Health Data Space, further solidifying its legacy as a critical driver of innovation and collaboration in European health care [39].
The success of EHDEN in harmonizing data to the OMOP CDM has led to significant advances in methods research and evidence generation [40-43]. Many of the DPs involved in EHDEN have used their standardized data to conduct analyses across a broad spectrum of use cases. For instance, several studies have been conducted to describe the natural history of diseases, the safety and effectiveness of treatments, and health care utilization patterns across diverse populations [44-46]. One clear example is a multinational network cohort study by Li et al [45], which used evidence generated from EHDEN DPs to characterize background incidence rates of adverse events of special interest related to COVID-19 vaccines. EHDEN’s standardized data has also enabled and improved the development of predictive models, allowing for personalized predictions of treatment outcomes and disease progression [47,48]. The harmonization of data has facilitated large-scale population-level studies, which are crucial for understanding trends in public health and informing health care policy decisions [49,50]. These examples of evidence generation illustrate the broad applicability of the data in the network, which serves both academic researchers and regulatory agencies. A regularly updated list of EHDEN-supported studies and publications is maintained on the EHDEN website, providing a comprehensive overview of the diverse applications of the network in regulatory, clinical, and methodological research.
A notable demonstration of EHDEN’s success is the network’s use in providing timely information on medicines under surveillance due to shortages in multiple European countries. More than 50 DPs contributed data to a study titled “Incidence, Prevalence, and Characterization of Medicines with Suggested Drug Shortages in Europe” [51]. This study represents the largest observational database study conducted across Europe, both in terms of the number of databases involved and its geographic scope. The findings will support European efforts to monitor the use of critical medicines, contributing to the global fight against medicine shortages.
The EHDEN data network has also affected other European collaboratives around RWD. IMI projects like PIONEER, BigData@Heart, EU-PEARL, and HARMONY also use the OMOP CDM and have partly continued the mapping work done in EHDEN [52-55]. In the EMA-commissioned DARWIN EU initiative, among the 20 DPs onboarded in the first 2 years, 16 are also EHDEN DPs [56].
Future Directions
Building on the progress achieved through the EHDEN network, several key areas offer opportunities for future development. One priority is fostering sustained engagement with DPs. Continuous collaboration will be essential to ensure that DPs remain active contributors to the network by regularly updating and improving their data contributions. Strategies to incentivize engagement, provide ongoing support, and ensure mutual value will be vital for the network’s long-term success, particularly as efforts shift toward more robust evidence generation and ongoing enhancements in data quality.
Expanding the network’s reach and optimizing its databases for specific research use cases are also key areas for growth. With 210 data sources currently included, there is a significant opportunity to onboard additional DPs and expand the network’s coverage across Europe. Future studies will also help identify gaps where further data optimization is required, such as refining mappings or addressing specific quality issues to ensure that the evidence generated is robust, reproducible, and generalizable.
Finally, the newly established EHDEN Foundation will play a critical role in these efforts. By securing funding and fostering collaborations, the Foundation can drive the onboarding of new DPs, address emerging research questions, and ensure that EHDEN continues to adapt to the evolving health care landscape. It also serves as a point of entry for external researchers, who may engage with the network and propose studies through its federated framework. These directions will position the network to remain a cornerstone for RWE generation in Europe, supporting both research and regulatory innovation.
Conclusions
The results of this study demonstrate that the identification, harmonization, and standardization of data sources through EHDEN have contributed significantly to understanding the diverse RWD landscape and advancement of evidence generation across Europe. These efforts are not only improving observational health research but are also influencing broader regulatory initiatives, such as DARWIN EU, which builds on the foundational work of EHDEN to leverage RWD for regulatory decision-making. Now that the initiative has transitioned to the EHDEN Foundation, there is an opportunity to focus even more on generating high-quality evidence, further solidifying the role of real-world data in improving health care and informing policy decisions across Europe.
Supplementary material
Acknowledgments
The authors would like to express their deepest gratitude to all European Health Data & Evidence Network (EHDEN) data partners who contributed their time, effort, and expertise in harmonizing their data to the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM). Their commitment to data quality and transparency has been fundamental to the success of this initiative. We extend our appreciation to the small-to-medium enterprises that played a pivotal role in supporting data partners through the extract-transform-load process. Their expertise in data mapping and technical implementation has been invaluable in ensuring high-quality and standardized data across the network. We would also like to acknowledge the contributions of the EHDEN work packages and their leads, whose dedication to data harmonization, quality assessment, infrastructure development, training, and sustainability efforts has made this project possible. Their leadership and vision have shaped EHDEN into a robust and scalable federated data network. Finally, we extend our thanks to all members of the EHDEN Consortium who have worked tirelessly to build and sustain this network. This includes researchers, data scientists, software engineers, governance and regulatory experts, and the broader Observational Health Data Sciences and Informatics (OHDSI) community, whose open-science collaboration and innovation continue to drive the success of EHDEN. We recognize that the continued success of EHDEN is the result of collective contributions from numerous stakeholders across Europe, and we sincerely appreciate the efforts of everyone involved. This project has received support from the EHDEN project. EHDEN received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement 806968. The Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation program and European Federation of Pharmaceutical Industries and Associations.
Abbreviations
- DARWIN EU
Data Analysis and Real World Interrogation Network
- DP
data partner
- DQD
data quality dashboard
- EFPIA
European Federation of Pharmaceutical Industries and Associations Declarations
- EHDEN
European Health Data & Evidence Network
- EHR
electronic health record
- EMIF
European Medical Information Framework
- ETL
extract-transform-load
- EU-ADR
European Union–Adverse Drug Reactions
- IMI
Innovative Medicines Initiative
- LOINC
Logical Observation Identifiers Names and Codes
- OHDSI
Observational Health Data Sciences and Informatics
- OMOP CDM
Observational Medical Outcomes Partnership Common Data Model
- RWD
real-world data
- RWE
real-world evidence
- RxNorm
Prescription Normalized Names
- SME
small-to-medium enterprise
- SNOMED CT
Systematized Nomenclature of Medicine – Clinical Terms
Footnotes
Authors’ Contributions: All authors (CB, MJS, MM, EAV, MC, PRR, and PBR) were involved in data collection. CB, MJS, PBR, MM, and PRR were involved in the study design, analysis, and interpretation of results. CB, MJS, PBR, and PRR contributed to writing, and all authors revised and approved the final draft.
Data Availability: All data supporting this work can be found on the European Health Data & Evidence Network (EHDEN) portal at [57].
Conflicts of Interest: CB, MJS, EAV, and PBR are employees of Johnson & Johnson and hold stock and stock options. PRR works for a department that receives/received unconditional research grants from Amgen, Chiesi, Johnson & Johnson, UCB Biopharma, the European Medicines Agency, and the Innovative Medicines Initiative.
References
- 1.Framework for FDA’s real-world evidence program. US Food and Drug Administration. 2018. [25-07-2025]. https://www.fda.gov/media/120060/download URL. Accessed.
- 2.HMA-EMA joint big data taskforce summary report. European Medicines Agency. 2019. [31-07-2025]. https://www.ema.europa.eu/en/documents/minutes/hma-ema-joint-task-force-big-data-summary-report_en.pdf URL. Accessed.
- 3.Methodological guide. The European Network of Centres for Pharmacoepidemiology and Pharmacovigilance (ENCePP) Guide on Methodological Standards in Pharmacoepidemiology. [06-09-2024]. https://encepp.europa.eu/encepp-toolkit/methodological-guide_en URL. Accessed.
- 4.Jonker CJ, Bakker E, Kurz X, Plueschke K. Contribution of patient registries to regulatory decision making on rare diseases medicinal products in Europe. Front Pharmacol. 2022;13:924648. doi: 10.3389/fphar.2022.924648. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Eskola SM, Leufkens HGM, Bate A, De Bruin ML, Gardarsdottir H. The role of real-world data and evidence in oncology medicines approved in EU in 2018-2019. J Cancer Policy. 2023 Jun;36:100424. doi: 10.1016/j.jcpo.2023.100424. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 6.Bolislis WR, Fay M, Kühler TC. Use of real-world data for new drug applications and line extensions. Clin Ther. 2020 May;42(5):926–938. doi: 10.1016/j.clinthera.2020.03.006. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 7.Guidance on good pharmacovigilance practices (GVP) module VIII post-authorisation safety studies. European Medicines Agency. [06-08-2024]. https://www.ema.europa.eu/en/documents/scientific-guideline/guideline-good-pharmacovigilance-practices-gvp-module-viii-post-authorisation-safety-studies-rev-3_en.pdf URL. Accessed.
- 8.Santoro A, Genov G, Spooner A, Raine J, Arlett P. Promoting and protecting public health: how the European Union pharmacovigilance system works. Drug Saf. 2017 Oct;40(10):855–869. doi: 10.1007/s40264-017-0572-8. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Goedecke T, Morales DR, Pacurariu A, Kurz X. Measuring the impact of medicines regulatory interventions - systematic review and methodological considerations. Br J Clin Pharmacol. 2018 Mar;84(3):419–433. doi: 10.1111/bcp.13469. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Health at a glance: Europe 2022: state of health in the EU cycle. Organisation for Economic Co-operation and Development. 2022. [22-08-2024]. https://www.oecd-ilibrary.org/social-issues-migration-health/health-at-a-glance-europe-2022_507433b0-en URL. Accessed.
- 11.Cave A, Kurz X, Arlett P. Real-world data for regulatory decision making: challenges and possible solutions for Europe. Clin Pharmacol Ther. 2019 Jul;106(1):36–39. doi: 10.1002/cpt.1426. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Bakker E, Plueschke K, Jonker CJ, Kurz X, Starokozhko V, Mol PGM. Contribution of real-world evidence in European Medicines Agency’s regulatory decision making. Clin Pharmacol Ther. 2023 Jan;113(1):135–151. doi: 10.1002/cpt.2766. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Flynn R, Plueschke K, Quinten C, et al. Marketing authorization applications made to the European Medicines Agency in 2018-2019: what was the contribution of real-world evidence? Clin Pharmacol Ther. 2022 Jan;111(1):90–97. doi: 10.1002/cpt.2461. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Gini R, Schuemie M, Brown J, et al. Data extraction and management in networks of observational health care databases for scientific research: a comparison of EU-ADR, OMOP, Mini-Sentinel and MATRICE strategies. EGEMS (Wash DC) 2016;4(1):1189. doi: 10.13063/2327-9214.1189. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Coloma PM, Schuemie MJ, Trifirò G, et al. Combining electronic healthcare databases in Europe to allow for large-scale drug safety monitoring: the EU-ADR Project. Pharmacoepidemiol Drug Saf. 2011 Jan;20(1):1–11. doi: 10.1002/pds.2053. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 16.Trifirò G, Coloma PM, Rijnbeek PR, et al. Combining multiple healthcare databases for postmarketing drug and vaccine safety surveillance: why and how? J Intern Med. 2014 Jun;275(6):551–561. doi: 10.1111/joim.12159. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 17.Stang PE, Ryan PB, Racoosin JA, et al. Advancing the science for active surveillance: rationale and design for the Observational Medical Outcomes Partnership. Ann Intern Med. 2010 Nov 2;153(9):600–606. doi: 10.7326/0003-4819-153-9-201011020-00010. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 18.Reynolds RF, Kurz X, de Groot MCH, et al. The IMI PROTECT project: purpose, organizational structure, and procedures. Pharmacoepidemiol Drug Saf. 2016 Mar;25 Suppl 1(5–10):5–10. doi: 10.1002/pds.3933. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 19.EFPIA recommendations on a connected data system in Europe. European Federation of Pharmaceutical Industries and Associations. [06-09-2024]. https://www.efpia.eu/news-events/the-efpia-view/efpia-news/efpia-recommendations-on-a-connected-data-system-in-europe/ URL. Accessed.
- 20.European Health Data & Evidence Network. [12-03-2024]. https://www.ehden.eu/ URL. Accessed.
- 21.Innovative Health Initiative. 2024. [12-09-2024]. http://www.ihi.europa.eu/ URL. Accessed.
- 22.Voss EA, Makadia R, Matcho A, et al. Feasibility and utility of applications of the common data model to multiple, disparate observational health databases. J Am Med Inform Assoc. 2015 May;22(3):553–564. doi: 10.1093/jamia/ocu023. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.OMOP Common Data Model. [25-07-2025]. https://ohdsi.github.io/CommonDataModel/ URL. Accessed.
- 24.Voss EA, Blacketer C, van Sandijk S, et al. European Health Data & Evidence Network – learnings from building out a standardized international health data network. J Am Med Inform Assoc. 2023 Dec 22;31(1):209–219. doi: 10.1093/jamia/ocad214. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Hripcsak G, Duke JD, Shah NH, et al. Observational Health Data Sciences and Informatics (OHDSI): opportunities for observational researchers. Stud Health Technol Inform. 2015;216(574–578):574–578. Medline. [PMC free article] [PubMed] [Google Scholar]
- 26.Schuemie M, Reps J, Black A, et al. Health-Analytics Data to Evidence Suite (HADES): open-source software for observational research. Stud Health Technol Inform. 2024 Jan 25;310:966–970. doi: 10.3233/SHTI231108. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.The Book of OHDSI. [25-07-2025]. Chapter 4: the common data model.https://ohdsi.github.io/TheBookOfOhdsi/ URL. Accessed. [Google Scholar]
- 28.Reich C, Ostropolets A, Ryan P, et al. OHDSI standardized vocabularies – a large-scale centralized reference ontology for international data harmonization. J Am Med Inform Assoc. 2024 Feb 16;31(3):583–590. doi: 10.1093/jamia/ocad247. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Papez V, Moinat M, Voss EA, et al. Transforming and evaluating the UK Biobank to the OMOP Common Data Model for COVID-19 research and beyond. J Am Med Inform Assoc. 2022 Dec 13;30(1):103–111. doi: 10.1093/jamia/ocac203. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.THEMIS convention repository for the OMOP CDM. THEMIS. [13-09-2024]. https://ohdsi.github.io/Themis/ URL. Accessed.
- 31.OHDSI/databaseconnector. GitHub. 2025. [07-05-2024]. https://github.com/OHDSI/DatabaseConnector URL. Accessed.
- 32.Blacketer C, Defalco FJ, Ryan PB, Rijnbeek PR. Increasing trust in real-world evidence through evaluation of observational data quality. J Am Med Inform Assoc. 2021 Sep 18;28(10):2251–2257. doi: 10.1093/jamia/ocab132. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Blacketer C, Voss EA, DeFalco F, et al. Using the data quality dashboard to improve the EHDEN network. Appl Sci (Basel) 2021 Jan;11(24):11920. doi: 10.3390/app112411920. doi. [DOI] [Google Scholar]
- 34.EHDEN/cdminspection: r package to support quality control inspection of an OMOP-CDM instance. GitHub. [13-09-2024]. https://github.com/EHDEN/CdmInspection URL. Accessed.
- 35.EHDEN Portal. [13-09-2024]. https://portal.ehden.eu/wordpress/index.php/landing-page-2/ URL. Accessed.
- 36.Klungel OH, Kurz X, de Groot MCH, et al. Multi-centre, multi-database studies with common protocols: lessons learnt from the IMI PROTECT project. Pharmacoepidemiol Drug Saf. 2016 Mar;25 Suppl 1:156–165. doi: 10.1002/pds.3968. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 37.Lovestone S, EMIF Consortium The European medical information framework: a novel ecosystem for sharing healthcare data across Europe. Learn Health Syst. 2020 Apr;4(2):e10214. doi: 10.1002/lrh2.10214. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Observational health data sciences and informatics national nodes. Observational Health Data Sciences and Informatics. [27-09-2024]. https://www.ohdsi-europe.org/index.php/national-nodes URL. Accessed.
- 39.The Draghi report on EU competitiveness. European Union. 2024. [25-07-2025]. https://commission.europa.eu/topics/strengthening-european-competitiveness/eu-competitiveness-looking-ahead_en#paragraph_47059 URL. Accessed.
- 40.Yang C, Fridgeirsson EA, Kors JA, Reps JM, Rijnbeek PR. Impact of random oversampling and random undersampling on the performance of prediction models developed using observational health data. J Big Data. 2024 Jan 3;11(1):7. doi: 10.1186/s40537-023-00857-7. doi. [DOI] [Google Scholar]
- 41.Fridgeirsson EA, Williams R, Rijnbeek P, Suchard MA, Reps JM. Comparing penalization methods for linear models on large observational health data. J Am Med Inform Assoc. 2024 Jun 20;31(7):1514–1521. doi: 10.1093/jamia/ocae109. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Williams RD, den Otter S, Reps JM, Rijnbeek PR. The DELPHI library: improving model validation, transparency and dissemination through a centralised library of prediction models. Stud Health Technol Inform. 2023 May 18;302:139–140. doi: 10.3233/SHTI230085. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 43.Markus AF, Kors JA, Rijnbeek PR. The role of explainability in creating trustworthy artificial intelligence for health care: a comprehensive survey of the terminology, design choices, and evaluation strategies. J Biomed Inform. 2021 Jan;113:103655. doi: 10.1016/j.jbi.2020.103655. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 44.Morales DR, Ostropolets A, Lai L, et al. Characteristics and outcomes of COVID-19 patients with and without asthma from the United States, South Korea, and Europe. J Asthma. 2023 Jan;60(1):76–86. doi: 10.1080/02770903.2021.2025392. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 45.Li X, Ostropolets A, Makadia R, et al. Characterising the background incidence rates of adverse events of special interest for Covid-19 vaccines in eight countries: multinational network cohort study. BMJ. 2021 Jun 14;373:n1435. doi: 10.1136/bmj.n1435. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Voss EA, Shoaibi A, Yin Hui Lai L, et al. Contextualising adverse events of special interest to characterise the baseline incidence rates in 24 million patients with COVID-19 across 26 databases: a multinational retrospective cohort study. EClinicalMedicine. 2023 Apr;58:101932. doi: 10.1016/j.eclinm.2023.101932. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Williams RD, Reps JM, OHDSI/EHDEN Knee Arthroplasty Group. Rijnbeek PR, Ryan PB, Prieto-Alhambra D. 90-Day all-cause mortality can be predicted following a total knee replacement: an international, network study to develop and validate a prediction model. Knee Surg Sports Traumatol Arthrosc. 2022 Sep;30(9):3068–3075. doi: 10.1007/s00167-021-06799-y. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Reps JM, Williams RD, You SC, et al. Feasibility and evaluation of a large-scale external validation approach for patient-level prediction in an international data network: validation of models predicting stroke in female patients newly diagnosed with atrial fibrillation. BMC Med Res Methodol. 2020 May 6;20(1):102. doi: 10.1186/s12874-020-00991-3. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Lane JCE, Weaver J, Kostka K, et al. Risk of depression, suicide and psychosis with hydroxychloroquine treatment for rheumatoid arthritis: a multinational network cohort study. Rheumatology (Oxford) 2021 Jul 1;60(7):3222–3234. doi: 10.1093/rheumatology/keaa771. doi. Medline. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Burn E, Weaver J, Morales D, et al. Opioid use, postoperative complications, and implant survival after unicompartmental versus total knee replacement: a population-based network study. Lancet Rheumatol. 2019 Dec;1(4):e229–e236. doi: 10.1016/S2665-9913(19)30075-X. doi. Medline. [DOI] [PubMed] [Google Scholar]
- 51.Pineda-Moncusí M, Rekkas A, Pérez ÁM, et al. Trends of drug use with suggested shortages and their alternatives across 41 real world data sources and 18 countries in Europe and North America. medRxiv. 2024 doi: 10.1101/2024.08.28.24312695. Preprint posted online on. doi. [DOI]
- 52.PIONEER - European Network of Excellence for Big Data in Prostate Cancer. [07-11-2024]. https://prostate-pioneer.eu/ URL. Accessed.
- 53.BigData@Heart > Home. [07-11-2024]. https://www.bigdata-heart.eu/ URL. Accessed.
- 54.HARMONY is part of the IMI “Big data for better outcomes. HARMONY Alliance. [07-11-2024]. https://www.harmony-alliance.eu/en/harmony-is-part-of-the-imi-big-data-for-better-outcomes-program URL. Accessed.
- 55.Innovative patient centric clinical trial platforms. EU-PEARL. [07-11-2024]. https://eu-pearl.eu/ URL. Accessed.
- 56.Darwin EU. European Medicines Agency. [26-09-2024]. https://www.darwin-eu.org/ URL. Accessed.
- 57.European Health Data & Evidence Network. EHDEN platform. [15-07-2025]. https://portal.ehden.eu/summary URL. Accessed.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


