Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2022 Nov 22;41(1):251–262. doi: 10.1016/j.vaccine.2022.11.031

Background rates of 41 adverse events of special interest for COVID-19 vaccines in 10 European healthcare databases - an ACCESS cohort study

C Willame a, C Dodd a, CE Durán a, RJHJ Elbers b, R Gini c, C Bartolini c, O Paoletti c, L Wang d, V Ehrenstein d, J Kahlert d, U Haug e, T Schink f, J Diez-Domingo g, A Mira-Iglesias g, JJ Carreras g, C Vergara-Hernández g, C Giaquinto h, E Barbieri h, L Stona i, C Huerta j, M Martín-Pérez k, P García-Poza k, A de Burgos k, M Martínez-González k, V Bryant k, F Villalobos l, M Pallejà-Millán l, M Aragón m, JJ Carreras g, P Souverein n, NH Thurin o, D Weibel a, OH Klungel n, MCJM Sturkenboom a,
PMCID: PMC9678835  PMID: 36446653

Abstract

Background

In May 2020, the ACCESS (The vACCine covid-19 monitoring readinESS) project was launched to prepare real-world monitoring of COVID-19 vaccines. Within this project, this study aimed to generate background incidence rates of 41 adverse events of special interest (AESI) to contextualize potential safety signals detected following administration of COVID-19 vaccines.

Methods

A dynamic cohort study was conducted using a distributed data network of 10 healthcare databases from 7 European countries (Italy, Spain, Denmark, The Netherlands, Germany, France and United Kingdom) over the period 2017 to 2020. A common protocol (EUPAS37273), common data model, and common analytics programs were applied for syntactic, semantic and analytical harmonization. Incidence rates (IR) for each AESI and each database were calculated by age and sex by dividing the number of incident cases by the total person-time at risk. Age-standardized rates were pooled using random effect models according to the provenance of the events.

Findings

A total number of 63,456,074 individuals were included in the study, contributing to 211.7 million person-years. A clear age pattern was observed for most AESIs, rates also varied by provenance of disease diagnosis (primary care, specialist care). Thrombosis with thrombocytopenia rates were extremely low ranging from 0.06 to 4.53/100,000 person-years for cerebral venous sinus thrombosis (CVST) with thrombocytopenia (TP) and mixed venous and arterial thrombosis with TP, respectively.

Interpretation

Given the nature of the AESIs and the setting (general practitioners or hospital-based databases or both), background rates from databases that show the highest level of completeness (primary care and specialist care) should be preferred, others can be used for sensitivity. The study was designed to ensure representativeness to the European population and generalizability of the background incidence rates.

Funding

The project has received support from the European Medicines Agency under the Framework service contract nr EMA/2018/28/PE.

Keywords: Background rates, Vaccine safety, Adverse events of special interest, COVID-19

1. Background

On 11 January 2020, the release of the genetic sequence of SARS-CoV-2 triggered the rapid development of COVID-19 vaccines on a global level [1]. More than two hundred vaccine candidates were in the development pipeline. One year later, 26 vaccines were in use across the world [2], and as of January 10th, 2022, 9.46 billion COVID-19 vaccine doses have been administrated worldwide, and about half of the world population has been vaccinated [3]. Due to the rapid development of new COVID-19 vaccines, questions arose about the benefits and risks of the vaccines at individual and population levels. Several emerging safety signals have been detected soon after COVID-19 vaccines launches. Researchers have reported case series with unusual thrombotic events after immunization with ChAdOx1nCov-19 (Oxford/AstraZeneca) [4], [5], [6] and Ad26.COV2.S (Janssen/Johnson & Johnson) [7] vaccines, which led to several regulatory measures, mainly in Europe and in the United States [8], [9]. These thrombotic events were shown to occur, in most instances, in co-occurrence with thrombocytopenia. This new phenomenon, named thrombosis with thrombocytopenia syndrome (TTS), was further characterized with the initiation of the development of a case definition by the Brighton Collaboration Working Group [10]. Furthermore, the spectrum of adverse events has been expanded to conditions such as myocarditis and pericarditis [11] with series of cases initially reported after vaccination with Comirnaty (Pfizer) in Israel [12]. Other very rare events of capillary leak syndrome were reported after vaccination with adenovector viral vaccines [13] and more recently, Guillain Barré Syndrome (GBS) has been detected as a potential safety concern following administration with Ad26.COV2.S vaccine [14]. The experience with COVID-19 vaccines highlights once more the importance and the need for robust surveillance systems and collaboration to carefully monitor adverse effects even after regulatory approvals for timely adoption of public health measures. The same conclusion, made after the 2009 H1N1 pandemic, had led to the Innovative Medicines Initiative funded project that designed and tested a system in Europe, which was implemented by the Vaccine Monitoring Collaboration for Europe (VAC4EU) in January 2020 [15]. In May 2020, ACCESS (The vACCine covid-19 monitoring readinESS), a project funded by the European Medicines Agency (EMA) leveraging expertise in the European Pharmacoepidemiology & Pharmacovigilance research network and the VAC4EU, was launched to prepare real-world monitoring of COVID-19 vaccines [16]. This ACCESS study aimed to generate background incidence rates of adverse events of special interest (AESI) that would allow contextualization of potential safety signals detected following administration of COVID-19 vaccines.

2. Methods

2.1. Study design and setting

A multi-database dynamic cohort study was conducted in 10 healthcare databases from 7 European countries: Italy, Spain, Denmark, Netherlands, Germany, France and United Kingdom (UK). The study protocol (EUPAS37273) is publicly available on the European Network of Centers for Pharmacoepidemiology and Pharmacovigilance (ENCePP) register [17]. The study was conducted over the period 2017 to 2020, except for two databases in which the study ran over the years 2010–2013 for Danish registries (DCE-AU) and 2014–2017 for German Pharmacoepidemiological Research Database (GePaRD). The 10 population-based healthcare databases included data from ARS, PEDIANET (Italy), FISABIO, BIFAP and SIDIAP (Spain), PHARMO (Netherlands), CPRD (UK), GePaRD (Germany), SNDS (France) and Danish Registries. The databases differed in terms of population size, provenance of the diagnosis (e.g., emergency room, in and/or outpatient, specialist or general practitioners (GP)) and diagnostic coding systems (International Classification of Diseases (ICD), Ninth Revision, Clinical Modification (ICD-9-CM), and ICD, Tenth Revision, Clinical Modification (ICD-10-CM), ICD-10 German Modification (ICD10-GM), CIM10 (Classification Internationale des Maladies), Read, SNOMED CT US Edition and Spanish Edition (SCTSPA)). Table 1 provides a summary of the main characteristics of the data sources. For three of them (BIFAP, SIDIAP and PHARMO), subpopulations were defined which included individuals with both primary care and hospital medical records (BIFAP_PC_HOSP, SIDIAP_PC_HOSP and PHARMO versus BIFAP_PC, SIDIAP_PC and PHARMO_PC_HOSP). The creation of subpopulation was necessary when diagnosis records from hospital discharge data and primary care had different source populations and/or lag times.

Table 1.

Characteristics of the datasources.

Country Datasources Population covered Active population Type of datasource Provenance of events Coding system Categories for analysis based on provenance of events
Denmark Danish registries National 5.9 million Record linkage In-and out-patient diagnoses ICD-10 IN-OUTPATIENT
France SNDS National 7.5 million* Insurance claims Health insurance, inpatient (hospital discharge) and outpatient (from long-term disease registration) diagnoses CIM-10 IN-OUTPATIENT
Germany GePaRD National 10.5 million** Insurance claims Health insurance, In-and out-patient*** diagnoses ICD-10-GM INPATIENT only
Italy ARS Tuscany 3.0 million Record linkage Inpatient diagnoses and emergency room department ICD-9-CM INPATIENT & EMR
PEDIANET National 0.2 million Family pediatricians medical records GP records ICD-9-CM GP only
Spain FISABIO Valencia 5.8 million Record linkage GP records, in-and out-patient discharge and emergency room ICD-9-CM / ICD-10 GP & IN-OUTPATIENT
BIFAP 8 regions† 10.3 million GP medical records GP records, in-hospital discharge diagnoses ICD-9-CM/ ICD-10 / SCTSPA GP only
GP & IN-OUTPATIENT (subpopulation)
SIDIAP Catalonia 6.2 million Record linkage GP records, in-and out-patient specialist diagnoses emergency room ICD-10-CM GP only
GP & IN-OUTPATIENT (subpopulation)
The Netherlands PHARMO Sub-sample 9.2 million Record linkage GP records, in-patient specialist diagnoses ICD-9-CM INPATIENT only
GP & IN-OUTPATIENT (subpopulation)
UK CPRD Sub-sample 4.7 million GP medical records GP records Read code GP only

ICD: International Classification of Diseases; CIM: Classification Internationale des Maladies; CM: Clinical Modification; GM: German Modification; SNOMED: US Edition of SNOMED Clinical Terms; SCTSPA: SNOMED Clinical Terms, Spanish Language Edition.

*A 1/10th sample of the SNDS representative of the French population over the study period was used.

†The 8 regions do not include Catalonia nor Valencia.

**Only one large statutory health insurance provider was included which represents 10.5 million people out of 16 million.

***Outpatient diagnoses were not used in this project. GePaRD was only used for a limited number of AESIs diagnosed in inpatient setting (Guillain Barre Syndrome, acute respiratory disease syndrome, heart failure, coronary artery disease, generalized convulsion, acute kidney injury, acute liver injury, anaphylaxis, multisystem inflammatory syndrome).

IN-OUTPATIENT: hospitalization including in and/or-outpatient setting; GP: general practitioners; EMR: Emergency Room.

2.2. Study population

The source population comprised all individuals observed in one of the participating databases for at least one day during the study period and who had at least one year of data availability before study entry, except for individuals with data available since birth. Individuals were included in the study according to pre-defined inclusion and exclusion criteria. Reasons for exclusion were: invalid or missing birth date or missing sex record, exit before study entry (01 January 2017; 01 January 2010 for DCE-AU; 01 January 2014 for GePaRD), and less than one year of look-back period prior to study entry. Individuals at increased risk of severe COVID-19 disease were identified according to the presence of at least one of the following underlying conditions in the look-back period or during the study follow-up: cardiovascular disease, cancer, chronic lung disease, HIV, chronic kidney disease, type 2 diabetes, severe obesity (BMI ≥ 30), sickle cell disease or use of immunosuppressants.

2.3. Adverse events of special interest (AESI)

As part of the harmonization of COVID-19 vaccine safety monitoring during clinical development phase, the Coalition for Epidemic Preparedness Innovations (CEPI) has created a preliminary list of AESIs for COVID-19 vaccine safety monitoring together with the Brighton Collaboration [18]. This list of AESIs has been defined based on events that are related or potentially related to marketed vaccines, events related to vaccine platforms or adjuvants, and events that may be associated with COVID-19. This preliminary list has been further extended and was reviewed and accepted by the European Medicines Agency advisory group monitoring committee. The final list included a total of 41 AESIs, see Box.

2.4. Data management workflow and data analysis

This study was conducted in a distributed manner using a common protocol, a ConcePTION common data model (CDM) [19] for syntactic harmonization, a common analytics program for semantic harmonization and data transformation/analysis [20]. Each data access provider (DAP) applied the Extract-Transform-Load process which led to a syntactic harmonization. The syntactic foundation transforms the structure of the data sets held by each DAP to a common format. To create the study variables semantic harmonization was needed to reconcile differences across different terminologies. A shared semantic foundation was built for each AESI by using a standardized event definition form. For each AESI and underlying condition, medical code lists have been created using the ADVANCE code mapper tool [21] and integrated coding systems: ICD-9-CM, ICD-10-CM/GM, CIM10, READv2, SNOMED CT US Edition and SCTSPA. DAPs were asked to review and update the proposed medical codes based on local coding habits and prior experience. Narrow and broad algorithms were established for most AESIs allowing, respectively, for a specific and a sensitive clinical case definition. Event definition forms including medical code lists were made publicly available through the VAC4EU Zenodo community (https://www.zenodo.org/communities/vac4eu/?page = 1&size = 20). R scripts that included semantic harmonization and transformation of data in the CDM into incidence rates were coded in R using version ≥ 3.1.0 and distributed to the DAPs for local deployment. Aggregated data were uploaded by each DAP on the Digital Research Environment (https://www.andrea-cloud.eu/azure-dre), a secured Microsoft Azure cloud-based research environment, for final analysis and pooling. Demographic characteristics including age and person-time of follow-up were computed in each data source. Incidence rates (IR) and 95 % exact confidence interval (95 %CI) for each AESI and for each database were calculated for the study period, by year, age and sex and by dividing the number of incident cases by the total person-time at risk. Age-standardized IRs (according to the European population [22]) for the period 2017–2019 (or 2010–2013 for DCE-AU or 2014–2017 for GePaRD) were pooled using the DerSimonian and Laird meta-analytic approach for random effects models according to the provenance of the events (Table 1). Incidence rates were expressed per 100,000 person-years (PY). Percentage change between the years 2017–2019 versus 2020 were also computed to assess the change in health care utilization during the COVID-19 pandemic. Statistical analyses were performed in SAS v9.4 and STATA v17.

3. Results

3.1. Characteristics of the population

A total number of 63,456,074 individuals were included in the study, contributing to 211.7 million person-years. Demographic characteristics are available in Table 2 . The largest contributions in person-time were from GePaRD (17.1 %) and BIFAP (16.3 %), followed by SNDS (13.7 %), PHARMO (12.6 %), Danish registries (10.6 %), FISABIO (9.9 %), SIDIAP (9.4 %), CPRD (6.0 %), ARS (4.3 %) and PEDIANET (0.2 %). Subpopulation sizes of data sources with both GP and hospital diagnoses were available (through linkage for part of the population) for 100 %, 43.1 %, 28.3 % and 5.4 % of the full included population for FISABIO, BIFAP, SIDIAP and PHARMO, respectively. The study flowchart is available in Supplementary materials (Fig. 1 ). The proportion of individuals affected by conditions which increase the severity of COVID-19 varied across databases, cardiovascular diseases and chronic lung diseases were the most prevalent risk factor for serious COVID-19 (Fig. 2 in Supplementary materials). (see Fig. 2 ).

Table 2.

Demographic characteristics.

Denmark
France
Germany
Italy
Spain
The Netherlands
UK
DCE-AU
SNDS
GePaRD
ARS
PEDIANET
FISABIO
BIFAP_PC
BIFAP_PC_HOSP
SIDIAP_PC
SIDIAP_PC_HOSP
PHARMO_HOSP
PHARMO_PC_HOSP
CPRD
Value % Value % Value % Value % Value % Value % Value % Value % Value % Value % Value % Value % Value %
Total number of subjects 5 955 360 . 7 479 708 . 10 539 971 . 3 067 602 . 181 290 . 5 886 560 . 10 266 468 . 4 423 843 . 6 205 573 . 1 758 239 . 9 184 832 . 496 197 . 4 688 710 .
Person-time (in years) across follow-up period (per age group)    
Overall 22 490 217 29 025 408 36 127 076 . 9 065 271 . 356 184 . 20 911 202 . 34 525 598 . 8 494 349 . 19 845 706 . 5 355 124 . 26 568 111 . 1 164 810 . 12 762 349 .
0–19 5 417 245 24.09 7 037 843 24.25 7 024 252 19.44 1 396 241 15.4 356 184 100 4 103 324 19.62 6 368 798 18.45 1 475 624 17.37 3 878 177 19.54 1 002 231 18.72 4 285 628 16.13 221 438 19.01 2 863 126 22.43
20–29 2 639 524 11.74 3 635 959 12.53 4 913 004 13.60 796 689 8.79 2 073 398 9.92 3 294 295 9.54 780 125 9.18 2 014 182 10.15 536 922 10.03 3 201 274 12.05 134 601 11.56 1 437 077 11.26
30–39 2 890 365 12.85 3 784 253 13.04 5 388 329 14.91 908 683 10.02 2 746 498 13.13 4 412 677 12.78 1 070 247 12.6 2 801 475 14.12 747 012 13.95 3 507 399 13.2 133 829 11.49 1 709 558 13.4
40–49 3 294 106 14.65 3 747 827 12.91 5 458 071 15.11 1 391 120 15.35 3 460 336 16.55 5 490 532 15.9 1 319 480 15.53 3 447 041 17.37 933 846 17.44 3 862 330 14.54 160 034 13.74 1 754 888 13.75
50–59 2 923 975 13 3 692 979 12.72 6 153 604 17.03 1 478 656 16.31 3 012 592 14.41 5 095 982 14.76 1 255 375 14.78 2 782 154 14.02 762 148 14.23 3 934 798 14.81 182 981 15.71 1 854 253 14.53
60–69 2 778 684 12.36 3 270 947 11.27 3 626 015 10.04 1 180 125 13.02 2 399 175 11.47 4 003 342 11.6 995 612 11.72 2 129 361 10.73 589 617 11.01 3 153 788 11.87 157 884 13.55 1 446 057 11.33
70–79 1 618 270 7.2 2 203 655 7.59 2 682 491 7.43 1 068 981 11.79 1 887 325 9.03 3 034 922 8.79 765 961 9.02 1 577 614 7.95 447 477 8.36 2 368 347 8.91 115 293 9.9 1 074 750 8.42
80+ 928 047 4.13 1 651 944 5.69 881 311 2.44 844 776 9.32 1 228 553 5.88 2 825 050 8.18 831 925 9.79 1 215 702 6.13 335 871 6.27 2 254 547 8.49 58 751 5.04 622 638 4.88
Person-time (in years) across follow-up period (per sex)
Female 11 319 983 50.33 14 774 535 50.90 18 146 362 50.23 4 706 024 51.91 170 278 47.81 10 641 698 50.89 17 701 930 51.27 4 347 542 51.18 10 067 763 50.73 2 713 540 50.67 13 967 221 52.57 589 204 50.58 6 408 155 50.21
Male 11 170 234 49.67 14 250 873 49.10 17 980 714 49.77 4 359 247 48.09 185 906 52.19 10 269 503 49.11 16 823 667 48.73 4 146 807 48.82 9 777 943 49.27 2 641 584 49.33 12 600 891 47.43 575 606 49.42 6 354 194 49.79

Note: PEDIANET includes only pediatric population up to 16 years old. BIFAP_PC_HOSP, SIDIAP_PC_HOSP and PHARMO_PC_HOSP represents subpopulations for BIFAP_PC, SIDIAP_PC and PHARMO_HOSP, respectively.

Fig. 1.

Fig. 1

Fig. 1

Fig. 1

Fig. 1

Age-stratified incidence rates for AESIs per body system over the study period* (per 100,000 person-years) *The study period includes the year 2017 to 2019, except for for two databases in which the study ran over the years 2010–2013 for Danish registries (DCE-AU) and 2014–2017 for German Pharmacoepidemiological Research Database (GePaRD). Note: Log scale was used to highlight the age-pattern. The log scale was automatically generated based on the magnitude of the incidence rates and varied across AESIs. No narrow clinical definition for acute aseptic arthritis available. ADEM: Acute Disseminated Encephalomyelitis, AKI: Acute Kidney Injury, ALI: Acute Liver Injury, ARDS: Acute Respiratory Distress Syndrome, CVST: Cerebral Venous Sinus Thrombosis, DIC: Disseminated Intravascular Coagulation, MIS: Multisystem Inflammatory Syndrome, SOCV: Single Organ Cutaneous Vasculitis, TP: Thrombocytopenia, VTE: Venous Thromboembolism.

Fig. 2.

Fig. 2

Distribution (%) of known risk factors for COVID-19 disease severity, per database.

3.2. Incidence rates of AESIs

IRs per database, per year, age and sex are detailed in the final study report available on Zenodo website [23] and on the VAC4EU dashboard [24]. Incidence rates that are presented in this paper used the narrow clinical definitions and are for time periods that exclude the year 2020. Table 3 presents age-standardized pooled incidence rates for all AESIs according to the provenance of events over the study period. Plots in Fig. 1 depict the incidence rates per age and according to the provenance of events. Age-and sex-stratified incidence rates for all AESIs according to the ACCESS recommendations are presented in Supplementary materials (Table 1).

Table 3.

Pooled incidence rates and 95% confidence intervals for all AESIs (narrow definition) over the study period* according to the provenance of the events databases.


Incidence rate (/100,000 person-years) (95 %CI)
Body System AESIs GP only INPATIENT only INPATIENT & EMR GP & IN-OUTPATIENT IN-OUTPATIENT
Autoimmune diseases ADEM 0.05 (0.00–0.14) 0.08 (0.00–0.38) 0.33 (0.06–0.59)
Acute Aseptic Arthritis**
Guillain Barre Syndrome 1.25 (0.27–2.23) 2.09 (0.46–3.74) 3.39 (2.16–4.63) 3.21 (1.00–4.42) 3.33 (2.48–4.17)
Narcolepsy 1.44 (0.81–2.07) 0.31 (0.06–0.56) 0.57 (0.00–1.14) 1.58 (0.91–2.26) 2.29 (0.77–3.80)
Thrombocytopenia 38.99 (7.23–70.76) 18.01 (16.31–19.70) 29.55 (26.03–33.08) 92.09 (42.47–141.71) 63.16 (0.00–147.83)
Type 1 Diabetes Mellitus 9.14 (4.27–14.01) 7.12 (6.41–7.83) 7.11 (5.83–8.39) 13.12 (9.88–16.36) 19.85 (18.86–20.84)
Cardiovascular system Arrhythmia 536.72 (82.87–990.58) 288.71 (282.78–294.64) 802.32 (786.92–817.72) 1199.31 (899.09–1499.53) 880.86 (662.45–1099.28)
Coronary Artery Disease 113.17 (71.44–154.91) 139.93 (51.26–228.61) 218.81 (211.51–226.11) 162.45 (119.08–205.81) 201.96 (191.14–212.77)
Heart Failure 154.28 (11.27–297.29) 189.99 (47.17–332.81) 453.08 (443.65–462.51) 416.76 (270.89–562.64) 404.93 (222.19–587.67)
Microangiopathy 0.32 (0.03–0.61) 1.11 (0.66–1.56) 0.62 (0.01–1.22) 3.39 (0.00–7.31) 7.13 (0.10–14.16)
Stress Cardiomyopathy 0.24 (0.10–0.37) 5.60 (4.39–6.80) 2.90 (0.99–4.80)
Myocarditis 1.43 (0.38–2.48) 1.28 (0.80–1.77) 6.61 (4.81–8.40) 3.18 (1.73–4.62) 5.30 (2.00–8.61)
Myocarditis or Pericarditis 11.86 (1.42–22.31) 3.96 (3.15–4.78) 30.04 (26.34–33.74) 20.63 (12.91–28.35) 21.24 (11.61–30.87)
Respiratory system ARDS 18.58 (3.73–33.44) 13.20 (0.00–30.20) 23.86 (20.99–26.72) 68.33 (16.10–120.56) 86.24 (32.27–140.21)
Circulatory system Arterial or VTE with TP 0.20 (0.09–0.30) 1.42 (0.96–1.88) 1.85 (1.02–2.68) 4.53 (0.00–9.94) 5.43 (0.00–14.79)
Arterial or VTE without TP 328.12 (40.52–615.71) 194.81 (189.92–199.70) 532.38 (520.19–544.56) 606.15 (510.17–702.13) 580.75 (532.39–629.11)
Arterial with TP 0.09 (0.01–0.16) 0.77 (0.45–1.08) 0.91 (0.34–1.49) 2.73 (0.00–6.12) 2.04 (0.00–5.35)
Arterial without TP 187.94 (32.56–343.32) 157.57 (153.34–161.80) 380.58 (370.84–390.32) 384.18 (279.11–489.24) 381.92 (318.93–444.92)
CVST with TP 0.00 (0.00–0.15) 0.00 (0.00–0.08) 0.03 (0.00–0.27) 0.06 (0.00–0.19) 0.00 (0.00–0.02)
CVST without TP 0.13 (0.00–0.27) 0.81 (0.42–1.19) 1.29 (0.47–2.11) 0.82 (0.50–1.14) 1.98 (0.08–3.88)
CVST 0.13 (0.00–0.27) 0.81 (0.42–1.19) 1.31 (0.49–2.14) 0.85 (0.53–1.17) 2.04 (0.04–4.04)
Disseminated Intravascular Coagulation 0.11 (0.03–0.20) 0.68 (0.32–1.04) 1.47 (0.69–2.26) 2.65 (0.00–5.85) 5.68 (0.00–11.61)
Hemorrhagic stroke 10.54 (0.91–20.17) 16.08 (14.67–17.49) 60.84 (6.76–64.93) 43.58 (26.44–60.73) 52.96 (41.27–64.64)
Ischemic stroke 104.69 (14.84–194.53) 66.28 (63.55–69.00) 171.75 (165.30–178.20) 229.29 (159.11–299.46) 187.73 (138.78–236.69)
SOCV 8.16 (5.09–11.23) 1.50 (0.99–2.01) 8.04 (6.28–9.79) 14.58 (2.47–26.68) 11.16 (4.74–17.59)
Thrombotic microangiopathy 0.32 (0.00–0.65) 0.47 (0.18–0.76) 0.62 (0.20–1.03) 1.03 (0.75–1.32) 1.54 (0.34–2.74)
VTE with TP 0.11 (0.02–0.19) 0.68 (0.34–1.02) 0.97 (0.32–1.62) 1.95 (0.00–4.24) 3.45 (0.00–9.59)
VTE without TP 141.68 (9.68–273.68) 40.12 (37.73–42.51) 160.94 (153.74–168.14) 228.48 (206.26–250.71) 209.55 (88.61–230.48)
VTE 141.77 (9.70–273.84) 40.66 (38.25–43.07) 161.66 (154.44–168.88) 229.67 (207.52–251.83) 211.85 (186.82–236.87)
Hepato-gastrointestinal and renal system Acute Kidney Injury 190.25 (87.45–293.04) 138.47 (53.67–223.26) 222.62 (215.48–229.76) 544.24 (156.52–931.96) 421.03 (40.51–801.55)
Acute Liver Injury 12.87 (1.54–24.19) 8.37 (5.82–10.91) 25.16 (21.91–28.42) 35.09 (19.87–50.31) 32.96 (8.38–57.55)
Nerves and central nervous system Generalized Convulsion 73.64 (43.77–103.51) 80.68 (9.66–151.70) 142.56 (134.83–150.30) 152.35 (78.67–226.02) 194.28 (175.77–212.79)
Meningoencephalitis 2.26 (0.33–4.19) 1.30 (0.81–1.78) 5.74 (4.11–7.36) 7.34 (4.19–10.49) 5.94 (2.43–9.45)
Transverse Myelitis 0.40 (0.06–0.74) 0.27 (0.03–0.50) 1.11 (0.34–1.88) 0.48 (0.23–0.72) 0.69 (0.00–1.72)
Other system Anaphylaxis 9.39 (3.16–15.61) 6.82 (0.00–13.71) 7.44 (5.54–9.33) 14.17 (7.46–20.87) 11.30 (10.04–12.56)
Anosmia, Ageusia 13.48 (2.95–24.01) 0.12 (0.00–0.29) 0.08 (0.00–0.37) 24.50 (15.09–33.91) 1.70 (1.25–2.14)
Death 642.94 (87.87–1198.01) 1150.09 (1134.88–1165.30) 812.34 (720.03–904.64) 1022.92 (661.34–1384.50)
MIS 0.52 (0.33–0.72) 0.59 (0.41–0.78) 1.08 (0.56–1.59) 1.18 (0.23–2.13) 0.79 (0.31–1.28)
Sudden death 52.96 (41.25–64.67) 1.97 (1.21–2.73) 81.62 (34.21–129.03) 24.30 (0.00–64.32)
Skin and mucous membrane system Chilblain like lesions 13.66 (1.01–26.32) 0.01 (0.00–0.09) 0.17 (0.00–0.54) 22.25 (2.70–41.79) 0.21 (0.00–0.65)
Erythema multiforme 5.99 (2.23–9.76) 0.31 (0.06–0.56) 9.65 (7.52–11.78) 8.72 (4.33–13.11) 2.64 (0.00–5.87)

*Study period for Danish registries: 2010–2013 and for GePaRD: 2014–2017. **No narrow clinical definition for acute aseptic arthritis available. GePaRD: only the following events were included in the study: GBS, ARDS, HF, CAD, Generalized convulsion, Acute Kidney Injury, Acute Liver Injury, anaphylaxis, MIS. GPs: general practitioners. ADEM: Acute Disseminated Encephalomyelitis, ARDS: Acute Respiratory Distress Syndrome, CVST: Cerebral Venous Sinus Thrombosis, DIC: Disseminated Intravascular Coagulation, MIS: Multisystem Inflammatory Syndrome, SOCV: Single Organ Cutaneous Vasculitis, TP: Thrombocytopenia, VTE: Venous Thromboembolism. GP only: PEDIANET, BIFAP, SIDIAP, CPRD; INPATIENT only: PHARMO; INPATIENT & EMR: ARS; GP & IN-OUTPATIENT: FISABIO, BIFAP subpopulation, SIDIAP subpopulation, PHARMO subpopulation; IN-OUTPATIENT: Danish registries, SNDS.

For the autoimmune diseases, pooled IRs for ADEM, GBS and narcolepsy were the lowest rates. Provenance of diagnoses impacted substantially the observed rates with diagnoses of narcolepsy, GBS and diabetes most frequently reported in settings including in-outpatient and/or GPs records. A clear age and sex-pattern was shown for GBS and TP. IRs for GBS and TP were slightly elevated in males. Cardiovascular disorders were more frequently reported in the hospital setting. Microangiopathy and stress cardiomyopathy showed the lowest rates compared to other cardiovascular disorders. A peak of myocarditis and myocarditis/pericarditis was observed in the 20–29 age category and higher rates of SOCV were observed in the younger population (0–19). IRs for myocarditis and coronary artery disease were higher in males, while IR for stress cardiomyopathy was higher in females. For circulatory disorders, IRs were higher for diagnoses from hospital records as compared to diagnoses in GPs records. Rates ranged from 229.67/100,000 PY for venous thromboembolism (VTE) to 0.85/100,000 PY for CVST in databases including GPs and hospital medical records. TTS rates were extremely low ranging from 0.06/100,000 PY for CVST with TP to 4.53/100,000 PY for mixed venous and arterial thrombosis with TP. Circulatory disorders were shown to increase with age, except for CVST for which an age-pattern was not detected. IRs for disseminated intravascular coagulation, arterial thrombosis and microangiopathy were higher in males compared to females. Hepato-gastrointestinal disorders were more frequently reported in hospitals or GPs settings with rates increasing with age and a slight decrease for acute liver injury in the elderly (80 + ). Similarly, nerves and central nervous disorders were more frequently reported in hospital setting. Rates for generalized convulsion peaked in the younger population (20–29) and in the elderly (80 + ). No clear age-pattern was observed for meningoencephalitis and rates for transverse myelitis dropped in the 80 +. IR for transverse myelitis was slightly elevated in females. Anaphylaxis and anosmia-ageusia diagnoses were more frequently reported in settings including GPs records. Rates for anaphylaxis and multisystem inflammatory disorders peaked in the younger ages (0–19). Rates for death and sudden death showed a clear age-pattern across study setting but could not be detected in settings including exclusively inpatient medical records. ARDS was more frequently reported in the hospital setting with rates declining in 20–29 year-olds and increasing again with age. Chilblain-like lesion and erythema multiforme diagnoses were more frequently reported in settings including GPs medical records. Rates of chilblain-like lesion peaked in 20–29 year-olds; erythema multiforme peaked in ages 0–19. Both AESIs showed higher rates in females compared to males.

3.3. Incidence rates in 2020 and in population with underlying conditions

For the year 2020, all AESIs, except anaphylaxis and ARDS, were less frequently reported in setting with emergency room visit. Anosmia-ageusia, sudden death, ARDS and thrombosis (CVST and VTE) were more frequently reported in settings with both GP and hospital medical records (Figure 3, Supplementary materials). IRs in population with underlying conditions showed significantly higher rates for all AESIs compared to the general population (data not shown).

4. Discussion

Based on data from 63 million European individuals, this cohort study generated age-and-sex specific background incidence rates with high precision for a pre-specified list of 41 AESIs, necessary for monitoring the safety of COVID-19 vaccines. We generated background incidence rates using a distributed data network with common protocol, common data model and common analytics using 10 diverse healthcare databases across 7 European countries. These rates have been reported from January 2021 onwards, periodically and were used throughout 2021 by the European Medicines Agency and vaccine manufacturers for observed/expected analyses (personal communication).

Gubernot et al. (2021)[25] recently conducted a literature review of incidence rates of 22 AESIs, as well as the Brighton Collaboration (9 events); our overall rates are consistent with the literature derived data although we could not compare age and sex strata. Li et al. (2021)[26] recently published a study on AESI incidence rates from the OHDSI network, which covered general practice or claims data from eight countries (USA, UK, Australia, France, Germany, Spain, Netherlands and Japan) and reported on 15 AESIs. In general, our results differed substantially for several of the 12 common AESIs, which may be explained by the fact that incidence rates from the 8 countries were pooled regardless that the provenance of the events that went into the numerator differed substantially: 5 of the OHDSI data sources only captured GP recorded diagnosis data, whereas US and Japanese data captured claims. Our approach and strength were to pool results only across similar provenance of the event and to present the rates by provenance, independently. We considered the different provenances in the analysis as this is crucial for the correct interpretation of real-world evidence derived from heterogeneous data sources, several AESIs are only diagnosed in secondary care and are underestimated in primary care medical records, such as cardiovascular and thrombotic events. We would recommend that rates are presented by provenance and that this diversity is preserved in the pooling for the observed/expected analyses.

To give examples we briefly describe and compare the rates for selected AESIs with published data focusing on AESI that have been identified as safety risks following administration of COVID-19 vaccines. More detailed contextualization of the rates for each of the different AESI against published references is available in our final study report [23]. Our VTE rates (pulmonary embolism and deep vein thrombosis) were of similar magnitude compared to literature data retrieved by Gubernot et al. (2021) (108–117/100,000 PY)(25), Huang et al. (2014) (133/100,000 PY)[27] and Pottegard et al. (2021) (126–158/100,000 PY)[28], with a notable increase of incidence with age. For TTS, we operationalized the Brighton Collaboration case definition after a public webinar by VAC4EU & Brighton Collaboration (https://youtu.be/-Sp5GKfzB2I) establishing four subcategories of thromboembolic events, i.e., venous thrombosis (VTE), arterial thrombosis (AMI and stroke), CVST and the combination of all (mixed venous and arterial), each of the thromboembolic conditions was stratified by the co-occurrence of a thrombocytopenia diagnosis within 10 days around the thromboembolic diagnosis. Our observations suggested that CVST is extremely rare, as are any of the combinations with thrombocytopenia, with rates estimated at < 1 to 5/100,000 PY. These observations are consistent with the recent study from Burn (2021)[29]. Our clinical definition for thrombocytopenia included both immune thrombocytopenia and secondary thrombocytopenia and showed higher rates compared to other published references such as Li et al. [26]which restricted the concept definition to immune diseases (448/100,000 PY versus 56/100,000 PY in males of older ages). Our incidence rates for the composite endpoint myocarditis/pericarditis were slightly lower compared to data from Li et al. (2021)[26], since we excluded chronic conditions and causes such as rheumatism. Our rates of myocarditis were much lower than our composite of myocarditis/pericarditis, showing that the composite endpoint was mainly driven by pericarditis medical conditions. Our rates for myocarditis differed by age and sex and were comparable with Gubernot et al. (2021) [25] which reported rates ranging from 1 to 10 cases/100,000 PY. The impact of the COVID-19 pandemic on healthcare seeking and recording was clearly highlighted in the year 2020 with a sharp increase in rates in medical events directly related to COVID-19 such as ARDS, sudden death and anosmia-ageusia.

4.1. How to use the background rates

In the context of readiness for real-world monitoring of COVID-19 vaccines, the background incidence rates, which had been released periodically and openly, have been proven useful for observed-to-expected (O/E) analyses by EMA and vaccine manufacturers. In vaccine pharmacoepidemiology, signal detection methods are preliminary assessments allowing identification of potential safety concerns, but background rates are required to interpret them [30], [31]. Health authorities usually request O/E analysis to refine detected safety signals before implementing any further assessments [32]. The O/E analysis relies on exposure data and published background incidence rates. Since mass vaccinations campaigns usually roll out in a channeled manner, it is of crucial importance to have rates stratified by age, sex, and underlying comorbidity, which usually is poorly documented in the literature. In this study, age, sex-stratified and comorbidity specific rates were generated from 10 existing large electronic healthcare databases in 7 European countries, with semantically harmonized data. Because each data source has its own characteristics with regards to provenance of the events (GPs only, in or outpatient settings, emergency room visit or specialist referrals), we provided pooled estimates according to the provenance of the events. Background rates can be generated prior to vaccination roll-out when electronic health data are available, but the users should be aware of the nature of the event, the setting in which it is diagnosed, and evaluate whether the data source appropriately captures the data. Data sources that contain data from the setting where the disease is typically diagnosed should be preferred. In this study, given the nature of the AESIs included in this study, we recommend using background rates from data banks that show the highest level of completeness of identification of these events in terms of the type of diagnoses (such as GP and hospital-based data sources) which includes all data sources with such subpopulation. For some events, such as CVST, data sources including emergency and outpatient visits may be preferred, while for anosmia-ageusia or chilblain-like lesions, data sources including GPs setting would be recommended (see Table 1 in Supplementary materials).

4.2. Strengths and Limitations

ACCESS was a project funded by the EMA to prepare European infrastructure to monitor COVID-19 vaccines. The project ran from May 2020 to July 2021 and delivered background rates of AESIs, template protocols for implementation of observational studies, and feasibility assessments in each country to participate in studies and analyses performed by EMA for its Scientific Committees and ECDC. All deliverables have been made publicly available immediately to the scientific community through the EU PAS register (https://www.encepp.eu/encepp/studiesDatabase.jsp), the VAC4EU website (https://vac4eu.org) and Zenodo (https://www.zenodo.org). European data sources are quite heterogeneous because of different coding systems, health care practices, provenance of diagnosis and systems. To standardize the analytical process, we applied a two-step approach, first a syntactic harmonization, putting all data in the same structure, and secondly a semantic harmonization, which was conducted transparently and centrally through a R-script. Semantic harmonization is complex, and infinite. It comprises harmonization of different coding systems with different granularity levels, coding practices in different settings. The harmonization process across terminologies was organized through the use of the Unified Medical Language System using the Codemapper [21] followed by extensive review of codes by the DAPs. This harnesses the expertise of the local data sources. It is acknowledged that while a rigorous harmonization process has been applied, residual heterogeneity may persist within and between data sources which would impact pooled results. Therefore, it is our recommendation to consider, in addition to pooled incidence rates, data source-specific incidence rates for further used of the generated data. The question on heterogeneity paths the way for the development of metrics to measure heterogeneity in data sources and the development of guidance to define acceptable thresholds when conducting distributed data network studies. Our study stressed the importance of an appropriate study setting to conduct future safety research’s studies. Due to the nature and resource constraints for this study, case validation could not be conducted; we attempted to assess and reveal the impact of potential misclassification by using narrow and broad clinical definitions for most of the AESIs, and by stratifying by the provenance of diagnosis. In some instances, the governance approval can be a lengthy process, especially in a pandemic situation. For this reason, the Danish DAP decided to prioritize the use of a set of data for which ethics approval was previously obtained. For the other DAPs, we obtained governance approvals from scientific and ethics committees within a few weeks after submission of the protocol. Access to data was also facilitated with pre-agreement with DAPs. Ultimately, we could generate background incidence rates for newly identified syndrome like TTS in a few days showing the strength of the network in rapid response to specific research questions.

5. Conclusion

The ACCESS project started at an early stage of the COVID-19 pandemic as a component of the EMA readiness strategy for the times where vaccines would be authorized. ACCESS was successful in delivering these data on time as the first set of background rates were made available to EMA in December 2020, providing support to the safety monitoring of vaccines as soon as they were available in the EU. A large population of 63 million European individuals was included in the study, without restrictions beyond study period, to ensure representativeness to the European population and generalizability of the background incidence rates.

Declaration of Competing Interest

The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: [Corinne Willame is conducting a PhD at the University Medical Center Utrecht, as part of which she coordinated this project from January 2021. She works full time for Janssen Pharmaceutica and allocated working-time to finalize this project, this should be considered an in-kind contribution. Janssen Pharmaceutica was not a member of the consortium or project. Rosa Gini, Claudia Bartolini and Olga Paoletti are employed by ARS Toscana, a public research center that conducts or participates in pharmacoepidemiology studies compliant with the ENCePP Code of Conduct. The budget of ARS is partially sustained by such studies. Nicolas Thurin works for Bordeaux PharmacoEpi, an independent research platform of the Bordeaux University and its subsidiary the ADERA SAS, which performs financially supported studies for public and private partners, in line with the ENCePP Code of Conduct. Lei Wang, Vera Ehrenstein and Johnny Kahlert are salaried employees of their organization, which receives institutional research funding from pharmaceutical companies and regulatory agencies, administered by Aarhus University. Miriam Sturkenboom, Daniel Weibel, Carlos E. Durán, Roel Elbers, are salaried employees by University Medical Center Utrecht, which receives institutional research funding from pharmaceutical companies and regulatory agencies, administered by University Medical Center Utrecht. All these studies follow the ENCePP code of conduct. Miriam Sturkenboom is a consultant to the Task Force for Global Health for the Safety Platform for Emergency vACcines (SPEAC) project. Caitlin Dodd currently works for Panalgo. Felipe Villalobos, Meritxell Pallejà-Millán and María Aragón are salaried employees at Fundació Institut Universitari per a la recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), which receives institutional research funding from public and private partners, pharmaceutical companies and regulatory agencies, administered by IDIAPJGol. Mar Martín-Pérez, Patricia García-Poza, Airam de Burgos, Maria Martínez-González and Verónica Bryant are employees at the Spanish Agency for Medicines and Medical Devices which fully finance the BIFAP database, a non-profit database for performing independent pharmacoepidemiologic research in the public research Spanish setting. Consuelo Huerta was employee at the Spanish Agency for Medicines during the time in which this study was performed. She is currently assistant professor of the Public Health Department of the Faculty of Medicine in the Complutense University of Madrid. Ulrike Haug and Tania Schink are working at an independent, non-profit research institute, the Leibniz Institute for Prevention Research and Epidemiology – BIPS. Unrelated to this study, BIPS occasionally conducts studies financed by the pharmaceutical industry. Almost exclusively, these are post-authorization safety studies (PASS) requested by health authorities. The design and conduct of these studies as well as the interpretation and publication are not influenced by the pharmaceutical industry and performed in line with the ENCePP Code of Conduct. All other authors have no conflict of interest.].

Acknowledgments

Acknowledgement

The research leading to these results was conducted as part of the activities of the EU PE&PV (Pharmacoepidemiology and Pharmacovigilance) Research Network which is a public academic partnership coordinated by the Utrecht University, The Netherlands. EU PE&PV collaborated with the Vaccine Monitoring Collaboration for Europe (VAC4EU) association The project has received support from the European Medicines Agency under the Framework service contract nr EMA/2018/28/PE. The content of this paper expresses the opinion of the authors and may not be understood or quoted as being made on behalf of or reflecting the position of the European Medicines Agency or one of its committees or working parties.

Approvals

Access to databases was approved by local scientific and ethics committees. Use of CPRD data in this study was approved by the Independent Scientific Advisory Committee for MHRA database research (protocol 20_000177).

Authors contribution

All authors contributed equally to the conduct of the study. All authors reviewed and approved the study design and study protocol. All authors approved the final version of the manuscript before submission.

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.vaccine.2022.11.031.

Appendix A. Supplementary material

The following are the Supplementary data to this article:

Supplementary data 1
mmc1.pdf (331.4KB, pdf)
Supplementary data 2
mmc2.xlsx (749.6KB, xlsx)
Supplementary data 3
mmc3.pptx (40.1KB, pptx)

Data availability

The authors are unable or have chosen not to specify which data has been used.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data 1
mmc1.pdf (331.4KB, pdf)
Supplementary data 2
mmc2.xlsx (749.6KB, xlsx)
Supplementary data 3
mmc3.pptx (40.1KB, pptx)

Data Availability Statement

The authors are unable or have chosen not to specify which data has been used.


Articles from Vaccine are provided here courtesy of Elsevier

RESOURCES