Robustness of outcomes in trials evaluating sodium–glucose co‐transporter 2 inhibitors for heart failure

Muhammad Shariq Usman; Muhammad Shahzeb Khan; Gregg C Fonarow; Stephen J Greene; Tim Friede; Muthiah Vaduganathan; Gerasimos Filippatos; Andrew J Stewart Coats; Stefan D Anker; Javed Butler

doi:10.1002/ehf2.13785

. 2022 Jan 13;9(2):885–893. doi: 10.1002/ehf2.13785

Robustness of outcomes in trials evaluating sodium–glucose co‐transporter 2 inhibitors for heart failure

Muhammad Shariq Usman ^1,^{^†}, Muhammad Shahzeb Khan ^2,^{^†}, Gregg C Fonarow ³, Stephen J Greene ², Tim Friede ^4,⁵, Muthiah Vaduganathan ⁶, Gerasimos Filippatos ⁷, Andrew J Stewart Coats ^8,⁹, Stefan D Anker ¹⁰, Javed Butler ^1,^✉

PMCID: PMC8934993 PMID: 35029056

Abstract

Aims

Recent trials have evaluated sodium–glucose co‐transporter 2 inhibitors in patients with heart failure (HF). We sought to assess the robustness of findings from these trials using the fragility index (FI).

Methods and results

Fragility index is defined as the minimum number of patients that must be moved from the ‘non‐event’ to the ‘event’ group to turn a statistically significant result to non‐significant. In addition to FI, fragility quotient [(FQ); FI divided by the sample size] was calculated to assess the proportion of events that must be moved to change the significance. For statistically non‐significant outcomes, reverse fragility index (RFI) and reverse fragility quotient (RFQ) were calculated. Robustness of findings after pooling data from all three trials was also assessed. A robust reduction in first HF hospitalization or cardiovascular mortality was seen with dapagliflozin (FI = 62 and FQ = 0.013), empagliflozin (FI = 50 and FQ = 0.013), and sotagliflozin (FI = 60 and FQ = 0.049). Dapagliflozin nominally improved all‐cause and cardiovascular mortality, with modest FI (n = 8 and 5) and FQ (0.002 and 0.001). Empagliflozin and sotagliflozin did not demonstrate statistically significant reductions in all‐cause mortality, with modest RFI (empagliflozin: RFI = 26 and RFQ = 0.007; sotagliflozin: RFI = 6 and RFQ = 0.005). A similar trend was seen with cardiovascular mortality (empagliflozin: RFI = 24 and RFQ = 0.006; sotagliflozin: RFI = 7 and RFQ = 0.006). Upon meta‐analysis, the result for first HF hospitalization or cardiovascular mortality was robust (FI = 95 and FQ = 0.010). The reductions in all‐cause (FI = 12 and FQ = 0.001) and cardiovascular mortality (FI = 9 and FQ = 0.001), while statistically significant, were fragile.

Conclusion

Improvement in the composite outcome of first HF hospitalization or cardiovascular death was highly concordant and robust across sodium–glucose co‐transporter 2 inhibitor trials. In contrast, secondary endpoints of all‐cause and cardiovascular mortality were statistically fragile, underscoring the need to power trials for mortality to fully understand the benefit of therapies on fatal events.

Keywords: Fragility index, Robustness, Sodium–glucose co‐transporter 2 inhibitors, Cardiac failure

Introduction

Recent trials show that sodium–glucose co‐transporter 2 (SGLT2) inhibitors improve heart failure (HF) outcomes. ¹ , ² , ³ The ‘Dapagliflozin and Prevention of Adverse‐Outcomes in HF’ (DAPA‐HF) ¹ showed that dapagliflozin reduced the composite endpoint of first HF hospitalization, urgent HF visit, or cardiovascular mortality among patients with HF with reduced ejection fraction (HFrEF). The ‘Empagliflozin Outcome Trial in Patients With Chronic HFrEF’ (EMPEROR‐Reduced) enrolled higher risk HFrEF patients and demonstrated a similar benefit. ² Recently, the ‘Effect of Sotagliflozin on Cardiovascular Events in Patients With Type 2 Diabetes Post Worsening HF’ (SOLOIST‐WHF) trial evaluated the effects of sotagliflozin in HF patients with diabetes and recent hospitalization for worsening HF ³ and included patients with HFrEF as well as HF with preserved ejection fraction (HFpEF). While this trial was terminated early, SOLOIST‐WHF also showed a reduction in first HF hospitalization or cardiovascular mortality.

While the magnitude of benefit for the primary endpoint was consistent in all trials, the observed effects on the secondary endpoint of mortality varied. Mortality benefit was nominally significant with dapagliflozin but not with empagliflozin or sotagliflozin, raising questions regarding differences in baseline risk profiles or play of chance. None of these trials however were designed to assess mortality impact by itself. Robustness of statistically significant and non‐significant dichotomous outcomes can be evaluated using fragility index (FI) and reverse fragility index (RFI). ⁴ , ⁵ FI can help evaluate trial results in addition to P‐values and effect size estimates. In this study, we sought to assess the robustness of the results across the SGLT2 inhibitors trials in HF by assessing the FI and RFI for clinical outcomes.

Methods

Study populations, definitions, and outcomes of interest

Publicly available data were utilized, and thus, institutional review board approval was not applicable. All placebo‐controlled trials designed to evaluate outcomes in HF patients using SGLT2 inhibitors were included. Three randomized controlled trials met these criteria. DAPA‐HF and EMPEROR‐Reduced included HFrEF outpatients with or without diabetes. ¹ , ² EMPEROR‐Reduced enrolled a higher risk population with lower ejection fraction (EF) and estimated glomerular filtration rate, and higher natriuretic peptides. SOLOIST‐WHF enrolled patients with diabetes hospitalized for worsening HF, regardless of EF. ³ Sotagliflozin, studied in SOLOIST‐WHF, differs from other SGLT2 inhibitors as it also has SGLT1‐inhibiting activity. ³

The primary outcome varied in all three trials. In EMPEROR‐Reduced, it was a composite of first HF hospitalization or cardiovascular mortality. DAPA‐HF had a similar primary composite but included urgent outpatient visits for intravenous HF therapy. The number of urgent visits was few, and excluding them resulted in no meaningful change in the effect size. The primary outcome in SOLOIST‐WHF was a composite of total (first and recurrent) HF hospitalizations, urgent HF visits, and cardiovascular mortality. We could not evaluate the FI/RFI for this outcome without patient‐level data access, but SOLOIST‐WHF also reported the composite of first HF hospitalization or cardiovascular mortality.

Fragility index and RFI were assessed for the (i) composite of first HF hospitalization or cardiovascular mortality, (ii) first HF hospitalization, (iii) cardiovascular mortality, and (iv) all‐cause mortality. FI for subgroup of the primary outcome was also assessed, but this was not possible for the SOLOIST‐WHF trial because the primary outcome included recurrent events.

Statistical analysis

For significant outcomes, FI was calculated in the manner described by Walsh et al. ⁶ In the treatment arm with a lower event rate, patients were added to the event group while subtracting patients from the non‐event group. Fisher's exact test was used to recalculate the two‐sided P‐value, while iteratively adding events until the P‐value became ≥0.05. For non‐significant outcomes, RFI was calculated. The total number of events in each group over the entire follow‐up was considered. Lower FI/RFI indicates less statistical robustness; however, there is no standardized cut‐off defined for acceptable fragility. Loss of follow‐up was compared with FI/RFI for each trial as it affects both the number of participants at risk and the number of events. When loss to follow‐up exceeds the FI or RFI, results should be cautiously interpreted as events of interest may occur in patients lost to follow‐up and factoring these may shift the results.

Fragility quotient (FQ), ⁷ , ⁸ which is the FI divided by the sample size, was also calculated to assess what proportion of patients must change status to change the significance of results. For instance, trial X has an FI of 2 and sample size of 500 while trial Y has an FI of 2 and sample size of 1000. Although both trials have the same FI, FQ can gauge which trial is ‘relatively’ more fragile. Trial X has an FQ of 0.004, meaning that four events per 1000 patients will be needed to change the results significance; while trial Y has an FQ of 0.002, indicating that the non‐significance of trial Y is contingent on ~2 events per 1000 patients, suggesting trial Y as more fragile. For statistically non‐significant outcomes, RFQ was calculated by dividing the RFI by the sample size. FI, RFI, FQ, and RFQ were calculated using the R Version 3.51 (R Project for Statistical Computing) and Excel, Version 14.1.3 (Microsoft 160 Corp).

The FI was also calculated after pooling data from all three trials. The previous meta‐analysis utilized HRs for which FI or RFI cannot be calculated. ⁹ For this study, (logarithm of the) risk ratios (RRs) were pooled from each study, which were calculated from dichotomous endpoints, ignoring the event times. A random‐effects model was used for meta‐analysis. Weights were assigned using the Mantel–Haenszel method. Fragility of meta‐analysis results was assessed using the technique described by Atal et al. ¹⁰ Review Manager (V.5.3) was used to conduct the meta‐analysis, and the calculator available at http://clinicalepidemio.fr/fragility_ma/ was used to calculate FI of meta‐analysis.

Results

Patient population

The baseline characteristics of patients are shown in Table 1 . The three studies included a total of 9696 patients (DAPA‐HF, n = 4744; EMPEROR‐Reduced, n = 3730; and SOLOIST‐WHF, n = 1222). The median follow‐up time was 18 months in DAPA‐HF, 16 months in EMPEROR‐Reduced, and 9 months in SOLOIST‐WHF. The number of patient's lost to follow‐up in DAPA‐HF, EMPEROR‐Reduced, and SOLOIST‐WHF was 36, 42, and 43, respectively.

Table 1.

Baseline characteristics

	EMPEROR‐Reduced		DAPA‐HF		SOLOIST‐WHF
	Empagliflozin	Placebo	Dapagliflozin	Placebo	Sotagliflozin	Placebo
Number of participants	1863	1867	2373	2371	608	614
Age, years (SD)	67.2 (10.8)	66.5 (11.2)	66.2 (11.0)	66.5 (10.8)	69 (63–76)	70 (64–76)
Sex, n (%)
Male	1426 (76.5)	1411 (75.6)	1809 (76.2)	1826 (77.0)	410 (67.4)	400 (65.1)
Female	437 (23.5)	456 (24.4)	564 (23.8)	545 (23.0)	198 (32.6)	214 (34.9)
NYHA functional classification (%)
II	75.1	75.0	67.7	67.4
III	24.4	24.4	31.5	31.7
IV	0.5	0.6	0.8	1.0
Mean LVEF (%)	27.7 (6.0)	27.2 (6.1)	31.2 (6.7)	30.9 (6.9)	35 (28–47)	35 (28–45)
HFpEF (%)					127 (20.9)	129 (21.0)
HFrEF (%)					481 (79.1)	485 (79.0)
NT‐proBNP (pg/mL)	1887 (1077–3429)	1926 (1153–3525)	1428 (857–2655)	1446 (857–2641)	1817 (845–3659)	1741 (843–3582)
Hospitalization for heart failure	577 (31.0)	574 (30.7)	1124 (47.4)	1127 (47.5)
Diabetes	927 (49.8)	929 (49.8)	1075 (45.3)	1064 (44.9)	25 (4.1)	20 (3.3)
Duration of diabetes (years)
eGFR (mL/min/1.73 m²)	61.8 (21.7)	62.2 (21.5)	66.0 (19.6)	65.5 (19.3)	49.2 (39.5–61.2)	50.5 (40.5–64.6)
Heart failure medications, n (%)
Renin–angiotensin inhibitor	1644 (88.8)	1673 (90.6)	>90%	>90%	553 (91.0)	563 (91.7)
ACE inhibitor	867 (46.5)	836 (44.8)	1332 (56.1)	1329 (56.1)	254 (41.8)	241 (39.3)
ARB	451 (24.2)	457 (24.5)	675 (28.4)	632 (26.7)	245 (40.3)	270 (44.0)
ARNI	340 (18.3)	387 (20.7)	250 (10.5)	258 (10.9)	93 (15.3)	112 (18.2)
Mineralocorticoid receptor antagonist	1306 (70.1)	1355 (72.6)	1696 (71.5)	1674 (70.6)	403 (66.3)	385 (62.7)
Beta‐blocker	1765 (94.7)	1768 (94.7)	2278 (96.0)	2280 (96.2)	564 (92.8)	561 (91.4)
ICD or CRT‐D	578 (31.0%)	593 (31.8%)	622 (26.2%)	620 (26.1%)
CRT‐D or CRT‐P	220 (11.8%)	222 (11.9%)	190 (8.0%)	164 (6.9%)

Open in a new tab

ACE, angiotensin‐converting enzyme; ARB, angiotensin receptor blocker; ARNI, angiotensin receptor/neprilysin inhibitor; CRT‐D, cardiac resynchronization therapy defibrillator; CRT‐P, cardiac resynchronization therapy pacemaker; eGFR, estimated glomerular filtration rate; HF, heart failure; HFpEF, heart failure with preserved ejection fraction; HFrEF, heart failure with reduced ejection fraction; ICD, implantable cardiac defibrillator; IQR, inter‐quartile range; LVEF, left ventricular ejection fraction; NT‐proBNP, N‐terminal pro‐B‐type natriuretic peptide; NYHA, New York Heart Association; SD, standard deviation.

Data are reported as n (%), mean (SD), or median (IQR).

Fragility index, reverse fragility index, and fragility quotient

Table 2 summarizes the findings from each trial and meta‐analysis and displays the FI/RFI and FQ. Figures 1 and 2 visually represent the FI/RFI and FQ for each outcome of interest.

Table 2.

Fragility of findings from SGLT2i trials and meta‐analysis of these trials

	DAPA‐HF					EMPEROR‐Reduced					SOLOIST‐WHF					Meta‐analysis
	Events/total SGLT2i	Events/total placebo	HR [CI]	FI/RFI	FQ	Events/total SGLT2i	Events/total placebo	HR [CI]	FI/RFI	FQ	Events/total SGLT2i	Events/total placebo	HR [CI]	FI/RFI	FQ	Events/total SGLT2i	Events/total placebo	RR [CI]	FI/RFI	FQ
HFH or CVM	386/2373	502/2371	0.74 [0.65–0.85]	62	0.0130	361/1863	462/1867	0.75 [0.65–0.86]	50	0.0130	201/608	298/614	0.71 [0.56–0.89]	60	0.0490	948/4844	1262/4852	0.75 [0.69–0.81]	95	0.0098
First HFH	231/2373	318/2371	0.70 [0.59–0.83]	43	0.0090	246/1863	342/1867	0.69 [0.59–0.81]	50	0.0130	N/A	N/A	N/A	N/A	N/A	477/4236	660/4238	0.72 [0.65–0.81]	61	0.0072
CVM	227/2373	273/2371	0.82 [0.69–0.98]	5	0.0010	187/1863	202/1867	0.92 [0.75–1.12]	24 ^a	0.0060	51/608	58/614	0.84 [0.58–1.22]	7 ^a	0.0060	465/4844	533/4852	0.87 [0.78–0.98]	9	0.0009
ACM	276/2373	329/2371	0.83 [0.71–0.97]	8	0.0020	249/1863	266/1867	0.92 [0.77–1.10]	26 ^a	0.0070	65/608	76/614	0.82 [0.59–1.14]	6 ^a	0.0050	590/4844	671/4852	0.88 [0.79–0.98]	12	0.0012

Open in a new tab

ACM, all‐cause mortality; CI, confidence interval; CVM, cardiovascular mortality; FI, fragility index; FQ, fragility quotient; HF, heart failure; HFH, heart failure hospitalization; HR, hazard ratio; N/A, not available; RFI, reverse fragility index; SGLT2i, sodium–glucose co‐transporter 2 inhibitor.

^{^a}

Reverse fragility index/reverse fragility quotient.

Fragility index and reverse fragility index of outcomes in heart failure (HF)‐specific sodium–glucose co‐transporter 2 inhibitor trials. CV, cardiovascular. *Reverse fragility index.

Fragility and reverse fragility quotients of outcomes in heart failure (HF)‐specific sodium–glucose co‐transporter 2 inhibitor trials. CV, cardiovascular. *Reverse fragility quotient.

Composite of first heart failure hospitalization or cardiovascular mortality

SGLT‐2 inhibitors reduced the risk of the composite endpoint of first HF hospitalization or cardiovascular mortality in DAPA‐HF (hazard ratio [HR] 0.74 [0.65–0.85]), EMPEROR‐Reduced (HR: 0.75 [0.65–0.86]), and SOLOIST‐WHF (HR: 0.71 [0.56–0.89]) trials. The results were robust (dapagliflozin: FI = 62 and FQ = 0.013; empagliflozin: FI = 50 and FQ = 0.013; and sotagliflozin: FI = 60 and FQ = 0.049), and FI was greater than patients lost to follow‐up. Table 3 shows the FI/RFI of the primary outcomes of DAPA‐HF and EMPEROR‐Reduced stratified according to different subgroups. Meta‐analysis demonstrated significant (RR: 0.75 [0.69–0.81]; P < 0.001; I ² = 20%) and robust (FI = 95 and FQ = 0.010) benefit.

Table 3.

Fragility index and reverse fragility index for subgroup analyses of the primary endpoint in DAPA‐HF and EMPEROR‐Reduced

Subgroup	DAPA‐HF					EMPEROR‐Reduced
Subgroup	Events/total in SGLT2 inhibitor group	Events/total in placebo group	HR [95% CI]	FI/RFI	FQ/RFQ	Events/total in SGLT2 inhibitor group	Events/total in placebo group	HR [95% CI]	FI/RFI	FQ/RFQ
Diabetes	215/1075	271/1064	0.75 [0.63–0.90]	20	0.009	200/927	265/929	0.72 [0.60–0.87]	27	0.015
No diabetes	171/1298	231/1307	0.73 [0.60–0.88]	22	0.008	161/936	197/938	0.78 [0.64–0.97]	2	0.001
Men	307/1809	406/1826	0.73 [0.63–0.85]	47	0.013	294/1426	353/1411	0.80 [0.68–0.93]	18	0.006
Women	79/564	96/545	0.79 [0.59–1.06]	3 ^a	0.003 ^a	67/437	109/456	0.59 [0.44–0.80]	14	0.016
Receiving ARNI	41/250	56/258	0.75 [0.50–1.13]	3 ^a	0.006 ^a	51/430	93/387	0.64 [0.45–0.89]	11	0.013
Not receiving ARNI	345/2123	446/2113	0.74 [0.65–0.86]	52	0.012	310/1523	369/1480	0.77 [0.66–0.90]	24	0.008
Age <65 years	162/1032	196/998	0.78 [0.63–0.96]	7	0.003	128/675	193/740	0.71 [0.57–0.89]	18	0.013
Age >65 years	224/1341	306/1373	0.72 [0.60–0.85]	34	0.013	233/1188	269/1127	0.78 [0.66–0.93]	10	0.004
History of HF hospitalization	195/1124	279/1127	0.67 [0.56–0.80]	44	0.020	153/577	177/574	0.79 [0.64–0.99]	1	0.001
No history of HF hospitalization	191/1249	223/1244	0.84 [0.69–1.01]	3 ^a	0.001 ^a	208/1286	285/1293	0.71 [0.60–0.85]	35	0.014
eGFR < 60	191/962	254/964	0.72 [0.59–0.86]	26	0.013	202/893	237/906	0.83 [0.69–1.00]	1 ^a	0.001 ^a
eGFR > 60	195/1410	248/1406	0.76 [0.63–0.92]	15	0.005	159/969	224/960	0.67 [0.55–0.83]	31	0.016

Open in a new tab

ARNI, angiotensin receptor/neprilysin inhibitor; CI, confidence interval; eGFR, estimated glomerular filtration rate; FI, fragility index; FQ, fragility quotient; HF, heart failure; HR, hazard ratio; RFI, reverse fragility index; SGLT2, sodium–glucose co‐transporter 2.

^{^a}

Reverse fragility index/reverse fragility quotient.

First heart failure hospitalization

Both DAPA‐HF (HR: 0.70 [0.59–0.83]) and EMPEROR‐Reduced (HR: 0.69 [0.59–0.81]) showed a significant reduction in HF hospitalization. The FI was high in DAPA‐HF (n = 43; FQ = 0.009) and EMPEROR‐Reduced (n = 50; FQ = 0.013), and FI was higher than the number of patients lost to follow‐up. This outcome was not reported in SOLOIST‐WHF. Meta‐analysis demonstrated a significant reduction in HF hospitalization with SGLT2 inhibitors (RR: 0.72 [0.65–0.81]; P < 0.001; I ² = 0%) with an FI of 61, and the FQ was 0.007.

Mortality

In DAPA‐HF, nominally significant reduction in all‐cause (HR: 0.83 [0.71–0.97]) and cardiovascular mortality (HR: 0.82 [0.69–0.98]) was observed. The FI was 8 for all‐cause and 5 for cardiovascular mortality, with FQs of 0.002 and 0.001, respectively. In EMPEROR‐Reduced, no statistically significant reduction in all‐cause (HR 0.92 [0.77–1.10]) or cardiovascular mortality (HR 0.92 [0.75–1.12]) was seen (RFI and RFQ were 26 and 0.007 for all‐cause and 24 and 0.006 for cardiovascular mortality). In SOLOIST‐WHF trial, no statistically significant difference in all‐cause (HR: 0.82 [0.59–1.14]) and cardiovascular mortality (0.84 [0.58–1.22]) was seen; results were fragile for in both cases (RFI = 6 and RFQ = 0.005 for all‐cause mortality; RFI = 7 and RFQ = 0.006 for cardiovascular mortality). Meta‐analysis demonstrated a significant, but fragile, reduction in all‐cause (RR: 0.88 [0.79–0.98]; P = 0.02; I ² = 0%; FI = 12 and FQ = 0.001) and cardiovascular mortality (RR: 0.87 [0.78–0.98]; P = 0.02; I ² = 0%; FI = 9 and FQ = 0.001).

Discussion

The DAPA‐HF, EMPEROR‐Reduced and SOLOIST‐WHF trials all reported highly concordant and statistically robust results for the primary endpoint of time to first HF hospitalization or cardiovascular death. ¹ , ² , ³ The FI for this endpoint was higher than the FI for outcomes in trials of other drugs, for example, therapies referenced in diabetes treatment guidelines (FI = 16) and anti‐thrombotic therapy (FI = 5) in venous thromboembolism guidelines. ¹¹ , ¹² The FI and RFIs of outcomes in landmark HF trials are displayed in Table 4 .

Table 4.

Fragility index of outcomes in key heart failure medication trials

Trial	Sample size	Primary endpoint	Primary endpoint			HF hospitalization/CV death			All‐cause mortality			CV mortality			HF hospitalization
Trial	Sample size	Primary endpoint	P‐value	FI/RFI	FQ/RFQ	P‐value	FI/RFI	RFQ/RFQ	P‐value	FI/RFI	RFQ/RFQ	P‐value	FI/RFI	RFQ/RFQ	P‐value	FI/RFI	RFQ/RFQ
SGLT2 inhibitors
DAPA‐HF	4744	HF hospitalization, urgent care visits due to HF or CV death	Sig	62	0.0131	Sig	62	0.0130	Sig	8	0.0020	Sig	5	0.0010	Sig	43	0.0090
EMPEROR‐Reduced	3730	HF hospitalization/CV death	Sig	50	0.0134	Sig	50	0.0134	NS	26	0.0070	NS	24	0.0060	Sig	50	0.0130
SOLOIST‐WHF	1222	Total (first and recurrent) HF hospitalization/CV death	Sig	NA	NA	Sig	60	0.0490	NS	6	0.0050	NS	7	0.0060	NA	NA	NA
ACE‐I
CONSENSUS	253	All‐cause mortality at 6 months	Sig	7	0.0277	NA	NA	NA	Sig	3	0.0120	Sig	4	0.0158	NA	NA	NA
SOLVD‐Treatment	2569	All‐cause mortality	Sig	10	0.0039	NA	NA	NA	Sig	10	0.0040	Sig	15	0.0058	Sig	91	0.0354
ARBs
Val‐HeFT	5010	All‐cause mortality/HF hospitalization/resuscitated cardiac arrest/administration of i.v. inotropic or vasodilator drugs for 4 or more hours	Sig	17	0.0034	NA	NA	NA	NS	63	0.0130	NS	58	0.0116	Sig	59	0.0118
CHARM‐Alternative	2028	HF hospitalization/CV death	Sig	29	0.0143	Sig	29	0.0143	NS	10	0.0049	NS	6	0.0030	Sig	40	0.0197
CHARM‐Added	2548	HF hospitalization/CV death	Sig	8	0.0031	Sig	8	0.0031	NS	10	0.0039	Sig	3	0.0012	Sig	5	0.0020
Beta‐blockers
CIBIS II	2647	All‐cause mortality	Sig	37	0.0140	NA	NA	NA	Sig	37	0.0140	Sig	11	0.0042	Sig	37	0.0140
MERIT‐HF	3991	All‐cause mortality	Sig	34	0.0085	NA	NA	NA	Sig	34	0.0085	Sig	38	0.0095	Sig	50	0.0125
COPERNICUS	2289	All‐cause mortality	Sig	30	0.0131	NA	NA	NA	Sig	30	0.0131	Sig	22	0.0096	Sig	37	0.0162
SENIORS	2128	All‐cause mortality/CV hospital admission	Sig	2	0.0009	NA	NA	NA	NS	11	0.0052	NS	7	0.0033	NS	31	0.0146
MRA
RALES	1663	All‐cause mortality	Sig	54	0.0325	NA	NA	NA	Sig	54	0.0325	Sig	46	0.0277	Sig	41	0.0247
EMPHASIS‐HF	2737	CV death/HF hospitalization	Sig	61	0.0223	Sig	61	0.0223	Sig	5	0.0018	Sig	3	0.0011	Sig	49	0.0179
H‐ISDN
A‐HeFT	1050	Composite score	Sig	NA	NA	NA	NA	NA	Sig	3	0.0029	Sig	2	0.0019	Sig	15	0.0143
Digoxin
DIG	6800	All‐cause mortality	NS	61	0.0090	NA	NA	NA	NS	61	0.008971	NS	87	0.0128	Sig	191	0.0281
Ivabradine
SHIFT	6505	HF hospitalization/CV death	Sig	67	0.0103	Sig	67	0.0103	NS	13	0.0019	NS	18	0.0028	Sig	91	0.0140
ARNI
PARADIGM‐HF	8399	HF hospitalization/CV death	Sig	118	0.0140	Sig	118	0.0140	Sig	49	0.0058	Sig	66	0.0079	Sig	54	0.0064
Vericiguat
VICTORIA	5050	HF hospitalization/CV death	Sig	8	0.0016	Sig	8	0.0016	NS	34	0.0067	NS	25	0.0050	NS	1	0.0002
Omecamtiv mecarbil
GALACTIC‐HF	8442	HF hospitalization, urgent care visits due to HF or CV death	Sig	1	0.0001	NS	33	0.0040	NS	78	0.0092	NS	79	0.0094	NS	40	0.0047

Open in a new tab

ACE‐I, angiotensin‐converting enzyme inhibitor; ARBs, angiotensin receptor blockers; ARNI, angiotensin receptor/neprilysin inhibitor; CV, cardiovascular; FI, fragility index; FQ, fragility quotient; HF, heart failure; H‐ISDN, hydralazine–isosorbide dinitrate; MRA, mineralocorticoid antagonist; RFI, reverse fragility index; RFQ, reverse fragility quotient; SGLT2, sodium–glucose co‐transporter 2.

In the SOLOIST‐WHF trial, the initial primary endpoint was the composite of first HF hospitalization or cardiovascular mortality; however, this was later changed to a composite of first and recurrent HF hospitalization, urgent HF visits, and cardiovascular mortality. Despite enrolling only 1222 patients, this trial showed a significant and robust (FI = 60) reduction in first HF hospitalization or cardiovascular mortality, reinforcing the benefit of SGLT2 inhibitors in HF. These effects remained consistent across a range of subgroups including patients with reduced and preserved EF, in‐hospital vs. post‐discharge initiation, and use of sacubitril/valsartan. ³ , ¹³

The risk of all‐cause and cardiovascular mortality was nominally reduced in DAPA‐HF but not in EMPEROR‐Reduced or SOLOIST‐WHF. Mortality outcomes were secondary endpoints and statistically fragile, with significance dependent on less than 10 events per 1000 patients. Meta‐analysis showed a robust FI for the composite endpoint of first HF hospitalization or cardiovascular mortality was 95 (FQ = 0.010) but did not result in higher FI for cardiovascular (FI = 9) or all‐cause mortality (FI = 12). These findings highlight that although combining studies via random‐effects meta‐analysis can help detect a statistically significant treatment effect by increasing power, this does not necessarily result in an increased FI and that the FI and RFI are not strictly linked to statistical significance, confidence intervals, and power. While the aforementioned are related concepts, FI offers additional value in interpretation of clinical trials and meta‐analyses.

The fact that a few events could change the statistical significance of the all‐cause and cardiovascular mortality underscores the importance of powering trials for mortality outcomes. When the primary outcome is a composite including non‐fatal events, neither the total number of fatal events nor the duration of trial follow‐up supports deriving definitive conclusions regarding mortality. While it is not possible to power trials for all secondary endpoints, considering the risk for mortality in HF, strong consideration should be given to designing trials for confirming mortality results independently by either larger sample size or longer follow‐up or both.

Background therapy can potentially influence the FI and RFI of clinical trials. Similar to most contemporary HF trials, patients in EMPEROR‐Reduced, DAPA‐HF, and SOLOIST‐WHF were well treated with guideline‐directed medical therapy at baseline (Table 1 ). In all three trials, over 90% of the participants were using beta‐blockers and renin–angiotensin–aldosterone inhibitors, and over two‐thirds were using mineralocorticoid receptor antagonists. There was some variation in the proportion of patients using a neprilysin inhibitor in addition to a renin–angiotensin–aldosterone inhibitor, with the highest rate of use in EMPEROR‐Reduced (20%), followed by SOLOIST‐WHF (17%) and then DAPA‐HF (11%). Overall, the background therapies across trials were similar and unlikely to influence the FI or RFI.

There are several limitations of this study. FI does not account for the difference in time to event and can give fragile results when the number of events in each group is the same but have a difference in the timing of these events. However, studies have shown no difference when FI was applied to time‐to‐event vs. frequency data. ⁴ , ⁵ , ⁶ Because trials are powered to detect the effect on primary outcome, interpretability of FI for secondary outcomes and subgroup analyses is limited. The use of Fisher's exact test in calculation of FI and RFI may be limited as DAPA‐HF and EMPEROR‐Reduced trials analysed data in models with covariates and time‐to‐event techniques in which the original data if analysed with Fisher's exact test may not yield the same P‐value as the published trial. While FI may perpetuate the dichotomous P‐value‐oriented data interpretation, it provides a more circumspect view of assessing results than based solely on P. Bayesian approaches may provide an alternate option, but the majority of trials currently are based on frequentist approaches.

In conclusion, findings for the composite endpoint of first HF hospitalization or cardiovascular death were highly consistent and robust across trials with dapagliflozin, empagliflozin, and sotagliflozin. In contrast, findings for the all‐cause and cardiovascular mortality were overall significant but fragile when meta‐analytically assessed, underscoring the need to design trials with adequate power and follow‐up to definitely assess the impact of novel interventions on mortality.

Conflict of interest

G.C.F. reports consulting for Abbott, Amgen, AstraZeneca, Bayer, CHF Solutions, Janssen, Medtronic, Merck, and Novartis. S.J.G. was supported by the Heart Failure Society of America/Emergency Medicine Foundation Acute Heart Failure Young Investigator Award funded by Novartis; has received research support from the American Heart Association, AstraZeneca, Amgen, Bristol Myers Squibb, Merck, and Novartis; has served on advisory boards for Amgen and Cytokinetics; and serves as a consultant for Amgen and Merck. M.V. is supported by the KL2/Catalyst Medical Research Investigator Training Award from Harvard Catalyst (NIH/NCATS Award UL 1TR002541); serves on advisory boards for Amgen, AstraZeneca, Baxter Healthcare, Bayer AG, Boehringer Ingelheim, Cytokinetics, and Relypsa; and participates on clinical endpoint committees for studies sponsored by Galmed, Novartis, and the NIH. T.F. reports personal fees from Novartis, Bayer, Janssen, SGS, Roche, Boehringer Ingelheim, Daiichi Sankyo, Galapagos, Penumbra, Parexel, Vifor, Biosense Webster, CSL Behring, Fresenius Kabi, Coherex Medical, and LivaNova. G.F. participated in committees of trials and registries sponsored by BI, Bayer, Medtronic, Servier, Novartis, and Vifor. A.J.S.C. reports honoraria and/or lecture fees from AstraZeneca, Bayer, Menarini, Novartis, Nutricia, Servier, Vifor, Actimed, Cardiac Dimensions, CVRx, Enopace, Faraday, Gore, Impulse Dynamics, Respicardia, Stealth Peptides, V‐Wave, Corvia, Arena, and ESN Cleer. S.D.A. has received research support from Vifor International and Abbott Vascular, and fees for consultancy and/or speaking from AstraZeneca, Bayer, Boehringer Ingelheim, Respicardia, Impulse Dynamics, Janssen, Novartis, Servier, and Vifor International. J.B. is a consultant to Abbott, Amgen, Applied Therapeutics, Array, AstraZeneca, Bayer, Boehringer Ingelheim, CVRx, Eli Lilly, G3 Pharmaceutical, Impulse Dynamics, Innolife, Janssen, Luitpold, Medtronic, Merck, Novartis, Novo Nordisk, Relypsa, Sequana, Stealth Peptides, and Vifor. All other authors report no disclosures.

Funding

None.

Usman, M. S. , Khan, M. S. , Fonarow, G. C. , Greene, S. J. , Friede, T. , Vaduganathan, M. , Filippatos, G. , Coats, A. J. S. , Anker, S. D. , and Butler, J. (2022) Robustness of outcomes in trials evaluating sodium–glucose co‐transporter 2 inhibitors for heart failure. ESC Heart Failure, 9: 885–893. 10.1002/ehf2.13785.

References

1. McMurray JJV, Solomon SD, Inzucchi SE, Køber L, Kosiborod MN, Martinez FA, Ponikowski P, Sabatine MS, Anand IS, Bělohlávek J, Böhm M, Chiang CE, Chopra VK, de Boer RA, Desai AS, Diez M, Drozdz J, Dukát A, Ge J, Howlett JG, Katova T, Kitakaze M, Ljungman CEA, Merkely B, Nicolau JC, O'Meara E, Petrie MC, Vinh PN, Schou M, Tereshchenko S, Verma S, Held C, DeMets DL, Docherty KF, Jhund PS, Bengtsson O, Sjöstrand M, Langkilde AM. Dapagliflozin in patients with heart failure and reduced ejection fraction. N Engl J Med 2019; 381: 1995–2008. [DOI] [PubMed] [Google Scholar]
2. Packer M, Anker SD, Butler J, Filippatos G, Pocock SJ, Carson P, Januzzi J, Verma S, Tsutsui H, Brueckmann M, Jamal W, Kimura K, Schnee J, Zeller C, Cotton D, Bocchi E, Böhm M, Choi DJ, Chopra V, Chuquiure E, Giannetti N, Janssens S, Zhang J, Gonzalez Juanatey JR, Kaul S, Brunner‐La Rocca HP, Merkely B, Nicholls SJ, Perrone S, Pina I, Ponikowski P, Sattar N, Senni M, Seronde MF, Spinar J, Squire I, Taddei S, Wanner C, Zannad F. Cardiovascular and renal outcomes with empagliflozin in heart failure. N Engl J Med 2020; 383: 1413–1424. [DOI] [PubMed] [Google Scholar]
3. Bhatt DL, Szarek M, Steg PG, Cannon CP, Leiter LA, McGuire DK, Lewis JB, Riddle MC, Voors AA, Metra M, Lund LH, Komajda M, Testani JM, Wilcox CS, Ponikowski P, Lopes RD, Verma S, Lapuerta P, Pitt B. Sotagliflozin in patients with diabetes and recent worsening heart failure. N Engl J Med 2020; 384: 117–128. [DOI] [PubMed] [Google Scholar]
4. Khan MS, Ochani RK, Shaikh A, Usman MS, Yamani N, Khan SU, Murad MH, Mandrola J, Doukky R, Krasuski RA. Fragility index in cardiovascular randomized controlled trials. Circ Cardiovasc Qual Outcomes 2019; 12: e005755. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Khan MS, Fonarow GC, Friede T, Lateef N, Khan SU, Anker SD, Harrell FE Jr, Butler J. Application of the reverse fragility index to statistically nonsignificant randomized clinical trial results. JAMA Netw Open 2020; 3: e2012469. [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Walsh M, Srinathan SK, McAuley DF, Mrkobrada M, Levine O, Ribic C, Molnar AO, Dattani ND, Burke A, Guyatt G, Thabane L, Walter SD, Pogue J, Devereaux PJ. The statistical significance of randomized controlled trial results is frequently fragile: a case for a Fragility Index. J Clin Epidemiol 2014; 67: 622–628. [DOI] [PubMed] [Google Scholar]
7. Wayant C, Meyer C, Gupton R, Som M, Baker D, Vassar M. The fragility index in a cohort of HIV/AIDS randomized controlled trials. J Gen Intern Med 2019; 34: 1236–1243. [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Bowers A, Meyer C, Tritz D, Cook C, Fuller K, Smith C, Diener B, Vassar M. Assessing quality of randomized trials supporting guidelines for laparoscopic and endoscopic surgery. J Surg Res 2018; 224: 233–239. [DOI] [PubMed] [Google Scholar]
9. Butler J, Usman MS, Khan MS, Greene SJ, Friede T, Vaduganathan M, Filippatos G, Coats AJ, Anker SD. Efficacy and safety of SGLT2 inhibitors in heart failure: systematic review and meta‐analysis. ESC Heart Fail 2020; 7: 3298–3309. [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Atal I, Porcher R, Boutron I, Ravaud P. The statistical significance of meta‐analyses is frequently fragile: definition of a fragility index for meta‐analyses. J Clin Epidemiol 2019; 111: 32–40. [DOI] [PubMed] [Google Scholar]
11. Chase Kruse B, Matt Vassar B. Unbreakable? An analysis of the fragility of randomized trials that support diabetes treatment guidelines. Diabetes Res Clin Pract 2017; 134: 91–105. [DOI] [PubMed] [Google Scholar]
12. Edwards E, Wayant C, Besas J, Chronister J, Vassar M. How fragile are clinical trial outcomes that support the CHEST clinical practice guidelines for VTE? Chest 2018; 154: 512–520. [DOI] [PubMed] [Google Scholar]
13. Zannad F, Ferreira JP, Pocock SJ, Anker SD, Butler J, Filippatos G, Brueckmann M, Ofstad AP, Pfarr E, Jamal W, Packer M. SGLT2 inhibitors in patients with heart failure with reduced ejection fraction: a meta‐analysis of the EMPEROR‐Reduced and DAPA‐HF trials. Lancet 2020; 396: 819–829. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0001] 1. McMurray JJV, Solomon SD, Inzucchi SE, Køber L, Kosiborod MN, Martinez FA, Ponikowski P, Sabatine MS, Anand IS, Bělohlávek J, Böhm M, Chiang CE, Chopra VK, de Boer RA, Desai AS, Diez M, Drozdz J, Dukát A, Ge J, Howlett JG, Katova T, Kitakaze M, Ljungman CEA, Merkely B, Nicolau JC, O'Meara E, Petrie MC, Vinh PN, Schou M, Tereshchenko S, Verma S, Held C, DeMets DL, Docherty KF, Jhund PS, Bengtsson O, Sjöstrand M, Langkilde AM. Dapagliflozin in patients with heart failure and reduced ejection fraction. N Engl J Med 2019; 381: 1995–2008. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0002] 2. Packer M, Anker SD, Butler J, Filippatos G, Pocock SJ, Carson P, Januzzi J, Verma S, Tsutsui H, Brueckmann M, Jamal W, Kimura K, Schnee J, Zeller C, Cotton D, Bocchi E, Böhm M, Choi DJ, Chopra V, Chuquiure E, Giannetti N, Janssens S, Zhang J, Gonzalez Juanatey JR, Kaul S, Brunner‐La Rocca HP, Merkely B, Nicholls SJ, Perrone S, Pina I, Ponikowski P, Sattar N, Senni M, Seronde MF, Spinar J, Squire I, Taddei S, Wanner C, Zannad F. Cardiovascular and renal outcomes with empagliflozin in heart failure. N Engl J Med 2020; 383: 1413–1424. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0003] 3. Bhatt DL, Szarek M, Steg PG, Cannon CP, Leiter LA, McGuire DK, Lewis JB, Riddle MC, Voors AA, Metra M, Lund LH, Komajda M, Testani JM, Wilcox CS, Ponikowski P, Lopes RD, Verma S, Lapuerta P, Pitt B. Sotagliflozin in patients with diabetes and recent worsening heart failure. N Engl J Med 2020; 384: 117–128. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0004] 4. Khan MS, Ochani RK, Shaikh A, Usman MS, Yamani N, Khan SU, Murad MH, Mandrola J, Doukky R, Krasuski RA. Fragility index in cardiovascular randomized controlled trials. Circ Cardiovasc Qual Outcomes 2019; 12: e005755. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ehf213785-bib-0005] 5. Khan MS, Fonarow GC, Friede T, Lateef N, Khan SU, Anker SD, Harrell FE Jr, Butler J. Application of the reverse fragility index to statistically nonsignificant randomized clinical trial results. JAMA Netw Open 2020; 3: e2012469. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ehf213785-bib-0006] 6. Walsh M, Srinathan SK, McAuley DF, Mrkobrada M, Levine O, Ribic C, Molnar AO, Dattani ND, Burke A, Guyatt G, Thabane L, Walter SD, Pogue J, Devereaux PJ. The statistical significance of randomized controlled trial results is frequently fragile: a case for a Fragility Index. J Clin Epidemiol 2014; 67: 622–628. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0007] 7. Wayant C, Meyer C, Gupton R, Som M, Baker D, Vassar M. The fragility index in a cohort of HIV/AIDS randomized controlled trials. J Gen Intern Med 2019; 34: 1236–1243. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ehf213785-bib-0008] 8. Bowers A, Meyer C, Tritz D, Cook C, Fuller K, Smith C, Diener B, Vassar M. Assessing quality of randomized trials supporting guidelines for laparoscopic and endoscopic surgery. J Surg Res 2018; 224: 233–239. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0009] 9. Butler J, Usman MS, Khan MS, Greene SJ, Friede T, Vaduganathan M, Filippatos G, Coats AJ, Anker SD. Efficacy and safety of SGLT2 inhibitors in heart failure: systematic review and meta‐analysis. ESC Heart Fail 2020; 7: 3298–3309. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ehf213785-bib-0010] 10. Atal I, Porcher R, Boutron I, Ravaud P. The statistical significance of meta‐analyses is frequently fragile: definition of a fragility index for meta‐analyses. J Clin Epidemiol 2019; 111: 32–40. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0011] 11. Chase Kruse B, Matt Vassar B. Unbreakable? An analysis of the fragility of randomized trials that support diabetes treatment guidelines. Diabetes Res Clin Pract 2017; 134: 91–105. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0012] 12. Edwards E, Wayant C, Besas J, Chronister J, Vassar M. How fragile are clinical trial outcomes that support the CHEST clinical practice guidelines for VTE? Chest 2018; 154: 512–520. [DOI] [PubMed] [Google Scholar]

[ehf213785-bib-0013] 13. Zannad F, Ferreira JP, Pocock SJ, Anker SD, Butler J, Filippatos G, Brueckmann M, Ofstad AP, Pfarr E, Jamal W, Packer M. SGLT2 inhibitors in patients with heart failure with reduced ejection fraction: a meta‐analysis of the EMPEROR‐Reduced and DAPA‐HF trials. Lancet 2020; 396: 819–829. [DOI] [PubMed] [Google Scholar]

PERMALINK

Robustness of outcomes in trials evaluating sodium–glucose co‐transporter 2 inhibitors for heart failure

Muhammad Shariq Usman

Muhammad Shahzeb Khan

Gregg C Fonarow

Stephen J Greene

Tim Friede

Muthiah Vaduganathan

Gerasimos Filippatos

Andrew J Stewart Coats

Stefan D Anker

Javed Butler

Abstract

Aims

Methods and results

Conclusion

Introduction

Methods

Study populations, definitions, and outcomes of interest

Statistical analysis

Results

Patient population

Table 1.

Fragility index, reverse fragility index, and fragility quotient

Table 2.

Figure 1.

Figure 2.

Composite of first heart failure hospitalization or cardiovascular mortality

Table 3.

First heart failure hospitalization

Mortality

Discussion

Table 4.

Conflict of interest

Funding

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases