Key Summary Points
The network meta-analysis (NMA) of Ismaila et al. contains limitations in design and reporting that could strongly influence the validity of the findings and conclusions. |
Heterogeneity between studies was not adequately accounted for; only results from a fixed-effects model are reported, which contradicts guidance on NMA methodology for comparing studies with considerable heterogeneity and can yield misleading results. |
Due to fundamental differences between the FULFIL and KRONOS studies, and a single common comparator that performed differently in the two studies, these should not have been relied upon to establish a network connection to compare fluticasone furoate/umeclidinium/vilanterol (FF/UMEC/VI) and budesonide/glycopyrronium bromide/formoterol fumarate (BUD/GLY/FOR). |
Given these methodological issues and selective reporting of fixed-effects model results, we do not agree with Ismaila et al.’s assertion that FF/UMEC/VI showed statistically significant improvements in the annualized rate of combined moderate/severe exacerbations versus BUD/GLY/FOR. |
The NMA findings and conclusions are not consistent with evidence from four other published NMAs of triple therapies in patients with chronic obstructive pulmonary disease (including two from independent researchers), all of which used random-effects models. |
Dear Editor,
We read with interest the publication by Ismaila et al. in Advances in Therapy entitled “Fluticasone Furoate/Umeclidinium/Vilanterol (FF/UMEC/VI) Triple Therapy Compared with Other Therapies for the Treatment of COPD: A Network Meta-Analysis” [1]. Network meta-analysis (NMA) is an important statistical technique, often used to make indirect treatment comparisons when no direct head-to-head studies are available [2]. However, if not robustly conducted, NMAs can yield misleading results [3, 4].
Although clinical and methodological differences between studies are inevitable, valid NMAs rely on the assumption that the included studies are sufficiently homogenous [5, 6]. Per best practice, when NMAs are undertaken in the presence of heterogeneity, random-effects models are recommended to account for heterogeneity [3–6]. Ismaila et al. noted a large amount of between-study heterogeneity for their severe exacerbation analysis; these results were therefore not deemed robust and not reported [1]. I2 is a standard statistical measure of homogeneity/heterogeneity [4, 7], and Ismaila et al. report an I2 of 86.59% for the network of studies reporting combined moderate/severe exacerbation rates [1]; this is classed as considerable heterogeneity, per Cochrane (I2 of 75–100%, highest category) [4], but the authors did not question the robustness of these findings [1]. Ismaila et al. subsequently analyzed a smaller group of studies with ≥ 24 weeks’ follow-up “to account for heterogeneity induced by differences in length of follow-up”, but I2 increased to 94.93% [1], suggesting that heterogeneity was not substantially caused by differences in follow-up length but by other factors. However, the authors did not further explore possible reasons for this heterogeneity, nor, as detailed below, did they report the results from a random-effects model, which would account for the heterogeneity.
Ismaila et al.’s methodology states that they implemented both fixed- and random-effects models; however, they only reported results and conclusions based on the fixed-effects model [1]. The approach of reporting only the fixed-effects model findings is not clearly described nor explained. Ismaila et al. acknowledged that, where heterogeneity was present, as in the moderate/severe exacerbation analysis, the random-effects model “automatically accounted for this” [1]. However, these random-effects model results were not reported, even in cases of considerable heterogeneity, and this could alter any conclusions reached. Fixed-effects models assume similar effect sizes across studies and ignore heterogeneity; confidence intervals (CIs) from such models do not reflect the extent of heterogeneity [4].
To better understand if their results are supported by a random-effects analysis, we independently replicated the annualized moderate/severe exacerbation rate analysis with the same software and data sources, using both fixed- and random-effects models. The fixed-effects model gave similar results to Ismaila et al.’s, confirming that their reported results are from a fixed-effects model (which was not made clear). However, the random-effects model, as recommended per best practice, showed no significant treatment–effect differences, due to wide CIs driven by between-study heterogeneity. Given the methodological issues, selective reporting of fixed-effects model results, and failure to disclose the random-effects model results, which we believe to be not significant per our replicate analysis, we do not agree with Ismaila et al. that FF/UMEC/VI showed statistically significant improvements in annualized moderate/severe exacerbation rates versus budesonide/glycopyrronium bromide/formoterol fumarate (BUD/GLY/FOR) [1]. Notably, their finding contradicts four other peer-reviewed NMA publications, including two from independent researchers [8, 9], all of which used random-effects models to account for heterogeneity and reported no significant exacerbation rate differences between triple therapies in patients with chronic obstructive pulmonary disease [8–11].
Aside from the appropriateness of the model, there is the fundamental issue of whether Ismaila et al.’s network approach should be used at all when studies are discernibly heterogenous, a point highlighted by Cochrane and others [4, 12]. NMA validity can be susceptible to network structure, and when indirect comparisons between disparate parts of a network rely solely on links consisting of single studies or a single treatment informed by only two studies, close attention must be paid to these studies, because study-specific biases will be reflected in the relative effect estimates between network segments [13]. It is difficult to determine common comparators and connections in Ismaila et al.’s network diagrams. Replotting (Fig. 1) helps to identify that the connection between FF/UMEC/VI and BUD/GLY/FOR hinges solely on KRONOS and FULFIL, two lung function studies, and their single common comparator, BUD/FOR (400/12) [14, 15]. There are important differences in study design between KRONOS and FULFIL, including blinding, run-in treatments, participating countries, and exacerbation history criteria. BUD/FOR was open-label in KRONOS and blinded in FULFIL [14, 15], and whether or not a treatment is blinded could conceivably impact study results. It is also likely that between-study differences in run-in treatments impacted baseline lung function measurements and subsequent changes from baseline in lung function, evidenced by the least squares mean change from baseline in pre-dose forced expiratory volume in 1 s (FEV1) at Week 24 being + 62 mL with BUD/FOR in KRONOS [14, 16] and − 29 mL with BUD/FOR in FULFIL [15]. Furthermore, FULFIL was conducted primarily in Eastern Europe, whereas KRONOS was conducted primarily in the US, Japan, and China [14, 15]. The countries in which studies are conducted are known to affect exacerbation results [17]. Additionally, prior exacerbations are well-recognized as a strong predictor of future exacerbation risk [18]. KRONOS did not require patients to have an exacerbation in the previous year (74% of patients did not) [14]; FULFIL required patients to have ≥ 2 moderate exacerbations or ≥ 1 severe exacerbation in the previous year if their FEV1 was ≥ 50–< 80%, but did not require exacerbations in the previous year if their FEV1 was < 50% (35% of patients did not) [15, 19]. Considering these studies are supposedly similar in other key aspects according to Ismaila et al., it is surprising and contrary to accepted scientific knowledge on the impact of prior exacerbations that the annual moderate/severe exacerbation rate was > 50% higher for BUD/FOR in KRONOS versus FULFIL (0.55 vs. 0.34), when 74% of KRONOS patients had no exacerbations in the previous year [14, 15]. Taken together, this highlights that there are fundamental differences between the studies, well beyond the key patient characteristics that Ismaila et al. extracted from the studies, and that these had a significant influence on the results, biasing the NMA findings.
Such differences are not limited to KRONOS and FULFIL. Ismaila et al. claim that “the similarity assumption held” for all studies in the network, including FULFIL and IMPACT, which are directly connected with the common comparator of FF/UMEC/VI [1]. One difference was that IMPACT only included patients with recent exacerbations, and the annual moderate/severe exacerbation rate with FF/UMEC/VI was 0.91 [20]. However, in a post hoc analysis of FULFIL, the annual moderate/severe exacerbation rate in patients with recent exacerbations before study entry (aligned with IMPACT) was 0.19 with FF/UMEC/VI [19], approximately 80% lower than in IMPACT. This again highlights that FULFIL is somewhat of an outlier, with fundamental differences to its connected studies; this questions how these studies could be considered similar and why heterogeneity was not further explored. FULFIL cannot be relied upon to form network connections to KRONOS or IMPACT, as these connections bias all wider relative treatment–effect estimates. Given the substantial differences in the linking studies connecting FF/UMEC/VI with BUD/GLY/FOR, Ismaila et al.’s analysis provides misleading results.
One way to ascertain whether a network of indirect relationships is reasonable is to compare direct and indirect evidence for relative treatment effects [13]. In their moderate/severe exacerbation rate comparison, there is a 25% difference (incidence rate ratio [IRR] 0.75) for FF/UMEC/VI versus UMEC/VI (per IMPACT results [20]), but a 66% difference (IRR 0.44) for FF/UMEC/VI versus GLY/FOR, where no direct comparison data exist [1]. The analysis therefore shows indirect evidence of a ~ 40% difference for UMEC/VI versus GLY/FOR. However, a 24-week head-to-head study comparing GLY/FOR versus UMEC/VI, with exacerbations as an exploratory endpoint, reported no appreciable difference in moderate/severe exacerbation rate (16.7% vs. 17.6%) or time to first moderate/severe exacerbation (hazard ratio [95% CI] 0.97 [0.73, 1.29]) [21]. The inconsistent estimates from these direct and indirect evidence sources indicate that Ismaila et al.’s network produces misleading indirect results regarding relative efficacy between FF/UMEC/VI and BUD/GLY/FOR or its components. Ismaila et al. did not report inconsistency testing of direct and indirect evidence [1], which is recommended per best practice when a network has loops [2, 22–24]. In our replicate analysis, we performed an inconsistency assessment [22] of the moderate/severe exacerbation network, which indicated statistical inconsistency between direct and indirect estimates among all network loops, further highlighting the potential for misleading results.
While other NMAs have included KRONOS and FULFIL, they employed different methods to avoid making a connection solely through KRONOS and FULFIL, thus avoiding the bias that Ismaila et al. introduce with their approach [8, 10, 11]. Heterogeneity between KRONOS and FULFIL makes them unsuitable as a critical network connection to analyze relative effects of BUD/GLY/FOR versus FF/UMEC/VI. Hence, we cast doubt on all the conclusions and statements of superior efficacy which pertain to these.
Acknowledgements
Funding
This work was funded by AstraZeneca. No funding or sponsorship was received for the publication of this article.
Medical Writing, Editorial, and Other Assistance
The authors thank Matt Chapman-Rounds (Quantics, Edinburgh, UK) for his independent statistical critique and editorial review. The authors also thank Anita Fitzgerald and Rachael McCool (York Health Economics Consortium, York, UK) for their independent scientific review. Medical writing support, under the direction of the authors, was provided by Sarah Piggott, MChem, CMC Connect, a division of IPG Health Medical Communications, and was funded by AstraZeneca, in accordance with Good Publication Practice (GPP 2022) guidelines [27].
Authorship
All named authors meet the International Committee of Medical Journal Editors (ICMJE) criteria for authorship of this article, take responsibility for the integrity of the work as a whole, and have given their approval for this version to be published.
Author Contributions
All named authors contributed to drafting and/or critically revising the letter. Barinder Singh contributed to the replication of the annualized exacerbation rate analysis.
Disclosures
Jonathan Marshall, Patrick Darken, Mario Ouwens, and Deniz Tansey-Dwyer are employees of AstraZeneca and hold stock and/or stock options in the company. Akanksha Sharma and Barinder Singh are employees of Pharmacoevidence Pvt. Ltd.
Compliance with Ethics Guidelines
This article is based on a previously conducted study and does not contain any study data with human participants or animals performed by any of the authors.
Data Availability
Data underlying our replication of the annualized exacerbation rate analysis referred to in this letter may be obtained in accordance with AstraZeneca’s data-sharing policy, described at https://astrazenecagrouptrials.pharmacm.com/ST/Submission/Disclosure.
Contributor Information
Jonathan Marshall, Email: jonathan.marshall1@astrazeneca.com.
Akanksha Sharma, Email: akanksha.sharma@pharmacoevidence.com.
Patrick Darken, Email: patrick.darken@astrazeneca.com.
Mario Ouwens, Email: mario.ouwens@astrazeneca.com.
Barinder Singh, Email: barinder.singh@pharmacoevidence.com.
Deniz Tansey-Dwyer, Email: deniz.tansey-dwyer@astrazeneca.com.
References
- 1.Ismaila AS, Haeussler K, Czira A, et al. Fluticasone furoate/umeclidinium/vilanterol (FF/UMEC/VI) triple therapy compared with other therapies for the treatment of COPD: a network meta-analysis. Adv Ther. 2022;39:3957–3978. doi: 10.1007/s12325-022-02231-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chaimani A, Caldwell DM, Li T, Higgins JPT, Salanti G. Chapter 11: Undertaking network meta-analyses. In: Higgins JPT, Thomas J, Chandler J, et al (editors). Cochrane handbook for systematic reviews of interventions. Version 6.3 (updated February 2022). Cochrane; 2022. https://training.cochrane.org/handbook.
- 3.Jansen JP, Trikalinos T, Cappelleri JC, et al. Indirect treatment comparison/network meta-analysis study questionnaire to assess relevance and credibility to inform health care decision making: an ISPOR-AMCP-NPC Good Practice Task Force report. Value Health. 2014;17:157–173. doi: 10.1016/j.jval.2014.01.004. [DOI] [PubMed] [Google Scholar]
- 4.Deeks JJ, Higgins JPT, Altman DG, on behalf of the Cochrane Statistical Methods Group. Chapter 10: Analysing data and undertaking meta-analyses. In: Higgins JPT, Thomas J, Chandler J, et al (editors). Cochrane handbook for systematic reviews of interventions. Version 6.3 (updated February 2022). Cochrane; 2022. Available from https://training.cochrane.org/handbook.
- 5.Tonin FS, Rotta I, Mendes AM, Pontarolo R. Network meta-analysis: a technique to gather evidence from direct and indirect comparisons. Pharm Pract (Granada) 2017;15:943. doi: 10.18549/PharmPract.2017.01.943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Dias S, Caldwell DM. Network meta-analysis explained. Arch Dis Child Fetal Neonatal Ed. 2019;104:F8–12. doi: 10.1136/archdischild-2018-315224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ahn E, Kang H. Concepts and emerging issues of network meta-analysis. Korean J Anesthesiol. 2021;74:371–382. doi: 10.4097/kja.21358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lee HW, Kim HJ, Jang EJ, Lee CH. Comparisons of efficacy and safety between triple (Inhaled corticosteroid/long-acting muscarinic antagonist/long-acting beta-agonist) therapies in chronic obstructive pulmonary disease: systematic review and Bayesian network meta-analysis. Respiration. 2021;100:631–643. doi: 10.1159/000515133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Rogliani P, Ora J, Cavalli F, Cazzola M, Calzetta L. Comparing the efficacy and safety profile of triple fixed-dose combinations in COPD: a meta-analysis and IBiS score. J Clin Med. 2022;11:4491. doi: 10.3390/jcm11154491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ferguson GT, Darken P, Ballal S, et al. Efficacy of budesonide/glycopyrronium/formoterol fumarate metered dose inhaler (BGF MDI) versus other inhaled corticosteroid/long-acting muscarinic antagonist/long-acting β2-agonist (ICS/LAMA/LABA) triple combinations in COPD: a systematic literature review and network meta-analysis. Adv Ther. 2020;37:2956–2975. doi: 10.1007/s12325-020-01311-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bourdin A, Molinari N, Ferguson GT, et al. Efficacy and safety of budesonide/glycopyrronium/formoterol fumarate versus other triple combinations in COPD: a systematic literature review and network meta-analysis. Adv Ther. 2021;38:3089–3112. doi: 10.1007/s12325-021-01703-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Poole C, Greenland S. Random-effects meta-analyses are not always conservative. Am J Epidemiol. 1999;150:469–475. doi: 10.1093/oxfordjournals.aje.a010035. [DOI] [PubMed] [Google Scholar]
- 13.Rouse B, Chaimani A, Li T. Network meta-analysis: an introduction for clinicians. Intern Emerg Med. 2017;12:103–111. doi: 10.1007/s11739-016-1583-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ferguson GT, Rabe KF, Martinez FJ, et al. Triple therapy with budesonide/glycopyrrolate/formoterol fumarate with co-suspension delivery technology versus dual therapies in chronic obstructive pulmonary disease (KRONOS): a double-blind, parallel-group, multicentre, phase 3 randomised controlled trial. Lancet Respir Med. 2018;6:747–758. doi: 10.1016/S2213-2600(18)30327-8. [DOI] [PubMed] [Google Scholar]
- 15.Lipson DA, Barnacle H, Birk R, et al. FULFIL trial: once-daily triple therapy for patients with chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2017;196:438–446. doi: 10.1164/rccm.201703-0449OC. [DOI] [PubMed] [Google Scholar]
- 16.ClinicalTrials.gov. A randomized, double-blind, parallel-group, 24-week, chronic-dosing, multi-center study to assess the efficacy and safety of PT010, PT003, and PT009 compared with Symbicort® Turbuhaler® (Kronos) (KRONOS); study results. https://clinicaltrials.gov/ct2/show/results/NCT02497001. Accessed 29 Sep 2022.
- 17.Calverley PMA, Martinez FJ, Vestbo J, et al. International differences in the frequency of chronic obstructive pulmonary disease exacerbations reported in three clinical trials. Am J Respir Crit Care Med. 2022;206:25–33. doi: 10.1164/rccm.202111-2630OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hurst JR, Han MK, Singh B, et al. Prognostic risk factors for moderate-to-severe exacerbations in patients with chronic obstructive pulmonary disease: a systematic literature review. Respir Res. 2022;23:213. doi: 10.1186/s12931-022-02123-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Panettieri RA, Jr, Camargo CA, Jr, Cheema T, et al. Effect of recent exacerbation history on the efficacy of once-daily single-inhaler fluticasone furoate/umeclidinium/vilanterol triple therapy in patients with chronic obstructive pulmonary disease in the FULFIL trial. Int J Chron Obstruct Pulmon Dis. 2022;17:2043–2052. doi: 10.2147/COPD.S367701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lipson DA, Barnhart F, Brealey N, et al. Once-daily single-inhaler triple versus dual therapy in patients with COPD. N Engl J Med. 2018;378:1671–1680. doi: 10.1056/NEJMoa1713901. [DOI] [PubMed] [Google Scholar]
- 21.Maltais F, Ferguson GT, Feldman GJ, et al. A randomized, double-blind, double-dummy study of glycopyrrolate/formoterol fumarate metered dose inhaler relative to umeclidinium/vilanterol dry powder inhaler in COPD. Adv Ther. 2019;36:2434–2449. doi: 10.1007/s12325-019-01015-3. [DOI] [PubMed] [Google Scholar]
- 22.Higgins JP, Jackson D, Barrett JK, Lu G, Ades AE, White IR. Consistency and inconsistency in network meta-analysis: concepts and models for multi-arm studies. Res Synth Methods. 2012;3:98–110. doi: 10.1002/jrsm.1044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Hoaglin DC, Hawkins N, Jansen JP, et al. Conducting indirect-treatment-comparison and network-meta-analysis studies: report of the ISPOR Task Force on Indirect Treatment Comparisons Good Research Practices: part 2. Value Health. 2011;14:429–437. doi: 10.1016/j.jval.2011.01.011. [DOI] [PubMed] [Google Scholar]
- 24.Dias S, Welton NJ, Sutton AJ, Caldwell DM, Lu G, Ades AE. NICE DSU Technical Support Document 4: Inconsistency in networks of evidence based on randomised controlled trials. 2011; last updated April 2014. http://www.nicedsu.org.uk. Accessed 3 Nov 2022. [PubMed]
- 25.Siler TM, Kerwin E, Singletary K, Brooks J, Church A. Efficacy and safety of umeclidinium added to fluticasone propionate/salmeterol in patients with COPD: results of two randomized, double-blind studies. COPD. 2016;13:1–10. doi: 10.3109/15412555.2015.1034256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Sousa AR, Riley JH, Church A, Zhu CQ, Punekar YS, Fahy WA. The effect of umeclidinium added to inhaled corticosteroid/long-acting β2-agonist in patients with symptomatic COPD: a randomised, double-blind, parallel-group study. NPJ Prim Care Respir Med. 2016;26:16031. doi: 10.1038/npjpcrm.2016.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.DeTora LM, Toroser D, Sykes A, et al. Good publication practice (GPP) guidelines for company-sponsored biomedical research: 2022 update. Ann Intern Med. 2022;175:1298–1304. doi: 10.7326/M22-1460. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data underlying our replication of the annualized exacerbation rate analysis referred to in this letter may be obtained in accordance with AstraZeneca’s data-sharing policy, described at https://astrazenecagrouptrials.pharmacm.com/ST/Submission/Disclosure.