Key Points
Question
Is the interpretation and reporting of noninferiority trials with primary end point results that are not statistically significant correct, and what are the associated factors of misleading reporting?
Findings
This systematic review of 52 noninferiority randomized clinical trials of cancer treatments with results for primary end points that are not statistically significant, 75% included misleading reporting. Multivariable analysis found that the prevalence of misleading reporting was significantly lower in reports with funding from for-profit sources and higher in reports of novel experimental treatments.
Meaning
These findings suggest that authors should carefully consider noninferiority cancer clinical trial result interpretation and reporting, especially for primary outcome results that are not statistically significant.
This systematic review examines misleading reporting and associated factors in noninferiority randomized clinical trials with primary results that are not statistically significant.
Abstract
Importance
Spin, the inaccurate reporting of randomized clinical trials (RCTs) with results that are not statistically significant for the primary end point, distorts interpretation of results and leads to misinterpretation. However, the prevalence of spin and related factors in noninferiority cancer RCTs remains unclear.
Objective
To examine misleading reporting, or spin, and the associated factors in noninferiority cancer RCTs through a systematic review.
Data Sources
A systematic search of the PubMed database was performed for articles published between January 1, 2010, and December 31, 2019, using the Cochrane Highly Sensitive Search Strategy.
Study Selection
Two investigators independently selected studies using the inclusion criteria of noninferiority parallel-group RCTs aiming to confirm effects to cancer treatments published between January 1, 2010, and December 31, 2019, reporting results that were not statistically significant for the primary end points.
Data Extraction and Synthesis
Standardized data abstraction was used to extract information concerning the trial characteristics and spin based on a prespecified definition. The main investigator extracted the trial characteristics while both readers independently evaluated the spin. The Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) reporting guideline was followed.
Main Outcomes and Measures
The main outcome was spin prevalence in any section of the report. Spin was defined as use of specific reporting strategies, from whatever motive, to highlight that the experimental treatment is beneficial, despite no statistically significant difference for the primary outcome, or to distract the reader from results that are not statistically significant. The associations (prevalence difference and odds ratios [ORs]) between spin and trial characteristics were also evaluated.
Results
The analysis included 52 of 2752 reports identified in the PubMed search. Spin was identified in 39 reports (75.0%; 95% CI, 61.6%-84.9%), including the abstract (34 reports [65.4%; 95% CI, 51.1%-76.9%]) and the main text (38 reports [73.1%; 95% CI, 59.7%-83.3%]). Univariate analysis found that the spin prevalence was higher in reports with data managers (prevalence difference, 27%; 95% CI, 1.1%-50.3%), reports without funding from for-profit sources (prevalence difference, 31.2%; 95% CI, 4.8%-53.8%), and reports of novel experimental treatments (prevalence difference, 37.5%; 95% CI, 5.8%-64.7%). Multivariable analysis found that novel experimental treatment (OR, 4.64; 95% CI, 0.98-22.02) and funding only from nonprofit sources only (OR, 5.20; 95% CI, 1.21-22.29) were associated with spin.
Conclusions and Relevance
In this systematic review, most noninferiority RCTs reporting results that were not statistically significant for the primary end points showed distorted interpretation and inaccurate reporting. The novelty of an experimental treatment and funding only from nonprofit sources were associated with spin.
Introduction
Randomized clinical trials (RCTs) are the criterion standard in research for hypothesis-based treatment efficacy and safety evaluation.1,2 RCTs must be performed according to predefined study protocols and statistical analysis plans.3 RCT results are typically interpreted based on the statistical significance of the primary end point analysis results. Trials with statistically significant results for the primary end point are known as positive trials, while those with results that are not significant results are negative trials. Both results are equally important for scientific progress if quality RCTs are planned, conducted, analyzed, and reported.4
However, problems can arise when reports mislead readers by distorting result interpretation and suggesting that positive results have been obtained, even if statistically significant differences have not been determined for the primary end point. This problem of misleading reporting is called spin.5,6,7,8,9,10 Boutron et al6 observed that RCT reports with spin were more likely to emphasize the benefit of the experimental treatment by focusing on statistically significant results, including those of secondary end points and subgroup analyses, which should be interpreted as exploratory results. Systematic reviews have previously classified spin in the field of oncology.7,8,9,10,11 Studies on the impact of spin have concluded that it might lead readers to overestimate result positivity.12,13,14
Spin in RCTs have mainly been discussed in superiority trials, with few reports assessing spin in noninferiority trials.15 Compared with superiority RCTs, noninferiority RCTs may have more factors that complicate interpretation, including noninferiority margin, assay sensitivity, and choice of analysis population.16 However, no systematic review has investigated spin prevalence in noninferiority RCTs in oncology. Given the complexities of interpretation inherent to noninferiority RCTs, we consider it is important to clarify how much spin exists and the factors associated with the spin when it is present.
Therefore, we performed a systematic review of the spin prevalence in negative noninferiority cancer RCTs published since 2010, when the US Food and Drug Administration guidelines for noninferiority trials were published.16 This systematic review of noninferiority RCTs investigated spin prevalence and its associated factors in oncology.
Methods
This systematic review follows the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) reporting guideline. Data were analyzed from March 22 to September 15, 2021.
Selection of Eligible Reports
The selection criteria were reports on an RCT, examining anticancer treatment, using a noninferiority design, and published between January 1, 2010, and December 31, 2019. Using PubMed, we created a search strategy reflecting the inclusion criteria and systematically collected reports. This search strategy is described in the eMethods in the Supplement. The Cochrane Highly Sensitive Search Strategy17 was used to identify the RCTs. After collecting reports identified by the search strategy, we read the abstract and main text of each report, and excluded reports based on our exclusion criteria, including reports other than direct anticancer efficacy assessments (eg, quality of life, cost-effectiveness, diagnosis method, and primary end point could not be identified); exploratory studies (eg, phase 1, phase 2) or pilot studies; superiority or equivalent studies; other study designs (eg, crossover study, multigroup or single-group study, and factorial design); studies that terminated early at an interim analysis; and reports not written in English. We defined negative trials as those that did not show that the effect of the experimental treatment, compared with the control treatment, was better than a prespecified noninferiority margin and included them in analysis (Figure 1).
Figure 1. Study Selection Flowchart .
RCT indicates randomized clinical trial.
To ensure the strategy of literature collection, we confirmed that the same reports were selected by 2 authors (C.I. and A.H.), independently conducting the search. The inclusion of reports not selected by both authors was discussed among all authors.
Extraction of Information From Selected Reports
From the selected reports, we extracted journal information (name, impact factor in 2018), general report information (publication year, number of citations in March 2021), author information (numbers of statisticians and other roles; eg, project and data managers), funding source (for-profit [eg, pharmaceutical or medical device industry], nonprofit [eg, public funds], both, no funding), rationale for conducting the study (novelty [ie, not used in clinical practice], safety, experimental treatment application simplicity), primary end point (efficacy measurement), noninferiority margin, number of patients or participants (planned and actual numbers of individuals enrolled), and analysis population. This information was recorded in a data collection form that was prepared in advance.
Spin and Evaluation Method Definition
We defined spin as the “use of specific reporting strategies, from whatever motive, to highlight that the experimental treatment is beneficial, despite a statistically nonsignificant difference for the primary endpoint, or to distract the reader from statistically nonsignificant results,” as proposed by Boutron et al.6 We used this definition to evaluate the presence of spin in the results and conclusion sections in the abstracts, as well as the results, discussion, and conclusion sections in the main text of each report. The spin strategies were identified according to the type of research,18,19,20,21 and we identified spin strategies for noninferiority RCTs. Spin was considered to be present if the report corresponded to the strategies of spin specified in advance and improperly emphasized the benefit of the experimental treatment.
The 8 spin strategies specified in advance claimed the benefit of the experimental treatment (1) by emphasizing trends in point estimates, despite lacking significance for the primary end point results, denoted as trend for primary end point; (2) based on the results of the secondary end point, denoted as secondary end point; (3) based on the results of subgroup analysis, denoted as subgroup analysis; (4) based on the secondary analysis results of the primary end point, such as by changing the analysis population or measuring the treatment effect, denoted as secondary analysis of the primary end point; (5) based on the intragroup comparison results, such as those before and after treatment, denoted as within-group comparisons; (6) with no mention in the discussion of the experimental treatment’s unclear safety profile in reports that stated the rationale for conducting the study was safety of the experimental treatment, denoted as no mention of safety profile; (7) based on safety alone, despite insignificant results of the primary end point analysis in the conclusion section, denoted as safety; or, (8) for any other situations deemed as spin by the reviewers, denoted as other. No report was found in the within-group comparison category, so the category is not presented in the results.
Spin Levels in the Conclusion Sections
The spin levels in the conclusions of the abstract and the main text were evaluated as none, low, moderate, and high. Low indicates that the report contained spin, despite mentioning that noninferiority was not confirmed; alternatively, the report did not mention that noninferiority was not confirmed but rather mentioned uncertainty in claiming the treatment benefit, as well as the need to perform additional confirmatory studies. Moderate indicates that the report did not mention that noninferiority was not confirmed and failed to mention either the presence of uncertainty to claim the experimental treatment benefit or the need for additional confirmatory studies. High indicates that the report did not mention noninferiority was not confirmed and recommended the clinical application of the experimental treatment, despite not mentioning the uncertainty in claiming the experimental treatment benefit and not mentioning the need for additional confirmatory studies.
Spin was evaluated independently by the main reviewer (C.I.) and 2 secondary reviewers (K.U. and K.O.). Disagreements were discussed to reach a final evaluation.
End Points
The primary end point of this study was the spin prevalence in any section of the reports, including results and conclusion of the abstract and the results, discussion, and conclusion in the main text. The secondary end points were the spin prevalence in the abstract and main text sections and the spin level in the abstract and main text conclusion.
Statistical Analysis
For discrete variables, we calculated the number of reports and percentages for each category. For continuous variables, we calculated the medians and IQRs. To determine the spin prevalence in any section of a report, we calculated the spin proportion and the 95% CI. To examine the associations between trial characteristics and prevalence of spin, we calculated the prevalence difference relative to the reference category, as well as the 95% CI. The Agresti-Coull22 or Agresti-Caffo methods23 were used for all 95% CI calculations. Logistic regression was performed to adjust for variables. Backward stepwise selection was used (P = .15 removed), and the Wald 95% CIs for odds ratios (ORs) were calculated. Journal specialty, publication year, journal impact factor, number of citations, presence of various author roles, funding from for-profit source, rationale of novelty, safety, application simplicity, primary outcome type (hazard ratio or survival proportion difference), noninferiority margin, planned sample size achievement, and analysis population were used as variables. All statistical analyses were performed using SAS statistical software version 9.4 (SAS Institute). Statistical significance was determined with 2-sided 95% CIs that do not cross 1.
Results
Study Characteristics of the Selected Reports
Our PubMed search identified 2752 studies, of which we selected 166 eligible parallel-group noninferiority RCTs, including 52 negative studies (Figure 1).24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75 Table 1 presents the selected study characteristics. The negative studies comprised 12 reports published from 2010 to 2013, 19 reports published from 2014 to 2016, and 21 reports published from 2017 to 2019. Two-thirds of the reports (34 reports [65.4%]) assessed drugs as experimental cancer treatments. More than half of the trials (29 reports [55.8%]) were funded by nonprofit sources only. No specific noninferiority margin was more frequently set as a trial design.
Table 1. Report Characteristics.
| Characteristic | Reports, No. (%) (N = 52) |
|---|---|
| Journal specialty | |
| Oncology | 35 (67.3) |
| Nononcology | 17 (32.7) |
| Publication year | |
| 2010-2013 | 12 (23.1) |
| 2014-2016 | 19 (36.5) |
| 2017-2019 | 21 (40.1) |
| Journal impact factor in 2018, median (IQR) | 14.2 (5.3-28.3) |
| Citations in March 2021, median (IQR), No. | 35 (13-95.5) |
| Cancer type | |
| Breast | 11 (21.2) |
| Blood | 9 (17.3) |
| Colon | 9 (17.3) |
| Lung | 3 (5.8) |
| Liver | 3 (5.8) |
| Prostate | 2 (3.9) |
| Gastro | 2 (3.9) |
| Uterine | 2 (3.9) |
| Kidney | 2 (3.9) |
| Other | 9 (17.3) |
| Experimental treatment | |
| Drug | 34 (65.4) |
| Surgical treatment or procedure | 10 (18.4) |
| Radiation | 4 (7.7) |
| Combination | 3 (5.8) |
| Other | 1 (1.9) |
| Comparator | |
| Drug | 30 (57.7) |
| Surgical treatment or procedure | 9 (17.3) |
| Radiation | 7 (13.5) |
| Combination (drug/surgery/procedure/radiation) | 6 (11.5) |
| Funding source | |
| None | 0 |
| For profit | 11 (21.2) |
| Nonprofit | 29 (55.8) |
| For profit and nonprofit | 9 (17.3) |
| Not reported | 3 (5.8) |
| Rationale for noninferiority trial (including duplicates) | |
| Novelty of experimental treatment | 41 (78.9) |
| Safety of experimental treatment | 32 (61.5) |
| Application simplicity of the experimental treatment | 8 (15.4) |
| Not reported | 9 (17.3) |
| Primary outcome | |
| OS | 12 (23.1) |
| PFS | 19 (36.5) |
| DFS | 7 (13.5) |
| CRR | 2 (3.9) |
| TTF | 1 (1.9) |
| Other | 11 (21.2) |
| Primary outcome (measure of treatment effect) | |
| HR | 35 (67.3) |
| Difference in survival proportion at specific time point | 14 (26.9) |
| Other | 2 (3.9) |
| Not reported | 1 (1.9) |
| Noninferiority margin, HR | |
| ≤1.00 to <1.25 | 10 (28.6) |
| 1.25 to <1.33 | 11 (31.4) |
| 1.33 to <1.5 | 8 (22.9) |
| ≥1.5 | 5 (14.3) |
| Not reported | 1 (2.86) |
| Noninferiority margin (difference in survival proportion at specific time point) | |
| ≤0% to <10% | 6 (42.9) |
| 10% to <15% | 5 (35.7) |
| ≤15% | 3 (21.4) |
| Total sample size, No. | |
| <200 | 7 (13.5) |
| 200-499 | 21 (40.4) |
| 500-999 | 11 (21.2) |
| ≥1000 | 13 (25.0) |
| Achievement of planned sample size | |
| Achieved | 32 (61.5) |
| Not achieved | 18 (34.6) |
| Not reported | 2 (3.9) |
| Power of sample size calculation | |
| <80% | 3 (5.8) |
| 80% to <90% | 38 (73.1) |
| ≥90% | 8 (15.4) |
| Not reported | 3 (5.8) |
| Analysis population | |
| ITT | 41 (78.8) |
| PPS | 11 (21.2) |
| Not reported | 0 |
Abbreviations: CRR, complete response rate; DFS, disease free survival; HR, hazard ratio; ITT, intention-to-treatment; OS, overall survival; PFS, progression free survival; PPS, per protocol set; TTF, time to treatment failure.
Spin Overview and Extent
Table 2 shows the spin prevalence in the 52 negative studies, according to each abstract and main text section. A total of 39 reports (75.0%; 95% CI, 51.7%-84.9%) contained spin in at least 1 part. Regarding the abstracts, 34 reports (65.4%; 95% CI, 51.8%-76.9%) contained spin in at least 1 section, and 10 reports (19.2%) contained spin in all sections. One report (1.9%) contained spin the results only, and 23 reports (44.2%) contained spin in the conclusions only. Regarding the main text, 38 reports (73.1%; 95% CI, 59.7%-83.3%) had spin in at least 1 section, and 6 reports (11.5%) had spin in all sections. Finally, 3 reports (5.8%) contained spin in the discussion only, and 13 reports (25.0%) contained spin in the conclusions only. In both the abstract and main text, the conclusions section tended to show spin more often.
Table 2. Overview of the Spin Prevalence, Strategy, and Level of Spin.
| Measure | Reports, No. (%) | |
|---|---|---|
| Abstract (n = 52) | Main text (n = 52) | |
| Presence of spin | ||
| Overall total, No. (%) [95% CI] | 39 (75.0) [61.7-84.9] | |
| Total, No. (%) [95% CI] | 34 (65.4) [51.8-76.9] | 38 (73.1) [59.7-83.3] |
| Section with spin | ||
| None | 18 (34.6) | 14 (26.9) |
| Result only | 1 (1.9) | 0 (0) |
| Discussion only | NA | 3 (5.8) |
| Conclusion only | 23 (44.2) | 13 (25.0) |
| In 2 sections | NA | 16 (30.8) |
| In all sections | 10 (19.2) | 6 (11.5) |
| Strategy of spin | ||
| Results | ||
| Any spin | 11 (21.2) | 12 (23.1) |
| Trend for primary end point | 1 (2.0) | 1 (1.9) |
| Secondary end point | 7 (13.5) | 4 (7.7) |
| Subgroup analysis | 4 (7.7) | 4 (7.7) |
| Secondary analysis of the primary end point | 2 (3.9) | 3 (5.8) |
| Within-group comparison | 0 | 0 |
| Other | 2 (3.9) | 3 (5.8) |
| Discussion | NA | 19 (36.5) |
| Any spin | NA | 19 (36.5) |
| Trend for primary end point | NA | 4 (7.7) |
| Secondary end point | NA | 6 (11.5) |
| Subgroup analysis | NA | 3 (5.8) |
| Secondary analysis of the primary end point | NA | 7 (13.5) |
| Within-group comparison | NA | 0 |
| No mention of safety profile | NA | 2 (3.9) |
| Other | NA | 1 (1.9) |
| Conclusions | ||
| Any spin | 33 (63.5) | 35 (67.3) |
| With no consideration statistically nonsignificant primary outcome | ||
| Trend for or secondary analysis of primary end point | 5 (9.6) | 5 (9.6) |
| Secondary end point or subgroup analysis | 9 (17.3) | 9 (17.3) |
| With acknowledgment statistically nonsignificant results for the primary outcome | ||
| Trend for or secondary analysis of primary end point | 4 (7.7) | 6 (11.5) |
| Secondary end point or subgroup analysis | 10 (19.2) | 11 (21.2) |
| Other | ||
| Safety | 6 (11.5) | 6 (11.5) |
| Other | 6 (11.5) | 8 (15.4) |
| Level of spin in the conclusion section | ||
| None | 19 (36.5) | 17 (32.7) |
| Low | 24 (46.2) | 25 (48.1) |
| Moderate | 1 (1.9) | 1 (1.9) |
| High | 8 (15.4) | 9 (17.3) |
Abbreviation: NA, not applicable.
Strategies and Spin Levels
According to our spin classification system, 7 reports (13.5%) contained secondary end point spin in the results of the abstract. Nine reports (17.3%) included secondary end point or subgroup analysis spin with no consideration of a primary outcome that was not statistically significant, and 10 reports (19.2%) had secondary end point or subgroup analysis spin that acknowledged results that were not statistically significant for the primary outcome in the conclusion of the abstract. Six reports (11.5%) contained safety spin in the same section. In the main text, 6 reports (11.5%) contained secondary end points spin, and 7 reports (13.5%) had secondary analysis of primary end point spin in the discussion of the main text. Regarding the conclusion of the main text, 9 reports (17.3%) showed secondary end point or subgroup analysis spin without mentioning that results related to the primary outcome were not statistically significant, 11 reports (21.2%) had secondary end point or subgroup analysis spin that mentioned that results related to the primary outcome were not statistically significant, and 6 reports (11.5%) had safety spin.
We identified 33 reports (63.5%) with spin in the conclusion of the abstract; of these, 24 reports (46.2%) had low spin levels and 8 reports (15.4%) had high spin levels. Similarly, 35 reports (67.3%) had spin in the main text, with 25 reports (48.1%) showing low spin levels and 9 reports (17.3%) showing high spin levels. Examples of the conclusions for each spin level in the conclusion of the abstract are shown in eTable 1 in the Supplement.
Factors Associated With Spin
We evaluated the association between spin and the assessed variables to consider their associations with spin prevalence (Figure 2). Lack of data managers, project managers, or similar roles among authors (prevalence difference, 27.0%; 95% CI, 1.1%-50.3%) and having no funding by for-profit sources (prevalence difference, 31.2%; 95% CI, 4.8%-53.8%) were associated with higher spin prevalence. Studies with novel study treatments showed more prevalent spin (prevalence difference, 37.5%; 95% CI, 5.8%-64.7%). Considering the backward stepwise multivariable logistic regression, of 52 reports included in this study, 3 reports were excluded with no reported efficacy measurement (hazard ratio or difference in survival proportion) and funding sources, and all variables shown in Figure 2 were entered; the final model included the novelty of the study treatment and source of funding. The analysis found that reports without funding by for-profit sources (OR, 5.20; 95% CI, 1.21-22.29) and reports on novel study treatments (4.64; 95% CI, 0.98-22.02) were associated with more prevalent spin.
Figure 2. Differences in Spin Prevalence According to Different Study Characteristics.

HR indicates hazard ratio; ITT, intention to treat; NA, not applicable; PPS, per protocol set.
We examined the associations of spin level in the conclusion of the abstract for the factors with large differences in spin prevalence and relatively large numbers of studies in those categories. We did not include as targets for examination factors that had relatively few studies (eTable 2 in the Supplement).
Discussion
This systematic review found that in noninferiority oncology RCTs with results that were not statistically significant, three-quarters of reports contained spin. Direct comparisons are complicated because of the differences in study periods and the definition of spin. However, the spin prevalence was higher in these reports than in studies reporting superiority RCTs for cancer treatments6,7,8,9,10 (Figure 3). Certain design features in noninferiority trials make interpretation difficult, and certain regulatory authorities16,76 or guidelines77 have made recommendations for interpretation. These complexities could lead to a high spin prevalence.
Figure 3. Comparison of Spin Prevalence by Clinical Study Type.
The x-axis shows the point estimates and 95% CIs of the prevalence of spin.
aSpin prevalence for the abstract only.
We identified specific spin strategies in noninferiority RCTs. Based on our spin strategies, approximately 10% of reports focused on safety, a spin unique to noninferiority trials. Safety itself is only the rationale, or merely a prerequisite for conducting a noninferiority trial. Researchers should consider the benefit of an experimental treatment based on the results of a confirmatory analysis of efficacy, not on the safety of the treatment.
As with superiority trials, the most prevalent spin strategy was focusing on the secondary end point or subgroup analysis. This common strategy was consistent across all sections. Some spin focusing on the secondary end point or subgroup analysis was based on claiming similar efficacy without statistical consideration (eg, only with the point estimation value).
Our analysis found that the conclusions of RCTs showed the highest spin prevalence compared with other sections, a finding consistent with other reports6,7,9 and with important implications, as most physicians are interested only in the conclusions sections of publications.78 Spin-based reporting of P values resulted in distorted conclusions. Moreover, readers might interpret studies to have positive results even with results for the primary outcome that are not significant.79
Few studies have explored spin-related factors. A study by Khan et al80 reported that a small number of citations per year, a primary end point of efficacy or nonbinary outcome, use of a drug as a control treatment, and publication in specific journals were factors associated with high spin level in reports on superiority trials in the field of cardiology. In our study, we generated 2 hypotheses for factors associated with spin prevalence. The first factor was the rationale for the experimental treatment being novelty. To our knowledge, this is the first study to suggest the possibility of an association between the experimental treatment novelty and spin. Even if authors report negative results for an experimental treatment that is already in clinical use, a single clinical trial might not be enough to eliminate the given treatment from clinical practice. However, if the experimental treatment is not used in clinical practice, not only would the experimental treatment not become the standard of care but the results might not even get published (ie, publication bias). The second factor was a lack of funding from for-profit sources. While clinical trials funded by for-profit sources tend to report data that favor for-profit sources,81 and a study by Vera-Badillo et al10 reported that funding did not appear to be significantly associated with spin, our study found a different trend. We suggest that this could be because the organization responsible for reviewing the contents of an RCT report is likely to be well designed if the study was conducted in an environment with an external funding source. The organizational structure of an RCT is important from the perspective of the study’s feasibility and the resulting data’s reliability.82 Our findings suggest that the organizational structure might also be important for ensuring proper reporting. In this context, data managers might contribute not only to improving data reliability but to reducing the spin incidence in the reports.
To improve the quality of RCT reports, the International Committee of Medical Journal Editors mandated in 2004 that RCTs need to be registered for them to be published as academic reports.83 This requirement has resulted in an increasing proportion of cancer clinical trials registering the primary end point,84 as well as studies disclosing all details of the protocol in addition to the primary end point.85 These sustained efforts, in addition to the efforts of journal editors to reduce spin, might contribute to the decreasing trend of spin prevalence.86 We anticipate that more studies are likely to be reported properly in the future.
Limitations
This study has some limitations. We only searched PubMed; therefore, this review potentially excludes studies not present in PubMed. As this review was limited to studies on cancer, the conditions of spin in reports on other diseases might differ. While we examined the associations between trial characteristics and spin prevalence, the number of reports included was too small and the characteristics were heterogenic to examine the associations between these characteristics in detail. Moreover, in the multivariable analysis, we selected variables by backward stepwise selection and the results are only useful for hypothesis generation; other factors could be confounders for the factors we found associated with spin and were impossible to adjust for.
Conclusions
This systematic review found that 75% of negative reports on noninferiority RCTs of cancer treatments contained spin. Our results suggest that lack of funding from for-profit sources and novel experimental treatments may be associated with a high spin prevalence.
eMethods. PubMed Detailed Search Strategy
eTable 1. Examples of Spin in the Abstract Conclusion Section
eTable 2. Factors Associated With Level of Spin
eReferences
References
- 1.Friedman LM, Furberg C, DeMets DL. Fundamentals of Clinical Trials. 5th ed. Springer; 2015. doi: 10.1007/978-3-319-18539-2 [DOI] [Google Scholar]
- 2.Murad MH, Asi N, Alsawas M, Alahdab F. New evidence pyramid. Evid Based Med. 2016;21(4):125-127. doi: 10.1136/ebmed-2016-110401 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.ICH . Efficacy guidelines. Accessed April 1, 2021. https://www.ich.org/page/efficacy-guidelines
- 4.Unger JM, Barlow WE, Ramsey SD, LeBlanc M, Blanke CD, Hershman DL. The scientific impact of positive and negative phase 3 cancer clinical trials. JAMA Oncol. 2016;2(7):875-881. doi: 10.1001/jamaoncol.2015.6487 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Fletcher RH, Black B. “Spin” in scientific writing: scientific mischief and legal jeopardy. Med Law. 2007;26(3):511-525. [PubMed] [Google Scholar]
- 6.Boutron I, Dutton S, Ravaud P, Altman DG. Reporting and interpretation of randomized controlled trials with statistically nonsignificant results for primary outcomes. JAMA. 2010;303(20):2058-2064. doi: 10.1001/jama.2010.651 [DOI] [PubMed] [Google Scholar]
- 7.Wayant C, Margalski D, Vaughn K, Vassar M. Evaluation of spin in oncology clinical trials. Crit Rev Oncol Hematol. 2019;144:102821. doi: 10.1016/j.critrevonc.2019.102821 [DOI] [PubMed] [Google Scholar]
- 8.Vera-Badillo FE, Shapiro R, Ocana A, Amir E, Tannock IF. Bias in reporting of end points of efficacy and toxicity in randomized, clinical trials for women with breast cancer. Ann Oncol. 2013;24(5):1238-1244. doi: 10.1093/annonc/mds636 [DOI] [PubMed] [Google Scholar]
- 9.Ghannad M, Olsen M, Boutron I, Bossuyt PM. A systematic review finds that spin or interpretation bias is abundant in evaluations of ovarian cancer biomarkers. J Clin Epidemiol. 2019;116:9-17. doi: 10.1016/j.jclinepi.2019.07.011 [DOI] [PubMed] [Google Scholar]
- 10.Vera-Badillo FE, Napoleone M, Krzyzanowska MK, et al. Bias in reporting of randomised clinical trials in oncology. Eur J Cancer. 2016;61:29-35. doi: 10.1016/j.ejca.2016.03.066 [DOI] [PubMed] [Google Scholar]
- 11.Altwairgi AK, Booth CM, Hopman WM, Baetz TD. Discordance between conclusions stated in the abstract and conclusions in the article: analysis of published randomized controlled trials of systemic therapy in lung cancer. J Clin Oncol. 2012;30(28):3552-3557. doi: 10.1200/JCO.2012.41.8319 [DOI] [PubMed] [Google Scholar]
- 12.Boutron I, Altman DG, Hopewell S, Vera-Badillo F, Tannock I, Ravaud P. Impact of spin in the abstracts of articles reporting results of randomized controlled trials in the field of cancer: the SPIIN randomized controlled trial. J Clin Oncol. 2014;32(36):4120-4126. doi: 10.1200/JCO.2014.56.7503 [DOI] [PubMed] [Google Scholar]
- 13.Boutron I, Haneef R, Yavchitz A, et al. Three randomized controlled trials evaluating the impact of “spin” in health news stories reporting studies of pharmacologic treatments on patients’/caregivers’ interpretation of treatment benefit. BMC Med. 2019;17(1):105. doi: 10.1186/s12916-019-1330-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Haneef R, Yavchitz A, Ravaud P, et al. Interpretation of health news items reported with or without spin: protocol for a prospective meta-analysis of 16 randomised controlled trials. BMJ Open. 2017;7(11):e017425. doi: 10.1136/bmjopen-2017-017425 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hernandez AV, Pasupuleti V, Deshpande A, Thota P, Collins JA, Vidal JE. Deficient reporting and interpretation of non-inferiority randomized clinical trials in HIV patients: a systematic review. PLoS One. 2013;8(5):e63272. doi: 10.1371/journal.pone.0063272 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Center for Drug Evaluation and Research; Center for Biologics Evaluation and Research . Non-inferiority clinical trials. Accessed April 1, 2021. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/non-inferiority-clinical-trials
- 17.Lefebvre C, Manheimer E, Glanville J. Chapter 6: searching for studies. In: Higgins J, Green S, eds. Cochrane Handbook for Systematic Reviews of Interventions. Version 5.1.0. The Cochrane Collaboration; 2011. [Google Scholar]
- 18.Ochodo EA, de Haan MC, Reitsma JB, Hooft L, Bossuyt PM, Leeflang MM. Overinterpretation and misreporting of diagnostic accuracy studies: evidence of “spin”. Radiology. 2013;267(2):581-588. doi: 10.1148/radiol.12120527 [DOI] [PubMed] [Google Scholar]
- 19.Lazarus C, Haneef R, Ravaud P, Boutron I. Classification and prevalence of spin in abstracts of non-randomized studies evaluating an intervention. BMC Med Res Methodol. 2015;15:85. doi: 10.1186/s12874-015-0079-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Yavchitz A, Ravaud P, Altman DG, et al. A new classification of spin in systematic reviews and meta-analyses was developed and ranked according to the severity. J Clin Epidemiol. 2016;75:56-65. doi: 10.1016/j.jclinepi.2016.01.020 [DOI] [PubMed] [Google Scholar]
- 21.Mouffak A, Lepelley M, Revol B, et al. High prevalence of spin was found in pharmacovigilance studies using disproportionality analyses to detect safety signals: a meta-epidemiological study. J Clin Epidemiol. 2021;138:73-79. doi: 10.1016/j.jclinepi.2021.06.022 [DOI] [PubMed] [Google Scholar]
- 22.Agresti A, Coull BA. Approximate is better than “exact” for interval estimation of binomial proportions. Am Statistician. 1998;52(2):119-126. doi: 10.2307/2685469 [DOI] [Google Scholar]
- 23.Agresti A, Caffo B. Simple and effective confidence intervals for proportions and differences of proportions result from adding two successes and two failures. Am Statistician. 2000;54(4):280-288. doi: 10.2307/2685779 [DOI] [Google Scholar]
- 24.Yakoub-Agha I, Mary J-Y, Hulin C, et al. ; Intergroupe Francophone du Myélome (IFM) . Low-dose vs. high-dose thalidomide for advanced multiple myeloma: a prospective trial from the Intergroupe Francophone du Myélome. Eur J Haematol. 2012;88(3):249-259. doi: 10.1111/j.1600-0609.2011.01729.x [DOI] [PubMed] [Google Scholar]
- 25.Donker M, van Tienhoven G, Straver ME, et al. Radiotherapy or surgery of the axilla after a positive sentinel node in breast cancer (EORTC 10981-22023 AMAROS): a randomised, multicentre, open-label, phase 3 non-inferiority trial. Lancet Oncol. 2014;15(12):1303-1310. doi: 10.1016/S1470-2045(14)70460-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Moon SH, Cho KH, Chung EJ, et al. A prospective randomized trial comparing hypofractionation with conventional fractionation radiotherapy for T1-2 glottic squamous cell carcinomas: results of a Korean Radiation Oncology Group (KROG-0201) study. Radiother Oncol. 2014;110(1):98-103. doi: 10.1016/j.radonc.2013.09.016 [DOI] [PubMed] [Google Scholar]
- 27.Stewart AK, Jacobus S, Fonseca R, et al. Melphalan, prednisone, and thalidomide vs melphalan, prednisone, and lenalidomide (ECOG E1A06) in untreated multiple myeloma. Blood. 2015;126(11):1294-1301. doi: 10.1182/blood-2014-12-613927 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Schuette W, Schneider C-P, Engel-Riedel W, et al. 65Plus: open-label study of bevacizumab in combination with pemetrexed or pemetrexed/carboplatin as first-line treatment of patients with advanced or recurrent nonsquamous non-small-cell lung cancer. Lung Cancer (Auckl). 2017;8:217-229. doi: 10.2147/LCTT.S142972 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Park C-K, Oh I-J, Kim K-S, et al. Randomized phase III study of docetaxel plus cisplatin versus pemetrexed plus cisplatin as first-line treatment of nonsquamous non-small-cell lung cancer: a TRAIL trial. Clin Lung Cancer. 2017;18(4):e289-e296. doi: 10.1016/j.cllc.2017.01.002 [DOI] [PubMed] [Google Scholar]
- 30.Lee K-W, Maeng CH, Kim T-Y, et al. A phase III study to compare the efficacy and safety of paclitaxel versus irinotecan in patients with metastatic or recurrent gastric cancer who failed in first-line therapy (KCSG ST10-01). Oncologist. 2019;24(1):18-e24. doi: 10.1634/theoncologist.2018-0142 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Paul SB, Acharya SK, Gamanagatti SR, Sreenivas V, Shalimar S, Gulati MS. Acetic acid versus radiofrequency ablation for the treatment of hepatocellular carcinoma: A randomized controlled trial. Diagn Interv Imaging. 2020;101(2):101-110. doi: 10.1016/j.diii.2019.06.011 [DOI] [PubMed] [Google Scholar]
- 32.Schrappe M, Bleckmann K, Zimmermann M, et al. Reduced-intensity delayed intensification in standard-risk pediatric acute lymphoblastic leukemia defined by undetectable minimal residual disease: Results of an international randomized trial (AIEOP-BFM ALL 2000). J Clin Oncol. 2018;36(3):244-253. doi: 10.1200/JCO.2017.74.4946 [DOI] [PubMed] [Google Scholar]
- 33.Donnelly BJ, Saliken JC, Brasher PMA, et al. A randomized trial of external beam radiotherapy versus cryoablation in patients with localized prostate cancer. Cancer. 2010;116(2):323-330. doi: 10.1002/cncr.24779 [DOI] [PubMed] [Google Scholar]
- 34.Roos DE, Smith JG, Stephens SW. Radiosurgery versus surgery, both with adjuvant whole brain radiotherapy, for solitary brain metastases: a randomised controlled trial. Clin Oncol (R Coll Radiol). 2011;23(9):646-651. doi: 10.1016/j.clon.2011.04.009 [DOI] [PubMed] [Google Scholar]
- 35.Van Poppel H, Da Pozzo L, Albrecht W, et al. A prospective, randomised EORTC intergroup phase 3 study comparing the oncologic outcome of elective nephron-sparing surgery and radical nephrectomy for low-stage renal cell carcinoma. Eur Urol. 2011;59(4):543-552. doi: 10.1016/j.eururo.2010.12.013 [DOI] [PubMed] [Google Scholar]
- 36.Díaz-Rubio E, Gómez-España A, Massutí B, et al. ; Spanish Cooperative Group for the Treatment of Digestive Tumors . First-line XELOX plus bevacizumab followed by XELOX plus bevacizumab or single-agent bevacizumab as maintenance therapy in patients with metastatic colorectal cancer: the phase III MACRO TTD study. Oncologist. 2012;17(1):15-25. doi: 10.1634/theoncologist.2011-0249 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Walker JL, Piedmonte MR, Spirtos NM, et al. Recurrence and survival after random assignment to laparoscopy versus laparotomy for comprehensive surgical staging of uterine cancer: Gynecologic Oncology Group LAP2 Study. J Clin Oncol. 2012;30(7):695-700. doi: 10.1200/JCO.2011.38.8645 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Johnson PJ, Qin S, Park J-W, et al. Brivanib versus sorafenib as first-line therapy in patients with unresectable, advanced hepatocellular carcinoma: results from the randomized phase III BRISK-FL study. J Clin Oncol. 2013;31(28):3517-3524. doi: 10.1200/JCO.2012.48.4410 [DOI] [PubMed] [Google Scholar]
- 39.Lück H-J, Du Bois A, Loibl S, et al. Capecitabine plus paclitaxel versus epirubicin plus paclitaxel as first-line treatment for metastatic breast cancer: efficacy and safety results of a randomized, phase III trial by the AGO Breast Cancer Study Group. Breast Cancer Res Treat. 2013;139(3):779-787. doi: 10.1007/s10549-013-2589-8 [DOI] [PubMed] [Google Scholar]
- 40.Hussain M, Tangen CM, Berry DL, et al. Intermittent versus continuous androgen deprivation in prostate cancer. N Engl J Med. 2013;368(14):1314-1325. doi: 10.1056/NEJMoa1212299 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hoskin PJ, Kirkwood AA, Popova B, et al. 4 Gy versus 24 Gy radiotherapy for patients with indolent lymphoma (FORT): a randomised phase 3 non-inferiority trial. Lancet Oncol. 2014;15(4):457-463. doi: 10.1016/S1470-2045(14)70036-1 [DOI] [PubMed] [Google Scholar]
- 42.Bath-Hextall F, Ozolins M, Armstrong SJ, et al. ; Surgery versus Imiquimod for Nodular Superficial basal cell carcinoma (SINS) study group . Surgical excision versus imiquimod 5% cream for nodular and superficial basal-cell carcinoma (SINS): a multicentre, non-inferiority, randomised controlled trial. Lancet Oncol. 2014;15(1):96-105. doi: 10.1016/S1470-2045(13)70530-8 [DOI] [PubMed] [Google Scholar]
- 43.Zaghloul MS, Eldebawy E, Ahmed S, et al. Hypofractionated conformal radiotherapy for pediatric diffuse intrinsic pontine glioma (DIPG): a randomized controlled trial. Radiother Oncol. 2014;111(1):35-40. doi: 10.1016/j.radonc.2014.01.013 [DOI] [PubMed] [Google Scholar]
- 44.Radford J, Illidge T, Counsell N, et al. Results of a trial of PET-directed therapy for early-stage Hodgkin’s lymphoma. N Engl J Med. 2015;372(17):1598-1607. doi: 10.1056/NEJMoa1408648 [DOI] [PubMed] [Google Scholar]
- 45.Yokomizo A, Kanimoto Y, Okamura T, et al. Randomized controlled study of the efficacy, safety and quality of life with low dose bacillus calmette-guérin instillation therapy for nonmuscle invasive bladder cancer. J Urol. 2016;195(1):41-46. doi: 10.1016/j.juro.2015.08.075 [DOI] [PubMed] [Google Scholar]
- 46.Cainap C, Qin S, Huang W-T, et al. Linifanib versus Sorafenib in patients with advanced hepatocellular carcinoma: results of a randomized phase III trial. J Clin Oncol. 2015;33(2):172-179. doi: 10.1200/JCO.2013.54.3298 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Johnson P, Federico M, Kirkwood A, et al. Adapted treatment guided by interim pet-ct scan in advanced hodgkin’s lymphoma. N Engl J Med. 2016;374(25):2419-2429. doi: 10.1056/NEJMoa1510093 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Eichhorst B, Fink A-M, Bahlo J, et al. ; international group of investigators; German CLL Study Group (GCLLSG) . First-line chemoimmunotherapy with bendamustine and rituximab versus fludarabine, cyclophosphamide, and rituximab in patients with advanced chronic lymphocytic leukaemia (CLL10): an international, open-label, randomised, phase 3, non-inferiority trial. Lancet Oncol. 2016;17(7):928-942. doi: 10.1016/S1470-2045(16)30051-1 [DOI] [PubMed] [Google Scholar]
- 49.Noronha V, Joshi A, Patil VM, et al. Once-a-week versus once-every-3-weeks cisplatin chemoradiation for locally advanced head and neck cancer: a phase iii randomized noninferiority trial. J Clin Oncol. 2018;36(11):1064-1072. doi: 10.1200/JCO.2017.74.9457 [DOI] [PubMed] [Google Scholar]
- 50.Kitano S, Inomata M, Mizusawa J, et al. Survival outcomes following laparoscopic versus open D3 dissection for stage II or III colon cancer (JCOG0404): a phase 3, randomised controlled trial. Lancet Gastroenterol Hepatol. 2017;2(4):261-268. doi: 10.1016/S2468-1253(16)30207-2 [DOI] [PubMed] [Google Scholar]
- 51.Conte P, Frassoldati A, Bisagni G, et al. Nine weeks versus 1 year adjuvant trastuzumab in combination with chemotherapy: final results of the phase III randomized Short-HER study. Ann Oncol. 2018;29(12):2328-2333. doi: 10.1093/annonc/mdy414 [DOI] [PubMed] [Google Scholar]
- 52.Joensuu H, Fraser J, Wildiers H, et al. Effect of adjuvant trastuzumab for a duration of 9 weeks vs 1 year with concomitant chemotherapy for early human epidermal growth factor receptor 2-positive breast cancer: the sold randomized clinical trial. JAMA Oncol. 2018;4(9):1199-1206. doi: 10.1001/jamaoncol.2018.1380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Grothey A, Sobrero AF, Shields AF, et al. Duration of adjuvant chemotherapy for stage iii colon cancer. N Engl J Med. 2018;378(13):1177-1188. doi: 10.1056/NEJMoa1713709 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Kodaira T, Kagami Y, Shibata T, et al. ; Radiation Therapy Study Group of the Japan Clinical Oncology Group . Results of a multi-institutional, randomized, non-inferiority, phase III trial of accelerated fractionation versus standard fractionation in radiation therapy for T1-2N0M0 glottic cancer: Japan Clinical Oncology Group Study (JCOG0701). Ann Oncol. 2018;29(4):992-997. doi: 10.1093/annonc/mdy036 [DOI] [PubMed] [Google Scholar]
- 55.Barendse RM, Musters GD, de Graaf EJR, et al. ; TREND Study group . Randomised controlled trial of transanal endoscopic microsurgery versus endoscopic mucosal resection for large rectal adenomas (TREND Study). Gut. 2018;67(5):837-846. doi: 10.1136/gutjnl-2016-313101 [DOI] [PubMed] [Google Scholar]
- 56.Fujiwara Y, Mukai H, Saeki T, et al. A multi-national, randomised, open-label, parallel, phase III non-inferiority study comparing NK105 and paclitaxel in metastatic or recurrent breast cancer patients. Br J Cancer. 2019;120(5):475-480. doi: 10.1038/s41416-019-0391-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Modest DP, Fischer von Weikersthal L, Decker T, et al. ; XELAVIRI/AIO KRK0110 Investigators . Sequential versus combination therapy of metastatic colorectal cancer using fluoropyrimidines, irinotecan, and bevacizumab: a randomized, controlled study—XELAVIRI (AIO KRK0110). J Clin Oncol. 2019;37(1):22-32. doi: 10.1200/JCO.18.00052 [DOI] [PubMed] [Google Scholar]
- 58.Claessens AKM, Bos MEMM, Lopez-Yurda M, et al. ; Dutch Breast Cancer Research Group (BOOG) . Intermittent versus continuous first-line treatment for HER2-negative metastatic breast cancer: the Stop & Go study of the Dutch Breast Cancer Research Group (BOOG). Breast Cancer Res Treat. 2018;172(2):413-423. doi: 10.1007/s10549-018-4906-8 [DOI] [PubMed] [Google Scholar]
- 59.Gillison ML, Trotti AM, Harris J, et al. Radiotherapy plus cetuximab or cisplatin in human papillomavirus-positive oropharyngeal cancer (NRG Oncology RTOG 1016): a randomised, multicentre, non-inferiority trial. Lancet. 2019;393(10166):40-50. doi: 10.1016/S0140-6736(18)32779-X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ramirez PT, Frumovitz M, Pareja R, et al. Minimally invasive versus abdominal radical hysterectomy for cervical cancer. N Engl J Med. 2018;379(20):1895-1904. doi: 10.1056/NEJMoa1806395 [DOI] [PubMed] [Google Scholar]
- 61.Retz M, Bedke J, Bögemann M, et al. SWITCH II: phase III randomized, sequential, open-label study to evaluate the efficacy and safety of sorafenib-pazopanib versus pazopanib-sorafenib in the treatment of advanced or metastatic renal cell carcinoma (AUO AN 33/11). Eur J Cancer. 2019;107:37-45. doi: 10.1016/j.ejca.2018.11.001 [DOI] [PubMed] [Google Scholar]
- 62.Miyake Y, Nishimura J, Kato T, et al. ; Multi-center Clinical Study Group of Osaka, Colorectal Cancer Treatment Group (MCSGO) . Phase III trial comparing UFT + PSK to UFT + LV in stage IIB, III colorectal cancer (MCSGO-CCTG). Surg Today. 2018;48(1):66-72. doi: 10.1007/s00595-017-1555-1 [DOI] [PubMed] [Google Scholar]
- 63.Fujita S, Mizusawa J, Kanemitsu Y, et al. ; Colorectal Cancer Study Group of Japan Clinical Oncology Group . Mesorectal excision with or without lateral lymph node dissection for clinical stage II/III lower rectal cancer (JCOG0212): a multicenter, randomized controlled, noninferiority trial. Ann Surg. 2017;266(2):201-207. doi: 10.1097/SLA.0000000000002212 [DOI] [PubMed] [Google Scholar]
- 64.Mavroudis D, Matikas A, Malamos N, et al. ; Breast Cancer Investigators of the Hellenic Oncology Research Group (HORG), Athens, Greece . Dose-dense FEC followed by docetaxel versus docetaxel plus cyclophosphamide as adjuvant chemotherapy in women with HER2-negative, axillary lymph node-positive early breast cancer: a multicenter randomized study by the Hellenic Oncology Research Group (HORG). Ann Oncol. 2016;27(10):1873-1878. doi: 10.1093/annonc/mdw274 [DOI] [PubMed] [Google Scholar]
- 65.Leyland-Jones B, Bondarenko I, Nemsadze G, et al. A randomized, open-label, multicenter, phase III study of epoetin alfa versus best standard of care in anemic patients with metastatic breast cancer receiving standard chemotherapy. J Clin Oncol. 2016;34(11):1197-1207. doi: 10.1200/JCO.2015.63.5649 [DOI] [PubMed] [Google Scholar]
- 66.Urata Y, Katakami N, Morita S, et al. Randomized phase III study comparing gefitinib with erlotinib in patients with previously treated advanced lung adenocarcinoma: wjog 5108l. J Clin Oncol. 2016;34(27):3248-3257. doi: 10.1200/JCO.2015.63.4154 [DOI] [PubMed] [Google Scholar]
- 67.Gelmon KA, Boyle FM, Kaufman B, et al. Lapatinib or trastuzumab plus taxane therapy for human epidermal growth factor receptor 2–positive advanced breast cancer: final results of NCIC CTG MA.31. J Clin Oncol. 2015;33(14):1574-1583. doi: 10.1200/JCO.2014.56.9590 [DOI] [PubMed] [Google Scholar]
- 68.Mavroudis D, Saloustros E, Malamos N, et al. ; Breast Cancer Investigators of Hellenic Oncology Research Group (HORG), Athens, Greece . Six versus 12 months of adjuvant trastuzumab in combination with dose-dense chemotherapy for women with HER2-positive breast cancer: a multicenter randomized study by the Hellenic Oncology Research Group (HORG). Ann Oncol. 2015;26(7):1333-1340. doi: 10.1093/annonc/mdv213 [DOI] [PubMed] [Google Scholar]
- 69.Korfel A, Thiel E, Martus P, et al. Randomized phase III study of whole-brain radiotherapy for primary CNS lymphoma. Neurology. 2015;84(12):1242-1248. doi: 10.1212/WNL.0000000000001395 [DOI] [PubMed] [Google Scholar]
- 70.Koeberle D, Betticher DC, von Moos R, et al. Bevacizumab continuation versus no continuation after first-line chemotherapy plus bevacizumab in patients with metastatic colorectal cancer: a randomized phase III non-inferiority trial (SAKK 41/06). Ann Oncol. 2015;26(4):709-714. doi: 10.1093/annonc/mdv011 [DOI] [PubMed] [Google Scholar]
- 71.Hirao M, Kurokawa Y, Fujita J, et al. ; Osaka University Clinical Research Group for Gastroenterological Study . Long-term outcomes after prophylactic bursectomy in patients with resectable gastric cancer: final analysis of a multicenter randomized controlled trial. Surgery. 2015;157(6):1099-1105. doi: 10.1016/j.surg.2014.12.024 [DOI] [PubMed] [Google Scholar]
- 72.Middleton G, Brown S, Lowe C, et al. A randomised phase III trial of the pharmacokinetic biomodulation of irinotecan using oral ciclosporin in advanced colorectal cancer: results of the Panitumumab, Irinotecan & Ciclosporin in Colorectal cancer therapy trial (PICCOLO). Eur J Cancer. 2013;49(16):3507-3516. doi: 10.1016/j.ejca.2013.06.017 [DOI] [PubMed] [Google Scholar]
- 73.Iwata H, Masuda N, Ohno S, et al. A randomized, double-blind, controlled study of exemestane versus anastrozole for the first-line treatment of postmenopausal Japanese women with hormone-receptor-positive advanced breast cancer. Breast Cancer Res Treat. 2013;139(2):441-451. doi: 10.1007/s10549-013-2573-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Thiel E, Korfel A, Martus P, et al. High-dose methotrexate with or without whole brain radiotherapy for primary CNS lymphoma (G-PCNSL-SG-1): a phase 3, randomised, non-inferiority trial. Lancet Oncol. 2010;11(11):1036-1047. doi: 10.1016/S1470-2045(10)70229-1 [DOI] [PubMed] [Google Scholar]
- 75.Fuchs M, Goergen H, Kobe C, et al. Positron emission tomography-guided treatment in early-stage favorable hodgkin lymphoma: final results of the international, randomized phase III HD16 trial by the German Hodgkin study group. J Clin Oncol. 2019;37(31):2835-2845. doi: 10.1200/JCO.19.00964 [DOI] [PubMed] [Google Scholar]
- 76.European Medicines Agency . Switching between superiority and non-inferiority. Accessed April 1, 2021. https://www.ema.europa.eu/en/switching-between-superiority-non-inferiority
- 77.Piaggio G, Elbourne DR, Pocock SJ, Evans SJW, Altman DG; CONSORT Group . Reporting of noninferiority and equivalence randomized trials: extension of the CONSORT 2010 statement. JAMA. 2012;308(24):2594-2604. doi: 10.1001/jama.2012.87802 [DOI] [PubMed] [Google Scholar]
- 78.Hisham R, Ng CJ, Liew SM, Hamzah N, Ho GJ. Why is there variation in the practice of evidence-based medicine in primary care: a qualitative study. BMJ Open. 2016;6(3):e010565. doi: 10.1136/bmjopen-2015-010565 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Ioannidis JPA. What have we (not) learnt from millions of scientific papers with P values? Am Stat. 2019;73(suppl 1):20-25. doi: 10.1080/00031305.2018.1447512 [DOI] [Google Scholar]
- 80.Khan MS, Lateef N, Siddiqi TJ, et al. Level and prevalence of spin in published cardiovascular randomized clinical trial reports with statistically nonsignificant primary outcomes: a systematic review. JAMA Netw Open. 2019;2(5):e192622. doi: 10.1001/jamanetworkopen.2019.2622 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Lexchin J, Bero LA, Djulbegovic B, Clark O. Pharmaceutical industry sponsorship and research outcome and quality: systematic review. BMJ. 2003;326(7400):1167-1170. doi: 10.1136/bmj.326.7400.1167 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Abebe KZ, Althouse AD, Comer D, et al. Creating an academic research organization to efficiently design, conduct, coordinate, and analyze clinical trials: the Center for Clinical Trials & Data Coordination. Contemp Clin Trials Commun. 2019;16:100488. doi: 10.1016/j.conctc.2019.100488 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.DeAngelis CD, Drazen JM, Frizelle FA, et al. ; International Committee of Medical Journal Editors . Clinical trial registration: a statement from the International Committee of Medical Journal Editors. JAMA. 2004;292(11):1363-1364. doi: 10.1001/jama.292.11.1363 [DOI] [PubMed] [Google Scholar]
- 84.You B, Gan HK, Pond G, Chen EX. Consistency in the analysis and reporting of primary end points in oncology randomized controlled trials from registration to publication: a systematic review. J Clin Oncol. 2012;30(2):210-216. doi: 10.1200/JCO.2011.37.0890 [DOI] [PubMed] [Google Scholar]
- 85.Song SY, Kim E. The clinical trial transparency in oncology significantly increased over the recent years. J Clin Epidemiol. 2020;119:100-108. doi: 10.1016/j.jclinepi.2019.11.018 [DOI] [PubMed] [Google Scholar]
- 86.Ghannad M, Yang B, Leeflang M, et al. A randomized trial of an editorial intervention to reduce spin in the abstract’s conclusion of manuscripts showed no significant effect. J Clin Epidemiol. 2021;130:69-77. doi: 10.1016/j.jclinepi.2020.10.014 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
eMethods. PubMed Detailed Search Strategy
eTable 1. Examples of Spin in the Abstract Conclusion Section
eTable 2. Factors Associated With Level of Spin
eReferences


