POINT
In the first quarter of 2014, the FDA appointed an independent expert panel that, after carefully reviewing the evidence, unanimously recommended that the human papillomavirus (HPV) primary-screening algorithm proposed in Roche's ATHENA trial be approved as safe and effective. The FDA in its own independent analysis concurred and recommended that the cobas HPV test be approved for use as a primary-screening test for cervical cancer using the proposed algorithm. As of this writing, the 3-year data from the ATHENA trial are nearing publication, as is the interim guidance commentary organized by the Society of Gynecologic Oncology (1, 2). As one who has been involved in most of the HPV diagnostic trials as well as the HPV vaccine trials, I have been asked to give my perspective on why I favor HPV primary screening rather than cotesting as the best approach to cervical cancer screening. Implicit in this request is the absence from the discussion of cytology alone as an option, yet it is the comparison of cytology to HPV testing, not cotesting, that has provided the data that many others and I have found so convincing that I, a long-practicing cytopathologist, now believe that the best way forward is primary HPV testing. The rationale for my viewpoint can be captured in three words: science, safety, and simplicity.
When I was president of the ASCP, one of the guiding strategies for the organization was the concept of patient-centered advocacy, and it remains a guiding strategy. Advocate for and do what is best for patients, based on the best available medical and scientific data, regardless of cost, including personal cost, cost to one's practice, or cost to one's institution, and that should always be the winning strategy (3). Knowing what is best may not always be simple, but the concept of balancing benefits and harms regardless of cost permeates the current guidelines for cervical cancer screening (4). As one examines the evidence, this emphasis on balance leads one to the nonintuitive concept that more is not always better. For cervical cancer, this means that excess screening, especially for a disease of such low prevalence in screened populations, may do more harm than good. Hence, contemporary approaches to cervical cancer screening seek to balance the benefits of screening with the harms of overreferral and overtreatment.
It is this balance of benefits versus harm that is a primary consideration in comparing the concept of HPV primary screening to that of cotesting. The data influencing these considerations are perhaps some of the most robust in all of medicine. Where else in medicine can one find combined data from published large-scale clinical trials that encompass approximately 1.2 million women in 6 countries (5–8)? There are six randomized controlled clinical trials, including studies performed in the United States, the Netherlands, Sweden, Italy, England, and Canada, all with a minimum of 3 years of follow-up and some with up to 12 years. Remarkably, the details of the trial designs vary considerably, yet the conclusions are fundamentally the same, cited as strong statistical evidence that we are indeed on the right track (5). HPV testing is superior to cytology alone and dominates in terms of clinical performance. Furthermore, the potential contribution of cotesting over HPV primary screening is extremely limited; i.e., more than 95% of the clinical utility in a cotesting scenario comes from the HPV test. The above-mentioned general conclusions are true not only for the detection of precancerous states (cervical intraepithelial neoplasia grade 2 or worse [CIN2+] or CIN3+) but also now for the prevention of invasive cancer, as detailed in the 2014 meta-analysis of the four European randomized controlled trials (5).
Prior pooled analyses from the European trials showed a 30 to 40% gain in the sensitivity of detection of CIN3+ for HPV over cytology, with minimal if any gain attributable to doing both tests, while suggesting an interval protection of at least 5 years (9). In the updated analysis with a median of 6.5 years of follow-up and data on 176,464 women covering 1,214,415 person years, the data demonstrate that HPV-based screening provided 60 to 70% greater protection against invasive cancer than other forms of screening (5). This expansion to invasive cancer required the pooling of data and long-term analysis, as the prevalence of invasive cancer is so low in screened populations. The effect of HPV testing was not seen in the first 2.5 years but was highly significant thereafter, supporting the concepts that cytology detects disease that the patient has now but that HPV testing not only detects current disease but also is better at predicting the risk of disease development. HPV testing was also superior for detection of adenocarcinoma, a finding not seen in most cytology studies. Importantly, the authors of the British ARTISTIC trial were also authors of this analysis; the initial ARTISTIC report is often cited as evidence that cytology can be as good as or better than HPV testing, a finding later thought to be related to the design of the study and the fact that liquid-based cytology (LBC) was just introduced in the population at the start of the trial (10, 11). This relative inexperience with LBC caused an initial apparent increase in sensitivity due to overdiagnosing of minor cytological abnormalities, and these diagnoses were not sustained in subsequent rounds of the trial (11).
The United States-based ATHENA trial was Roche's FDA prospective registration trial for the cobas HPV test (2, 7, 12–15). The trial was designed to potentially validate a unified and simplified HPV primary-screening algorithm for women who were 25 years old and older. The algorithm capitalizes on the sensitivity and specificity of genotyping to minimize loss to follow-up and maximize disease ascertainment. As noted above, independent reviews support the manufacturer's claims that the proposed primary-screening algorithm is superior to cytology alone and equivalent to or better than cotesting for the prevention of CIN3 and invasive cancer (K. Simon and M. Kondratovich, talk presented at the 2014 Microbiology Devices Advisory Committee Meeting [outline available at http://www.fda.gov/AdvisoryCommittees/CommitteesMeetingMaterials/MedicalDevices/MedicalDevicesAdvisoryCommittee/MicrobiologyDevicesPanel/ucm388531.htm]). Much of the improvement in performance relative to cotesting comes from the extension of HPV screening and genotyping to the age group 25 to 30 years, for which current guidelines do not advocate HPV testing except for ASCUS triage. Yet this is a critical age group, as there is more CIN3 in this group than in all of the women 40 years old and older in the ATHENA study (2; Simon and Kondratovich, talk presented at the 2014 Microbiology Devices Advisory Committee Meeting). The potential for overtreatment in this age group needs to be balanced against the fact that without treatment of precancer, there is virtually no benefit to screening. Furthermore, the potential harms of treatments may be overestimated in the literature (16).
Critics of ATHENA conclusions point to the low sensitivity of cytology in the study, but cytology was used such that it had maximum potential for sensitivity, as all patients with abnormal cytology were referred for colposcopy (Simon and Kondratovich, talk presented at the 2014 Microbiology Devices Advisory Committee Meeting). The verification bias-adjusted (VBA) sensitivities of cytology (42.6%) versus HPV primary algorithms (not tests, as in reference 17) at 58.3% seem low to some, relative to results in the literature (17), yet the reader should be cautioned to make apples-to-apples comparisons. The only other large VBA randomized controlled trial comparing cytology testing to HPV testing is the Canadian Cervical Cancer Screening Trial (CCCaST), which found essentially the same relative numbers as those found in the ATHENA study (6). All the other relevant studies include unadjusted numbers, including the original and oft-cited Duke review, where cytology was estimated to have a single-round sensitivity of 50% (18). In addition, verification bias adjustment, because it extrapolates from a population of cotest-negative patients, impacts both tests similarly, thereby not changing the rank order comparisons. The validity of verification bias adjustment for very rare events has also recently been called into question (7, 19). The truth about test performance is probably between the two estimates; the actual rates are more likely in the real world closer to the unadjusted rates. However, the relative performances are the same and, as in all the trials, favor HPV testing over cytology, with performance equivalent to or better than that of cotesting, depending mostly on the frequency of screening.
Since at least 2003, the National Cancer Institute (NCI) in collaboration with Kaiser Permanente of Northern California (KPNC) has been analyzing the screening histories of women who have been tested with both an HPV test and cytology. This data set, now including over a million women, has been widely cited in guideline development (4, 8, 20). As published in 2014, the reassurance provided by primary HPV testing every 3 years against the future risk of precancer and cancer was superior to that provided by Pap testing every 3 years as well as cotesting every 5 years, which serves as independent real-world confirmation of the ATHENA analysis (8). Suggestions in the United States that the 5-year interval for cotesting may be too long might be balanced by the fact that outside the United States, no country is considering any form of cotesting, and some think that 5 years may be too short! In the KPNC analysis, even small differences between very low risks can be statistically significant because of the size of the population. When the cancer risks after an HPV-negative result are compared with cancer risks after a cotest-negative result at the same time point, some of the risks have overlapping 95% confidence intervals. The 3-year cancer risk after an HPV-negative versus cotest-negative result is statistically significant (0.011 versus 0.007, P = 0.03). However, the 5-year cancer risk after an HPV-negative result versus a cotest-negative result is not (0.017 versus 0.014, P = 0.11). A 0.004% gain in performance translates to ∼1 additional case per 25,000 women, a level that most individual practices would never be able to perceive.
One other very recent independent analysis based on modeling and consideration of the impact of HPV vaccination comes from Australia. My cytopathology colleagues in Australia have proudly presented the quality of their cytology-based screening program and have relatively resisted the need for implementing HPV testing (21). Yet, based on their systematic review and modeling, effective in 2016, algorithmic primary HPV screening every 5 years will replace cytology screening every 2 years because it is deemed to be safer and more effective (National Screening Program Renewal [http://www.cancerscreening.gov.au/internet/screening/publishing.nsf/Content/future-changes-cervical]).
Much of the safety of HPV primary screening is inherent in the science above. Finding more, real disease and treating it to prevent cancer has always motivated the fight against cervical cancer. While some of the potential criticisms or concerns have been addressed, in the United States, many have observed that screening is more opportunistic and less organized, unlike in the much better organized health care settings in Europe. Yet for women who do get screened, should one not use a screening approach that is as safe, effective, and simple to comply with as possible? As they say, “all politics is local,” and in the United States, most women are in the hands of their local medical practitioner, who I believe ultimately will want to do the right thing based on the best science. I submit, as would any epidemiologist, that doing the most sensitive test first at the right interval is more likely to achieve these goals, and that demands a change in historical practice. Complexity is the enemy of compliance, and certainly simplicity in screening programs should improve adoption of guidelines (22). One should point out that cotesting was initiated in the United States, without any clinical trial, in order to compensate for the perceived lack of performance of cytology-based screening, yet one of the unintended consequences of cotesting is the creation of significant algorithmic complexity (20). Because we are coming from a cytology-based approach, the algorithms still account for the significance of the numerous Bethesda categories. If one simply multiplies the 7 main cytology interpretations by the 2 main possible HPV results, that yields 14 different outcomes, and that is without genotyping. In contrast, HPV primary screening treats cytology dichotomously, as positive or negative, greatly simplifying the colposcopy versus no-colposcopy decision point. So yes, adding primary HPV testing to the menu of possible approaches to cervical cancer screening, namely, cytology alone, ASCUS triage, and cotesting, each with potentially different frequencies or mixing of approaches depending on age, adds complexity. However, within a main strategy, primary HPV testing has the virtue of a single unified approach for virtually the entire population aged 25 years and older, with the simplest to follow within the algorithm process.
The final concern of many critics is that we have more than 60 years of experience with cytology-based screening, but ATHENA lasted only 3 years and we need more data and long-term follow-up in the United States. Do we really? Do we ignore the assembled wealth of the data discussed above and not appreciate a more global point of view? Do we in the United States, actually care more about our patients than our colleagues in Canada, Europe, and Australia? Or can we not all agree, based on the data, that there should be a reasonable and standard approach to screening for cervical cancer, at least in health care systems that can afford and support it? Furthermore, given the costs involved and the absence of government support, it is unlikely that there will ever be the kind of multiround 6- to 10-year trial in the United States. In my opinion, such a study is not needed because the existing worldwide data are all consistent with the results of the ATHENA study and sufficient.
In summary, the subject of this exchange of viewpoints is focused on the choice between two superior choices, cotesting and HPV primary screening. While the devil is always in the details, it is indeed remarkable that neither of us argues in favor of cytology alone as the primary screening test. However, it should be pointed out, as it was in the FDA review, that more than half of the women screened in the United States are still screened with Pap smears alone. Indeed, the recently published guidelines state that cytology alone every 3 years is acceptable medical practice even though much of the above data were already available before 2012. Perhaps the real debate should focus on moving the standard of care to what we now know is best and then implementing it for all who need it.
Mark H. Stoler