Skip to main content
Cureus logoLink to Cureus
. 2019 Aug 29;11(8):e5513. doi: 10.7759/cureus.5513

Mining Google Trends Data for Health Information: The Case of the Irish “CervicalCheck” Screening Programme Revelations

Paul M Ryan 1,, C Anthony Ryan 2
Editors: Alexander Muacevic, John R Adler
PMCID: PMC6818734  PMID: 31687289

Abstract

Background

In April 2018, the Irish cervical smear screening programme, “CervicalCheck”, came under intense scrutiny as the accuracy of hundreds of “negative” results were brought in question.

Aim

The goal of this brief report was to assess the impact of this real-life event on public information-seeking behaviour, using Google search anomalies as a proxy. Irish relative search volume data for several terms relating to cervical testing/cancer and human papillomavirus were extracted for a five-year period from February 2014 to January 2019 and analysed for the presence of anomalous spikes and shifts in the mean baseline.

Results

An unprecedented positive spike in searches relating to cervical testing/cancer was observed immediately after the CervicalCheck revelations, which remained anomalous for the month to follow (p < 0.05). This public interest preceded a mirroring increase in uptake of complimentary consultations offered by the Department of Health to the women concerned. Despite this service engagement and interest in cervical health, the relative search volumes for terms “human papillomavirus infection” and ”HPV vaccine” were just 78 and 51% of their maximum search volume for the five-year period.

Conclusions

Anomaly analysis revealed an unprecedented spike in information-seeking behaviour following the CervicalCheck revelations. However, this was not associated with a comparable elevation in HPV interest. This suggests that more public education and promotion of the HPV vaccine is warranted, in the context of vastly reduced uptake in recent years. Finally, Google Trends data represents a free an open source means by which to assess information-seeking behaviour of the public in relation to health and disease.

Keywords: cervicalcheck, smear test, hpv, google trends

Introduction

“CervicalCheck” is the Irish national screening programme which allows women aged 25-60 years old to avail of free cervical smear test. The cytological analysis has been carried out by two Irish and two US laboratories since 2008. Until recently, the programme appeared to be functioning well and, as such, garnered little attention. In April 2018, however, it was revealed that roughly 220 women with cervical cancer were never informed that their negative smear test results were inaccurate. Subsequent inquiry into the revelations revealed that, although the laboratories contracted to perform the sample processing and testing were safe, there existed serious failures in governance and structure of the screening programme, including a grave lack of transparency with in ongoing performance audits which led to harmful delays in many diagnoses. This revelation and the lack of publicly available information combined to create a media frenzy, which further fuelled the considerable public anxiety around the issue.

Human papillomavirus (HPV) is found to be associated with virtually all cervical cancers and a significant proportion of sexually active individuals will come into contact with the virus at some stage [1]. In 2010, Ireland launched their school-based HPV vaccination programme for girls aged 12-13 years and initially enjoyed impressive compliance of 80%, with a peak of 86.9% in 2014-2015 [2]. This success was in a climate in which mothers were found to have minimal knowledge of HPV or the vaccine, which left space for anti-vaccination lobby groups to spread misinformation [3]. Regrettably, the influence of anti-vaccination lobby groups on parental choice became a considerable issue in 2016, resulting in a drop to ~50% compliance rate.

Google Trends search data has proven to be a valuable complementary tool in epidemiological and public health research in recent years. Most recently, Google search data was used to demonstrate the increase in sexual assault and sexual harassment-related information-seeking behaviour in the wake of the #MeToo movement [4].

The present brief report aimed to utilise Irish Google search trends in the wake of the CervicalCheck revelations as a proxy for public interest. The authors predicted that Google searches relating to cervical testing and cancer would increase anomalously. In concert, we expected that the increased public fear would, in turn, lead to increased information-seeking behaviour relating to primary prevention of cervical cancer, that is, the HPV vaccine.

Materials and methods

Data acquisition

This study monitored the volume of Google searches relating to the cervical smear test and HPV in Ireland over a five-year period (02/02/2014-20/01/2019). Data were extracted from the Google Trends website in January 2019 (https://trends.google.com/trends/). Initially the term “cervical check” was investigated and several of the top related queries were then included in the analysis. These additional terms included “cervical smear test” and “cervical cancer”. In addition, trend data for “HPV vaccine” and “human papillomavirus infection” searches were assessed over the same time period. Google provides relative search volumes (RSV), which outline the percentage proportion of the highest search volume in the predetermined region and period. Although this limits the comparability of the data to other studies, it is entirely adequate for the purposes of the present study as it is focussed solely in the presence of positive peaks, or anomalies. Finally, General Practitioner (GP) complimentary consultation data in the aftermath of the CervicalCheck revelations was garnered from the CervicalCheck Steering Committee weekly reports to the Minister for Health [5].

Statistical analysis

Data were imported to R through RStudio (v1.1.463) and prepared for analysis. R packages “Anomalize” (v0.1.1) and “Tidyverse” (v1.2.1) were used for anomaly analysis and plotting, respectively [6-7]. “Anomalize” allows the user to decompose time series, detect anomalies in the dataset and create bands separating the non-anomalous data from the anomalous spikes. In this analysis, an alpha of 0.05 was considered significant. In addition, GraphPad Prism (v6 for Mac, GraphPad Software, San Diego, California) was utilised for plotting of data.

Results

The simplified timeline of the CervicalCheck revelations is presented in Figure 1, demonstrating the initial reporting in April 2018 and subsequent commencement of general practitioner consultations the following month. Several additional points of significance are represented in the timeline, including publication of the Scally Scoping Inquiry Progress Report in September 2018. This was a thorough external investigation examining the events surrounding the CervicalCheck failures, including the standards of laboratory testing and internal governance of the programme itself, led by Dr. Gabriel Scally a Professor of Public Health at University of the West of England and the University of Bristol. Finally, the timeline includes the death of a high-profile victim in October 2018, as well as re-ignition of public discontent in January 2019 due to lack of progress.

Figure 1. Major relevant events in the months following the disclosure of the CervicalCheck shortcomings.

Figure 1

GP, general practitioner

Until recently, trends in Google searches relating to the CervicalCheck screening programme remained minimal, with modest annual non-anomalous peaks in January of each year. In response to the CervicalCheck failure disclosure, the Irish public reacted by vastly increasing the relative rate of Google searches with the terms “cervical check” and “cervical cancer” (p < 0.05; Figure 2A). This anomalous peak in interest was short-lived (4-5 weeks; Figures 2B-C), and interest in the terms did not reach anomalous heights again in the subsequent months. In turn, the Minister for Health announced complementary GP visits and cervical smear tests for any women concerned about their cervical screening. In line with this, almost 350,000 smear tests were ordered in Ireland in 2018, representing a ~40% increase in the annual testing rate (Figure 2A; grey bars).

Figure 2. Anomaly analysis of cervical smear test-related Google searches and complimentary general practitioner visit uptake response.

Figure 2

[A] Detailed relative search volume of “cervical check” and commonly associated terms (lines), with reactionary uptake rates of free general practitioner consultations (bars). [B-E] Anomaly analysis of relative search volume for terms [B] “Cervical Check”, [C] “Cervical Cancer”, [D] "HPV Vaccine", [E] “Human Papillomavirus Infection", over the five year period (2014-2019 inclusive) with anomalies identified by red data points. Relative search volume is standardised to the highest search volume within the timeframe for each individual plot. RSV, relative search volume; GP, general practitioner.

Due to the considerable media attention afforded to the anti-vaccination lobby groups in previous years, we observed a number of anomalous search weeks for “HPV vaccine” (Figure 2D) and “human papillomavirus infection” (Figure 2E) from August 2015 onwards. Searches for both terms peaked in September 2017 and failed to reach similar levels thereafter (Figure 2A). Despite the substantial peak in cervical testing/cancer-related searches, “HPV vaccine” and “human papillomavirus infection” searches reached just 56% (Figure 2D) and 78% of maximum search volume (Figure 2E), respectively, in the period following the event. Furthermore, HPV related search terms were never found within the top related queries for terms “cervical check”, “cervical cancer” and “smear test” for data on the five-year period. This suggests that the Irish public did not seek information on HPV and cervical testing/cancer concurrently, and indicates that many may not entirely connect the two issues.

Discussion

The CervicalCheck controversy was a public event which was high on emotion and low on reliable information. This, in turn, led to a media storm which propagated anxiety amongst the population, as is indicated by the results of this study. In January 2019, a high-profile victim publicly criticised the government’s management of the issue, suggesting that there was a loss of momentum and interest in reform. Indeed, the results of this study indicate that public attention for the issue dwindled relatively rapidly to baseline mean and did not reach comparable heights in the subsequent months, despite the publishing of an expert scoping inquiry into the failings of the CervicalCheck screening programme in September 2018 (Figure 1). However, the tragic loss of one of the acclaimed victims in October 2018 appears to stir moderate public interest, resulting in a single anomalous peak in “cervical cancer” searches (Figure 2A; pink line).

These data suggest that, although the CervicalCheck revelations triggered an increased information-seeking behaviour with regards to cervical testing and cancer, the Irish public did not demonstrate a heightened awareness of the primary prevention scheme for the main etiological contributor of the disease. The HPV vaccination programme has encountered challenging times in the past two years and, as a result, the HPV Vaccine Alliance was established in 2017 to present the facts about HPV vaccination in the factsheet and infographic-style outreach projects. Although efforts in educating the public appear to have stemmed the tide, the hangover from the anti-vaccination lobby groups is clear as uptake rates remain wholly inadequate. The present brief report suggests that vaccine advocacy groups should address the apparent understanding deficit and utilise high-profile publicity to reiterate the connection of HPV and cervical cancer to the public, in order to promote the national primary prevention scheme. This type of advocacy should be clear, factual and presented in a manner which does not detract from the suffering of those who were failed by the CervicalCheck programme. Importantly, Ireland has recently announced intentions to begin offering universal HPV vaccination for both males and females during the first year of secondary school education beginning in September 2019, representing a step forward with prospective protection which should not be understated. Previous knowledge and the conclusions of the present study are summarised in Figure 3.

Figure 3. Current knowledge and report summary.

Figure 3

HPV, human papillomavirus

Google Trends data is a publicly-available resource which appears to be a useful epidemiological and public health tool in assessing anomalous or seasonal information-seeking behaviour. The tool has recently been successfully applied to topics such as seasonality of several diseases, as well as public interest “cheap cigarettes” following the US states increases in cigarette taxation [8-11]. These cases demonstrate just a few manners in which such a dataset may be interrogated and demonstrate the versatility of this open-access resource. However, there are several important limitations to this brief report. Firstly, Google search data is an approximation of public interest and, while it does not represent the sole source of public information, it is a significant one. Secondly, due to the higher rates of internet users <65 years of age, this metric may be biased by youth over-sampling. Finally, the data released by Google Trends are not quantitative, but rather a representation of search volumes standardised relative to the highest search volume within the predetermined time and region. 

Conclusions

In the wake of the CervicalCheck revelations, information-seeking behaviour regarding cervical testing and cancer was vastly increased in comparison to the previous five years. This anomalous spike in public interest immediately preceded a mirroring increase in uptake of complementary GP consultations offered by the Department for Health to concerned women. Despite this increase in cervical testing/cancer interest and healthcare engagement, we did not observe a comparable and concurrent increase in searches relating to HPV infection and vaccination. This indicates that the public currently may not conflate the two issues entirely and, therefore, further educational work is warranted to encourage primary prevention of HPV. Despite this, efforts are beginning to realise, with universal HPV vaccination being offered to secondary school attendees of both sexes in Ireland from September 2019 onwards. To our knowledge, this is the first study to use Google Trends data to examine public information-seeking behaviour in relation to cervical cancer testing and HPV, or the temporal association thereof.

The content published in Cureus is the result of clinical experience and/or research by independent individuals or organizations. Cureus is not responsible for the scientific accuracy or reliability of data or conclusions published herein. All content published within Cureus is intended only for educational, research and reference purposes. Additionally, articles published within Cureus should not be deemed a suitable substitute for the advice of a qualified health care professional. Do not disregard or avoid professional medical advice due to content published within Cureus.

The authors have declared that no competing interests exist.

Human Ethics

Consent was obtained by all participants in this study

Animal Ethics

Animal subjects: All authors have confirmed that this study did not involve animal subjects or tissue.

References

  • 1.Human Papillomavirus-associated cancers - United States, 2008-2012. Viens LJ, Henley SJ, Watson M, et al. Morb Mortal Wkly Rep. 2016;65:661–666. doi: 10.15585/mmwr.mm6526a1. [DOI] [PubMed] [Google Scholar]
  • 2.Rapid response to HPV vaccination crisis in Ireland. Corcoran B, Clarke A, Barrett T. Lancet. 2018;391:2103. doi: 10.1016/S0140-6736(18)30854-7. [DOI] [PubMed] [Google Scholar]
  • 3.Irish mothers' intentions to have daughters receive the HPV vaccine. Fahy A, Desmond DM. Ir J Med Sci. 2010;179:427–430. doi: 10.1007/s11845-010-0501-7. [DOI] [PubMed] [Google Scholar]
  • 4.Internet searches for sexual harassment and assault, reporting, and training since the #MeToo movement. Caputi TL, Nobles AL, Ayers JW. JAMA Intern Med. 2018;179:258–259. doi: 10.1001/jamainternmed.2018.5094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.CervicalCheck Steering Committee: weekly report to the Minister for Health. [Feb;2019 ];https://www.gov.ie/en/collection/b7319a-cervicalcheck-steering-committee-weekly-reports-to-the-minister-for-/ Health. 2018
  • 6.Anomalize: tidy anomaly detection. [Aug;2019 ];https://cran.r-project.org/web/packages/anomalize/index.html package. 2018
  • 7.Wickham H. SpringerLink. New York, NY: Springer-Verlag; 2016. ggplot2: Elegant Graphics for Data Analysis. [Google Scholar]
  • 8.Is there seasonality in hypothyroidism? a Google Trends pilot study. Ilias I, Alexiou M, Meristoudis G. Cureus. 2019;11:3965. doi: 10.7759/cureus.3965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Seasonality of cellulitis: evidence from Google Trends. Zhang X, Dang S, Ji F, et al. Infect Drug Resist. 2018;11:689–693. doi: 10.2147/IDR.S163290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.The utility of "Google Trends" for epidemiological research: Lyme disease as an example. Seifter A, Schwarzwalder A, Geis K, Aucott J. Geospat Health. 2010;4:135–137. doi: 10.4081/gh.2010.195. [DOI] [PubMed] [Google Scholar]
  • 11.Google searches for "cheap cigarettes" spike at tax increases: evidence from an algorithm to detect spikes in time series data. Caputi TL. Nicotine Tob Res. 2018;20:779–783. doi: 10.1093/ntr/ntx143. [DOI] [PubMed] [Google Scholar]

Articles from Cureus are provided here courtesy of Cureus Inc.

RESOURCES