Skip to main content
Health and Quality of Life Outcomes logoLink to Health and Quality of Life Outcomes
. 2007 Jan 8;5:3. doi: 10.1186/1477-7525-5-3

Measuring the psychosocial consequences of screening

John Brodersen 1,, Stephen P McKenna 2, Lynda C Doward 2, Hanne Thorsen 1
PMCID: PMC1770907  PMID: 17210071

Abstract

The last three decades have seen a dramatic rise in the implementation of screening programmes for cancer in industrialised countries. However, in contrast to screening for infectious diseases, most cancer screening programmes only have the potential to reduce mortality; they cannot lower the incidence of cancer in a population. In fact, most cancer screening programmes have been shown to increase the incidence of the disease as a consequence of over-diagnosis. A further dilemma of cancer screening programmes is that they do not distinguish between healthy people and those with disease. Rather, they identify a continuum of disease severity. Consequently, many healthy people who have abnormal screening tests are wrongly diagnosed. Indeed, studies have demonstrated that for each screening-prevented death from cancer, at least 200 false-positive results are given. Therefore, screening has the potential to be harmful as well as beneficial. The psychosocial consequences of false-positive screening results cannot be determined by diagnostic tests or by other technical means. Instead, patient reported outcome measures must be employed. To measure the outcomes of screening accurately and comprehensively patient reported outcome measures have to capture; the nature and extent of the psychosocial consequences and how these change over time. The outcome measures used must have high content validity and their psychometric properties should be determined prior to their use in the specific population. In particular it is important to establish unidimensionality, additivity and item ordering through the application of Item Response Theory.

Background

The history of medical screening began with that for syphilis and later for tuberculosis. Screening for infectious diseases served a dual purpose; to cure the patient and to reduce the incidence of the disease in the general population [1]. The World Health Organization (WHO) screening criteria published in 1968 highlighted major concerns with the occurrence of false-negative screening results; as people with undetected disease continue to be a source of infection [2]. The past three decades have seen a dramatic increase in the implementation of screening programmes for cancer in industrialised countries. However, in contrast to screening for infectious diseases, most cancer screening programmes only have the potential to reduce mortality; they cannot lower the incidence of cancer in a population. In fact, most cancer screening programmes have been shown to increase the incidence of the disease as a consequence of over-diagnosis or increase the incidence of conditions that are characterised as pre-cancers, dysplasia or atypical cells. [3-7]. It is important to acknowledge that by far the majority of these are harmless conditions that will never become invasive cancers [8-12].

Discussion

Cancer screening in a general population does not distinguish between healthy people and those with disease. Rather, it identifies a continuum of disease severity [13]. For example, a recent study by Taupin et al found that 45% of participants screened for colorectal cancer had either hyperplastic (benign) or adematous (pre-cancerous) polyps which were removed [14]. Most lay people would probably see the presence of a growth as being indicative of cancer and its removal as a positive event. However, as polyps rarely become malignant, their removal could actually be viewed as over-treatment of a harmless condition.

Another dilemma with any mass cancer screening programme is that screening tests tend to have low predictive power. Consequently, many healthy people having abnormal screening tests are wrongly diagnosed (termed false alarms or "false-positive" results). Indeed, studies have demonstrated that for each screening-prevented death from cancer, at least 200 false-positive results are given [7,15,16]. In the case of mammography screening, studies have shown that the receipt of a false-positive result has substantial negative psychosocial consequences for women. These can persist for up to three years after the screening procedure [17,18]. Clearly, medical screening has the potential to be as harmful as it is beneficial [19].

In response to this dilemma the WHO-criteria for screening have recently been updated. New criteria have been added concerning ethical aspects of the screening process, the psychosocial consequences of false-positive screening results and the need for fully informed consent [20].

The psychosocial consequences of false-positive screening results cannot be determined by diagnostic tests or by other technical means. Instead, patient reported outcome (PRO) measures must be employed. To measure the outcomes of screening accurately and comprehensively, PRO measures have to capture:

• the nature of the psychosocial consequences,

• the extent of the psychosocial consequences, and

• changes in psychosocial consequences over time.

The measures used must have high content validity [21]. This means that they must both cover relevant aspects of the construct being measured and exclude issues that are irrelevant. Qualitative research has shown that abnormal and false-positive cancer screening results have a negative impact on the following psychosocial domains; anxiety, fear, mood, behaviour, sleep, sexuality and social functioning [22-26]. Unfortunately, studies reporting on psychosocial aspects of cancer screening have mostly employed questionnaires that have poor content validity and/or that have not been validated for this purpose [14,27-29].

For example, Taupin and colleagues used the SF-36 to assess the impact of screening for colorectal cancer [14]. There were several flaws in the design of this study which resulted in an underestimation of the negative consequences of the screening process. First, the authors did not test the adequacy of the content of the SF-36 for this study population. It is well established that generic PROs such as the SF-36 do not necessarily work in a consistent manner across different populations [30] and the instrument's psychometric properties should have been explored to justify its use. It is not unreasonable to suppose that the SF-36 would have low content validity in the setting of colorectal cancer screening as it does not cover many of the most important issues related to screening and because it contains a high number of irrelevant items. Taupin and colleagues recorded 30 minor and two major adverse events from the 231 colonoscopies undertaken. However, it is doubtful whether any of the SF-36 items would be capable of capturing the thoughts or feelings of a healthy person who experienced an adverse event.

A major problem with the SF-36 scales is that their items were selected in order to have high scale consistency. However, internal consistency does not ensure that a scale is unidimensional; that is, that all of the items measure a single underlying construct and so can be added together to yield a total scale score. Good internal consistency merely suggests that the items are correlated [31]. Modern PROs are required to establish unidimensionality (or, in the case of multidimensional PROs, unidimensionality of subscales), additivity and item ordering through the application of Item Response Theory (IRT) [32]. The Rasch model (an IRT model) provides formal representation of perfect measurement. Where items are shown to fit a Rasch model the measure can be shown to posses criterion-related construct validity [33], to be objective [34], sufficient [35] and, therefore, also reliable [36]. IRT evidence indicates that the SF-36 scales are not unidimensional and that items in the subscales cannot validly be added together [37].

Participants in the study (who were asymptomatic) were ineligible for inclusion if they had:

• gastrointestinal symptoms requiring attendance at a primary care physician in the previous year,

• significant co-morbidity,

• a prior diagnosis of cancer,

• previous colonic surgery or therapeutic anticoagulation.

Such a group would be expected to have a significantly better health status than that of an age matched general population. In fact, the SF-36 failed to show any such differences, confirming its lack of sensitivity. Only 37.3% of those invited chose to participate in the screening study suggesting that the sample consisted of those who were most positive about screening. Such people would be expected to underestimate any negative psychosocial experiences because of the perceived benefits of the procedure.

Taupin et al only found differences post colonoscopy on the mood domains of vitality, emotional role limitations and mental health. However, even for these domains mean scores only increased between 1.9 and 4.4 points, well below the 10 to 20 points needed for a clinically meaningful improvement in health status on the SF-36 subscales [38].

Despite the evidence presented in their paper the authors concluded that: "Average-risk persons benefit significantly from colon cancer screening with colonoscopy, by improving in Mental Health and Vitality domains of Quality of Life". Such a conclusion is not justified. First, the psychosocial consequences of screening are best investigated in a randomised design. Secondly, it would be necessary to employ a PRO with good psychometric and scaling properties. It is essential that the PRO used has high content validity in order to capture the psychosocial consequences of screening accurately. Evidence of the unidimensionality of the collected data should also be reported. Finally, it is pertinent to ask whether it is ethical to give participants in screening exercises the impression that they will benefit from the process itself, given the absence of evidence supporting this conclusion and the availability of proof that false-positive results are common and have an adverse effect on well-being and health status [26,27].

Conclusion

At present it is far from clear that cancer screening in a general population is effective. Such screening has the potential to be as harmful as it is beneficial. It is equally important to investigate the harms of a screening test as its benefits. For example, potential reductions in mortality from cancer need to be contrasted with the psychosocial consequences of false-positive screening results. When measuring the adverse effects of screening it is necessary to employ PRO's that are relevant to the population and that have good psychometric properties. It is recommended that the Rasch model should be adopted as the 'gold standard' for determining the adequacy of a PRO. Only where the data collected fit the Rasch model can they be verified as being objective, sufficient and reliable.

Competing interests

JB and HT have conducted considerable research into the adverse impact on participants of false-positive tests in cancer screening and have developed outcome measures to determine these impacts. JB recently published a PhD thesis on this topic and is an advisor to the Danish National Board of Health on informed consent in breast cancer screening.

SPM and LCD have no competing interests.

Authors' contributions

JB drafted the manuscript and coordinated the production of the article. SPM, LCD and HT assisted in drafting the manuscript. All authors read and approved the final manuscript.

Contributor Information

John Brodersen, Email: j.brodersen@gpmed.ku.dk.

Stephen P McKenna, Email: smckenna@galen-research.com.

Lynda C Doward, Email: ldoward@galen-research.com.

Hanne Thorsen, Email: h.thorsen@gpmed.ku.dk.

References

  1. Morabia A, Zhang FF. History of medical screening: from concepts to action. Postgrad Med J. 2004;80:463–469. doi: 10.1136/pgmj.2003.018226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Wilson JMG, Jungner G. Principles and practice of screening for disease Geneva: World Health Organization. 1968.
  3. Mayor S. Study highlights insensitivity of PSA screening. BMJ. 2005;331:67–a. doi: 10.1136/bmj.331.7508.67-a. [DOI] [Google Scholar]
  4. Pashayan N, Powles J, Brown C, Duffy SW. Incidence trends of prostate cancer in East Anglia, before and during the era of PSA diagnostic testing. Br J Cancer. 2006;95:398–400. doi: 10.1038/sj.bjc.6603247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Raffle AE, Quinn M. Harms and benefits of screening to prevent cervical cancer. Lancet. 2004;364:1483–1484. doi: 10.1016/S0140-6736(04)17260-7. [DOI] [PubMed] [Google Scholar]
  6. Zackrisson S, Andersson I, Janzon L, Manjer J, Garne JP. Rate of over-diagnosis of breast cancer 15 years after end of Malmo mammographic screening trial: Follow-up study. BMJ. 2006;332:689–692. doi: 10.1136/bmj.38764.572569.7C. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Gotzsche PC, Nielsen M. Screening for breast cancer with mammography. Cochrane Database Syst Rev. 2006:CD001877. doi: 10.1002/14651858.CD001877.pub2. Review. [DOI] [PubMed] [Google Scholar]
  8. Nielsen M, Thomsen JL, Primdahl S, Dyreborg U, Andersen JA. Breast cancer and atypia among young and middle-aged women: a study of 110 medicolegal autopsies. British Journal of Cancer. 1987:814–9. doi: 10.1038/bjc.1987.296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Ottesen GL, Graversen HP, Blichert-Toft M, Christensen IJ, Andersen JA. Carcinoma in situ of the female breast. 10 year follow-up results of a prospective nationwide study. Breast Cancer Res Treat. 2000;62:197–210. doi: 10.1023/A:1006453915590. [DOI] [PubMed] [Google Scholar]
  10. Zahl PH, Andersen JM, Maehlen J. Spontaneous regression of cancerous tumors detected by mammography screening. JAMA. 2004;292:2579–80. doi: 10.1001/jama.292.21.2579. [DOI] [PubMed] [Google Scholar]
  11. Zahl PH, Strand BH, Mahlen J. Incidence of breast cancer in Norway and Sweden during introduction of nationwide screening: prospective cohort study. BMJ. 2004 doi: 10.1136/bmj.38044.666157.63. bmj. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Ostor AG. Natural history of cervical intraepithelial neoplasia: a critical review. Int J Gynecol Pathol. 1993;12:186–92. doi: 10.1097/00004347-199304000-00018. [DOI] [PubMed] [Google Scholar]
  13. Rose G, Barker DJ. Epidemiology for the uninitiated. What is a case? Dichotomy or continuum? BMJ. 1978;2:873–874. doi: 10.1136/bmj.2.6141.873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Taupin D, Chambers SL, Corbett M, Shadbolt B. Colonoscopic screening for colorectal cancer improves quality of life measures: a population-based screening study. Health Qual Life Outcomes. 2006;4:82. doi: 10.1186/1477-7525-4-82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Towler BP, Irwig L, Glasziou P, Weller D, Kewenter J. Screening for colorectal cancer using the faecal occult blood test, hemoccult. Cochrane Database Syst Rev. 2000;2:CD001216. doi: 10.1002/14651858.CD001216. Review. [DOI] [PubMed] [Google Scholar]
  16. Raffle AE, Alden B, Quinn M, Babb PJ, Brett MT. Outcomes of screening to prevent cancer: analysis of cumulative incidence of cervical abnormality and modelling of cases and deaths prevented. BMJ. 2003;326:901. doi: 10.1136/bmj.326.7395.901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Brett J, Austoker J. Women who are recalled for further investigation for breast screening: psychological consequences 3 years after recall and factors affecting re-attendance. J Public Health Med. 2001;23:292–300. doi: 10.1093/pubmed/23.4.292. [DOI] [PubMed] [Google Scholar]
  18. Brett J, Bankhead C, Henderson B, Watson E, Austoker J. The psychological impact of mammographic screening. A systematic review. Psychooncology. 2005;14:917–938. doi: 10.1002/pon.904. [DOI] [PubMed] [Google Scholar]
  19. Barratt AL. Cancer screening. Benefits, harms and making an informed choice. Aust Fam Physician. 2006;35:39–42. [PubMed] [Google Scholar]
  20. Det Etiske Råd. Screening – a report. The Danish Council of Ethics. http://www.etiskraad.dk/sw307.asp
  21. McCaffery KJ, Barratt AL. Assessing psychosocial/quality of life outcomes in screening: How do we do it better? J Epidemiol Community Health. 2004;58:968–970. doi: 10.1136/jech.2004.025114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lunde IM. "Jeg håber det bedste...". [Danish] Ringkøbing: Den Medicinske Forskningsenhed, Ringkøbing. 1997.
  23. Cockburn J, De LT, Hurley S, Clover K. Development and validation of the PCQ: a questionnaire to measure the psychological consequences of screening mammography. Soc Sci Med. 1992;34:1129–1134. doi: 10.1016/0277-9536(92)90286-Y. [DOI] [PubMed] [Google Scholar]
  24. Posner TN, Vessey M. Prevention of Cervical Cancer, the Patients View. London: King's Fund Publishing Office; 1988. [Google Scholar]
  25. Padgett DK, Yedidia MJ, Kerner J, Mandelblatt J. The emotional consequences of false positive mammography: African-American women's reactions in their own words. Women Health. 2001;33:1–14. doi: 10.1300/J013v33n03_01. [DOI] [PubMed] [Google Scholar]
  26. Brodersen J. Measuring psychosocial consequences of false-positive screening results – breast cancer as an example. Department of General Practice, Institute of Public Health, Faculty of Health Sciences, University of Copenhagen: Månedsskrift for Praktisk Lægegerning, Copenhagen; 2006. ISBN: 87-88638-36-7. [Google Scholar]
  27. Brodersen J, Thorsen H, Cockburn J. The adequacy of measurement of short and long-term consequences of false-positive screening mammography. J Med Screen. 2004;11:39–44. doi: 10.1258/096914104772950745. [DOI] [PubMed] [Google Scholar]
  28. Brodersen J, Thorsen H, Cockburn J. Validity of short-term consequences of cancer prevention and screening activities? J Clin Oncol. 2005;23:244–245. doi: 10.1200/JCO.2005.05.900. [DOI] [PubMed] [Google Scholar]
  29. Cullen J, Schwartz MD, Lawrence WF, Selby JV, Mandelblatt JS. Short-term impact of cancer prevention and screening activities on quality of life. J Clin Oncol. 2004;22:943–952. doi: 10.1200/JCO.2004.05.191. [DOI] [PubMed] [Google Scholar]
  30. Hobart JC, Williams LS, Moran K, Thompson AJ. Quality of life measurement after stroke: uses and abuses of the SF-36. Stroke. 2002;33:1348–1356. doi: 10.1161/01.STR.0000015030.59594.B3. [DOI] [PubMed] [Google Scholar]
  31. Cortina JM. What is coefficient alpha? An examination of theory and applications. J Appl Psychol. 1993;78:98–104. doi: 10.1037/0021-9010.78.1.98. [DOI] [Google Scholar]
  32. Tennant A, McKenna SP, Hagell P. Application of Rasch Analysis in the Development and Application of Quality of Life Instruments. Value Health. 2004;7:S22–S26. doi: 10.1111/j.1524-4733.2004.7s106.x. [DOI] [PubMed] [Google Scholar]
  33. Rosenbaum PR. Criterion-related construct validity. Psychometrika. 1989;54:625–33. doi: 10.1007/BF02296400. [DOI] [Google Scholar]
  34. Rasch G. An Informal Report on a Theory of Objectivity in Comparisons. In: Van der Kamp LJTh, Vlek CAJ, editor. An Informal Report on a Theory of Objectivity in Comparisons. Leyden: University of Leyden; 1967. pp. 1–19. [Google Scholar]
  35. Andersen EB. Sufficient Statistics and Latent Trait Models. Psychometrika. 1977;42:69–81. doi: 10.1007/BF02293746. [DOI] [Google Scholar]
  36. Bartholomew DJ. The Statistical Approach to Social measurement. San Diego: Academic Press; 1996. [Google Scholar]
  37. Raczek AE, Ware JE, Bjorner JB, Gandek B, Haley SM, Aaronson NK, Apolone G, Bech P, Brazier JE, Bullinger M, Sullivan M. Comparison of Rasch and summated rating scales constructed from SF-36 physical functioning items in seven countries: results from the IQOLA Project. International Quality of Life Assessment. J Clin Epidemiol. 1998;51:1203–1214. doi: 10.1016/S0895-4356(98)00112-7. [DOI] [PubMed] [Google Scholar]
  38. Gilbert C, Brown MCJ, Cappelleri J, Parpia T, McKenna SP. Establishing a minimally important difference in 6-minute walk distance and SF-36 among patients with pulmonary arterial hypertension. Chest. 2005;128:365S–a. [Google Scholar]

Articles from Health and Quality of Life Outcomes are provided here courtesy of BMC

RESOURCES