Skip to main content
The Journal of Infectious Diseases logoLink to The Journal of Infectious Diseases
letter
. 2021 Nov 23;225(7):1301–1302. doi: 10.1093/infdis/jiab571

Human Cytomegalovirus and Risk of Incident Cardiovascular Disease in UK Biobank

Tom A Yates 1,, Gareth J Griffith 2,3, Tim T Morris 2,3
PMCID: PMC8974855  PMID: 35024849

To The Editor—Hamilton et al’s [1] analysis of the association between cytomegalovirus (CMV) infection and incident cardiovascular disease (CVD) in 8531 UK Biobank (UKBB) participants is stronger than most observational studies of the CMV-CVD association. There was careful adjustment for age, sex, socioeconomic position, and cardiovascular risk factors. Neither inflammatory markers nor prior CVD were included as covariates, which would have been inappropriate. However, although the authors acknowledge that sections of the population are underrepresented in the UKBB, the potential impact of this on their main results is not discussed.

Nonrandom sample selection can lead to spurious in-sample associations, a form of collider bias. A schematic example is presented in Figure 1A, in which individuals with high CVD risk or high CMV exposure are less likely to be sampled. This induces a negative correlation between CVD and CMV in the sample that does not hold in the broader population. To evaluate the extent to which collider bias might explain Hamilton et al’s [1] null findings, we compared CVD risk and CMV exposure in UKBB participants to that in more representative population samples.

Figure 1.

Figure 1.

(A) Schematic demonstrating an in-sample negative correlation (filled circles and dashed line) that does not exist in the underlying population (all circles) may be generated by selectively including individuals with lower levels of both exposure and outcome. (B) Simulated data showing how in-sample associations (dashed line) may not reflect population level associations (line) if selected individuals (crosses) are a specific subset of the population (triangles).

UKBB is a highly selected sample. Of 9 million 40- to 69-year-olds invited to participate, only 5.5% enrolled [2]. The sample is enriched for individuals of higher socioeconomic position and white ethnicities (94.6% vs 91.3%) [2]. UKBB participants are less likely to be obese, drink less alcohol, and are less likely to smoke compared with more population representative respondents to the Health Survey for England (HSE) [2]. All-cause mortality rates among UKBB participants aged 70–74 years are 46% and 56% lower in men and women, respectively, than in the general population of England and Wales [2].

UKBB participants have lower self-reported CVD prevalence at enrollment [2], lower exposure to CVD risk factors [2, 3], and CVD mortality rates that are 2.9-fold lower than individuals of a comparable age in the HSE [3]. Previous analyses have shown that these selection effects can attenuate risk factor-CVD associations. For example, for hypertension, the hazard ratio for CVD mortality is 1.9 in UKBB and 2.6 in HSE [3].

Because CMV is more prevalent in those of lower socioeconomic position [4, 5], it is plausible that UKBB participants have lower CMV exposure than the general population. Cytomegalovirus seroprevalence appears lower in UKBB than in participants from a 2002 serosurvey using blood samples from individuals accessing National Health Service (NHS) care in England and Wales [6]. Age-specific CMV seroprevalence in the serosurvey was only presented graphically, preventing quantification of these differences.

Comparing the age-specific CMV seroprevalence reported by Hamilton et al [1] with that among pregnant women in the Born in Bradford (BiB) birth cohort [7] suggests markedly lower CMV prevalence in UKBB. Cytomegalovirus seropositivity is more prevalent in older individuals, which is partly a cohort effect, with most incident infections occurring in individuals under the age of 40 years [6]. Seroprevalence in 40- to 49-year-olds in UKBB is 48.0%, similar to that among pregnant white British women in BiB (48.6%), despite the latter having a mean age of 26.6 years [1, 7]. Cytomegalovirus is also more prevalent in people of non-white ethnicity [4, 5, 7]. Seroprevalence among United Kingdom-born pregnant women of South Asian ethnicity in BiB was 89.3% (mean age 27.4 years). Note that CMV seroprevalence in UKBB is similar in men (56.0%) and women (59.3%) [1].

It appears that UKBB participants have lower CVD and CMV prevalence than the general population. The potential impact of selection effects into UKBB on observed associations can be simulated. For example, selecting 5% of individuals with lower risk of both characteristics can produce a null in-sample association if the 2 characteristics have a Pearson’s correlation coefficient of only 0.2 in the broader population (Figure 1B). Here, both characteristics are treated as continuous variables, eg, CVD incidence and lifetime exposure to CMV. The code to reproduce this analysis under varying selection mechanisms is available at https://github.com/timtmorris/collider-bias.

Collider bias is a common and underappreciated problem in health research. Its potential to impact analyses in UKBB and other unrepresentative samples is well described [3, 8, 9]. This bias may be particularly consequential when investigating the impact of ubiquitous exposures on common diseases. Here, modest differences in attributable risk imply a substantial burden of disease.

The best solution to this problem would be to invest in making participation in research accessible to a broader section of the community. This includes access to healthcare services that routinely contribute data to, eg, Hospital Episodes Statistics. Where this has not been done, researchers should conduct their analyses in other less selected datasets or, where data exist on the determinants of inclusion, weight the sample and adjust for selection. This is an area of active methodological research [10].

Notes

Financial support. T. A. Y. is funded via a National Institute for Health Research (NIHR) Academic Clinical Fellowship (ACF-2018-21-007) and acknowledges support from the NIHR Imperial Biomedical Research Centre. The Medical Research Council (MRC) and the University of Bristol support the MRC Integrative Epidemiology Unit (MC_UU_00011/1, MC_UU_00011/3).

Potential conflicts of interest. T. A. Y. is currently applying for funding to undertake research on the indirect consequences of cytomegalovirus infection. All authors have submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Conflicts that the editors consider relevant to the content of the manuscript have been disclosed.

References

  • 1. Hamilton EM, Allen NE, Mentzer AJ, Littlejohns TJ.. Human cytomegalovirus and risk of incident cardiovascular disease in UK Biobank. J Infect Dis 2022; 225:1179–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Fry A, Littlejohns TJ, Sudlow C, et al. . Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am J Epidemiol 2017; 186:1026–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Batty GD, Gale CR, Kivimäki M, Deary IJ, Bell S.. Comparison of risk factor associations in UK Biobank against representative, general population based studies with conventional response rates: prospective cohort study and individual participant meta-analysis. BMJ 2020; 368:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Cannon MJ, Schmid DS, Hyde TB.. Review of cytomegalovirus seroprevalence and demographic characteristics associated with infection. Rev Med Virol 2010; 20:202–13. [DOI] [PubMed] [Google Scholar]
  • 5. Winter JR, Taylor GS, Thomas OG, Jackson C, Lewis JEA, Stagg HR.. Factors associated with cytomegalovirus serostatus in young people in England: a cross-sectional study. BMC Infect Dis 2020; 20:875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Vyse AJ, Hesketh LM, Pebody RG.. The burden of infection with cytomegalovirus in England and Wales: how many women are infected in pregnancy? Epidemiol Infect 2009; 137:526–33. [DOI] [PubMed] [Google Scholar]
  • 7. Pembrey L, Raynor P, Griffiths P, Chaytor S, Wright J, Hall AJ.. Seroprevalence of cytomegalovirus, Epstein Barr virus and varicella zoster virus among pregnant women in Bradford: a cohort study. PLoS One 2013; 8:e81881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Munafò MR, Tilling K, Taylor AE, Evans DM, Smith GD.. Collider scope: when selection bias can substantially influence observed associations. Int J Epidemiol 2018; 47:226–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Griffith GJ, Morris TT, Tudball MJ, et al. . Collider bias undermines our understanding of COVID-19 disease risk and severity. Nat Commun 2020; 11:5749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Griffith GJ, Hemani G, Herbert A, et al. . We should be cautious about associations of patient characteristics with COVID-19 outcomes that are identified in hospitalised patients. Available at: https://www.hdruk.org/news/we-should-be-cautious-about-associations-of-patient-characteristics-with-covid-19-outcomes-that-are-identified-in-hospitalised-patients/. Accessed 7 October 2021.

Articles from The Journal of Infectious Diseases are provided here courtesy of Oxford University Press

RESOURCES