Abstract
Age-related male Y and female X chromosome mosaicism is commonly observed in large population-based studies. To investigate the frequency of male X chromosome mosaicism, we scanned for deviations in chromosome X genotyping array intensity data in a population-based survey of 196,219 UK Biobank men. We detected 12 (0.006%) men with mosaic chromosome X gains ≥ 2 Mb and found no evidence for mosaic chromosome X loss, a level of detection substantially lower than for autosomes or other sex chromosomes. The rarity of chromosome X mosaicism in males relative to females reflects the importance of chromosome X gene dosage for leukocyte function.
Subject terms: Computational biology and bioinformatics, Genetics
Introduction
Genetic mosaicism is the presence of clonal populations of cells harboring post-zygotic mutations. The size of mosaic mutations ranges from single point mutations to large somatic copy number (or neutral) alterations (SCNAs) spanning an entire chromosome1–7. Genetic mosaicism is observed in normal tissue, with increasing levels detected as the sensitivity of detection approaches improves8–10. Recent evidence suggests SCNAs may be associated with elevated risk for chronic diseases like cancer11–14. Population-based studies have characterized age-related mosaic SCNAs in DNA of circulating leukocytes from apparently healthy individuals using genotyping array-based detection approaches3–7. Autosomal mosaicism is observed in approximately 5% of adults6,7, mosaic Y loss (mLOY) is estimated to affect over 15% of elderly men11–13, and female X chromosome mosaicism is observed at 4 times the rate of the autosomes15. Reports of female X chromosome mosaicism indicate events more often involve the entire chromosome and favor the inactivated X chromosome. This is likely due to the ability of female leukocytes to tolerate loss or gain of a Barr body15. Female tumor genomes also have elevated rates of mutation (2–4 × per Mb) on the inactivated X chromosome16.
The frequency of male X chromosome mosaicism has not been well-characterized in large populations. In females, who are diploid for the X chromosome, the loss or gain of an inactivated copy of the X chromosome can be tolerated. In hemizygous males, changes in X chromosome dosage, particularly losses, may be deleterious for cellular function and survival.
Results
We detected a lower than expected total of 12 (0.006%) mosaic gains, including one on the q arm of the X chromosome (Xq) and 11 spanning the entire X chromosome (Fig. 1, Supplementary Fig. S1). Most detected gains were at low cellular fractions, Log2 R Ratio (LRR)-estimated proportions of 11–18% (Supplementary Table S1). Only two detected mosaic chromosome X gains had median LRR ≥ 0.25, suggesting that it is rare to observe ≥ 20% of leukocytes affected by mosaic X chromosome gains (Supplementary Table S1). B allele frequency (BAF) plots indicated no heterozygosity apart from pseudoautosomal regions and the X-transposed region (Xq21) (Supplementary Figure S1), suggesting mitotic nondisjunction as the most likely origin; although revertant constitutional XXY isodisomy could be possible. We did not detect evidence for mosaic chromosome X loss in any men. We detected 24 (0.01%) men with constitutional XXY (Supplementary Table S2, Supplementary Figs. S2–S3), lower than the estimated 0.15% in the UK population17 likely due to the highly selected and generally healthy population participating in the UK Biobank18. We observed 3 cases with heterozygosity spanning the entire X chromosome suggesting paternal meiosis I errors or maternal meiosis I errors in the absence of recombination as the molecular origin of heterodisomy. An additional 10 cases of detected constitutional XXY had partial heterozygous BAF plots suggesting nondisjunction errors during maternal meiosis I or II when recombination previously occurred resulting in partial heterodisomy. The remaining 11 cases demonstrated no evidence for X chromosome heterozygosity indicating isodisomy because of female meiotic II errors in the absence of recombination or early post-zygotic mitotic errors. All XXY men had normal Y chromosome dosage, except for one individual (K2) who displayed LRR evidence for a Y chromosome gain (Supplementary Fig. S3).
Discussion
The rarity of X mosaicism in UK Biobank males (0.006%) likely reflects the deleterious impact on cell survival or clonal fitness of losing key cellular genes on a hemizygous chromosome. It is unlikely that the lack of evidence for detectable male chromosome X mosaicism, particularly loss of the entire chromosome, is primarily due to the analytical approach. We note that detection of mosaic events on the male X chromosome from array intensity data requires a specialized detection approach. Autosomal and female X chromosome mosaic events are detected from array intensity data using either segmentation of BAF data of heterozygous variants3–6 or a combination of BAF and genotype phase data7,19. The male X chromosome is hemizygous and does not have BAF values for heterozygous variants. Instead, variant LRR data is used for investigating sex chromosome (X and Y chromosome) mosaicism in men, as exemplified by several studies of mosaic chromosome Y loss11–14,20. Our detection of male chromosome X mosaicism is based on LRR data, with an additional step required for correction of the proportion of males and females present in the UK Biobank genotyping cluster files. Our LRR detection approach effectively centers median chromosome X LRR values around the expected value of zero and suggests that if males harbor X chromosome mosaicism, the fraction of abnormal cells is low and below the limit of detection of existing genotyping array-based approaches (i.e., < 10% cellular fraction). It is plausible that male X mosaicism is present in circulating leukocytes below levels of current detection, but this would involve a low fraction of circulating cells suggesting a challenge to leukocyte fitness.
The disparity between high rates of chromosome Y mosaicism, specifically mLOY, and the relative rarity of X mosaicism in men likely reflects the biological importance of gene function and dosage on these sex chromosomes. The X chromosome contains more than 1000 genes, many of which are established as essential in cellular function21,22, while the male specific region of the Y chromosome only encodes 27 proteins23. The lack of evidence for clonal fitness for mosaic loss of X chromosomes in men is striking and underscores the importance of single copy genes on the X chromosome. This is consistent with X chromosome evolutionary history in humans and other mammals, wherein purifying (negative) selection has accelerated the removal of deleterious alleles on the X chromosome relative to the autosomes24. Genomic investigations characterizing tumor DNA also demonstrate an absence of X nullisomy in cancer genomes; although chromosome Y loss is common25–27. The Y chromosome contains substantially fewer genes, many of which have roles in sex-determination and spermatogenesis21–23, but their loss can be tolerated with aging. Evidence from our large population-based investigation of men suggests chromosome X mosaicism is rare in blood; therefore, making it unlikely that male X mosaicism could have large attributable fractions for risk of common cancers or chronic diseases. Our findings underscore the importance of gene dosage in sex chromosomes, particularly in men.
Conclusions
Our analysis identified low frequencies of X chromosome mosaicism in circulating leukocytes of men. While cases of Klinefelter syndrome demonstrate that male cells can tolerate extra copies of the X chromosome, albeit with known phenotypic consequences such as elevated breast cancer rates28–30, our findings suggest clonal expansion of leukocytes harboring a change in X chromosome dosage is rare in the general male population and, when present, is a mosaic gain in usually less than 20% of leukocytes. These observational findings in UK Biobank men are in agreement with the biological hypothesis that X inactivation ensures proper X chromosome dosage in females as well as in cases of Klinefelter syndrome for normal cellular function31,32.
Materials and methods
We characterized the frequency of chromosome X mosaicism in males scanning for large-scale chromosome X mosaicism (≥ 2 Mb) in previously genotyped DNA samples from 196,219 male UK Biobank participants. The UK Biobank is a population-based cohort study representing individuals in the UK's National Health Service, who reside near a UK Biobank assessment center. UK Biobank participants provided baseline information on demographic, lifestyle, and other health-related factors, as well as biological samples and physical measures. The National Information Governance Board for Health and Social Care and the National Health Service North West Multicentre Research Ethics Committee approved the UK Biobank study and all participants provided written informed consent at enrollment. The study was conducted in accordance with recognized ethical guidelines (e.g., U.S. Common Rule). DNA samples were blood-derived with age at sample collection ranging from 37 to 73 years of age (mean = 56.76, median = 58). Male UK Biobank participants were previously genotyped using either Affymetrix UK BiLEVE or UK Biobank Axiom arrays18,33. All genomic and covariate data used in this publication is available through application for access through the UK Biobank (https://www.ukbiobank.ac.uk/register-apply/).
Log2 R Ratio (LRR) was used to detect possible mosaic gains and losses of the male X chromosome. LRR values are the log2 of the ratio of observed SNP intensity value to expected intensity value. Deviations in LRR values greater than expected baseline LRR suggest copy number gain and less than expected baseline LRR suggest copy number loss. We used two median LRR thresholds for detecting potential mosaic events that reflect different leukocyte proportions affected by X mosaicism: moderate: |LRR| ≥ 0.15 and high: |LRR| ≥ 0.25. As the X chromosome in males is hemizygous, it was not possible to investigate potential copy neutral mosaic events. We calculated median and standard deviation of LRR across all probes on the X chromosome and filtered out samples with high variability in LRR values (standard deviation of LRR ≥ 0.30). Our final analytic sample size consisted of 196,219 men.
As raw UK Biobank LRR values for males were calculated by combining intensity data together for both males and females (Supplementary Fig. S4), detection of male X mosaicism required generation of a male-specific adjusted LRR so baselines for males would be centered around 0 (Fig. 1) for correct copy number calling. We calculated male-specific LRR values for each genotyping probe on the X chromosome by taking the median of LRR values for each probe across all male samples. A baseline-corrected LRR was then generated by subtracting this male- specific baseline LRR from each male sample LRR in the genotyping cluster group. The male X chromosome baseline-corrected LRR values for each male participant were then segmented for mosaic events using the circular binary segmentation (CBS) algorithm in the R package, DNAcopy34 by applying the following workflow: (1) perform outlier detection and smoothing of the LRR data prior to analysis by circular binary segmentation, (2) segment baseline corrected LRR data into regions of estimated equal copy number using circular binary segmentation (CBS), (3) run the analysis with and without the “undo splits” option to remove change-points that are likely due to local trends (< 3 standard deviations apart), (4) filter to select for events with median LRR ≥ 0.15 and ≥ 0.25 and the number of probes ≥ 200 for possible gain as well as median of segmentation region ≤ − 0.15 and ≤ − 0.25 and the number of probes ≥ 200 for possible loss, (5) plot all detected potential gains and losses for manual review by a trained reviewer, and (6) check chromosome Y probes for all cases of potential gain of the X chromosome to ensure each case is karyotypically male.
Supplementary Information
Acknowledgements
Special thanks to Derek W. Brown for critical input on the manuscript. This research was supported by the Intramural Research Program of the National Cancer Institute. The opinions expressed by the authors are their own and this material should not be interpreted as representing the official viewpoint of the U.S. Department of Health and Human Services, the National Institutes of Health or the National Cancer Institute.
Author contributions
W.Z. analyzed the data, interpreted the primary results, drafted the manuscript. S.L. analyzed the data, drafted the manuscript. S.M.K. analyzed the data, drafted the manuscript. M.Y. interpreted the primary results, drafted the manuscript. S.J.C. interpreted the primary results, drafted the manuscript. M.J.M. supervised the study, interpreted the primary results, drafted the manuscript.
Data availability
All genomic and covariate data used in this publication is available through application for access through the UK Biobank (https://www.ukbiobank.ac.uk/register-apply/).
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-020-80948-0.
References
- 1.Jaiswal S, et al. Age-related clonal hematopoiesis associated with adverse outcomes. N. Engl. J. Med. 2014;371:2488–2498. doi: 10.1056/NEJMoa1408617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Genovese G, et al. Clonal hematopoiesis and blood-cancer risk inferred from blood DNA sequence. N. Engl. J. Med. 2014;371:2477–2487. doi: 10.1056/NEJMoa1409405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Jacobs KB, et al. Detectable clonal mosaicism and its relationship to aging and cancer. Nat. Genet. 2012;44:651–658. doi: 10.1038/ng.2270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Laurie CC, et al. Detectable clonal mosaicism from birth to old age and its relationship to cancer. Nat. Genet. 2012;44:642–650. doi: 10.1038/ng.2271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Forsberg LA, et al. Age-related somatic structural changes in the nuclear genome of human blood cells. Am. J. Hum. Genet. 2012;90:217–228. doi: 10.1016/j.ajhg.2011.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Machiela MJ, et al. Characterization of large structural genetic mosaicism in human autosomes. Am. J. Hum. Genet. 2015;96:487–497. doi: 10.1016/j.ajhg.2015.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Loh PR, et al. Insights into clonal haematopoiesis from 8342 mosaic chromosomal alterations. Nature. 2018;559:350–355. doi: 10.1038/s41586-018-0321-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Martincorena I, et al. Somatic mutant clones colonize the human esophagus with age. Science. 2018;362:911–917. doi: 10.1126/science.aau3879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Martincorena I, et al. Tumor evolution. High burden and pervasive positive selection of somatic mutations in normal human skin. Science. 2015;348:880–886. doi: 10.1126/science.aaa6806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yizhak K, et al. RNA sequence analysis reveals macroscopic somatic clonal expansion across normal tissues. Science. 2019 doi: 10.1126/science.aaw0726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zhou W, et al. Mosaic loss of chromosome Y is associated with common variation near TCL1A. Nat. Genet. 2016;48:563–568. doi: 10.1038/ng.3545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Forsberg LA, et al. Mosaic loss of chromosome Y in peripheral blood is associated with shorter survival and higher risk of cancer. Nat. Genet. 2014;46:624–628. doi: 10.1038/ng.2966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Loftfield E, et al. Predictors of mosaic chromosome Y loss and associations with mortality in the UK Biobank. Sci. Rep. 2018;8:12316. doi: 10.1038/s41598-018-30759-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Loftfield E, et al. Mosaic Y loss is moderately associated with solid tumor risk. Cancer Res. 2019;79:461–466. doi: 10.1158/0008-5472.Can-18-2566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Machiela MJ, et al. Female chromosome X mosaicism is age-related and preferentially affects the inactivated X chromosome. Nat. Commun. 2016;7:11843. doi: 10.1038/ncomms11843. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jäger N, et al. Hypermutation of the inactive X chromosome is a frequent event in cancer. Cell. 2013;155:567–581. doi: 10.1016/j.cell.2013.09.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Service, N. H. Klinefelter Syndrome. https://www.nhs.uk/conditions/klinefelters-syndrome/ (2019).
- 18.Fry A, et al. Comparison of sociodemographic and health-related characteristics of UK biobank participants with those of the general population. Am. J. Epidemiol. 2017;186:1026–1034. doi: 10.1093/aje/kwx246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Vattathil S, Scheet P. Extensive hidden genomic mosaicism revealed in normal tissue. Am. J. Hum. Genet. 2016;98:571–578. doi: 10.1016/j.ajhg.2016.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wright DJ, et al. Genetic variants associated with mosaic Y chromosome loss highlight cell cycle genes and overlap with cancer susceptibility. Nat. Genet. 2017;49:674–679. doi: 10.1038/ng.3821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ross MT, et al. The DNA sequence of the human X chromosome. Nature. 2005;434:325–337. doi: 10.1038/nature03440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Lahn BT, Page DC. Four evolutionary strata on the human X chromosome. Science. 1999;286:964–967. doi: 10.1126/science.286.5441.964. [DOI] [PubMed] [Google Scholar]
- 23.Skaletsky H, et al. The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature. 2003;423:825–837. doi: 10.1038/nature01722. [DOI] [PubMed] [Google Scholar]
- 24.Veeramah KR, Gutenkunst RN, Woerner AE, Watkins JC, Hammer MF. Evidence for increased levels of positive and negative selection on the X chromosome versus autosomes in humans. Mol. Biol. Evol. 2014;31:2267–2282. doi: 10.1093/molbev/msu166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sauter G, et al. Y chromosome loss detected by FISH in bladder cancer. Cancer Genet. Cytogenet. 1995;82:163–169. doi: 10.1016/0165-4608(95)00030-s. [DOI] [PubMed] [Google Scholar]
- 26.Yamada K, Kasama M, Kondo T, Shinoura N, Yoshioka M. Chromosome studies in 70 brain tumors with special attention to sex chromosome loss and single autosomal trisomy. Cancer Genet. Cytogenet. 1994;73:46–52. doi: 10.1016/0165-4608(94)90180-5. [DOI] [PubMed] [Google Scholar]
- 27.Wolman SR, Sanford J, Ratner S, Dawson PJ. Breast cancer in males: DNA content and sex chromosome constitution. Mod. Pathol. 1995;8:239–243. [PubMed] [Google Scholar]
- 28.Evans DB, Crichlow RW. Carcinoma of the male breast and Klinefelter's syndrome: Is there an association? CA Cancer J. Clin. 1987;37:246–251. doi: 10.3322/canjclin.37.4.246. [DOI] [PubMed] [Google Scholar]
- 29.Brinton LA. Breast cancer risk among patients with Klinefelter syndrome. Acta Paediatr. 2011;100:814–818. doi: 10.1111/j.1651-2227.2010.02131.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Hultborn R, et al. Prevalence of Klinefelter's syndrome in male breast cancer patients. Anticancer Res. 1997;17:4293–4297. [PubMed] [Google Scholar]
- 31.Lyon MF. Gene action in the X-chromosome of the mouse (Mus musculus L.) Nature. 1961;190:372–373. doi: 10.1038/190372a0. [DOI] [PubMed] [Google Scholar]
- 32.Kamischke A, Baumgardt A, Horst J, Nieschlag E. Clinical and diagnostic features of patients with suspected Klinefelter syndrome. J. Androl. 2003;24:41–48. doi: 10.1002/j.1939-4640.2003.tb03129.x. [DOI] [PubMed] [Google Scholar]
- 33.Sudlow C, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12:e1001779. doi: 10.1371/journal.pmed.1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Olshen AB, Venkatraman ES, Lucito R, Wigler M. Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics. 2004;5:557–572. doi: 10.1093/biostatistics/kxh008. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All genomic and covariate data used in this publication is available through application for access through the UK Biobank (https://www.ukbiobank.ac.uk/register-apply/).