Abstract
In a recent issue of Cancer Cell, Carrot-Zhang et al. identified ancestry-specific molecular variants and expression changes among patients from The Cancer Genome Atlas (TCGA).1 Their study findings and limitations highlight the critical need to diversify populations represented in cancer genomics research.
In a recent issue of Cancer Cell, Carrot-Zhang et al. identified ancestry-specific molecular variants and expression changes among patients from The Cancer Genome Atlas (TCGA).1 Their study findings and limitations highlight the critical need to diversify populations represented in cancer genomics research.
Main Text
Ever since the “Philadelphia chromosome” was characterized as a leukemia-defining translocation, cancer has been considered a genetic disease. Molecular alterations can now inform diagnosis, treatment targets, and predictive and prognostic biomarkers across cancer types. Efforts such as The International Hapmap Project, the 1000 Genomes Project, and the Trans-Omics for Precision Medicine (TOPMed) Program have all emphasized the need to include participants from diverse international backgrounds, yet the vast trove of genomic and transcriptomic data in cancer genomics research continues to overwhelmingly represent patients of European ancestry. Of the major genomic datasets in oncology that offer publicly available ancestry information, The Cancer Genome Atlas (TCGA) includes 83% European ancestry patients compared to 6% each of African and East Asian patients,2 while the Pan-Cancer Analysis of Whole Genomes (PCAWG) includes 77% European ancestry patients compared to 16% East Asian and 5% African patients.3
Prior work on genomic associations between ancestry and cancer has focused largely on the direct relationship between genomic alterations and disparities in outcomes. For example, patients of African ancestry with cancer have worse overall survival in spite of similar or decreased incidence rates relative to patients of European ancestry.4 In breast cancer, while socioeconomic and cultural factors explain some of this disparity, women of African ancestry also suffer higher rates of the more aggressive triple-negative or HER2-positive subtypes, with at least 40% of this subtype difference estimated as genetically driven.5 However, it remains unknown why breast cancer subtypes show ancestry-specific differences as well as what biological factors drive evolving tumors toward a given subtype.6
Carrot-Zhang and colleagues are among only a handful of groups asking how genetic ancestry may influence cancer development. In their recently published article, they examined associations between ancestry (defined via population-specific genomic markers) and somatic (cancer-only) alterations, DNA methylation sites, messenger RNA (mRNA) expression, and microRNA (miRNA) expression across the 10,678 patients and 33 cancer types represented in TCGA. Their study expanded on prior work to include epigenetic and post-transcriptional regulation as well.1 In addition, they performed integrated analyses of these data types to study ancestry-specific expression quantitative trait loci (eQTLs) and cancer immunogenicity. They now demonstrate that across cancer genomics research, ancestry must be accounted for in experimental design.
Ignoring differential distributions of cancer-specific subtypes among patients of differing ancestral populations (for example, the higher prevalence of triple-negative subtype among African ancestry patients with breast cancer) introduces confounding, thus imperiling the translational significance of research findings. The Cancer Genetic Ancestry Atlas developed by Yuan et al. previously associated PIK3CA mutations with European ancestry and TP53 mutations with African ancestry.2 These relationships are now argued to be subtype specific, a pattern validated by studying breast tumors from patients in Nigeria as well as those from TCGA European and African ancestry patients.5,7 The interaction between TP53 mutations in patients of African ancestry and subtypes also associates with higher risk of recurrence.5
Also importantly, with their work on methylation and microRNAs, Carrot-Zhang and colleagues raise the larger question of how genes literally interact with geography and environment to influence cancer development. The group consistently saw more significant methylation sites and greater consistency in methylation when examining ancestry within specific cancer types compared to a pan-cancer analysis. Similarly, the largest effect sizes for ancestry-specific differences in miRNA expression were within specific cancer types. DNA methylation markers in cancer have been previously shown to perform well in differentiating geographic origins.8 Alterations in miRNAs expression, among many other functions, correlate with environmental exposures, illuminating a potential mechanism bridging ancestral genes and clinical cancer development that must be considered in further detail.9
The limitation consistently highlighted by Carrot-Zhang et al. across these interesting findings is power. To find genomic variants specific to African ancestry, the authors estimate 50% power to detect an association with a given cancer for a minimum odds ratio of 1.5, assuming a mutation frequency of 0.3. These standards will only detect the most obvious ancestry-specific associations. If we were to design an experiment to seek variants with ancestry-specific cancer associations at 80% power and a type-I error of 0.05, we would ideally need more than 15,000 patients of African ancestry to participate in research. The sum total of patients of African ancestry in TCGA is 651. That estimate does not even fully account for well-known analytic issues in genomics such as shorter linkage disequilibrium in patients of African ancestry (necessitating whole-genome sequencing to find disease-associated variants) and lack of full representation of African-specific variants in the reference human genome.10 Given this paucity of data on minority populations in the U.S., efforts such as the Polyethnic 1000 from the New York Genome Center highlight how to move toward improving annotated data on non-European ancestry cancer genomes.
In the United States, we are now being asked to face the inequity in our healthcare system head-on, driven by the concurrent pandemics of systemic racism and COVID-19. Vulnerable populations face significant potential harm both from the growing global burden of cancer and the fact that the majority of individuals currently diagnosed with cancer will not benefit from either precision oncology-based preventative risk assessment or targeted therapies. Scientific collaborations and mutually beneficial international partnerships at intersection of patient care and research can be leveraged to actively engage diverse populations of patients in genetics research across racial, geographic, and economic barriers. Now more than at any time in history, we have the responsibility and capacity not only to ask how we can best serve every cancer patient in the world using genomic research, but also to use science and technology to promote equity.
Acknowledgments
Declarations of Interests
P.S.R. declares no competing interests. O.I.O. is a co-founder and shareholder at CancerIQ. She also holds an advisory role and is a shareholder at Tempus, receives research support from Roche/Genentech and Color Genomics (institutional), and declares affiliations with Bio Ventures for Global Health.
References
- 1.Carrot-Zhang J., Chambwe N., Damrauer J.S., Knijnenburg T.A., Robertson A.G., Yau C., Zhou W., Berger A.C., Huang K.L., Newberg J.Y., Cancer Genome Atlas Analysis Network Comprehensive analysis of genetic ancestry and its molecular correlates in cancer. Cancer Cell. 2020;37:639–654.e6. doi: 10.1016/j.ccell.2020.04.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Yuan J., Hu Z., Mahal B.A., Zhao S.D., Kensler K.H., Pi J., Hu X., Zhang Y., Wang Y., Jiang J. Integrated analysis of genetic ancestry and genomic alterations across cancers. Cancer Cell. 2018;34:549–560.e9. doi: 10.1016/j.ccell.2018.08.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium Pan-cancer analysis of whole genomes. Nature. 2020;578:82–93. doi: 10.1038/s41586-020-1969-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.DeSantis C.E., Siegel R.L., Sauer A.G., Miller K.D., Fedewa S.A., Alcaraz K.I., Jemal A. Cancer statistics for African Americans, 2016: Progress and opportunities in reducing racial disparities. CA Cancer J. Clin. 2016;66:290–308. doi: 10.3322/caac.21340. [DOI] [PubMed] [Google Scholar]
- 5.Huo D., Hu H., Rhie S.K., Gamazon E.R., Cherniack A.D., Liu J., Yoshimatsu T.F., Pitt J.J., Hoadley K.A., Troester M. Comparison of Breast Cancer Molecular Features and Survival by African and European Ancestry in The Cancer Genome Atlas. JAMA Oncol. 2017;3:1654–1662. doi: 10.1001/jamaoncol.2017.0595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Dietze E.C., Sistrunk C., Miranda-Carboni G., O’Regan R., Seewaldt V.L. Triple-negative breast cancer in African-American women: disparities versus biology. Nat. Rev. Cancer. 2015;15:248–254. doi: 10.1038/nrc3896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Pitt J.J., Riester M., Zheng Y., Yoshimatsu T.F., Sanni A., Oluwasola O., Veloso A., Labrot E., Wang S., Odetunde A. Characterization of Nigerian breast cancer reveals prevalent homologous recombination deficiency and aggressive molecular features. Nat. Commun. 2018;9:4181. doi: 10.1038/s41467-018-06616-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hao X., Luo H., Krawczyk M., Wei W., Wang W., Wang J., Flagg K., Hou J., Zhang H., Yi S. DNA methylation markers for diagnosis and prognosis of common cancers. Proc. Natl. Acad. Sci. USA. 2017;114:7414–7419. doi: 10.1073/pnas.1703577114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Vrijens K., Bollati V., Nawrot T.S. MicroRNAs as potential signatures of environmental exposure or effect: a systematic review. Environ. Health Perspect. 2015;123:399–411. doi: 10.1289/ehp.1408459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sherman R.M., Forman J., Antonescu V., Puiu D., Daya M., Rafaels N., Boorgula M.P., Chavan S., Vergara C., Ortega V.E. Assembly of a pan-genome from deep sequencing of 910 humans of African descent. Nat. Genet. 2019;51:30–35. doi: 10.1038/s41588-018-0273-y. [DOI] [PMC free article] [PubMed] [Google Scholar]