Abstract
Objective:
Zidovudine (ZDV) has been extensively used in pregnant women to prevent vertical transmission of human immunodeficiency virus (HIV), but few studies have evaluated potential mutagenic effects of ZDV during fetal development.
Design:
Our study investigated clonal hematopoiesis in HIV-exposed uninfected (HEU) newborns, 94 of whom were ZDV-exposed and 91 antiretroviral therapy (ART)-unexposed and matched for potential confounding factors.
Methods:
Utilizing high depth sequencing and genotyping arrays, we comprehensively examined blood samples collected during the first week after birth for potential clonal hematopoiesis (CH) associated with fetal ZDV exposure, including clonal single nucleotide variants (SNVs), small insertions and deletions (indels), and large structural copy number or copy neutral alterations.
Results:
We observed no statistically significant difference in the number of SNVs and indels per person in ZDV exposed children (adjusted ratio (95% confidence interval) for expected number of mutations=0.79 [0.50, 1.22], P=0.3), and no difference in the number of large structural alterations. Mutations in common CH driver genes were not found in the study population. Mutational signature analyses on SNVs detected no novel signatures unique to the ZDV-exposed children and the mutational profiles were similar between the two groups.
Conclusions:
Our results suggest that CH at levels detectable in our study is not strongly influenced by in utero ZDV exposure; however, additional follow-up studies are needed to further evaluate the safety and potential long-term impacts of in utero ZDV exposure in HEU children as well as better investigate genomic aberrations occurring late in pregnancy.
Keywords: Zidovudine, clonal hematopoiesis, genotoxicity, HIV-exposed uninfected, nucleoside reverse transcriptase inhibitor
INTRODUCTION
Zidovudine (ZDV) was the first approved antiretroviral medication used for preventing the transmission of human immunodeficiency virus (HIV) from mother to child [1]. The availability of ZDV, especially as part of combination antiretroviral therapy (ART), has dramatically reduced vertical transmission of HIV [2]. However, the long-term safety of in utero exposure to antiretroviral medications in HIV-exposed uninfected (HEU) children remains a concern.
ZDV is a nucleoside reverse transcriptase inhibitor (NRTI) which competitively inhibits HIV reverse transcriptase and is incorporated into viral DNA, resulting in DNA chain termination [3]. NRTIs can also become incorporated into human nuclear and mitochondrial DNA [4]. Previous studies have shown potential impact of in utero exposure to ZDV and other NRTIs. For instance, Chenadec et al. observed transiently lower hemoglobin which disappeared within 3 months and reduction in neutrophil, lymphocyte, and platelet counts which persisted from birth to at least 18 months in the French Perinatal Study [5]. The European Collaborative Study reported that reduced neutrophil count could be observed in kids exposed to antiretroviral therapy even after 8 years [6]. Finally, Pacheco et al. demonstrated that changes in platelet and lymphocyte counts in children exposed to antiretroviral therapy persisted 2 years after birth [7]. In vitro studies also suggest ZDV could have genotoxic effects [8,9]. Furthermore, evidence from animal studies suggests ZDV could have transplacental effects and cause genetic abnormalities and genotoxicity [10–12]. Finally, aneuploidy rate in cord blood cells as well as expression of genes related to DNA repair have been shown to be altered in children with exposure to ZDV [13,14].
These hematologic and genotoxic effects have raised concerns regarding the long-term risk of cancer in HEU children exposed in utero to ZDV. The International Agency for Research on Cancer has classified ZDV as a probable carcinogen for humans (class 2A agent) [15]. As the population of surviving HEU children expands and ages, knowledge of potential genotoxic and carcinogenic effects of early-life exposure to ZDV is essential for weighing potential risks and benefits of in utero antiretroviral regimens that include ZDV or other NRTIs.
Clonal hematopoiesis (CH) refers to clonal expansion of a subset of leukocytes carrying somatic mutations [16]. These mutations range in size from a single nucleotide variant (SNV) to large somatic copy number alterations (SCNAs) at the chromosomal scale. The detection of CH relies on the size of event: SNVs require next-generation sequencing whereas SCNAs are better detected by virtual karyotyping methods using single nucleotide polymorphism (SNP) genotyping arrays. Besides being a characteristic of myelodysplastic syndromes as well as leukemia [17,18], CH has been found to be associated with cardiovascular disease [19], Alzheimer disease [20], and cancer [21–23]. Previous studies on CH have also found associations with chemotherapy [24], cigarette smoking [16,25] and air pollution [26] suggesting that CH can serve as a biomarker reflecting environmental exposure to a carcinogen or mutagen. However, no study has characterized potential adverse effects of ZDV exposure on fetal CH using currently available genomic technologies.
In this study, we investigate CH as a marker of genomic damage detected in peripheral blood mononuclear cell samples obtained from ZDV-exposed and, for comparison, ART-unexposed neonates during the first week after birth. We utilize high-depth DNA sequencing and SNP genotype array-based methods to comprehensively identify clonal SNVs, small insertions and deletions (indels), and large acquired SCNAs.
METHODS
Study participants
We started with 200 HEU children born to women living with HIV from two studies. One hundred newborns who were exposed to ZDV in utero for the longest duration (most of them in combination with other antiretrovirals) and born between 1995 and 2017 were selected from the Surveillance Monitoring for ART Toxicities (SMARTT) study of the Pediatric HIV/AIDS Cohort Study (PHACS) network. SMARTT enrolled children with detailed information available on in utero antiretroviral medication exposure from 22 US sites since 1995 [27,28]. The comparison group was comprised of ART-unexposed children who were frequency-matched to the exposure group by age, gender, and ethnicity mostly from the Women and Infants Transmission Study (WITS) (one ART-unexposed child from SMARTT). WITS enrolled newborns born during 1990–1994 and their mothers living with HIV from 6 US sites [29]. The current study shared the same ZDV-exposed and most ART-unexposed children with our prior study, and details of participant selection have been described previously [30]. We retrieved information from PHACS and WITS on gestational age, birthweight, maternal demographic and health information including HIV RNA levels and CD4 cell count before delivery, and antiretroviral use during pregnancy. After DNA extraction, 6 children from WITS and 4 from SMARTT were excluded due to insufficient DNA yield. Further examination of genomic data revealed that 1 child from WITS and 2 childern from SMARTT had evidence for contamination and were removed. An additional 2 children from WITS were excluded due to gender discordance, leaving 94 ZDV-exposed and 91 ART-unexposed HEU children included in the current analysis. Institutional Review Board approval was obtained at each participating center and all participating mothers and/or caregivers provided written informed consent.
Laboratory methods
Exome and untranslated region (UTR) sequencing
Whole exome sequencing (including neighboring UTRs) for the HEU newborns was performed at the Cancer Genomics Research Laboratory of the National Cancer Institute, as previously described [31]. In brief, 1.1 μg of genomic DNA was extracted from peripheral blood mononuclear cell samples utilizing magnetic-bead based DNA extraction and purification chemistry on the QIAsymphony SP according to the manufacturer’s validated protocols (QIAGEN, Germany). Exome+UTR capture was performed using SeqCap EZ Human Exome Library v3.0, (Roche NimbleGen, Inc., Madison, WI, USA). The captured DNA was coded with unique molecular index then underwent paired-end sequencing on an Illumina HiSeq following Illumina-provided protocols for 2×100-bp paired-end sequencing. Exome sequencing was conducted to high depth to achieve a targeted minimum threshold of 80% of coding sequence bases with coverage of over 250-fold, based on the University of California Santa Cruz (UCSC) hg19 “known gene” transcripts database (http://genome.ucsc.edu/)[32]. The exome data analyzed are now archived in a controlled-access dbGaP repository under the following accession number (phs002061).
Variant alignment, calling, and annotation
Details of variant alignment and calling methods have been published previously [31]. Briefly, sequencing reads were trimmed using the Trimmomatic tool [33] that reported the longest high-quality stretch of each read. Reads were then aligned to the reference genome (hg19) using the Novoalign software v3.00.05 (http://www.novocraft.com). To refine alignments, the following reads were removed from further analysis: 1) duplicate reads (using the MarkDuplicates module of the Picard software (v1.126) (http://picard.sourceforge.net/)), 2) sequence pairs not in complementary directions when mapped to the reference genome, and 3) sequence reads not reflecting the expected fragment length (300 ± 100 bp). A local realignment around known and novel sites of insertion and deletion mutations was performed using the RealignerTargetCreator and IndelRealigner modules from the Genome Analysis Toolkit (GATK v3.1) [34]. BAM file level recalibration was also performed using BaseRecalibrator module from GATK. Variants were identified by HaplotypeCaller and UnifiedGenotyper modules from GATK as well as FreeBayes [35]. All called variants were annotated by snpEff with a focus on predicted medium and high impact mutations in subset analyses [36].
Identification of somatic SNVs and small indels from sequence variant calls
Variant calls were further filtered by the following criteria: 1) must be within 5 bp of designed exome probe capture region, 2) must reside within coding regions of the genome, and 3) must be called by HaplotypeCaller, UnifiedGenotyper, and FreeBayes. We defined SNVs and small indels as variants which 1) had two or more reads for the alternative allele to remove potential sequencing noise or artifacts, 2) had a minor allele frequency ≤ 0.1% in Exome Aggregation Consortium (ExAC) [37], NHLBI GO Exome Sequencing Project (ESP) [38], 1000 Genomes Project phase 3 [39] and gnomAD [40] databases which suggests the called variants were not a common germline variant, 3) had a minor allele frequency < 5% in the current sequenced study population to exclude potential sequencing artifacts, and 4) were not in highly repetitive regions (which often result in poorly mapped reads and false variant calls). The highly repetitive regions include a) loci which failed Hardy-Weinberg equilibrium in the 1000 Genomes Project [39], b) regions with unusually high read depth reported previously [41], c) low complexity regions identified by mdust [42], and d) highly duplicated regions [43]. Finally, we created a theoretical distribution of variant allele frequencies (VAFs) by symmetrically mirroring the observed distribution of variants between VAF of 0.50 and 0.98 for VAFs < 0.5. A variant with VAF < the 0.25% quantile of this theoretical distribution was classified as a detectable somatic SNV or indel. VAF is defined as the percentage of reads containing a specific alternative allele divided by the overall coverage for that locus.
Characterization of somatic SNV mutational signatures
Somatic SNV calls from each ZDV exposure group were aggregated and used as input for SigProfilerMatrixGenerator [44] to identify potential de novo and known single base substitution (SBS) mutational signatures (COSMIC Mutational Signatures v3) [45]. Results from both the de novo and decomposed mutational signatures analyses were visualized using tidysig [46] and compared between ZDV-exposed and ART-unexposed children. We analyzed all 96 trinucleotide mutational contexts as well as the 6 single nucleotide SNV somatic substitutions by principal component analysis to explore potential clustering of samples by ZDV exposure group. Binomial tests of each mutational context and Chi-squared tests for overall association between ZDV exposure and mutational context were employed to identify potential differences in mutational signature. Mutational signatures were further confirmed with SigneR [47].
SNP array genotyping and somatic copy number alteration detection
Among the 185 individuals, 184 had enough DNA for genotyping, and 5 samples failed QC leaving 179 genotyped. Genome‐wide genotyping was independently performed using Illumina Global Screening Arrays (version 1). We estimated log2 R ratio (LRR) and B allele frequency (BAF) values from raw array intensity data to detect somatic copy number change or copy neutral alterations sized > 100kb as described previously [48]. The LRR and BAF values were re-calculated after quantile normalization [49], then analyzed utilizing the MoChA [50] framework to systematically detect deviations in LRR and BAF values which indicate the presence of somatic copy number or copy neutral alterations. All resulting somatic calls were manually inspected to ensure high true positive rates.
Statistical analysis
Fisher’s exact tests using the mid-p method [51] and Wilcoxon rank sum tests were performed to compare characteristics of ZDV-exposed and ART-unexposed children. Significance level for multiple comparison was determined by Bonferroni correction. We employed negative binomial regression to evaluate associations between ZDV exposure and number of detectable clonal SNVs, in univariable models and multivariable models adjusting for infant’s sex, race/ethnicity, gestational age, and maternal use of alcohol, marijuana, and tobacco. As only 2 SCNA events were identified, we did not conduct statistical comparisons. All statistical analyses and visualizations were performed using R v4.0.2 [52].
RESULTS
Characteristics of the 185 participants are described in Table 1. No statistically significant differences were observed in infants’ sex, race/ethnicity, and mothers’ use of tobacco, alcohol, marijuana, or illicit drugs during pregnancy between the two groups. ZDV-exposed children were younger than ART-unexposed children (median 38.3 versus 39 weeks, P=0.021) and were slightly lighter in birth weight (median 3 versus 3.13 kg, P=0.045). Mothers of ZDV-exposed children were older at delivery (median 28.6 versus 26 years, P<0.001) and had lower CD4+ T-cell count (median 502 versus 577.5 cells/ul, P=0.015) and HIV viral load (median 48 versus 9684 copies/ml, P<0.001).
Table 1.
Study participant characteristics of the newborn children in ZDV-exposed and ART-unexposed groups used for genomic characterization in our study of CH.
| Variable | ZDV-exposed, N=94 | ART-unexposed, N=91 | P |
|---|---|---|---|
| Birth weight (kg), median (range) | 3 (1.74, 4.38) | 3.13 (1.91, 4.60) | 0.045 |
| Gestational age (weeks), median (range) | 38.3 (32.6, 42.1) | 39.0 (3.20, 43.0) | 0.021 |
| Female, n (%) | 49 (52.1) | 46 (50.6) | 0.83 |
| Group, n (%) | <0.001 | ||
| Public WITS | 0 (0) | 90 (98.9) | |
| SMARTT w/o WITS | 93 (98.9) | 1 (11) | |
| SMARTT WITS | 1 (11) | 0 (0) | |
| Year of birth, n (%) | <0.001 | ||
| 1990–1994 | 0 (0) | 88 (96.7) | |
| 1995–2017 | 94 (100) | 3 (3.3) | |
| Mothers age at delivery (years), median (range) | 28.58 (16.92, 44.97) | 26 (15, 40) | <0.001 |
| Participant’s race/ethnicity, n (%) | 0.36 | ||
| White | 6 (6.4) | 13 (14.3) | |
| African-American | 31 (33.0) | 40 (44.0) | |
| Hispanic | 55 (58.5) | 32 (35.2) | |
| Other | 2 (2.1) | 6 (6.6) | |
| Days from sample draw to birth, median (range) | 1 (0, 7) | 1 (0, 7) | 0.072 |
| Last CD4+ cell count during pregnancy (cells/μl), median (range) | 502 (34, 1198) | 577.5 (74, 2330) | 0.015 |
| Last HIV RNA level during pregnancy (copies/ml), median (range) | 48 (0, 64361) | 9684 (0, 672005) | <0.001 |
| <400 | 82 (88.2) | 10 (11.0) | <0.001 |
| ≥400 | 11 (11.8) | 81 (89.0) | |
| Mother ever used tobacco, n (%) | 26 (27.7) | 32 (35.2) | 0.28 |
| Mother ever used alcohol, n (%) | 25 (26.6) | 32 (35.2) | 0.21 |
| Mothers ever used marijuana/hashish, n (%) | 13 (13.8) | 8 (8.8) | 0.29 |
| NNRTIs use during pregnancy, n (%) | 18 (19.2) | 0 (0) | <0.001 |
| PIs use during pregnancy, n (%) | 70 (74.5) | 0 (0) | <0.001 |
Footnote:
NNRTI: Non-Nucleoside Reverse Transcriptase Inhibitor; PI: Protease Inhibitor.
Clonal somatic single nucleotide variants and small indels
We identified a total of 80 clonal somatic SNVs and 2 indels in ZDV-exposed and 93 somatic SNVs and 9 indels in ART-unexposed children. Mean mutation burden per person was 0.01 (range 0 to 0.11) in ZDV-exposed and 0.02 (range 0 to 0.16) mutations/Mb in ART-unexposed infants. The range of VAF for detectable clonal SNVs and indels ranged from 2.5% to 30.3% (Figure S1). The overall mean number of detected somatic SNVs or indels per participant was 0.94 with the mean count among ZDV-exposed children of 0.84 and among ART-unexposed children of 1.03 (Figure 1, P=0.3). In univariable negative binomial regression analyses, we did not observe significant associations between the number of clonal SNVs and indels with ZDV exposure (P=0.36) or with sex, race/ethnicity, substance use, or mother’s last measured viral load or CD4+ T-cell count (Table S1). In a multivariable model adjusting for infant’s sex, race/ethnicity, gestational age, and maternal use of alcohol, marijuana, and tobacco, the somatic SNV or indel count appeared lower in ZDV-exposed than ART-unexposed children but the difference was non-significant (adjusted ratio for mutation counts = 0.79 [95% CI 0.50 to 1.22], P=0.3, Table 2). We also investigated somatic SNVs and indels with potential high or medium impact predicted by snpEff (Tables S2). Among the top 50 mutated genes with detected mutations in our sample set, 34 genes had at least one medium or high impact somatic SNVs or indels detected; 29 and 11 of these top genes had medium or high impact mutations in ZDV-exposed and ART-unexposed infants, respectively.
Figure 1.
Number of somatic SNVs and indels combined in peripheral blood mononuclear cell derived DNA. Dotted lines indicate median for each group.
Table 2.
Multivariable associations of the number of whole-exome detected clonal SNVs and indels with zidovudine exposure and other descriptive characteristics in both ZDV-exposed and ART-unexposed newborn children.
| Characteristic | Adjusted ratio for expected mutation counts [95%CI] | P |
|---|---|---|
| Zidovudine exposure | 0.79 [0.50, 1.22] | 0.30 |
| Gestational age (week) | 0.99 [0.88, 1.12] | 0.87 |
| Infant’s Male sex | 0.78 [0.50, 1.23] | 0.29 |
| Infant Non-African American race | 1.08 [0.68, 1.72] | 0.74 |
| Maternal Tobacco Use | 0.96 [0.58, 1.56] | 0.85 |
| Maternal Marijuana Use | 1.36 [0.68, 2.74] | 0.38 |
| Maternal Alcohol Use | 0.85 [0.51, 1.42] | 0.51 |
We curated a list of 37 CH driver genes (Table S3) commonly observed to be somatically mutated in studies of clonal hematopoiesis [53–55]. We did not detect the presence of any clonal SNV or indel in these genes.
To compare potential mutational processes and characteristic signatures between ZDV- exposed and ART-unexposed children, we analyzed SBS mutational signatures (Figure 2). Results were highly similar with no visible overall differences in mutational context by fetal ZDV exposure status. We likewise did not observe differences based on principal component analysis (Figure S2). In addition, we did not identify enrichment based on 96 SBS groups (Table S4) or mutational contexts (Table S5, Figure 2) adjusted by Bonferroni corrected thresholds. Finally, we further performed mutational signature analyses to identify the contribution of known signatures to the somatic SNVs detected in our study. We identified SBS signatures 1 and 5 (two clock-like mutational signatures [56]) as the predominant signatures in both groups. We also found SBS signature 15 which is one of the mutation signatures associated with defects in DNA mismatch repair (MSI). However, we did not observe any mutational signatures exclusively in the ZDV-exposed individuals and found no evidence suggesting a novel signature associated with ZDV exposure.
Figure 2.
Mutation count by base context for somatic SNVs detected by deep coverage whole exome sequencing. Top panel: 94 ZDV-exposed infants. Lower panel: 91 ART-unexposed infants. Footnote: *: Base mutations are displayed in the reference>alternate format (e.g., C>A represents a C to A mutation). The X axis shows base context in which the base pair before and after the mutation are displayed. The Y axis represents the total number of mutations detected from each base context.
Detected structural somatic copy number and copy neutral alterations
We detected 2 clonal SCNAs in the peripheral blood mononuclear cell-derived DNA in ART-unexposed and none in ZDV-exposed children. The first child had a partial duplication of chromosome 6 (Figure 3a) that impacted a small fraction of sampled leukocytes (approximately 5%). And the second child was a female with a total loss of one copy of the X chromosome that impacted approximately 62% of sampled leukocytes (Figure 3b).
Figure 3.
Log intensity ratio (LRR) and B-allele frequency (BAF) of detected SCNA by SNP arrays. (a) Detected mosaic SCNA on chromosome 6 from one participant. (b) Detected SCNA on chromosome X from another participant. Footnote: *: Black dots denote LRR of each SNP with scale on the left (Y1 axis), and red dots denote BAF of each SNP with scale on the right (Y2 axis). Three red bands represent BAF of SNPs with homozygous major alleles (bottom), heterozygous alleles (middle), and homozygous minor alleles (top). (a) Vertical grey lines and the horizontal blue line indicate the region with detected SCNA event (from 9,661,265 to 39,016,096). The two panels to the right are the probability density of BAF (red) and LRR (black) in normal (norm) and SCNA (Seg1) regions. The two black lines are the modes of two potentially overlapping BAF distributions, and the green shaded area shows the 95% confidence interval for the predicted mode. Based on the mode of BAF, about 5% of the cells carry this SCNA. (b) Both LRR and BAF suggested the event covered the whole chromosome. The decreased intensity in LRR suggested copy number loss, and the modes of BAF indicated that 62% of sampled cells carried this event.
Discussion
We employed both deep whole-exome sequencing and genome-wide genotyping arrays to investigate CH as a potential result of in utero ZDV exposure in 94 newborn HEU children and 91 matched ART-unexposed newborns. With high depth, broad coverage, and integration of unique molecular index, we were capable of detecting clonal mutations of low frequency across the whole exome. We did not observe statistically significant differences in the number of somatic SNV and indels among children who were exposed in utero to ZDV. Further analyses found that neither the mutational spectrum nor the underlying mutational signatures differed between the ZDV-exposed and ART-unexposed children. Our genome-wide scan for SCNAs identified two individuals with SCNAs in the ART-unexposed group. Cumulatively, these results suggest in utero exposure to ZDV has limited impact on the types and frequencies of clonal somatic mutations in peripheral leukocytes detectable by our approaches.
The nonsignificant ratio reduction in somatic SNV and indels in ZDV-exposed children compared to ART-unexposed controls (mean count of 0.84 vs 1.03, P=0.3) is intriguing, but we did not find evidence from mutational context or mutation signature analyses for any ZDV-specific biologic mechanisms driving this slight reduction among ZDV-exposed children. We also did not observe any instance of CH within commonly observed CH driver genes. Interestingly, the genes observed to be somatically mutated in our investigation often contained a large proportion of medium or high impact mutations, suggesting these mutations may have deleterious impacts on leukocyte function.
We detected 2 ART-unexposed children with mosaic aneuploidies but found no evidence for aneuploidies among the ZDV-exposed children. These findings differ from those reported by André-Schmutz et al. [14] and Vivanti et al. [13] in which an increase in aneuploidy was observed in cord blood at birth in children exposed in utero to ART containing ZDV. The observed differences could be attributable to different laboratory methods: André-Schmutz et al. and Vivanti et al. used cytogenic approaches to assess cell karyotypes for aneuploidy, whereas our analysis of genotyping array intensity data is equivalent to creating a “virtual karyotype” [57]. Moreover, virtual karyotyping allows for an assessment of all circulating peripheral leukocytes, while the cytogenic karyotyping analyses by André-Schmutz et al. and Vivanti et al focused exclusively on CD3+ T cells. Although virtual karyotyping is not sensitive enough to capture all genomic aberrations down to single cell level, the MoChA algorithm is sensitive to detect clonal events carried by clones which disproportionately expand relative to non-mutated cells to a detectable cellular fraction of 1% when the event spans beyond 100Mb. For events as small as 1Mb, the minimum detectable clonal fraction is around 10%, and there was no significant difference in the detection of gain or loss except for potential false positive from constitutional gain events which is removed in the pre-processing step of MoChA [50]. The survival and proliferation of such clones provide preliminary indication that these genomic aberrations might confer some functional advantage. The absence of detectable mosaic aneuploidy among ZDV-exposed children in our investigation cannot entirely rule out the presence of elevated aneuploidy in children exposed in utero to ZDV, but suggests that even if elevated rates of aneuploidy exist, they have not resulted in disproportionate growth of aberrant clones at the time point we sampled for DNA collection.
Our study used both deep whole exome sequencing and SNP genotyping arrays to detect CH in blood-derived DNA collected from newborns during the first week after birth. The use of high-depth sequencing allowed for us to detect CH clones with SNV or indel VAFs ranging from 2.5% to 30.3%. Likewise, intensity data from genotyping arrays allowed us to generate virtual karyotypes for detecting mutated cell fractions for SCNAs ranging from approximately 5% to 95%. While our genomic approaches were able to thoroughly characterize CH in ZDV-exposed and ART-unexposed HEU children reaching the frequencies described above, our methods were unable to detect SNVs or indels above the VAF threshold which is indistinguishable from germline mutations or low-frequency clones below our limits of detection (e.g., mutations in single cells). All of these somatic mutations, while not studied by CH investigations, could have associations with in utero ZDV exposure with potential relevance for future disease risk (i.e., elevated hematologic cancer risk), but we have limited ability to examine this hypothesis in the current study. Furthermore, our study was a snapshot at birth which might not be the best time point to evaluate the impact of mosaic events ocurred in late pregnancy. Besides, selective forces during infancy, adolescence and adulthood may promote clonal expansion of low-frequency clones currently below our window of detection to become detectable CH later in life. Additionally, our study only investigated somatic mutations in blood-derived DNA. Mutations could be present in other tissues and have relevance for increased disease risk such as cancer or cardiovascular diseases in relation to in utero ZDV exposure. Future studies should utilize single cell sequencing approaches to determine excess frequency of mutational load and evaluate specific components and combinations of ART in greater detail. Finally, with an overall clonal mutation rate of 0.94/person during gestation and approximately balanced design of 91 and 94 individuals, the power of our study to detect a 2-fold increase in mutation rate was excellent (94%, with two-sided significance level of 0.05). However, the power to detect smaller differences was lower. For instance, if the actual increase in mutation rate is 1.6 fold, our study’s power was only 65%.
The current study focused on the comparison between ZDV-exposed and ART-unexposed individuals. A particular challenge of investigating potential genotoxic impacts of in utero ART exposure is the varied content of drugs that comprise the treatment. Different combinations of antiretroviral drugs might have varied impact on the development of somatic mutations and clonal selection of mutated cells needed for the detection of CH. While modern ART regimens often replace ZDV with newer generation of NRTIs, they still share similar mechanisms of action as ZDV. Therefore, our study on ZDV could have implications for the safety of more recent NRTIs included in the ART regimens. Consistent with our findings which suggest that ZDV exposure did not resulted in detectable abnormal clonality in blood cells, the French Perinatal Study also failed to detect changes in risk of blood cancers from ages of 0–15 [58].
In conclusion, we did not find statistically significant evidence for the contribution of in utero ZDV exposure to CH in HEU children. Future larger long-term follow-up studies are warranted to better assess genotoxicity and to ensure the long-term safety of in utero NRTI exposure.
Supplementary Material
Acknowledgements
We thank the children and families for their participation in WITS and PHACS, and the individuals and institutions involved in the conduct of WITS and PHACS. Data management services were provided by Frontier Science and Technology Research Foundation (PI: Suzanne Siminski), and regulatory services and logistical support were provided by Westat, Inc (PI: Julie Davidson). The following institutions, clinical site investigators and staff participated in conducting PHACS SMARTT in 2017, in alphabetical order: Ann & Robert H. Lurie Children’s Hospital of Chicago: Ellen Chadwick, Margaret Ann Sanders, Lynn Heald, Ruthellen Williams; Baylor College of Medicine: William Shearer, Mary Paul, Norma Cooper, Lynnette Harris; Bronx Lebanon Hospital Center: Murli Purswani, Emma Stuard, Mahboobullah Mirza Baig, Alma Villegas; Children’s Diagnostic & Treatment Center: Ana Puga, Dia Cooley, Patricia A. Garvie, James Blood; New York University School of Medicine: William Borkowsky, Sandra Deygoo, Marsha Vasserman; Rutgers - New Jersey Medical School: Arry Dieudonne, Linda Bettica, Juliette Johnson; St. Jude Children’s Research Hospital: Katherine Knapp, Kim Allison, Megan Wilkins, Jamie Russell-Bell; San Juan Hospital/Department of Pediatrics: Nicolas Rosario, Lourdes Angeli-Nieves, Vivian Olivera; SUNY Downstate Medical Center: Stephan Kohlhoff, Ava Dennie, Ady Ben-Israel, Jean Kaye; Tulane University School of Medicine: Russell Van Dyke, Karen Craig, Patricia Sirois; University of Alabama, Birmingham: Marilyn Crain, Paige Hickman, Dan Marullo; University of California, San Diego: Stephen A. Spector, Kim Norris, Sharon Nichols; University of Colorado, Denver: Elizabeth McFarland, Emily Barr, Christine Kwon, Carrie Chambers; University of Florida, Center for HIV/AIDS Research, Education and Service: Mobeen Rathore, Kristi Stowers, Saniyyah Mahmoudi, Nizar Maraqa, Laurie Kirkland; University of Illinois, Chicago: Karen Hayani, Lourdes Richardson, Renee Smith, Alina Miller; University of Miami: Gwendolyn Scott, Sady Dominguez, Jenniffer Jimenez, Anai Cuadra; Keck Medicine of the University of Southern California: Toni Frederick, Mariam Davtyan, Guadalupe Morales-Avendano, Janielle Jackson-Alvarez; University of Puerto Rico School of Medicine, Medical Science Campus: Zoe M. Rodriguez, Ibet Heyer, Nydia Scalley Trifilio.
Conflicts of interest and Source of Funding:
This work was supported by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health. The Pediatric HIV/AIDS Cohort Study (PHACS) was supported by the Eunice Kennedy Shriver National Institute Of Child Health & Human Development (NICHD) with co-funding from the National Institute Of Dental & Craniofacial Research (NIDCR), the National Institute Of Allergy And Infectious Diseases (NIAID), the National Institute Of Neurological Disorders And Stroke (NINDS), the National Institute On Deafness And Other Communication Disorders (NIDCD), the National Institute Of Mental Health (NIMH), the National Institute On Drug Abuse (NIDA), the National Institute On Alcohol Abuse And Alcoholism (NIAAA), the National Cancer Institute (NCI), the Office of AIDS Research (OAR), and the National Heart, Lung, and Blood Institute (NHLBI) through cooperative agreements with the Harvard T.H. Chan School of Public Health (HD052102) and the Tulane University School of Medicine (HD052104). This work was also supported by the Intramural Research Program of the National Cancer Institute (ZIA CP010115-10).
Footnotes
Disclaimer
The conclusions and opinions expressed in this article are those of the authors and do not necessarily reflect those of the National Institutes of Health or U.S. Department of Health and Human Services.
REFERENCE
- 1.Connor EM, Sperling RS, Gelber R, Kiselev P, Scott G, O’Sullivan MJ, et al. Reduction of maternal-infant transmission of human immunodeficiency virus type 1 with zidovudine treatment. Pediatric AIDS Clinical Trials Group Protocol 076 Study Group. N Engl J Med 1994; 331:1173–1180. [DOI] [PubMed] [Google Scholar]
- 2.Little KM, Taylor AW, Borkowf CB, Mendoza MCB, Lampe MA, Weidle PJ, et al. Perinatal Antiretroviral Exposure and Prevented Mother-to-child HIV Infections in the Era of Antiretroviral Prophylaxis in the United States, 1994–2010. Pediatr Infect Dis J 2017; 36:66–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Balzarini J, Naesens L, De Clercq E. New antivirals - mechanism of action and resistance development. Curr Opin Microbiol 1998; 1:535–546. [DOI] [PubMed] [Google Scholar]
- 4.Poirier MC, Olivero OA, Walker DM, Walker VE. Perinatal genotoxicity and carcinogenicity of anti-retroviral nucleoside analog drugs. Toxicol Appl Pharmacol 2004; 199:151–161. [DOI] [PubMed] [Google Scholar]
- 5.Le Chenadec J, Mayaux M-J, Guihenneuc-Jouyaux C, Blanche S, Enquête Périnatale Française Study Group. Perinatal antiretroviral treatment and hematopoiesis in HIV-uninfected infants. AIDS 2003; 17:2053–2061. [DOI] [PubMed] [Google Scholar]
- 6.European Collaborative Study. Levels and patterns of neutrophil cell counts over the first 8 years of life in children of HIV-1-infected mothers. AIDS 2004; 18:2009–2017. [DOI] [PubMed] [Google Scholar]
- 7.Pacheco SE, McIntosh K, Lu M, Mofenson LM, Diaz C, Foca M, et al. Effect of perinatal antiretroviral drug exposure on hematologic values in HIV-uninfected children: An analysis of the women and infants transmission study. J Infect Dis 2006; 194:1089–1097. [DOI] [PubMed] [Google Scholar]
- 8.Sussman HE, Olivero OA, Meng Q, Pietras SM, Poirier MC, O’Neill JP, et al. Genotoxicity of 3’-azido-3’-deoxythymidine in the human lymphoblastoid cell line, TK6: relationships between DNA incorporation, mutant frequency, and spectrum of deletion mutations in HPRT. Mutat Res 1999; 429:249–259. [DOI] [PubMed] [Google Scholar]
- 9.Meng Q, Walker DM, Olivero OA, Shi X, Antiochos BB, Poirier MC, et al. Zidovudine-didanosine coexposure potentiates DNA incorporation of zidovudine and mutagenesis in human cells. Proc Natl Acad Sci USA 2000; 97:12667–12671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Olivero OA, Anderson LM, Diwan BA, Haines DC, Harbaugh SW, Moskal TJ, et al. Transplacental effects of 3’-azido-2’,3’-dideoxythymidine (AZT): tumorigenicity in mice and genotoxicity in mice and monkeys. J Natl Cancer Inst 1997; 89:1602–1608. [DOI] [PubMed] [Google Scholar]
- 11.Walker DM, Malarkey DE, Seilkop SK, Ruecker FA, Funk KA, Wolfe MJ, et al. Transplacental carcinogenicity of 3’-azido-3’-deoxythymidine in B6C3F1 mice and F344 rats. Environ Mol Mutagen 2007; 48:283–298. [DOI] [PubMed] [Google Scholar]
- 12.Olivero OA, Torres LR, Gorjifard S, Momot D, Marrogi E, Divi RL, et al. Perinatal exposure of patas monkeys to antiretroviral nucleoside reverse-transcriptase inhibitors induces genotoxicity persistent for up to 3 years of age. J Infect Dis 2013; 208:244–248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Vivanti A, Soheili TS, Cuccuini W, Luce S, Mandelbrot L, Lechenadec J, et al. Comparing genotoxic signatures in cord blood cells from neonates exposed in utero to zidovudine or tenofovir. AIDS 2015; 29:1319–1324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.André-Schmutz I, Dal-Cortivo L, Six E, Kaltenbach S, Cocchiarella F, Le Chenadec J, et al. Genotoxic signature in cord blood cells of newborns exposed in utero to a Zidovudine-based antiretroviral combination. J Infect Dis 2013; 208:235–243. [DOI] [PubMed] [Google Scholar]
- 15.IARC. Some Antiviral and Antineoplastic Drugs, and Other Pharmaceutical Agents. International Agency for Research on Cancer; 2000. [Google Scholar]
- 16.Zink F, Stacey SN, Norddahl GL, Frigge ML, Magnusson OT, Jonsdottir I, et al. Clonal hematopoiesis, with and without candidate driver mutations, is common in the elderly. Blood 2017; 130:742–752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Desai P, Roboz GJ. Clonal Hematopoiesis and therapy related MDS/AML. Best Pract Res Clin Haematol 2019; 32:13–23. [DOI] [PubMed] [Google Scholar]
- 18.Desai P, Hassane D, Roboz GJ. Clonal Hematopoiesis and risk of Acute Myeloid Leukemia. Best Pract Res Clin Haematol 2019; 32:177–185. [DOI] [PubMed] [Google Scholar]
- 19.Jaiswal S, Natarajan P, Silver AJ, Gibson CJ, Bick AG, Shvartz E, et al. Clonal Hematopoiesis and Risk of Atherosclerotic Cardiovascular Disease. N Engl J Med 2017; 377:111–121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dumanski JP, Lambert J-C, Rasi C, Giedraitis V, Davies H, Grenier-Boley B, et al. Mosaic Loss of Chromosome Y in Blood Is Associated with Alzheimer Disease. Am J Hum Genet 2016; 98:1208–1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Loftfield E, Zhou W, Yeager M, Chanock SJ, Freedman ND, Machiela MJ. Mosaic Y Loss Is Moderately Associated with Solid Tumor Risk. Cancer Res 2019; 79:461–466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Genovese G, Kähler AK, Handsaker RE, Lindberg J, Rose SA, Bakhoum SF, et al. Clonal hematopoiesis and blood-cancer risk inferred from blood DNA sequence. N Engl J Med 2014; 371:2477–2487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Coombs CC, Zehir A, Devlin SM, Kishtagari A, Syed A, Jonsson P, et al. Therapy-Related Clonal Hematopoiesis in Patients with Non-hematologic Cancers Is Common and Associated with Adverse Clinical Outcomes. Cell Stem Cell 2017; 21:374–382.e4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hsu JI, Dayaram T, Tovy A, De Braekeleer E, Jeong M, Wang F, et al. PPM1D Mutations Drive Clonal Hematopoiesis in Response to Cytotoxic Chemotherapy. Cell Stem Cell 2018; 23:700–713.e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Dumanski JP, Rasi C, Lönn M, Davies H, Ingelsson M, Giedraitis V, et al. Mutagenesis. Smoking is associated with mosaic loss of chromosome Y. Science 2015; 347:81–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wong JYY, Margolis HG, Machiela M, Zhou W, Odden MC, Psaty BM, et al. Outdoor air pollution and mosaic loss of chromosome Y in older men from the Cardiovascular Health Study. Environ Int 2018; 116:239–247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Williams PL, Seage GR, Van Dyke RB, Siberry GK, Griner R, Tassiopoulos K, et al. A trigger-based design for evaluating the safety of in utero antiretroviral exposure in uninfected children of human immunodeficiency virus-infected mothers. Am J Epidemiol 2012; 175:950–961. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Williams PL, Crain MJ, Yildirim C, Hazra R, Van Dyke RB, Rich K, et al. Congenital anomalies and in utero antiretroviral exposure in human immunodeficiency virus-exposed uninfected infants. JAMA Pediatr 2015; 169:48–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Rodriguez EM, Mendez H, Rich K, Sheon A, Fox H, Green K, et al. Maternal drug use in perinatal HIV studies. The Women and Infants Transmission Study. Ann N Y Acad Sci 1993; 693:245–248. [DOI] [PubMed] [Google Scholar]
- 30.Wang Y, Brummel SS, Beilstein-Wedel E, Dagnall CL, Hazra R, Kacanek D, et al. Association between zidovudine-containing antiretroviral therapy exposure in utero and leukocyte telomere length at birth. AIDS 2019; 33:2091–2096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Shi J, Yang XR, Ballew B, Rotunno M, Calista D, Fargnoli MC, et al. Rare missense variants in POT1 predispose to familial cutaneous malignant melanoma. Nat Genet 2014; 46:482–486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Haeussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, et al. The UCSC Genome Browser database: 2019 update. Nucleic Acids Res 2019; 47:D853–D858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 2014; 30:2114–2120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 2011; 43:491–498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing. arXiv:12073907 [q-bio] Published Online First: 20 July 2012.http://arxiv.org/abs/1207.3907 (accessed 4 Mar2020). [Google Scholar]
- 36.Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin) 2012; 6:80–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 2016; 536:285–291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Auer PL, Reiner AP, Wang G, Kang HM, Abecasis GR, Altshuler D, et al. Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project. Am J Hum Genet 2016; 99:791–801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.1000 Genomes Project Consortium, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, et al. A global reference for human genetic variation. Nature 2015; 526:68–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes. bioRxiv 2019; :531210. [Google Scholar]
- 41.Genovese G, Handsaker RE, Li H, Kenny EE, McCarroll SA. Mapping the Human Reference Genome’s Missing Sequence by Three-Way Admixture in Latino Genomes. Am J Hum Genet 2013; 93:411–421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Li H Toward better understanding of artifacts in variant calling from high-coverage samples. Bioinformatics 2014; 30:2843–2851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Bailey JA, Gu Z, Clark RA, Reinert K, Samonte RV, Schwartz S, et al. Recent segmental duplications in the human genome. Science 2002; 297:1003–1007. [DOI] [PubMed] [Google Scholar]
- 44.Bergstrom EN, Huang MN, Mahto U, Barnes M, Stratton MR, Rozen SG, et al. SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events. BMC Genomics 2019; 20:685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Tate JG, Bamford S, Jubb HC, Sondka Z, Beare DM, Bindal N, et al. COSMIC: the Catalogue Of Somatic Mutations In Cancer. Nucleic Acids Res 2019; 47:D941–D947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Dawson ET. edawson/tidysig.; 2020. https://github.com/edawson/tidysig (accessed 2 Jun2020). [Google Scholar]
- 47.Rosales R, Drummond R, Valieris R, Silva IT da. signeR: Empirical Bayesian approach to mutational signature discovery. Bioconductor version: Release (3.11); 2020. doi: 10.18129/B9.bioc.signeR [DOI] [PubMed] [Google Scholar]
- 48.Peiffer DA, Le JM, Steemers FJ, Chang W, Jenniges T, Garcia F, et al. High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping. Genome Res 2006; 16:1136–1148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Jacobs KB, Yeager M, Zhou W, Wacholder S, Wang Z, Rodriguez-Santiago B, et al. Detectable clonal mosaicism and its relationship to aging and cancer. Nat Genet 2012; 44:651–658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Loh P-R, Genovese G, Handsaker RE, Finucane HK, Reshef YA, Palamara PF, et al. Insights into clonal haematopoiesis from 8,342 mosaic chromosomal alterations. Nature 2018; 559:350–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Hwang JTG, Yang M-C. AN OPTIMALITY THEORY FOR MID p–VALUES IN 2 × 2 CONTINGENCY TABLES. Statistica Sinica 2001; 11:807–826. [Google Scholar]
- 52.R Core Team. R: A language and environment for statistical computing. Vienna, Austria:; 2014. http://www.R-project.org/ [Google Scholar]
- 53.Dorsheimer L, Assmus B, Rasper T, Ortmann CA, Ecke A, Abou-El-Ardat K, et al. Association of Mutations Contributing to Clonal Hematopoiesis With Prognosis in Chronic Ischemic Heart Failure. JAMA Cardiol 2019; 4:25–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Haybar H, Shahrabi S, Ghanavat M, Khodadi E. Clonal hematopoiesis: Genes and underlying mechanisms in cardiovascular disease development. J Cell Physiol 2019; 234:8396–8401. [DOI] [PubMed] [Google Scholar]
- 55.Fuster JJ, Walsh K. Somatic Mutations and Clonal Hematopoiesis: Unexpected Potential New Drivers of Age-Related Cardiovascular Disease. Circ Res 2018; 122:523–532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Alexandrov LB, Jones PH, Wedge DC, Sale JE, Campbell PJ, Nik-Zainal S, et al. Clock-like mutational processes in human somatic cells. Nat Genet 2015; 47:1402–1407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Hagenkord JM, Parwani AV, Lyons-Weiler MA, Alvarez K, Amato R, Gatalica Z, et al. Virtual karyotyping with SNP microarrays reduces uncertainty in the diagnosis of renal epithelial tumors. Diagn Pathol 2008; 3:44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Hleyhel M, Goujon S, Sibiude J, Tubiana R, Dollfus C, Faye A, et al. Risk of cancer in children exposed to antiretroviral nucleoside analogues in utero: The french experience. Environmental and Molecular Mutagenesis 2019; 60:404–409. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.



