Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2022 Jun 18;122:444–448. doi: 10.1016/j.ijid.2022.06.023

SARS-CoV-2 intra-host evolution during prolonged infection in an immunocompromised patient

Erika Giorgia Quaranta 1,, Alice Fusaro 1, Edoardo Giussani 1, Valeria D'Amico 1, Maria Varotto 1, Matteo Pagliari 1, Maria Teresa Giordani 2, Maira Zoppelletto 3, Francesca Merola 3, Antonio Antico 3, Paola Stefanelli 4, Calogero Terregino 1, Isabella Monne 1
PMCID: PMC9212919  PMID: 35724829

Abstract

Objectives: Intra-host SARS-CoV-2 evolution during chronic infection in immunocompromised hosts has been suggested as being the possible trigger of the emergence of new variants. Methods: Using a deep sequencing approach, we investigated the SARS-CoV-2 intra-host genetic evolution in a patient with HIV over a period of 109 days. Results: Sequencing of nasopharyngeal swabs at three time points demonstrated dynamic changes in the viral population, with the emergence of 26 amino acid mutations and two deletions, 57% of them in the Spike protein. Such a combination of mutations has never been observed in other SARS-CoV-2 lineages detected so far. Conclusion: Our data confirm that persistent infection in certain immunocompromised individuals for a long time may favor the dangerous emergence of new SARS-CoV-2 variants with immune evasion properties.

Keywords: SARS-CoV-2, Immunocompromised patient, Intra-host evolution, Prolonged infection, Sequencing

Introduction

During the SARS-CoV-2 pandemic, several cases of prolonged infections were reported in immunosuppressed patients (Dolan et al., 2021). Most of these cases revealed an intra-host viral evolution, which allows the virus to accumulate mutations faster than during normal inter-host transmission (Avanzato et al., 2020; Leung et al., 2022). During these prolonged infections, SARS-CoV-2 can acquire mutations at key epitopes in the Spike (S) protein, potentially affecting virus replication, infectivity, and antigenicity, which are common to the variants of concern (VOCs) (European Centre for Disease Prevention and Control, 2022). Although the mechanisms underlying the emergence of new variants are still widely discussed, accelerated viral evolution in immunocompromised patients seems to be involved in the genesis of VOCs, as suggested for the Alpha variant (Hill et al., 2022.). This may also have been the case for the more recent Omicron variant, whose excess number of mutations in the S gene suggests it may have originated in a patient with immunocompromised conditions chronically infected with SARS-CoV-2 (Ma et al., 2022).

In this study, we performed an in-depth analysis of the complete genome of SARS-CoV-2 collected at three different time points from a patient with HIV over a period of 109 days.

Methods

Case description

A patient was diagnosed HIV-1 positive with AIDS and wasting syndrome in October 2006. Antiretroviral therapy (ART) was immediately started with azidothymidine (AZT) + 3TC + nevirapine when the clinical picture was characterized by a CD4 count of 52 cells/ml and an HIV-1 load (VL) of 83,157 cv/ml. In March 2007, the CD4 count was 184 cells/ml, and VL was 141 cv/ml, but after a few months, the patient interrupted the therapy, and the CD4 count dropped to 5 cells/ml, with a VL greater than 500,000 on September 12, 2007. The patient restarted and interrupted AZT + Abacavir + Lopinavir treatment several times from 2007 to 2020. On November 5, 2020, the patient tested positive for SARS-CoV-2 by real-time reverse transcription–PCR (rRT-PCR) (Allplex™ SARS-CoV-2 Assay, Seegene Inc., Seoul, Korea) and the nasopharyngeal swabs collected on 15, 22, and 30 November 2020 still confirmed the presence of SARS-CoV-2 infection. During this time, the patient was not under any ART therapy, and clinical data regarding this period are lacking. On June 5, 2021 (time point one), the patient went to the hospital with a mild fever and watery diarrhea that had persisted for a few weeks, with a weightloss of 20 kg. Once again, the patient tested positive by rRT-PCR for SARS-CoV-2, and blood tests demonstrated a very low CD4 count (3 cells/ml) and a very high HIV-1 load of 558,000 cv/ml. ART was immediately restarted with Dolutegravir + Abacavir + Lamivudine, and soon after, the patient was discharged on a domestic isolation regimen. The patient received two doses of SARS-CoV-2 mRNA (Pfizer) Comirnaty* 195 FL multidose vaccine, the first on June 3, 2021, and the second on July 9, 2021. Testing for SARS-CoV-2 was performed on June 28, 2021 (time point two), and on September 22, 2021 (time point three), with the patient still resulting positive.

The patient was treated with monoclonal antibodies (Bamlanizumab 700 mg + Etesivimab 1400 mg ev) on September 30, 2021.

RNA extraction, next-generation sequencing, and bioinformatics analysis

RNA purification was performed on the three nasopharyngeal swabs using the QIAamp Viral RNA kit (Qiagen) according to the manufacturer's protocol. The RNA from each sample was reverse transcribed using the LunaScript RT SuperMix (New England Biolabs), and 5 μL of cDNA was amplified with the ARTIC primer set (ARTIC nCoV-2019 V3 Panel, Integrated DNA Technologies) and the Q5 Hot Start High Fidelity DNA Polymerase (New England Biolabs). The following conditions were used: 98°C for 30 seconds, 95°C for 15 seconds, and 63°C for 5 minutes (35-40 cycles), with a 4°C holding step. Amplicons were purified using 0.8x AMPure XP beads (Beckman Coulter) and quantified using the Qubit dsDNA HS Assay kit (ThermoFisher Scientific). Libraries were prepared with the Illumina DNA Prep kit (Illumina) according to the manufacturer's instructions, and quality was checked using the Bioanalyzer High Sensitivity DNA kit (Agilent Technologies). Libraries were sequenced on Illumina platforms in 150 paired-end mode. Read quality was assessed using FastQC (Andrews, 2010). Cutadapt (Martin, 2011) was employed to perform quality filtering and adapter trimming. Reads were aligned against the Wuhan-Hu-1 SARS-CoV-2 genome (GenBank accession number NC_045512.2) using BWA-MEM (Li, 2013). iVar (Grubaugh et al., 2019) was used to remove PCR primers. GATK4 (Van der Auwera et al., 2020) was used to correct potential errors, improve alignment and recalibrate base quality. Variants call was performed with LoFreq* (Wilm et al., 2012). Consensus sequences were produced with a script developed in-house. Single nucleotide polymorphisms and indels with a frequency >50% were reported in the consensus, whereas positions with a coverage <10 were masked with the letter N, which indicates any base according to the International Union of Pure and Applied Chemistry annotation.

Results

SARS-CoV-2 sequencing was performed on nasopharyngeal swabs collected from the patient on time points one (June 5, 2021), two (June 28, 2021), and three (September 22, 2021). Unfortunately, they were the only samples available for sequencing at the time of writing. The raw data are available on NCBI Sequence Read Archive with accession number PRJNA837407 and consensus sequences were deposited in the Global Initiative on Sharing All Influenza Data under the following ID: EPI_ISL_2927997 (time point one), EPI_ISL_3006795 (time point two) and EPI_ISL_4968925 (time point three). All the samples produced high-quality data, yielding the near-complete genome. Specifically, we obtained 1,784,831 SARS-CoV-2 reads for time point one, 1,608,528 for time point two, and 3,223,925 for time point three, which allowed us to produce respectively 99.73%, 99.46%, and 99.67% of the genome, with an average coverage of 7,806X for time point one, 6,599X for time point two and 14,140X for time point three (Figure 1 A). These sequences formed a monophyletic group within the AH.3 lineage (Pangolin v4.0.6, pangolin-data version v1.8, O'Toole et al., 2021), a lineage that has been circulating in a few European countries since the second half of October 2020 (Figure 1C). They showed 24-26 amino acid substitutions compared with the Wuhan-Hu-1 reference sequence (GenBank accession number NC_045512.2), 10 of which are distinctive of the AH.3 lineage (Figure 1B). Over 109 days, 35 nucleotide positions underwent mutations, 26 resulting in amino acid changes, and two different deletions appeared (Figure 2 ). Among protein-coding changes, eight were observed in Open Reading Frame 1ab (ORF1ab), 16 in S protein (14 mutations and two deletions), one in ORF3a, one in Membrane (M) protein, and two in Nucleocapsid (N) protein. Similarly to what is reported in other studies (Leung et al., 2022; Weigang et al., 2021), several amino acid changes arising from such intra-host viral evolution have also been observed in VOCs and/or have been associated with effects on viral fitness. The S13I mutation in the S protein, for example, is typical of the Epsilon variant and is responsible for a marked reduction in the binding of N-terminal domain-directed neutralizing antibodies in vitro (McCallum et al., 2021). Similarly, del144, peculiar to the Alpha variant, is believed to ease antibody resistance (Wang et al., 2021). Although del242/246 has neither been associated with any VOC nor with other particular biological effects, it should be highlighted that deletions in this region of the S protein have been associated with the alteration of the epitope structure and consequently a reduced immunoreactivity (Klinakis et al., 2021). Remarkably, at residue E484, we observed the appearance of E484A mutation at time point two and E484K at time point three, although E484A continued to be present at a lower frequency (20%) (Figure 1D). E484 is a crucial residue, as substitutions at this position have been noticed in many rapidly spreading VOCs (Laurini et al., 2021) and are associated with reduced sensitivity to neutralization by convalescent human sera (Liu et al., 2021). In addition, it is worth mentioning that the E484A mutation, typical of the Omicron variant, detected in the sample collected on June 28, 2021 (time point two), had never been observed in Italy. Two other mutations in the receptor-binding domain, F490L, and S494P, are respectively linked to a reduced neutralization sensitivity and an increased binding to angiotensin-converting enzyme 2, thereby increasing overall viral virulence (Chakraborty, 2021; Li et al., 2020). Focusing on the 37 nucleotide positions where the three consensus sequences differed (35 mutations and two deletions), we found that 95% of the observed mutations/deletions were present as minority variants even in samples where they were not detected at the consensus level (Figure 2). For example, mutations F490L, S494P, and Q677K in the S protein were present with a frequency of 89.70-89.79% at time point one, then dropped to a frequency of 16.43-18.51% at time point two and returned to be the prevalent mutation (with a frequency of 63.57-89.02%) at time point three (Figure 1D). This observation suggests that a mixed population was present in the patient as early as time point one. Unfortunately, the lack of sequences from previous samples makes it impossible to determine whether these multiple variants may have resulted from pre-existing co-infection or intra-host virus evolution of a single virus strain.

Figure 1.

Figure 1

SARS-CoV-2 mutation accumulation during chronic infection in a patient with HIV over 109 days. (A) Genome coverage profiles for the three nasopharyngeal swabs analyzed in this study. Y-axis is in logarithmic scale. (B) Schematic representation of amino acid mutations found in patient swabs compared with the Wuhan-Hu-1 reference sequence (GenBank accession number NC_045512.2). The heatmap summarizes the variant frequencies; bordered squares represent consensus mutations (frequency >50%), and the mutations written in blue are those typical of the AH.3 lineage. (C) Phylogenetic analysis of the sequences obtained from patient swabs at the three time points. The sequences were aligned to a set of representative SARS-CoV-2 genome sequences belonging to AH.3 lineage and the main lineages and VOCs identified so far. The maximum-likelihood phylogenetic tree was constructed with IQ-TREE (GTR+F+R2) and rooted on the Wuhan-Hu-1 reference genome. Ultra-fast bootstrap supports are indicated above the nodes. (D) Histograms representing the amino acid frequency at the positions that differ between the three time points. Amino acids are indicated above the columns for each position. VOCs = variants of concern.

Figure 2.

Figure 2

Genome variants identified in nasopharyngeal swabs collected from the patient at three time points in comparison to the Wuhan-Hu-1 reference sequence (GenBank accession number NC_045512.2) in the nucleotide positions that differed between the three consensus sequences. When present, amino acid mutations are indicated. The rectangle marks two different mutations present at the same nucleotide position (26,767; Membrane protein).

Conclusion

Through the ultra-deep sequencing of the SARS-CoV-2 population at multiple time points, we demonstrated that a highly divergent variant could rapidly emerge in an HIV-positive patient with SARS-CoV-2 persistent infection. In less than four months, the virus accumulated 26 amino acid mutations and two deletions, 57% of which were located in the S protein. Many of these mutations are involved in SARS-CoV-2 ability to evade recognition by the immune system and are indeed found in the major VOCs. The depletion of CD4 unfitted the patient for clearing SARS-CoV-2. This case, added to the ones previously reported (Avanzato et al., 2020; Leung et al., 2022), highlights the crucial role of SARS-CoV-2-positive patients with HIV and low CD4 count as possible sources of new variants.

Acknowledgments

Declaration of Competing Interest

The authors have no competing interests to declare.

Funding

This study was partially funded by the Italian Ministry of Health through the project CCM 2020 - Title: “Caratterizzazione molecolare del virus pandemico SARS-CoV-2 in Italia” and the Ricerca Corrente Istituto Zooprofilattico Sperimentale delle Venezie 10/19. Part of the genomic monitoring which led to these results has also been supported by the Resolution of the Regional Council of Veneto n. 805 on June 22, 2021.

Acknowledgments

The authors thank all researchers who have shared genome data openly through the Global Initiative on Sharing All Influenza Data (GISAID). The authors would like to thank the technical and scientific representatives of the Ministry of Health: Dr. Stefania D'Amato and Dr. Michela Sabbatucci. We extend our sincere appreciation to all the operators of the Unità Locale Socio Sanitaria 7 Pedemontana P.O. Bassano del Grappa engaged in the sampling and initial processing of specimens. We also thank Ms. Francesca Ellero from the Istituto Zooprofilattico Sperimentale delle Venezie for editing the manuscript.

Ethical approval

Informed consent was obtained from the patient.

References

  1. Andrews S. FastQC. A quality control tool for high throughput sequencing data, 2010,http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed 9 May 2022).
  2. Avanzato VA, Matson MJ, Seifert SN, Pryce R, Williamson BN, Anzick SL, Barbian K, Judson SD, Fischer ER, Martens C, Bowden TA, de Wit E, Riedo FX, Munster VJ. Case study: prolonged infectious SARS-CoV-2 shedding from an asymptomatic immunocompromised individual with cancer. Cell. 2020;183:1901–1912. doi: 10.1016/j.cell.2020.10.049. e9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Chakraborty S. Evolutionary and structural analysis elucidates mutations on SARS-CoV2 spike protein with altered human ACE2 binding affinity. Biochem Biophys Res Commun. 2021;534:374–380. doi: 10.1016/j.bbrc.2020.11.075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Dolan SA, Mulcahy Levy J, Moss A, Pearce K, Butler M, Jung S, Dominguez SR, Mwangi E, Maloney K, Rao S. SARS-CoV-2 persistence in immunocompromised children. Pediatr Blood Cancer. 2021;68:e29277. doi: 10.1002/pbc.29277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. European Centre for Disease Prevention and Control, 2022. SARS-CoV-2 variants of concern, 2022, https://www.ecdc.europa.eu/en/covid-19/variants-concern (accessed 9 March 2022).
  6. Grubaugh ND, Gangavarapu K, Quick J, Matteson NL, Goes De Jesus J, Main BJ, Tan AL, Paul LM, Brackney DE, Grewal S, Gurfield N, Ka K, Rompay V, Isern S, Michael SF, Coffey LL, Loman NJ, Andersen KG. An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar. Genome Biol. 2019;20:8. doi: 10.1186/s13059-018-1618-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Klinakis A, Cournia Z, Rampias T. N-terminal domain mutations of the spike protein are structurally implicated in epitope recognition in emerging SARS-CoV-2 strains. Comput Struct Biotechnol J. 2021;19:5556–5567. doi: 10.1016/j.csbj.2021.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Laurini E, Marson D, Aulic S, Fermeglia A, Pricl S. Molecular rationale for SARS-CoV-2 spike circulating mutations able to escape bamlanivimab and etesevimab monoclonal antibodies. Sci Rep. 2021;11:20274. doi: 10.1038/s41598-021-99827-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Leung WF, Chorlton S, Tyson J, Al-Rawahi GN, Jassem AN, Prystajecky N, Masud S, Deans GD, Chapman MG, Mirzanejad Y, Murray MCM, Wong PHP. COVID-19 in an immunocompromised host: persistent shedding of viable SARS-CoV-2 and emergence of multiple mutations: a case report. Int J Infect Dis. 2022;114:178–182. doi: 10.1016/j.ijid.2021.10.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Hill V, Plessis L du, Peacock TP, Aggarwal D, Colquhoun R, Carabelli AM, Ellaby N, Gallagher E, Groves N, Jackson B, McCrone J, O'Toole Á, Price A, Sanderson T, Scher E, Southgate J, Volz E, The COVID-19 genomics UK (COG-UK) consortium, Barclay W, Barrett J, Chand M, Connor T, Goodfellow I, Gupta R, Harrison E, Loman N, Myers R, Robertson D, Pybus O, Rambaut A. The origins and molecular evolution of SARS-CoV-2 lineage B.1.1.7 in the UK. bioRxiv, 8 March 2022. https://www.biorxiv.org/content/10.1101/2022.03.08.481609v1 (accessed 9 May 2022).
  11. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM, 2013, https://arxiv.org/abs/1303.3997v2 (accessed 9 May 2022).
  12. Li Q, Wu J, Nie J, Zhang L, Hao H, Liu S, Zhao C, Zhang Q, Liu H, Nie L, Qin H, Wang M, Lu Q, Li X, Sun Q, Liu J, Zhang L, Li X, Huang W, Wang Y. The impact of mutations in SARS-CoV-2 spike on viral infectivity and antigenicity. Cell. 2020;182:1284–1294. doi: 10.1016/j.cell.2020.07.012. e9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Liu Z, VanBlargan LA, Bloyet LM, Rothlauf PW, Chen RE, Stumpf S, Zhao H, Errico JM, Theel ES, Liebeskind MJ, Alford B, Buchser WJ, Ellebedy AH, Fremont DH, Diamond MS, Whelan SPJ. Identification of SARS-CoV-2 spike mutations that attenuate monoclonal and serum antibody neutralization. Cell Host Microbe. 2021;29:477–488. doi: 10.1016/j.chom.2021.01.014. e4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Ma W, Yang J, Fu H, Su C, Yu C, Wang Q, de Vasconcelos ATR, Bazykin GA, Bao Y, Li M. Genomic perspectives on the emerging SARS-CoV-2 omicron variant. Genomics Proteomics Bioinformatics. 2022 doi: 10.1016/j.gpb.2022.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10–12. [Google Scholar]
  16. McCallum M, Bassi J, de Marco A, Chen A, Walls AC, di Iulio J, Tortorici MA, Navarro MJ, Silacci-Fregni C, Saliba C, Sprouse KR, Agostini M, Pinto D, Culap K, Bianchi S, Jaconi S, Cameroni E, Bowen JE, Tilles SW.…Veesler D. SARS-CoV-2 immune evasion by the B.1.427/B.1.429 variant of concern. Science. 2021;373:648–654. doi: 10.1126/science.abi7994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. O'Toole Á, Scher E, Underwood A, Jackson B, Hill V, McCrone JT, Colquhoun R, Ruis C, Abu-Dahab K, Taylor B, Yeats C, Du Plessis L, Maloney D, Medd N, Attwood SW, Aanensen DM, Holmes EC, Pybus OG, Rambaut A. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021;7:veab064. doi: 10.1093/ve/veab064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Van der Auwera GA, O'Connor BD. O'Reilly Media; 2020. Genomics in the Cloud: using Docker, GATK, and WDL in Terra. [Google Scholar]
  19. Wang R, Zhang Q, Ge J, Ren W, Zhang R, Lan J, Ju B, Su B, Yu F, Chen P, Liao H, Feng Y, Li X, Shi X, Zhang Z, Zhang F, Ding Q, Zhang T, Wang X, Zhang L. SARS-CoV-2 variants resist antibody neutralization and broaden host ACE2 usage. bioRxiv. 2021 03.09.434497. [Google Scholar]
  20. Weigang S, Fuchs J, Zimmer G, Schnepf D, Kern L, Beer J, Luxenburger H, Ankerhold J, Falcone V, Kemming J, Hofmann M, Thimme R, Neumann-Haefelin C, Ulferts S, Grosse R, Hornuss D, Tanriver Y, Rieg S, Wagner D.…Kochs G. Within-host evolution of SARS-CoV-2 in an immunosuppressed COVID-19 patient as a source of immune escape variants. Nat Commun. 2021;12:6405. doi: 10.1038/s41467-021-26602-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Wilm A, Aw PPK, Bertrand D, Yeo GHT, Ong SH, Wong CH, Khor CC, Petric R, Hibberd ML, Nagarajan N. LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets. Nucleic Acids Res. 2012;40:11189–11201. doi: 10.1093/nar/gks918. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from International Journal of Infectious Diseases are provided here courtesy of Elsevier

RESOURCES