Abstract
Introduction
In Cameroon, a manifold diversity of HIV strains exists with CRF02_AG and unique recombinant forms (URFs) being the predominant strains. In recent years, a steady increase in URFs and clade F2 viruses has been monitored through partial genome sequencing. There is an information gap in the characterization of emerging URFs along the full genome, which is needed to address the challenges URFs pose towards diagnosis, treatment and HIV‐1 vaccine design.
Method
Eighteen Cameroonian URFs from samples collected between the years 2000 and 2015 were studied using a newly developed near full genome sequencing (NFGS) protocol based on variable nested RT‐PCRs with a versatile primer set. Near full genomes were characterized for recombination patterns and sequence signatures with possible impact on antiretroviral treatment or Env‐directed immune responses. Third‐generation sequencing (3GS) of near full or half genomes (HGs) gave insight into intra‐patient URF diversity.
Results
The characterized URFs were composed of a broad variety of subtypes and recombinants including A, F, G, CRF01_AE, CRF02_AG and CRF22_01A1. Phylogenetic analysis unveiled dominant CRF02_AG and F2 recombination patterns. 3GS indicated a high intra‐patient URF diversity with up to four distinct viral sub‐populations present in plasma at the same time. URF pol genomic analysis revealed a number of accessory drug resistance mutations (DRMs) in the ART‐naïve participants. Genotypic env analysis suggests CCR5 usage in 14/18 samples and identified deviations at residues, critical for gp120/gp41 interphase and CD4 binding site broadly neutralizing antibodies in more than half of the studied URFs. V1V2 sites of immune pressure in the human RV144 vaccine study varied in more than a third of URFs.
Conclusions
This study identified novel mosaic patterns in URFs in Cameroon. In line with the regional predominance of CRF_02AG and the increased prevalence of clade F2, prominent CRF_02AG and F2 background patterns were observed underlying the URFs. In the context of the novel mosaic genomes, the impact of the identified accessory DRMs and Env epitope variations on treatment and immune control remains elusive. The evolving diversity of HIV‐1 URFs in Cameroon requires continuous monitoring to respond to the increasing challenges for diagnosis, antiretroviral treatment and prevention.
Keywords: unique recombinant forms, near full genome sequencing, third‐generation sequencing, intra‐patient viral diversity, Env epitopes and drug resistance mutations
Abbreviation
- 3GS
third‐generation sequencing
- cDNA
complementary deoxyribonucleic acid
- CRF
circulating recombinant form
- GTR
generalized time reversible
- HG
half genome
- MDC
medical diagnostic center
- NFG
near full genome
- NFGS
near full genome sequencing
- NYUSoM
New York University School of Medicine
- PCR
polymerase chain reaction
- RNA
ribonucleic acid
- RT
reverse transcriptase
- SGA
single genome amplification
- URF
unique recombinant form
1. Introduction
Based on the latest HIV molecular surveillance report, recombinant forms account for almost every fourth HIV infection globally (22.8%) 1. Circulating recombinant form (CRF) 02_AG (CRF02_AG) is the most predominant recombinant form and the fourth most predominant lineage in the world (7.7%). Also, there has been a consistent and ongoing rise in the percentage of unique recombinant forms (URFs) globally, which currently constitute the fifth most prevalent group of HIV viruses (6.1%) 1. Comparing the periods 1990 to 1999 and 2010 to 2015, URFs almost doubled (1.7x) globally and increased more than four‐fold (4.6x) in West Africa 1. The Congo basin, located in West‐Central Africa between the Sanaga River in Cameroon and the Congo River in the former Belgian Congo, is hypothesized to be the geographical origin of HIV‐1. Cross‐species transmission events from chimpanzees and other monkeys to hunters and butchers are supposed to be initiators of the HIV epidemic in humans 2, 3, 4, 5. Cameroon is the likely country of origin of HIV‐1 groups M, N, O and P 6, 7, 8. Possibly as a consequence, it has one of the most genetically diverse HIV epidemics in the world 9, 10, 11. The predominant lineage is the recombinant CRF02_AG, which accounts for more than half of infections 12, 13. Also, circulating virus lineages include virtually every known HIV‐1 group M (HIV‐1 M) pure subtype, many CRFs, and a variety of URFs composed of pure, CRF and/or non‐classifiable sequences making this country a unique source of rare and emerging HIV‐1 M viral variants 5, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18. The range of subtypes and the extent of viral diversity within a geographical region considerably impact HIV diagnosis, treatment and prevention 19. It is, therefore, crucial to monitor and genetically characterize HIV globally, specifically at HIV diversity hot spot regions like Cameroon.
The characterization of circulating HIV‐1 M strains in Africa is incomplete, particularly in West‐Central Africa, where the diversity of subtypes is exceptionally high. A crucial genetic feature of HIV is its recombination‐prone nature leading to the emergence of intra‐ and inter‐subtype recombinants in dually infected individuals 20, 21. Currently, there are at least 98 CRFs 22, and numerous URFs identified, composed of pure and other recombinant subtypes including under‐sampled parental lineages that cannot be reasonably classified within the established HIV‐1 M subtypes 23, 24, 25. Recombination‐prone sites have been identified 26, 27; however, recombination events are scattered along the whole HIV genome 28. Most genotypic HIV‐1 M data is determined through partial genomic sequencing of one or a few limited regions. Partial genome sequencing bears the risk of missing recombination sites and parental subtypes, which results in an inaccurate or obscured determination of HIV‐1 M genetic diversity.
As part of our ongoing molecular surveillance efforts in Cameroon, our laboratories have in recent years identified and characterized two CRFs (CRF_36cpx and CRF_37cpx) 29, 30, and we have monitored a steady increase in URFs 12, 13, 17, 21. Through the application of a robust and versatile near full genome (NFG) amplification and sequencing technique 31, the current study characterized 18 near‐full genomes of HIV‐1 M recombinants. Deep sequencing revealed considerable intra‐patient URF diversity and evolution. Near full genome sequencing (NFGS) enabled the mapping of sites targeted by antiretroviral therapy or neutralizing antibodies, for example, the pol and env genomic regions, for resistance‐conferring mutations. The study highlights the broad utility of URF NFGS for HIV surveillance, diagnosis and personalized therapy.
2. Methods
2.1. Ethical clearance
This study was approved by the Institutional Review Boards of the Cameroon Ministry of Public Health and the New York University School of Medicine (NYUSoM), New York, USA. Before sample collection, informed consent was obtained from the study participants, who were all part of a cohort of HIV positive individuals; this cohort is monitored at the Medical Diagnostic Center (MDC), Yaoundé, Cameroon in collaboration with NYUSoM, New York, USA.
2.2. Study samples
Whole blood samples were collected between the years 2000 and 2015. They were shipped from the MDC under regulatory guidelines of transfer on biological samples to NYUSoM, where plasma was separated from peripheral blood mononuclear cells through Ficoll gradient centrifugation (Histopaque, Sigma‐Aldrich, St. Louis, MO, USA), and stored at −80°C. Eighteen URFs were selected based on previously published work from our group using partial genome sequencing (total of 509 participants) 11, 12, 13, 17, 32, 33. Inclusion criteria were genetic evidence of URF infection and available plasma volumes ≥500 μL. RNA extraction, cDNA synthesis and NFGS using Sanger sequencing or third‐generation sequencing (3GS) were performed as described in detail in Banin et al. 31. Briefly, a flexible one, two, or multiple amplicon strategy enabled NFGS for all study samples. Amplifications and sequencing were performed using a newly composed set of primers binding to semi‐conserved regions within the HIV‐1 genome of group M viruses. Single genome amplifications (SGA) on endpoint‐diluted cDNA were performed for six near full or half genomes (HGs) to preclude that the identified URFs were the result of amplification artefacts based on template switching when mixtures of genetically diverse templates were present 31.
2.3. Phylogenetic analysis
Generated sequences were assembled using DNA Star SeqMan Pro (where applicable) and aligned using CLUSTAL Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/) with reference sequences from all known HIV‐1 M (sub‐)subtypes and CRFs from the Los Alamos National Library (LANL) HIV sequence database 34. Maximum likelihood phylogenetic trees were generated using MEGA version 5.2 software package with 1000 bootstrap replicates, and pairwise evolutionary distances were estimated using the maximum composite likelihood substitution model 35. Intra‐patient diversity was studied based on 3GS data as follows. After removal of short and intermediate 3GS reads (Table 5 in Banin et al. 31), the long reads were aligned in MEGA5.2 and phylogenetically analysed for cluster formation. For confirmation, multiple reads per cluster were analysed using highlighter plots and Simplot for consistent breakpoint patterns within each cluster and differing breakpoint patterns between clusters. 3GS (sub)populations were averaged to consensus (con) sequences using Consensus Maker 34 or SeqMan Pro.
2.4. Highlighter
The highlighter tool of the LANL database 34 was used to analyse the site‐specific genetic diversity between subjects and between different reads per subject along the entire study sequence. It was also used to detect/exclude cross‐contamination between samples.
2.5. Recombination analysis
Each query sequence was plotted against all known pure HIV‐1 M subtypes and CRF reference sequences in Similarity plots (Simplot software package 3.5.2, window size 200, step size 20). Both similarity plots and the recombinant identification program (RIP; LANL database) were used to identify the contributing parental subtypes of the recombinants (Figure S1). A more focused recombination breakpoint analysis was done by bootscanning (Simplot, window size 200, step size 20, 250 bootstrap replicates). Bootstrap support of 70% between a subtype reference strain and a query sequence was used as criteria to assign a subtype to a breakpoint region. If bootstrap values remained below 70% for all pure and CRF reference subtypes, the region was considered unidentified.
Recombinant fragments were phylogenetically studied using RAxML version 8 36 implemented in CIPRES 37. An un‐rooted maximum likelihood phylogenetic tree was constructed from an alignment (created with MUSCLE, implemented in MEGA version 5.2) combining URF recombinant fragments of sizes >900 bp and full genome reference sequences with 1000 bootstrap replicates. RAxML is based on generalized time reversible (GTR) nucleotide substitution models (GTR‐GAMMA used in this study) and enables accurate inference of phylogenies from alignments containing sequences of different sizes, i.e. it tolerates large amounts of missing data 36, 38. It was suited to perform phylogenetic subtype determination/confirmation for several recombinant fragments >900 bp in size (as determined using similarity plots and bootscanning) in a single maximum likelihood phylogenetic tree.
2.6. Drug resistance mutation and co‐receptor tropism analysis
NFGS were tested for canonical HIV‐1 drug resistance mutations (DRM) using the Stanford HIV drug resistance database (https://hivdb.stanford.edu/) with consideration of recent literature and updates of the World Health Organization (WHO) and the International Antiviral Society (IAS)‐USA 39, 40, 41, 42. The analysis included the pol genomic regions encoding for protease (PR), reverse transcriptase (RT) and integrase (IN). Co‐receptor tropism analysis was performed using the Geno2pheno co‐receptor 2.5 tool (http://coreceptor.geno2pheno.org/) with a 5% false positive rate, as recommended for the prediction of CRF02_AG strains 43, and the PhenoSeq tool for (sub‐)subtypes A/A1/A2/CRF02_AG (http://tools.burnet.edu.au/phenoseq/) 44.
2.7. Env epitope and N‐glycosylation analysis
Env amino acid sequences were screened for the presence of key residues for broadly neutralizing antibodies and for sites of immune pressure identified in the partially protective human vaccine study RV144 45, 46. Env amino acid sequences from the study subjects and reference sequences of subtype B (HXB2), CRF02_AG (0014BBY) and F2 (CM53657) were aligned using Clustal Omega. Potential N‐linked glycosylation sites were determined using the N‐Glycosite tool from the LANL database 34 with sites highlighted in red (Figure S8). Key N‐glycosylation sites, residues that are critical for broadly neutralizing antibodies and sites of immune pressure based on the RV144 vaccine study, were labelled according to Rolland et al. 46, deCamp et al. 47 and Courtney et al. 21.
2.8. Data storage and documentation
NFG sequences are available from GenBank with accession numbers MK086109‐MK086126. Whole sets of raw deep sequencing reads have been deposited in the Sequence Read Archive with accession numbers SRR9074509‐SRR9074519. All other data are available from the corresponding author upon reasonable request.
3. Results
3.1. URFs in Cameroon are composed of diverse subtypes with scattered recombination breakpoints
Plasma samples from 18 Cameroonian study participants at ART‐naïve time points were selected for the current study (Table 1). All study participants were infected with HIV‐1 M URFs based on partial genome sequencing 11, 12, 17, 32, 33; no further biological selection criteria were applied. The study included participants from both genders, between 20 and 70 years of age, and at different stages of HIV‐1 infection. The samples were collected between the years 2000 and 2015 with plasma viral loads between 300 and 950,000 copies/mL. CD4 counts were in the range between 170 and 530 cells/μL. The clinical and demographic data of the participants and a summary of the NFGS approaches are shown in Table 1.
Table 1.
# | Sample ID | Sex | Age | Sampling date | Diagnosis date | CD4 counts (cells/μL) | Viral load copies/mL | Antiretroviral treatment | # Amplicons for NFGS | Genotype (NFGS) |
---|---|---|---|---|---|---|---|---|---|---|
1 | LB016‐1 | F | 39 | January 2010 | November 2007 | 201 | 9870 | Naïve | 1 | 02AG/A1 |
2 | LB069‐1 | F | 36 | May 2010 | July 2008 | 266 | 25,618 | Naïve | 1 | 22_01A1/F2 |
3 | LB082‐1 | M | 43 | June 2010 | 2006 | 226 | 953,760 | Naïve | 2 | A1/F2 |
4 | LB089‐1 | M | 30 | July 2010 | June 2010 | 371 | 20,148 | Naïve | 1 | 02AG/F2 |
5 | LB095‐1 | M | 21 | July 2010 | July 2010 | 443 | 78,218 | Naïve | 1 | 02AG/01AE |
6 | LB104‐1 | F | 41 | October 2010 | October 2010 | 407 | 47,366 | Naïve | 2 | 02AG/F2/22_01A1 |
7 | MDC 131‐1 | F | 48 | June 2011 | NA | 529 | 4140 | Naïve | 2 | 02AG/F2 |
8 | MDC 179‐2 | M | 39 | June 2012 | July 2011 | 306 | 3620 | Naïve | 2 | 02AG/F2 |
9 | BDHS024‐2 | F | 27 | NA | NA | NA | 4198 | Naïve | 2 | 02AG/F2 |
10 | BDHS33 | F | 20 | NA | November 2006 | NA | 28,480 | NA | 3 | 02AG/F2 |
11 | NYU119‐3 | NA | NA | NA | NA | NA | NA | NA | 2 | 02AG/F2 |
12 | NYU124‐2 | F | 35 | September 2001 | January 2001 | 495 | 5688 | Naïve | 2 | 02AG/F2 |
13 | NYU129‐5 | F | 44 | August 2002 | January 2000 | 426 | 478,774 | Naive | 1 | 02AG/A1 |
14 | NYU1122‐1 | M | 70 | Jun 2000 | June 2000 | NA | 306 | Naïve | 1 | 02AG/A1 |
15 | NYU1999‐1 | F | 23 | July 2000 | July 2000 | NA | 14,558 | Naïve | 2 | 02AG/F2 |
16 | NYU2140‐1 | F | 50 | July 2000 | July 2000 | NA | 13,564 | Naïve | 2 | 02AG/F2 |
17 | NYU6556‐3 | M | 34 | February 2008 | February 2001 | 174 | 17,742 | Naïve | 2 | 02AG/G |
18 | NYU6541‐6 | F | 31 | November 2004 | July 2002 | 169 | 65,554 | Naïve | 1 | 02AG/F2 |
NA, not available; NFGS, near full‐genome sequencing; MDS, medical diagnostic center.
Sanger NFGS was obtained for all 18 URF samples (Figure 1, Figures S1, S2 and Table 1), complemented by selective 3GS, SGA and cloning experiments (see also Banin et al. 31). The URF near full genomes were composed of the pure (sub‐)subtypes A (A1), F (F2), and G as well as the CRFs CRF01_AE, CRF02_AG and CRF22_01A1. The recombinant breakpoints were scattered along the full genome; among the 18 URF near full genomes, no recurrent breakpoint positions became evident. However, comparing the URFs according to functional genomic units, it was apparent that CRF02_AG was dominant in gag, accessory genes (vif, vpr, vpu), and env, whereas F2 had high abundance in pol (Figure 1). Though small in size, notably more than 70% (13/18) of the studied URFs exhibited undefined regions, which could not be attributed to any pure subtype or currently known CRF (a common nonexclusive selection of reference strains is shown in Figure S1). The most complex mosaic pattern was found for LB104‐1, being composed of F2, CRF_02AG, CRF_2201A1 and undefined regions. In the highly conserved gag and pol regions, repeated analyses yielded low genetic distances (<0.2%) between a few samples. Follow‐up studies with longitudinal samples will determine whether genetic linkages within the cohort or impurities during sample processing accounted for this observation (Figure 1, grey highlighted regions).
3.2. Common phylogenetic background patterns in Cameroonian URFs
Phylogenetic breakpoint analysis revealed a heavy presence of mosaic CRF02_AG and F2 segments in the studied URFs (Figure 1, Figures S2, S3, Table S1). Interestingly, and following the segregated breakpoint analysis, two common clustering patterns of the URFs were identified when phylogenetic analyses were performed using the near full genomes, i.e. URFs either clustered with CRF02_AG or F2 clades (Figure 2). The CRF02_AG clustering was predominant and applied to fourteen URFs (NYU6556‐3, NYU1999‐1, NYU2140‐1, NYU119‐3, BDHS33, MDC131‐3, NYU124‐2, BDHS24‐2, LB089‐1, LB095‐1, LB016‐1, NYU1122‐1, MDC179‐2 and NYU129‐5) (Figure 2, red). Nested within the F2 branch, four URFs (LB069‐1, LB104‐1, LB082‐1 and NYU6541‐1) were identified (Figure 2, cyan). Of note, seven more URFs were composed by more than a third of their near full genomes of F2 sequences (Figure 1, Figures S2, S3). F2 regions were most abundant in pol, followed by gag and the border region between the end of env and nef. F2 most frequently recombined with CRF02_AG, A1, CRF22_01A1 and undefined regions. Overall, dominant CRF02_AG and F2 framework regions appeared as a common pattern in all of the studied URFs from Cameroon.
3.3. High degree of intra‐patient URF diversity determined by 3GS
Twelve samples were subjected to 3GS analyses using the portable Minion technology suited for long read lengths. 3GS enabled to determine the extent of intra‐patient URF diversity and evolution. According to the differential polymerase chain reaction (PCR) amplification efficiencies of the near full genomes, HGs or shorter constructs per subject (Table 1) 31, we performed 3GS with a selection of three near full genomes, six HGs, and three vif/gp120 products. Based on the limited number of 3GS amplification rounds (15×) (to minimize amplification artefacts), the long read lengths, and the intrinsic limitations in yield using the recently introduced portable 3GS technology, the depth of 3GS remained low to moderate with mostly few thousand HIV‐specific reads per sample (Figure S4) 31. After removal of singular reads and possible contaminants, the long reads were subjected to phylogenetic analyses including maximum likelihood trees, highlighter and breakpoint analysis. Comparing the outcomes of 3GS with single‐genome amplification and bulk sequencing confirmed genetic similarities and comparable breakpoint patterns thus, the suitability of our 3GS approach 31. Three participants were identified with a remarkable diversity of viral populations simultaneously present in plasma, that is, BDHS33, NYU2140‐1 and NYU124‐2 (Figure 3, Figures S5, S6). The other studied 3GS samples revealed homogeneous viral populations with consistent breakpoint patterns, as shown representatively for four participants (Figure S7). A high intra‐patient diversity was observed for subject BDHS33, studied over the second HG (Figure 3A,B,C,D,E,F). Three viral sub‐populations were identified, which genetically differed by 6% to 17%. According to the 3GS results, each sub‐population made up > 2% of the entire viral population (Figure 3D,E,F), which minimized the likelihood that in vitro recombination was the sole cause for the presence of the detected recombinants 31. All three sub‐populations were identified as recombinant viruses composed of CRF02_AG, F2, and undefined regions. While two URF sub‐populations were genetically unrelated (clusters 1 and 2), the third URF sub‐population (cluster 3) was composed of sequences of the other two sub‐populations (parts 1 and 2 respectively; Figure 3A,B,C) suggesting the co‐presence of a secondary URF together with its parental URF lineages. Besides BDHS33, subjects NYU2140‐1 and NYU124‐2 exhibited high intra‐sample HIV‐1 diversities with four unique viral sub‐populations (clusters a to d) (Figures S5, S6). In each sample, either one (cluster b to 75%) or two populations (cluster b to 45% and cluster c to 31%) predominated respectively. Bulk amplification with Sanger sequencing selectively detected one of the 3GS sub‐populations with >6% prevalence, which was the major population for BDHS33 (cluster 1, 52%), but not for NYU124‐2 (cluster a, 19%) and NYU2140‐1 (cluster c, 7%).
3.4. DRM, co‐receptor tropism and Env epitope analysis in URFs from Cameroon
To assess whether the URFs potentially inherited impaired sensitivities to antiretroviral treatment, the entire pol regions were studied for mutations conferring resistance to protease inhibitors (PI), reverse transcriptase inhibitors (RTI), and integrase inhibitors (INSTI) (Table 2). The bulk sequences of the 18 participants were subjected to a genotypic DRM analysis using the Stanford and WHO HIV drug resistance databases. Five out of 18 subjects exhibited accessory DRMs including mutations V32E (PI), T97A and Q146H (both INSTI). T97A was most frequently observed (4×). No canonical RTI or major PI or INSTI resistance mutation was detected. Genotypic co‐receptor analysis, using Geno2pheno and the settings recommended for CRF02_AG infections 43 predicted exclusive CCR5 usage for all studied URFs, whereas PhenoSeq, using settings optimized for subtype A and CRF02_AG, predicted CXCR4 tropism for 4/18 URF viruses (Table 2). To analyse the URFs’ immunological features and putative resistance mutations against broadly neutralizing antibodies (bnAbs), envelope protein (Env) amino acid alignments of functional URF bulk sequences were studied (Figure S8). As expected, the URFs exhibited highly diverse variable regions V1‐V5, in contrast to the constant regions, which were mostly consistent in length and amino acid composition. Potential N‐glycosylation sites with relevance for bnAbs were highly conserved for most sites (≤1 out of 17 mutated); however, frequent resistance mutations were observed at sites N234 (8/17) and N637 (3/17). Of interest, both sites are relevant for gp120/gp41 interphase bnAbs, such as 8ANC195 and PGT151 48, 49. Also, sites of immune pressure identified in the human RV144 vaccine trial 45, 46 (4 to 5 deviating residues per site) and CD4 binding site (CD4bs) residues capable of conferring resistance to CD4bs bnAbs 47 (up to twelve deviating residues per site) showed greater divergence (Figure S8).
Table 2.
No. | Sample ID | PI Resistance Mutations | RTI Resistance Mutations | INSTI Resistance Mutations | Co‐receptor tropism | ||||
---|---|---|---|---|---|---|---|---|---|
Major | Accessory | NRTI | NNRTI | Major | Accessory | G2P (5% FPR) | PhenoSeq | ||
1 | LB016‐1 | None | None | None | None | None | None | R5 | R5 |
2 | LB069‐1 | None | None | None | None | None | None | R5 | R5 |
3 | LB082‐1 | None | None | None | None | None | None | R5 | R5 |
4 | LB089‐1 | None | None | None | None | None | None | R5 | X4 |
5 | LB095‐1 | None | None | None | None | None | None | R5 | R5 |
6 | LB104‐1 | None | None | None | None | None | T97A | R5 | X4 |
7 | MDC131‐3 | None | None | None | None | None | None | R5 | R5 |
8 | MDC179‐2 | None | None | None | None | None | None | R5 | X4 |
9 | BDHS24‐2 | None | V32E | NA | NA | None | T97A | R5 | R5 |
10 | BDHS33 | None | None | None | None | None | None | R5 | X4 |
11 | NYU119‐3 | None | None | None | None | None | None | R5 | R5 |
12 | NYU124‐2 | None | None | None | None | None | None | R5 | R5 |
13 | NYU129‐5 | None | None | None | None | None | T97A | R5 | X4 |
14 | NYU1122‐1 | None | None | None | None | None | Q146H | R5 | R5 |
15 | NYU1999 | None | None | None | None | None | None | R5 | R5 |
16 | NYU2140‐1 | None | None | None | None | None | None | R5 | R5 |
17 | NYU6556‐3 | None | None | None | None | None | None | R5 | R5 |
18 | NYU6541‐6 | None | None | None | None | None | T97A | R5 | R5 |
The presence of drug resistance mutations, according to the Stanford and WHO drug resistance database and X4 tropism are highlighted in bold. FPR, false positive rate; G2P, Geno2Pheno; INSTI, integrase strand transfer inhibitor; NA, Not applicable due to a frameshift in the respective genomic region; NNRTI, non‐NRTI; NRTI, nucleoside reverse transcriptase inhibitor; PI, protease inhibitor; R5, predicted CCR5 tropism; RTI, reverse transcriptase inhibitor; URF, unique recombinant form.; X4, predicted CXCR4 tropism.
4. Discussion
Eighteen URFs that were identified between the years 2000 and 2015 in the HIV recombinant hotspot Cameroon were characterized using a newly established NFG amplification and sequencing protocol 31. The study provided insight into the evolving HIV‐1 diversity, the genomic complexity of URF infections, the structural dynamics of emerging recombinant forms, and possible implications/caveats for antiretroviral treatment and vaccine research.
The URFs exhibited divergent recombination patterns with mosaic pieces of diverse subtypes and CRFs scattered along the whole genomes. It underlined the necessity of performing NFGS in order to achieve accurate HIV surveillance, subtyping and genotypic analyses. The URFs were composed of pure (sub‐)subtypes A (A1), F (F2) and G, and recombinant forms CRF02_AG, CRF01_AE and CRF22_01A1, thus, URFs appeared to have taken up key circulating strains in Cameroon 12, 13, 17, 50. Notably, 16/18 URFs contained CRF02_AG sequences and 13/18 URFs contained F2 sequences. For all URFs, the frameworks were exclusively built of CRF02_AG or F2 sequences. While CRF02_AG has been well documented to be the predominant lineage and the major fundament of recombinant strains in Cameroon 11, 12, 18, 24, 51, F2 is less known, and the growing influence of F2 in the URF genetic pool only recently became evident 12, 13. Within the last two decades, F2 infections have steadily increased in Cameroon and spread beyond the Cameroonian borders in West and Central Africa 12, 13, 17, 52, 53, 54, 55, 56, which likely promoted the involvement of F2 sequences in emerging recombinant forms. A CRF is declared when a characteristic recombination pattern (of a former URF) recurs in at least three epidemiologically unlinked individuals. While CRFs composed of CRF02_AG and F2 have not been described so far, the current and recent studies mentioned above suggest their likely presence and a timely detection when a higher number of URF samples is analysed.
Although bulk sequencing covered a large part of the relevant information in terms of strain monitoring, deep sequencing provided more profound insight into strain diversity and presence of subdominant subtypes/recombinants within an individual. Whereas bulk sequencing identifies one (or a mix of few variants) in a sample, which is dominant in terms of relative abundance or amplification yield, deep sequencing can detect several co‐existing variants. It enables the identification of otherwise undetectable variants/recombinants and may reveal the presence of (recombination events with) additional subtypes. For example, Figure 3, Figures S5, and S6 showed that multiple strains were present in three individuals at the studied time points, which bulk sequencing could not identify. Furthermore, bulk sequencing almost exclusively detected CRF02_AG in the 3’ HG of #NYU124_2 viruses (Figure 1), which was comparable to cluster a of the deep sequencing results (Figure S6). In contrast, deep sequencing revealed the presence of three additional viral populations, of which two were dominated by F2 sequences (along the studied 3’ half of the genome; clusters b and d in Figure S6).
Irrespective of sequencing method, a one amplicon strategy, as successfully applied for 7/18 samples, was the preferable NFGS approach since the assembly of multiple amplicons bears the potential risk of obtaining chimeras composed of different variants circulating in the same subject. The likeliness to obtain chimeras increases with the number of composite amplicons. The considerable size of the overlapping regions in our multiple amplicon approaches (e.g. >900 bp in two‐amplicon approaches) helped to avoid artificial chimeras. Assembly inaccuracies cannot be entirely excluded when using multiple partial amplicons (≥2). Yet, deep sequencing in combination with assembly algorithms exist for the correct assembly of large composite genomes, thus, to minimize assembly artefacts 57, 58.
The detection of numerous undefined regions in 2/3 of the studied URFs implies that the genetic pool of HIV is still insufficiently explored. Plausible reasons are evolutionary leaps of the circulating strains or the presence of unidentified strains/clades including traces of ancient viruses 23. Since the undefined regions were rather small in size (<500 bp), a lack of strong phylogenetic signals might have contributed that some fragments remained undefined. Besides HIV's high intrinsic evolutionary rate due to its error‐prone replication process, intra‐ or inter‐subtype recombination expedites its rapid evolution. Recombination results from super‐ or dual infection of an individual with two or more genetically distinct viruses 21, 59. The detected high intra‐patient diversity in three out of 11 successfully determined 3GS samples indicated that the diversification of viral strains through recombination was actively ongoing. Of interest, one participant exhibited three URF populations including a secondary URF and both of its parental URF lineages that most likely served as genetic templates after the occurrence of dual infection. Although a thorough confirmation of each 3GS sub‐population through extensive SGA analyses was beyond the capacity and scope of the current study, the SGA validation of our 3GS approach and the exclusion of minority variants with less than 2% prevalence makes it appear likely that in vitro recombination was not the only source for the detected secondary URF 31, 60, 61. Further studies are needed to retrace the events of in vivo recombination fully and to assess the clinical and functional consequences of high URF diversity within each individual and the entire population.
Subtype‐specific differences in replicative fitness have been described, and in the case of URFs, replication kinetics were associated with subtype characteristics of the pol segment 11. Specifically, in vitro replication was enhanced in URFs with pol regions of clades CRF01_AE, CRF22_01A1 or D when compared with CRF02_AG. Of interest, our study showed overall predominant CRF02_AG sequences; however, not for the pol regions where a strong influence of other clades, specifically F2, became evident. Recent recombinant pattern analysis revealed that, on a global scale, subtype F fragments most frequently contributed to recombinants in gag, protease, reverse transcriptase, integrase and gp120 genomic regions, confirming the tendency of subtype F to recombine in pol 62. The focus on Cameroon, the predominance of sub‐subtype F2 over F1, and the limited number of 18 NFGS in our study may be responsible for the subtle differences in F fragment distribution compared with the global study. Overall, these data suggest that the generation of new URFs and onward transmission may be partly driven by replication characteristics of the parental strain pol regions.
The pol region is also the primary target for antiretroviral therapy, and sequence evolution in pol is dependent on drug selection pressure. Controversially discussed, polymorphisms known to confer drug resistance may generally occur more frequently in individuals/populations with the higher genetic diversity of HIV strains 63, 64. Our study included ART‐naïve individuals with no indications of treatment history, in whom no signs of major canonical DRMs were detected. However, 5/18 individuals were identified to carry accessory DRMs, which, in the context of the mosaic genetic composition, might confer new ways of drug resistance. Notably, while the emerging subtype F2 strains are largely understudied, the genetically closely related F1 viruses were shown to exhibit reduced sensitivities to reverse transcriptase and protease inhibitors 65, 66.
More generally, higher pre‐treatment diversity has been associated with higher viral load, less effective viral control after treatment interruption, accelerated disease progression 67, 68, 69, and impact on drug efficacy while under treatment 63, 70, 71. Presumably, an expedited process of escape is provided by an outgrowth of resistant variants or recombination events out of a larger pool of available strains. The presence of multiple viral sub‐populations in 3/11 study samples that were subjected to 3GS suggests that such processes are of relevance in our study population. More precise deep sequencing as obtained with the portable MinIon 3GS technology, with the given limitations in depth and accuracy 31, 72, 73, will be of importance to define the clinical consequences of high intra‐patient URF diversity.
With regards to the antibody‐exposed Env regions of the studied URF viruses, sites of immune pressure, as identified in the human RV144 vaccine trial 45, 46 varied in half of the studied URFs. Also, essential residues for binding/neutralization of gp120/gp41 interphase as well as conformational CD4 binding site bnAbs were divergent in more than half of the studied strains 47, 48, 49. It indicates that differential quaternary Env arrangements and adaptations may have evolved/been necessary upon the generation of novel URFs. Longitudinal follow‐up of individuals with URF infections including clinical and immunological monitoring will be paramount to comprehensively assess and counteract the challenges URFs pose for human healthcare.
5. Conclusions
The current study illustrated the genetic complexity and structural dynamics in URFs from Cameroon. Periodic monitoring of emerging strains in Cameroon and other regions in West‐Central Africa is necessary to detect evolutionary trends that might appear in other regions of the world at a later time, as monitored in the past for the migration of major HIV‐1 M subtypes. The complications of diverse HIV infections may, among other factors, contribute to regional public healthcare issues as evident in the most recent 90‐90‐90 target report from UNAIDS; West and Central Africa had one of the most pronounced treatment target gaps in the world, i.e. percentages below 50% in diagnosed, treated and virally suppressed people living with HIV. Ongoing full genome monitoring of increased numbers of URF infections will reveal, whether the dominant CRF02_AG and F2 framework patterns will continue to persist or further evolve, and how the challenges that emerging URFs pose for HIV diagnosis, antiretroviral treatment and vaccine development can be addressed.
Competing interests
The authors have no competing interests to declare.
Authors’ contributions
RD designed the study. ANB, MTu, JSB, PZ, IOO and XW performed experiments. ANB, MTu, MTo, AKJ and RD performed experimental and statistical data analysis. AJN, DM, JN, MKG, AH and CF supervised experimental and statistical procedures. ANB and RD wrote the manuscript. All authors reviewed the manuscript.
Supporting information
Acknowledgements
The authors thank the individuals who consented to participate and donated their blood samples for this study. Our appreciation is extended to the Cameroon Ministry of Public Health for supporting this study. The authors thank Dr. Phillipe N. Nyambi for the support he gave to the students and several co‐authors of the study and contributing to the design and initiation of the project. The authors also thank Caroline Kakam, Bladine Asaah and Flavia Camacho for their assistance in sample collection and manuscript preparation.
Funding
This study was supported by the National Institutes of Health Grants AI083142 (MTu, AJN, RD) and TW009604 (ANB, MTu, JSB, AJN, RD). MTo is supported by the Sub‐Saharan African Network for TB/HIV Research Excellence (SANTHE), a DELTAS Africa Initiative (grant # DEL‐15‐006). The DELTAS Africa Initiative is an independent funding scheme of the African Academy of Sciences (AAS)'s Alliance for Accelerating Excellence in Science in Africa (AESA) and supported by the New Partnership for Africa's Development Planning and Coordinating Agency (NEPAD Agency) with funding from the Welcome Trust (grant # 107,752/Z/15/Z) and the UK Government.
Banin, A. N. , Tuen, M. , Bimela, J. S. , Tongo, M. , Zappile, P. , Khodadadi‐Jamayran, A. , Nanfack, A. J. , Okonko, I. O. , Meli, J. , Wang, X. , Mbanya, D. , Ngogang, J. , Gorny, M. K. , Heguy, A. , Fokunang, C. and Duerr, R. Near full genome characterization of HIV‐1 unique recombinant forms in Cameroon reveals dominant CRF02_AG and F2 recombination patterns. J Int AIDS Soc. 2019; 22(7):e25362
References
- 1. Hemelaar J, Elangovan R, Yun J, Dickson‐Tetteh L, Fleminger I, Kirtley S, et al. Global and regional molecular epidemiology of HIV‐1, 1990‐2015: a systematic review, global survey, and trend analysis. Lancet Infect Dis. 2019;19(2):143–55. [DOI] [PubMed] [Google Scholar]
- 2. Keele BF, Van Heuverswyn F, Li Y, Bailes E, Takehisa J, Santiago ML, et al. Chimpanzee reservoirs of pandemic and nonpandemic HIV‐1. Science. 2006;313(5786):523–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Sharp PM, Hahn BH. Origins of HIV and the AIDS pandemic. Cold Spring Harb Perspect Med. 2011;1(1):a006841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. McNeil D Jr. Chimp to man to history: the path of AIDS. The New York Times. 2011 October 17, 2011. [cited 2018 Nov 19]. Available from: https://www.nytimes.com/2011/10/18/health/18aids.html
- 5. Tongo M, Martin DP, Zembe L, Mpoudi‐Ngole E, Williamson C, Burgers WA. Characterization of HIV‐1 gag and nef in Cameroon: further evidence of extreme diversity at the origin of the HIV‐1 group M epidemic. Virol J. 2013;10:29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Tebit DM, Arts EJ. Tracking a century of global expansion and evolution of HIV to drive understanding and to combat disease. Lancet Infect Dis. 2011;11(1):45–56. [DOI] [PubMed] [Google Scholar]
- 7. Faria NR, Rambaut A, Suchard MA, Baele G, Bedford T, Ward MJ, et al. HIV epidemiology. The early spread and epidemic ignition of HIV‐1 in human populations. Science. 2014;346(6205):56–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Peeters M, Jung M, Ayouba A. The origin and molecular epidemiology of HIV. Expert Rev Anti Infect Ther. 2013;11(9):885–96. [DOI] [PubMed] [Google Scholar]
- 9. Vergne L, Bourgeois A, Mpoudi‐Ngole E, Mougnutou R, Mbuagbaw J, Liegeois F, et al. Biological and genetic characteristics of HIV infections in Cameroon reveals dual group M and O infections and a correlation between SI‐inducing phenotype of the predominant CRF02_AG variant and disease stage. Virology. 2003;310(2):254–66. [DOI] [PubMed] [Google Scholar]
- 10. Brennan CA, Bodelle P, Coffey R, Devare SG, Golden A, Hackett J Jr, et al. The prevalence of diverse HIV‐1 strains was stable in Cameroonian blood donors from 1996 to 2004. J Acquir Immune Defic Syndr. 2008;49(4):432–9. [DOI] [PubMed] [Google Scholar]
- 11. Ragupathy V, Zhao J, Wood O, Tang S, Lee S, Nyambi P, et al. Identification of new, emerging HIV‐1 unique recombinant forms and drug resistant viruses circulating in Cameroon. Virol J. 2011;8:185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Courtney CR, Agyingi L, Fokou A, Christie S, Asaah B, Meli J, et al. Monitoring HIV‐1 group M subtypes in Yaounde, Cameroon reveals broad genetic diversity and a novel CRF02_AG/F2 infection. AIDS Res Hum Retroviruses. 2016;32(4):381–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Nanfack AJ, Redd AD, Bimela JS, Ncham G, Achem E, Banin AN, et al. Multimethod longitudinal HIV drug resistance analysis in antiretroviral‐therapy‐naive patients. J Clin Microbiol. 2017;55(9):2785–800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Nkengasong JN, Janssens W, Heyndrickx L, Fransen K, Ndumbe PM, Motte J, et al. Genotypic subtypes of HIV‐1 in Cameroon. AIDS. 1994;8(10):1405–12. [DOI] [PubMed] [Google Scholar]
- 15. Fonjungo PN, Mpoudi EN, Torimiro JN, Alemnji GA, Eno LT, Lyonga EJ, et al. Human immunodefiency virus type 1 group M protease in Cameroon: genetic diversity and protease inhibitor mutattional features. J Clin Microbiol. 2001;40:837–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Torimiro JN, D'Arrigo R, Takou D, Nanfack A, Pizzi D, Ngong I, et al. Human immunodeficiency virus type 1 intersubtype recombinants predominate in the AIDS epidemic in Cameroon. New Microbiol. 2009;32(4):325–31. [PubMed] [Google Scholar]
- 17. Agyingi L, Mayr LM, Kinge T, Orock GE, Ngai J, Asaah B, et al. The evolution of HIV‐1 group M genetic variability in Southern Cameroon is characterized by several emerging recombinant forms of CRF02_AG and viruses with drug resistance mutations. J Med Virol. 2014;86(3):385–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Carr JK, Wolfe ND, Torimiro JN, Tamoufe U, Mpoudi‐Ngole E, Eyzaguirre L, et al. HIV‐1 recombinants with multiple parental strains in low‐prevalence, remote regions of Cameroon: evolutionary relics? Retrovirology. 2010;7:39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Hemelaar J, Gouws E, Ghys PD, Osmanov S; W‐UNfH, Characterisation . Global trends in molecular epidemiology of HIV‐1 during 2000‐2007. AIDS. 2011;25(5):679–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Konings FA, Haman GR, Xue Y, Urbanski MM, Hertzmark K, Nanfack A, et al. Genetic analysis of HIV‐1 strains in rural eastern Cameroon indicates the evolution of second‐generation recombinants to circulating recombinant forms. J Acquir Immune Defic Syndr. 2006;42(3):331–41. [DOI] [PubMed] [Google Scholar]
- 21. Courtney CR, Mayr L, Nanfack AJ, Banin AN, Tuen M, Pan R, et al. Contrasting antibody responses to intrasubtype superinfection with CRF02_AG. PLoS One. 2017;12(3):e0173705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. HIV Circulating Recombinant Forms (CRFs) , HIV Sequence Database, Los Alamos National Laboratory, NM, USA [cited 2019 Apr 30]. Available from: https://www.hiv.lanl.gov/content/sequence/HIV/CRFs/CRFs.html
- 23. Tongo M, Dorfman JR, Abrahams MR, Mpoudi‐Ngole E, Burgers WA, Martin DP. Near full‐length HIV type 1M genomic sequences from Cameroon: evidence of early diverging under‐sampled lineages in the country. Evol Med Public Health. 2015;2015(1):254–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Tongo M, Dorfman JR, Martin DP. High degree of HIV‐1 group M (HIV‐1M) genetic diversity within Circulating recombinant forms: insight into the early events of HIV‐1M evolution. J Virol. 2015;90(5):2221–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Rodgers MA, Wilkinson E, Vallari A, McArthur C, Sthreshley L, Brennan CA, et al. Sensitive next‐generation sequencing method reveals deep genetic diversity of HIV‐1 in the democratic republic of the Congo. J Virol. 2017;91(6):e01841–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Fan J, Negroni M, Robertson DL. The distribution of HIV‐1 recombination breakpoints. Infect Genet Evol. 2007;7(6):717–23. [DOI] [PubMed] [Google Scholar]
- 27. Smyth RP, Schlub TE, Grimm AJ, Waugh C, Ellenberg P, Chopra A, et al. Identifying recombination hot spots in the HIV‐1 genome. J Virol. 2014;88(5):2891–902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Archer J, Pinney JW, Fan J, Simon‐Loriere E, Arts EJ, Negroni M, et al. Identifying the important HIV‐1 recombination breakpoints. PLoS Comput Biol. 2008;4(9):e1000178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Powell RL, Zhao J, Konings FA, Tang S, Ewane L, Burda S, et al. Circulating recombinant form (CRF) 37_cpx: an old strain in Cameroon composed of diverse, genetically distant lineages of subtypes A and G. AIDS Res Hum Retroviruses. 2007;23(7):923–33. [DOI] [PubMed] [Google Scholar]
- 30. Powell RL, Zhao J, Konings FA, Tang S, Nanfack A, Burda S, et al. Identification of a novel circulating recombinant form (CRF) 36_cpx in Cameroon that combines two CRFs (01_AE and 02_AG) with ancestral lineages of subtypes A and G. AIDS Res Hum Retroviruses. 2007;23(8):1008–19. [DOI] [PubMed] [Google Scholar]
- 31. Banin AN, Tuen M, Bimela JS, Tongo M, Zappile P, Khodadadi‐Jamayran A, et al. Development of a versatile, near full genome amplification and sequencing approach for a broad variety of HIV‐1 group M variants. Viruses. 2019;11(4):317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Powell R, Barengolts D, Mayr L, Nyambi P. The evolution of HIV‐1 diversity in rural cameroon and its implications in vaccine design and trials. Viruses. 2010;2(2):639–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Zhong P, Burda S, Urbanski M, Kenfack H, Tongo M, Heyndrickx L, et al. HIV type 1 group M clades infecting subjects from rural villages in equatorial rain forests of Cameroon. J Acquir Immune Defic Syndr. 2002;31(5):495–505. [DOI] [PubMed] [Google Scholar]
- 34. HIV Sequence Database , Los Alamos National Laboratory, NM, USA [cited 2019 Apr 30]. Available from: https://www.hiv.lanl.gov/content/sequence/HIV/mainpage.html
- 35. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post‐analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Miller M, Pfeiffer W, Schwartz T. Creating the CIPRES Science Gateway for inference of large phylogenetic trees. 2010 Gateway Computing Environments Workshop (GCE); New Orleans, LA: IEEE; 2010. p. 1–8.
- 38. Stamatakis A, Alachiotis N. Time and memory efficient likelihood‐based tree searches on phylogenomic alignments with missing data. Bioinformatics. 2010;26(12):i132–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Bennett DE, Camacho RJ, Otelea D, Kuritzkes DR, Fleury H, Kiuchi M, et al. Drug resistance mutations for surveillance of transmitted HIV‐1 drug‐resistance: 2009 update. PLoS One. 2009;4(3):e4724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. WHO . HIV drug resistance report 2017. Geneva: 2017. [cited 2019 Apr 30]. Available from: https://apps.who.int/iris/bitstream/handle/10665/255896/9789241512831-eng.pdf [Google Scholar]
- 41. Gunthard HF, Calvez V, Paredes R, Pillay D, Shafer RW, Wensing AM, et al. Human immunodeficiency virus drug resistance: 2018 recommendations of the international antiviral society‐USA panel. Clin Infect Dis. 2019;68(2):177–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Wensing AM, Calvez V, Gunthard HF, Johnson VA, Paredes R, Pillay D, et al. 2017 update of the drug resistance mutations in HIV‐1. Top Antivir Med. 2017;24(4):132–3. [PMC free article] [PubMed] [Google Scholar]
- 43. Soulie C, Fofana DB, Boukli N, Sayon S, Lambert‐Niclot S, Wirden M, et al. Performance of genotypic algorithms for predicting tropism of HIV‐1CRF02_AG subtype. J Clin Virol. 2016;76:51–4. [DOI] [PubMed] [Google Scholar]
- 44. Cashin K, Gray LR, Harvey KL, Perez‐Bercoff D, Lee GQ, Sterjovski J, et al. Reliable genotypic tropism tests for the major HIV‐1 subtypes. Sci Rep. 2015;5:8543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Rerks‐Ngarm S, Pitisuttithum P, Nitayaphan S, Kaewkungwal J, Chiu J, Paris R, et al. Vaccination with ALVAC and AIDSVAX to prevent HIV‐1 infection in Thailand. N Engl J Med. 2009;361(23):2209–20. [DOI] [PubMed] [Google Scholar]
- 46. Rolland M, Edlefsen PT, Larsen BB, Tovanabutra S, Sanders‐Buell E, Hertz T, et al. Increased HIV‐1 vaccine efficacy against viruses with genetic signatures in Env V2. Nature. 2012;490(7420):417–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. deCamp A, Hraber P, Bailer RT, Seaman MS, Ochsenbauer C, Kappes J, et al. Global panel of HIV‐1 Env reference strains for standardized assessments of vaccine‐elicited neutralizing antibodies. J Virol. 2014;88(5):2489–507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Falkowska E, Le KM, Ramos A, Doores KJ, Lee JH, Blattner C, et al. Broadly neutralizing HIV antibodies define a glycan‐dependent epitope on the prefusion conformation of gp41 on cleaved envelope trimers. Immunity. 2014;40(5):657–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Scheid JF, Mouquet H, Ueberheide B, Diskin R, Klein F, Oliveira TY, et al. Sequence and structural convergence of broad and potent HIV antibodies that mimic CD4 binding. Science. 2011;333(6049):1633–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Ndembi N, Abraha A, Pilch H, Ichimura H, Mbanya D, Kaptue L, et al. Molecular characterization of human immunodeficiency virus type 1 (HIV‐1) and HIV‐2 in Yaounde, Cameroon: evidence of major drug resistance mutations in newly diagnosed patients infected with subtypes other than subtype B. J Clin Microbiol. 2008;46(1):177–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Zhao J, Tang S, Ragupathy V, Gaddam D, Wang X, Zhang P, et al. CRF22_01A1 is involved in the emergence of new HIV‐1 recombinants in Cameroon. J Acquir Immune Defic Syndr. 2012;60(4):344–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Aghokeng AF, Vergne L, Mpoudi‐Ngole E, Mbangue M, Deoudje N, Mokondji E, et al. Evaluation of transmitted HIV drug resistance among recently‐infected antenatal clinic attendees in four Central African countries. Antivir Ther. 2009;14(3):401–11. [DOI] [PubMed] [Google Scholar]
- 53. Burda ST, Viswanath R, Zhao J, Kinge T, Anyangwe C, Tinyami ET, et al. HIV‐1 reverse transcriptase drug‐resistance mutations in chronically infected individuals receiving or naive to HAART in Cameroon. J Med Virol. 2010;82(2):187–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Ceccarelli L, Salpini R, Moudourou S, Cento V, Santoro MM, Fokam J, et al. Characterization of drug resistance mutations in naive and ART‐treated patients infected with HIV‐1 in Yaounde. Cameroon. J Med Virol. 2012;84(5):721–7. [DOI] [PubMed] [Google Scholar]
- 55. Koizumi Y, Ndembi N, Miyashita M, Lwembe R, Kageyama S, Mbanya D, et al. Emergence of antiretroviral therapy resistance‐associated primary mutations among drug‐naive HIV‐1‐infected individuals in rural western Cameroon. J Acquir Immune Defic Syndr. 2006;43(1):15–22. [DOI] [PubMed] [Google Scholar]
- 56. Ndongmo CB, Pieniazek D, Holberg‐Petersen M, Holm‐Hansen C, Zekeng L, Jeansson SL, et al. HIV genetic diversity in Cameroon: possible public health importance. AIDS Res Hum Retroviruses. 2006;22(8):812–6. [DOI] [PubMed] [Google Scholar]
- 57. Ye C, Hill CM, Wu S, Ruan J, Ma ZS. DBG2OLC: efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies. Scie Rep. 2016;6:31900. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Ekblom R, Wolf JB. A field guide to whole‐genome sequencing, assembly and annotation. Evol Appl. 2014;7(9):1026–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Redd AD, Quinn TC, Tobian AA. Frequency and implications of HIV superinfection. Lancet Infect Dis. 2013;13(7):622–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Aralaguppe SG, Siddik AB, Manickam A, Ambikan AT, Kumar MM, Fernandes SJ, et al. Multiplexed next‐generation sequencing and de novo assembly to obtain near full‐length HIV‐1 genome from plasma virus. J Virol Methods. 2016;236:98–104. [DOI] [PubMed] [Google Scholar]
- 61. Jordan MR, Kearney M, Palmer S, Shao W, Maldarelli F, Coakley EP, et al. Comparison of standard PCR/cloning to single genome sequencing for analysis of HIV‐1 populations. J Virol Methods. 2010;168(1–2):114–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Tongo M, de Oliveira T, Martin DP. Patterns of genomic site inheritance in HIV‐1M inter‐subtype recombinants delineate the most likely genomic sites of subtype‐specific adaptation. Virus Evol. 2018;4(1):vey015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Megens S, Laethem KV. HIV‐1 genetic variation and drug resistance development. Expert Rev Anti Infect Ther. 2013;11(11):1159–78. [DOI] [PubMed] [Google Scholar]
- 64. Palumbo PJ, Wilson EA, Piwowar‐Manning E, McCauley M, Gamble T, Kumwenda N, et al. Association of HIV diversity and virologic outcomes in early antiretroviral treatment: HPTN 052. PLoS One. 2017;12(5):e0177281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Apetrei C, Descamps D, Collin G, Loussert‐Ajaka I, Damond F, Duca M, et al. Human immunodeficiency virus type 1 subtype F reverse transcriptase sequence and drug susceptibility. J Virol. 1998;72(5):3534–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Poveda E, de Mendoza C, Parkin N, Choe S, Garcia‐Gasco P, Corral A, et al. Evidence for different susceptibility to tipranavir and darunavir in patients infected with distinct HIV‐1 subtypes. AIDS. 2008;22(5):611–6. [DOI] [PubMed] [Google Scholar]
- 67. Joos B, Trkola A, Fischer M, Kuster H, Rusert P, Leemann C, et al. Low human immunodeficiency virus envelope diversity correlates with low in vitro replication capacity and predicts spontaneous control of plasma viremia after treatment interruptions. J Virol. 2005;79(14):9026–37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Mani I, Gilbert P, Sankale JL, Eisen G, Mboup S, Kanki PJ. Intrapatient diversity and its correlation with viral setpoint in human immunodeficiency virus type 1 CRF02_A/G‐IbNG infection. J Virol. 2002;76(21):10745–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Sagar M, Lavreys L, Baeten JM, Richardson BA, Mandaliya K, Chohan BH, et al. Infection with multiple human immunodeficiency virus type 1 variants is associated with faster disease progression. J Virol. 2003;77(23):12921–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Butler IF, Pandrea I, Marx PA, Apetrei C. HIV genetic diversity: biological and public health consequences. Curr HIV Res. 2007;5(1):23–45. [DOI] [PubMed] [Google Scholar]
- 71. Hemelaar J. Implications of HIV diversity for the HIV‐1 pandemic. J Infect. 2013;66(5):391–400. [DOI] [PubMed] [Google Scholar]
- 72. Brown BL, Watson M, Minot SS, Rivera MC, Franklin RB. MinION nanopore sequencing of environmental metagenomes: a synthetic approach. Gigascience. 2017;6(3):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Jain M, Koren S, Miga KH, Quick J, Rand AC, Sasani TA, et al. Nanopore sequencing and assembly of a human genome with ultra‐long reads. Nat Biotechnol. 2018;36(4):338–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.