Skip to main content
AIDS Research and Human Retroviruses logoLink to AIDS Research and Human Retroviruses
. 2020 Jul 2;36(7):574–582. doi: 10.1089/aid.2020.0031

New Genomes from the Congo Basin Expand History of CRF01_AE Origin and Dissemination

Dennis Maletich Junqueira 1,2,3, Eduan Wilkinson 2,3, Ana Vallari 4, Xianding Deng 5,6, Asmeeta Achari 5,6, Guixia Yu 5,6, Carole McArthur 7, Lazare Kaptue 8, Dora Mbanya 9,10, Charles Chiu 5,6,11, Gavin A Cloherty 4, Tulio de Oliveira 2,3,12, Mary A Rodgers 4,
PMCID: PMC7398440  PMID: 32281388

Abstract

Although the first HIV circulating recombinant form (CRF01_AE) is the predominant strain in many Asian countries, it is uncommonly found in the Congo Basin from where it first originated. To fill the gap in the evolutionary history of this important strain, we sequenced near complete genomes from HIV samples with subgenomic CRF01_AE regions collected in Cameroon and the Democratic Republic of the Congo from 2001 to 2006. HIV genomes were generated from N = 13 plasma specimens by next-generation sequencing of metagenomic libraries prepared with spiked primers targeting HIV, followed by Sanger gap-filling. Genome sequences were aligned to reference strains, including Asian and African CRF01_AE sequences, and evaluated by phylogenetic and recombinant analysis to identify four CRF01_AE strains from Cameroon. We also identified two CRF02, one CRF27, and six unique recombinant form genomes (01|A1|G, 01|02|F|U, F|G|01, A1|D|01, F|G|01, and A1|G|01). Phylogenetic analysis, including the four new African CRF01_AE genomes, placed these samples as a bridge between basal Central African Republic CRF01_AE strains and all Asian, European, and American CRF01_AE strains. Molecular dating confirmed previous estimates indicating that the most recent common CRF01_AE ancestor emerged in the early 1970s (1968–1970) and spread beyond Africa around 1980 to Asia. The new sequences and analysis presented in this study expand the molecular history of the CRF01_AE clade, and are illustrated in an interactive Next Strain phylogenetic tree, map, and timeline at (https://nextstrain.org/community/EduanWilkinson/hiv-1_crf01).

Keywords: HIV, CRF, surveillance, Congo, phylogenetics, molecular clock analysis

Introduction

CRF01_AE was the first circulating recombinant form (CRF) to be identified in the HIV type 1 (HIV-1) epidemic.1,2 The genome structure of this virus appeared to be a mosaic strain containing a mixture of subtype A and portions of sequences that did not cluster with other known subtypes of HIV-1 at the time. The non-A regions (portions of vif, vpr, env, nef, and 3′ long terminal repeat) were classified as subtype E, although a complete subtype E genome has not been identified to date. The earliest isolates of CRF01_AE were collected in 1990 in the Central African Republic (CAR).2–4 Molecular characterization of the envelope region from strains circulating in Africa in the 1990s indicated that genotype E accounted for 31% of 29 randomly selected samples collected in CAR,4 whereas subtype E was absent from studies conducted in Uganda, Zaire (now Democratic Republic of the Congo [DRC]), Malawi, and Tanzania.5–8

In contrast, HIV isolates collected from Thailand in the 1990s were dominated by subtype E envelope sequences.9 Once this “subtype” was officially recognized as CRF01_AE in the late 1990s, many molecular epidemiological surveys confirmed that CRF01_AE has continued to predominate the epidemics in Southeast Asia over the past three decades (reaching up to almost 82% among HIV-1 infections in the region).10 The spread of CRF01_AE has been tracked from Thailand to Vietnam, China, and ultimately a global dissemination.11–14 However, CRF01_AE has not extensively spread within Central Africa, with CRF01_AE infection having a prevalence of <1% in nearby Cameroon,15 <4% in DRC,16,17 <3% in Chad,18,19 <1% in the Republic of the Congo,20–22 and 6% in CAR23 in recent studies. However, the extent to which CRF01_AE may still be circulating within recombinant strains cannot be evaluated with subgenomic sequences reported in most of these studies.

Both subgenomic sequences and complete genomes have been used to estimate when CRF01_AE first emerged in Africa and became a key Asian strain. Bayesian analysis with subgenomic sequences estimated the origins of CRF01_AE in Central Africa to the 1960s by several models.14 More recently, analyses with four complete genome sequences have estimated that CRF01_AE first emerged in CAR in the 1970s and was introduced in Asia in the early 1980s (Ref.12,13). The evolutionary path of CRF01_AE between CAR and Asia has not been well characterized due to the limited number of only four CRF01_AE complete genome sequences ever produced from Africa, which could be a reflection of its low prevalence.

To address this gap, near-complete genome sequences were generated from N = 13 specimens collected in Cameroon and DRC carrying HIV strains identified as CRF01_AE by next-generation sequencing (NGS) and subgenomic Sanger sequencing. Phylogenetic analysis with references from CAR and Asia indicates that four new Cameroonian CRF01_AE strains sequenced herein provide a new molecular link between the two previously well-characterized major clades.

Materials and Methods

Study subjects

Thirteen plasma specimens were collected between 2001 and 2006 from HIV-1-positive individuals participating in viral diversity studies in Cameroon and the DRC (Table 1). The specimens came from blood donors or participants seeking voluntary testing as previously described.16,24 The DRC study was approved by the University of Missouri-Kansas City Research Board. The Cameroon study was approved by the Cameroon National Ethical Review Board, the Faculty of Medicine and Biomedical Science Institutional Review Boards (IRB), and the Ministry of Health of Cameroon.

Table 1.

Sample Origin and Recombination Analysis Summary of Thirteen HIV-1 Sequences Isolated in Africa

Sample ID Year Country Viral load (log copies/mL) REGA
jpHMM Simplot BLAST
RAxML Final
Classification Bootstrap strength Classification Identity
260-27 2005 CM 5.32 NA NA A1/G/01/U 01/G/A NA 90% CRF01_AE URF_A1/G/01
890-05 2006 CM 5.05 NA NA A1/G A1/G CRF02_AG 93% CRF02_AG CRF02_AG
A1188 2002 CM 4.25 NA NA CRF01/A1/G CRF01/A1/G CRF01_AE 92% CRF01_AE URF_01/A1/G
A1152 2002 CM 4.22 CRF02_AG 100 A1/G A1/G CRF02_AG 93% CRF02_AG CRF02_AG
A1549 2002 CM 4.11 NA NA A1/G/F2/01 CRF01/A1/F/G/U URF_0222 89% CRF01_AE URF_01/02/F/U
CG-0422-02V 2002 DRC NA NA NA A1/D/01 A1/D/01 NA 90% D URF_A1/D/01
CG-0427a-02V 2002 DRC NA NA NA F1/G/01/U F/G/01 F1 90% F1 URF_F/G/01
CG-0070-01 2001 DRC NA NA NA F1/G/01/U F/G/01 F1 89% F1 URF_F/G/01
CG-0077-01 2003 DRC NA CRF27_cpx 100 G/H/C/J/01/A1/B G/H/J/C/01/A1 CRF27_cpx 90% CRF27_cpx CRF27_cpx
U7957 2003 CM NA CRF01_AE 100 CRF01_AE CRF01_AEa CRF01_AE 92% CRF01_AE CRF01_AE
U8216 2003 CM NA CRF01_AE 100 CRF01_AE CRF01_AEa CRF01_AE 93% CRF01_AE CRF01_AE
234-40 2005 CM 4.14 CRF01_AE 100 CRF01_AE CRF01_AEa CRF01_AE 92% CRF01_AE CRF01_AE
1002-28 2006 CM 4.81 CRF01_AE 100 CRF01_AE CRF01_AEa CRF01_AE 92% CRF01_AE CRF01_AE
a

Reference sequences for the Simplot analyses are described in Supplementary Table S2.

01, CRF01_AE; BLAST, Basic Local Alignment Search Tool; CM, Cameroon; CRF, circulating recombinant form; DRC, Democratic Republic of the Congo; jpHMM, jumping profile Hidden Markov Model; NA, not assigned; RAxML, Randomized Axelerated Maximum Likelihood; REGA, software from Rega Institute in Central Belgium; URF, unique recombinant form.

Amplification and sequencing

All near full-length nucleotide sequences of HIV-1 (HXB2: 94-9720) were amplified and sequenced from plasma viral RNA by a metagenomic sequencing with spiked primer enrichment (MSSPE) NGS approach.25 Gaps were filled in by supplemental Sanger sequencing.16 Briefly, nucleic acid was extracted from plasma according to the manufacturers' instructions using (i) the EZ1 Advanced XL system and EZ1 Virus Mini Kit (Qiagen, Hilden, Germany) for NGS or (ii) the RNA extraction protocol on an m2000sp system (Abbott Molecular, Des Plaines, IL) for Sanger sequencing. For NGS, random hexamer (RH) primers (Life Technologies) were spiked with HIV-specific primers.25 These primers were designed from 3,571 reference genomes, which cover genotypes M, N, O, and P of HIV-1 (Supplementary Table S1). A total of 798 primers of 13 nucleotides (464 forward and 334 reverse) were ordered and synthesized by Integrated DNA Technologies, Inc. (IDT, Coralville, IA). The HIV RNA extract was mixed with spiked primer (4 μM) plus RH (0.4 μM) in a 10:1 ratio and heated to 65°C for 5 min. The reverse transcription master mix (10 mL SuperScript III buffer, 5 mL dNTP of 12.5 mM, 2.5 mL DTT of 0.1 M, and 1 mL SuperScript III enzyme) was added to each sample and incubated at 25°C for 5 min, followed by 42°C for 30 min and 94°C for 2 min. Generated complementary DNA went through metagenomic library preparation using Illumina Nextera XT protocol to be analyzed on a HiSeq instrument (Illumina). Genome consensus sequences were generated using CLC Bio and Sanger sequences covering gaps were merged into the consensus using Sequencher v5.4.1 (Genecodes) as previously described.16 All open reading frames were annotated with SeqBuilder (DNAStar Laservene, v15) software. The Genbank accession numbers for the 13 HIV-1 genomes are MN116200-MN116212.

Recombination analysis and sequence data

Recombination structures of all thirteen genomes were determined by five different detection methods: (i) REGA subtyping tool v3.0 (Ref.26); (ii) jumping profile Hidden Markov Model (jpHMM)-HIV27; (iii) Simplot v3.5.1 (Ref.28) under the Kimura 2-parameter model and a sliding window of 200 nucleotides increased by increments of 20 nucleotides; (iv) nucleotide Basic Local Alignment Search Tool (BLAST) to search for the most similar sequences; and (v) through phylogenetic reconstruction against a comprehensive dataset of 500 HIV-1 whole-genome sequences16 or through. The whole-genome phylogenetic reconstruction was performed in Randomized Axelerated Maximum Likelihood (RAxML)29 incorporating the best-fitting model of nucleotide substitution as determined in important quartets (IQ)-Tree30 and 1,000 bootstrap replicates31 that were used to infer transfer support for branches in the phylogeny.32 The final genome classification was defined based on the results of all five methods.

Phylogenetic and phylogeographic reconstruction

Isolates identified as CRF01_AE in this study were analyzed against all whole-genome CRF01_AE sequences in the Los Alamos National Laboratory (LANL) HIV-1 sequence database (n = 382; accessed October 2018). First, we re-subtyped these reference sequences with REGA v3.0 (Ref.26) and jpHMM27 to insure the correct subtype assignment. Sequences presenting classification discrepancies from the Genbank information were excluded from the dataset (n = 11).

The final dataset was submitted to a Nextstrain build.33 Nextstrain consists of a suite of tools that take raw sequences (in fasta format) and associated metadata (e.g., time, country, publications, authors, and/or journals) as input. In short, Nextstrain performs a sequence alignment of the input data in multiple sequence alignment based on Fast Fourier transform (MAFFT),34 which in turn is used to infer a maximum likelihood. For the purpose of this study, the maximum likelihood (ML) tree topology was inferred in RAxML using the general time-reversible (GTR)+G+I model of nucleotide substitution.35 The resulting tree topology is then transformed into a dated phylogeny where the branches correspond to units of real calendar time using a least squared dating approach.36 In turn, the dated phylogeny is used to perform ancestral state reconstruction to infer the probable location of internal nodes in the phylogeny using a marginal likelihood approach. Finally, the time-scaled tree and the spatial reconstruction are then visualized together in a web browser with auspice.33

Phylodynamic reconstruction

Demographic history of the CRF01_AE epidemic was examined under a Bayesian Markov Chain Monte Carlo approach as implemented in BEAST v.1.8.4 (Ref.37). First, a phylogenetic tree of the CRF01_AE whole-genome sequences (n = 4) was aligned with 371 whole-genome CRF01_AE reference sequences from LANL in MAFFT v7.0 (Ref.34). The subsequent alignment was used to construct an ML tree topology in RAxML v8.0 under the GTR+G+I model of nucleotide substitution and 1,000 bootstrap replicates. The resulting phylogeny was manually inspected in FigTree38 and highly monophyletic clades—defined as containing three or more sequences from the same country and with good branch support (>0.95)—were pruned with the ape package in R39 to only one sequence, preferably keeping the oldest isolate in the clade. The pruned phylogeny (N = 179) was tested for good clock-like signal in Tempest v1.5.1 using the sampling dates and root-to-tip divergence.40 An additional 61 sequences were removed as outliers resulting in a final R2 of 0.903 and a slope rate of ∼3.22 × 10−3 mutations per site per year, and with an x-intercept around 1969.7 (Supplementary Fig. S1).

This dataset was submitted to a Bayesian analysis incorporating the best-fitted nucleotide substitution model (GTR +4Γ+I+F) as determined in IQ-Tree41 with an uncorrelated lognormal relaxed molecular clock42 and various coalescent tree priors (e.g., skyride, skygrid, and skyline, constant and exponential growth). Bayes factor comparison between various coalescent tree priors consistently favored the use of the skyride coalescent tree prior.43 The molecular clock was calibrated using the sampling dates of the sequences under a uniform-distributed prior for the evolutionary rate according to previous simulations. After 5 × 108 steps, convergence was checked using Tracer v1.6 discarding the first 50% of values for burn-in.44 Following the discarding of the burn-in, the remaining posterior parameters and trees were used to reconstruct the demographic growth of the global HIV-1 CRF01_AE linage through time.

Results

Sequence generation

A set of N = 13 specimens with subgenomic sequences classified as CRF01 was identified from previous viral diversity studies conducted in Cameroon and DRC for complete genome sequencing. A range of 4.11–5.32 log copies/ml HIV RNA was detected among samples with previous viral load results available (N = 7). NGS of all 13 samples using the MSSPE method yielded complete or near-complete genomes of at least 8,800 nucleotides in length. Full coverage was achieved for N = 8 samples and >90% coverage for N = 5 (A1549, 1002-28, 234-40, A1152, and A1188), which were filled in further by Sanger sequencing. Average read depths for each genome ranged from 14 to 18,899 × and the number of reads mapping to the consensus ranged from 1,060 to 1,744,672.

Recombination analysis

Genome-wide recombination analyses of whole-genome sequences were performed to determine the viral subtype and to detect evidence of intersubtype recombination. Despite minor differences, all five methods we employed to characterize the genome profiles of the whole-genome sequences broadly led to similar results (Table 1). Based on consensus agreement between the five different methods, we identified four CRF01_AE isolates (U7957, U8216, 234-40, and 1002-28) and six unique recombinant forms (260-27, A1188, A1549, CG-0422-02V, CG-0427a-02V, and CG-0070-01) that included portions of genomic regions assigned to CRF01_AE (Fig. 1). Of the remaining three whole-genome sequences, two were classified as CRF02_AG (890-05 and A1152) and one as belonging to CRF27_cpx (CG-0077-01).

FIG. 1.

FIG. 1.

Schematic representation of jpHMM recombination results for thirteen HIV samples collected between 2001 and 2006 from HIV-1-positive individuals in Cameroon and the DRC samples isolated in the Congo Basin, Africa. Viral subtypes were denoted by different colors according to the legend. The gray N/A portions correspond to unclassifiable regions. CRF, circulating recombinant form; DRC, Democratic Republic of the Congo; jpHMM, jumping profile Hidden Markov Model; LTR, long terminal repeat; N/A, not assigned.

Phylogenetic and phylogeographic reconstruction

Only CRF01_AE sequences were submitted to further phylogenetic analysis. The four CRF01_AE strains isolated in this study were aligned to 371 CRF01_AE sequences from LANL, after a filtering step to achieve a more equitable temporal and geographic distribution as implemented in Nextstrain (final alignment; n = 358). Our phylogenetic and evolutionary analysis of near full-length CRF01_AE sequences in Nextstrain indicates that HIV-1 CRF01_AE arose in Africa in the early 1970s (1968–1970) (Figs. 24). As expected, our isolates grouped at the base of the tree with four other samples from Central Africa (three from the CAR and one from Cameroon) (Fig. 2). This pattern suggests that Cameroon has a secondary role in the African CRF01_AE epidemic and was seeded by Central African viruses around 1974 (1973–1975). The grouping of the four sequences in the base of the tree suggests that the current epidemic in Africa was directly derived from the primary CRF01_AE strains. The estimated evolutionary rate for this dataset was 1.5 × 10−3 nucleotide substitutions/site/year.

FIG. 2.

FIG. 2.

Nextstrain maximum-likelihood phylogenetic analysis of CRF01_AE whole-genome sequences. Tree branches are colored according to sampling origin. The sequences described in this study are located at the base of the tree identified by a red contour.

FIG. 4.

FIG. 4.

Nextstrain geographical view of CRF01_AE temporal dynamics. The geographical locations of isolates at each snapshot in time on the Bayesian skygrid are depicted with links between related strains indicated by a line. Lines between countries represent putative HIV transitions between regions and circle diameters are proportional to the square root of the number of branches that maintain the same location state at each time-point. Estimates of past population dynamics for CRF01_AE are plotted on the maps from A to F.

After almost 10 years of localized transmissions, the first divergence from the African strains was estimated to have occurred in the early 1980s (1980–1981) by the introduction of the virus in Thailand. It seems that only one or a few strains were imported from Africa and reached Asia. Thai sequences were found to be dispersed throughout the tree, structuring the basal positions to several clusters (Fig. 2). This pattern suggests that this lineage is likely to be the ancestor of all CRF01_AE strains in Asia (Fig. 4). Specifically, Thai sequences seeded the epidemics in China, Myanmar, the Philippines, Indonesia, the United States, The United Kingdom, and Sweden (Fig. 4). The epidemic in China that started in 1984 was driven by multiple independent lineages, most of them imported from Thailand. Thai sequences fall within the base of four major Chinese clusters and point to independent introductions of this strain into China. The Chinese epidemic was also seeded by strains from Vietnam in the mid-1990s (Fig. 2). In addition, Chinese sequences were also found dispersed throughout the tree lying next to Thai sequences indicating extensive interaction between the two epidemics. The evolutionary history of the CRF01_AE strains as they exited Africa and were introduced to Asia, Europe, and the Americas is illustrated in an interactive phylogenetic tree, map, and timeline at https://nextstrain.org/community/EduanWilkinson/hiv-1_crf01.

Phylodynamics of CRF01_AE

We further investigated the past population dynamics of CRF01_AE using 179 sequences under a Bayesian skyride model (Fig. 3). Consistent with the ML results, we estimated the time of the most recent common ancestor of CRF01_AE to be around 1971 (95% highest posterior density (HPD): 1970–1973) (Supplementary Fig. S2). The estimate of the effective number of HIV-1 infections through time shows an exponential increase in the viral population between 1971 and 1980, period in which the virus was still circulating in Africa (Fig. 4). The introduction of CRF01_AE in Asian countries is followed by a short decrease in the CRF01_AE population probably due to the unsustained number of dead-end infections (Figs. 3 and 4). After 1985, the number of new CRF01_AE infections exponentially increases, reaching a plateau after the 2000s. More recently, we observed a decrease in these estimates; however, the 95% Bayesian confidence interval prevents us to confirming this hypothesis.

FIG. 3.

FIG. 3.

Temporal dynamics of CRF01_AE spread from Africa. Bayesian skygrid estimates of past population dynamics for CRF01_AE were plotted. The left y-axis represents the effective number of infections (Ne) and the x-axis the time.

Discussion

The four African CRF01_AE genomes presented in this study provide further resolution for the epidemiology of the CRF01_AE clade and provide a new genetic link between the African and Asian CRF01_AE clades of the HIV-1 pandemic. First, phylogenetic analysis with the new Cameroonian CRF01_AE strains confirmed the African origin of CRF01_AE.12–14 Likewise, molecular dating with these strains was consistent with previous estimates of emergence of CRF01_AE in Africa in the early 1970s and in Asia in the early 1980s (Refs.12,13). CRF01_AE appears to have originated in CAR where it spread to Cameroon and other countries in central Africa, although limited sampling from African countries prevents further resolution of the spread of CRF01_AE in the region. After ∼10 years of localized transmission in central Africa, CRF01_AE spread to Thailand and onward to other Asian countries (Fig. 4). These results suggest a scenario of multiple introductions into epidemiologically linked, high-risk groups, primarily heterosexual, people who inject drugs and man who have sex with man related clusters.

The four CRF01_AE genomes from Cameroon were identified among a larger set of N = 13 genomes that were previously classified as CRF01_AE based on sequencing of smaller subgenomic regions. However, the detection of recombination in the majority of these strains (N = 9) highlights the limitations of subgenomic sequences for classifying strains. Surprisingly, three recombinant genomes were identified among these that were classified as CRF02_AG or CRF27_cpx (Table 1). This finding is likely explained by similarities in the regions found in both CRF01_AE and CRF01_AG, as well as the presence of CRF01_AE in the complex recombinant pattern of CRF27_cpx.4,45,46 These unexpected classifications confirm that full-length sequences provide the most accurate classification. Furthermore, the identification of mostly recombinant genomes in this dataset indicates that reliance upon subgenomic sequences alone can lead to underestimation of the amount of HIV recombination in a population. This is especially relevant for estimating the true prevalence of CRF01_AE, which could be skewed by subgenomic sequences from unrecognized recombinant genomes. Although CRF01_AE is rare in the Congo Basin, it is likely that its prevalence is actually much lower than has been estimated due to subgenomic sequence limitations. Interestingly, CRF01_AE has recently been found at dramatically higher rates in Kenya in 2008 and 2015 (14%–25%),47,48 although with subgenomic sequence classifications. Whether these strains represent a resurgence of CRF01_AE or a CRF01_AE-containing recombinant in East Africa remains to be seen.

The phylodynamic reconstruction with a skyride tree prior and the addition of more African sequences led to a more refined demographic reconstruction of this HIV-1 clade. The reconstruction suggests a small decrease in the effective population size in the early 1980s with another dip in the mid 1990s. The first followed the transmission from Africa to Asia and may relate to this migration event, while the second dip in the effective population size may very well be due to behavioral changes in Thailand at the outbreak of the epidemic in that country.49

We anticipate that further sampling and sequencing of global CRF01_AE strains will provide additional resolution for the molecular and geographical epidemiology presented herein. To accommodate these future updates, the interactive CRF01_AE Next Strain phylogenetic tree, map, and timeline provide an ongoing resource for tracking this HIV clade. As more strains are sequenced, they can easily be merged with the CRF01_AE Next Strain dataset to enable comprehensive monitoring of this clade in real time.

Sequence Data

The Genbank accession numbers for the 13 HIV-1 genomes are MN116200–MN116212.

Supplementary Material

Supplemental data
Supp_Table2.pdf (17.2KB, pdf)
Supplemental data
Supp_Table1.docx (47.4KB, docx)
Supplemental data
Supp_Fig1.pdf (109.2KB, pdf)
Supplemental data
Supp_Fig2.pdf (193.5KB, pdf)

Author Disclosure Statement

No competing financial interests exist.

Funding Information

This study was funded by Abbott Laboratories, IH grant R33-AI129455 (CYC) from the National Institute of Allergy and Infectious Diseases, NIH grant R01-HL105704 (CYC) from the National Heart, Lung, and Blood Institute, and the Charles and Helen Schwab Foundation (CYC).

Supplementary Material

Supplementary Figure S1

Supplementary Figure S2

Supplementary Table S1

Supplementary Table S2

References

  • 1. Carr JK, Salminen MO, Koch C, et al. : Full-length sequence and mosaic structure of a human immunodeficiency virus type 1 isolate from Thailand. J Virol 1996;70:5935–5943 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Gao F, Robertson DL, Morrison SG, et al. : The heterosexual human immunodeficiency virus type 1 epidemic in Thailand is caused by an intersubtype (A/E) recombinant of African origin. J Virol 1996;70:7013–7029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Anderson JP, Rodrigo AG, Learn GH, et al. : Testing the hypothesis of a recombinant origin of human immunodeficiency virus type 1 subtype E. J Virol 2000;74:10752–10765 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Murphy E, Korber B, Georges-Courbot MC, et al. : Diversity of V3 region sequences of human immunodeficiency viruses type 1 from the central African Republic. AIDS Res Hum Retroviruses 1993;9:997–1006 [DOI] [PubMed] [Google Scholar]
  • 5. Albert J, Franzen L, Jansson M, et al. : Ugandan HIV-1 V3 loop sequences closely related to the U.S./European consensus. Virology 1992;190:674–681 [DOI] [PubMed] [Google Scholar]
  • 6. Oram JD, Downing RG, Roff M, et al. : Sequence analysis of the V3 loop regions of the env genes of Ugandan human immunodeficiency proviruses. AIDS Res Hum Retroviruses 1991;7:605–614 [DOI] [PubMed] [Google Scholar]
  • 7. Orloff GM, Kalish ML, Chiphangwi J, et al. : V3 loops of HIV-1 specimens from pregnant women in Malawi uniformly lack a potential N-linked glycosylation site. AIDS Res Hum Retroviruses 1993;9:705–706 [DOI] [PubMed] [Google Scholar]
  • 8. Zwart G, Wolfs TF, Bookelman R, et al. : Greater diversity of the HIV-1 V3 neutralization domain in Tanzania compared with The Netherlands: Serological and genetic analysis. AIDS 1993;7:467–474 [DOI] [PubMed] [Google Scholar]
  • 9. Ichimura H, Kliks SC, Visrutaratna S, Ou CY, Kalish ML, Levy JA: Biological, serological, and genetic characterization of HIV-1 subtype E isolates from northern Thailand. AIDS Res Hum Retroviruses 1994;10:263–269 [DOI] [PubMed] [Google Scholar]
  • 10. Hemelaar J, Elangovan R, Yun J, et al. : Global and regional molecular epidemiology of HIV-1, 1990–2015: A systematic review, global survey, and trend analysis. Lancet Infect Dis 2019;19:143–155 [DOI] [PubMed] [Google Scholar]
  • 11. Angelis K, Albert J, Mamais I, et al. : Global dispersal pattern of HIV type 1 subtype CRF01_AE: A genetic trace of human mobility related to heterosexual sexual activities centralized in Southeast Asia. J Infect Dis 2015;211:1735–1744 [DOI] [PubMed] [Google Scholar]
  • 12. Feng Y, He X, Hsi JH, et al. : The rapidly expanding CRF01_AE epidemic in China is driven by multiple lineages of HIV-1 viruses introduced in the 1990s. AIDS 2013;27:1793–1802 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Li X, Liu H, Liu L, et al. : Tracing the epidemic history of HIV-1 CRF01_AE clusters using near-complete genome sequences. Sci Rep 2017;7:4024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Liao H, Tee KK, Hase S, et al. : Phylodynamic analysis of the dissemination of HIV-1 CRF01_AE in Vietnam. Virology 2009;391:51–56 [DOI] [PubMed] [Google Scholar]
  • 15. Rodgers MA, Vallari AS, Harris B, et al. : Identification of rare HIV-1 Group N, HBV AE, and HTLV-3 strains in rural South Cameroon. Virology 2017;504:141–151 [DOI] [PubMed] [Google Scholar]
  • 16. Rodgers MA, Wilkinson E, Vallari A, et al. : Sensitive next-generation sequencing method reveals deep genetic diversity of HIV-1 in the Democratic Republic of the Congo. J Virol 2017;91:e01841-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Vidal N, Mulanga C, Bazepeo SE, et al. : HIV type 1 pol gene diversity and antiretroviral drug resistance mutations in the Democratic Republic of Congo (DRC). AIDS Res Hum Retroviruses 2006;22:202–206 [DOI] [PubMed] [Google Scholar]
  • 18. Adawaye C, Fokam J, Kamangu E, et al. : Virological response, HIV-1 drug resistance mutations and genetic diversity among patients on first-line antiretroviral therapy in N'Djamena, Chad: Findings from a cross-sectional study. BMC Res Notes 2017;10:589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Vidal N, Koyalta D, Richard V, et al. : High genetic diversity of HIV-1 strains in Chad, West Central Africa. J Acquir Immune Defic Syndr 2003;33:239–246 [DOI] [PubMed] [Google Scholar]
  • 20. Niama FR, Toure-Kane C, Vidal N, et al. : HIV-1 subtypes and recombinants in the Republic of Congo. Infect Genet Evol 2006;6:337–343 [DOI] [PubMed] [Google Scholar]
  • 21. Niama FR, Vidal N, Diop-Ndiaye H, et al. : HIV-1 genetic diversity and primary drug resistance mutations before large-scale access to antiretroviral therapy, Republic of Congo. BMC Res Notes 2017;10:243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Pircher M, Diafouka M, Papuchon J, et al. : Molecular characterization of HIV type 1 in Brazzaville, Republic of Congo, and first data on resistance to antiretroviral drugs. AIDS Res Hum Retroviruses 2012;28:1798–1802 [DOI] [PubMed] [Google Scholar]
  • 23. Marechal V, Jauvin V, Selekon B, et al. : Increasing HIV type 1 polymorphic diversity but no resistance to antiretroviral drugs in untreated patients from Central African Republic: A 2005 study. AIDS Res Hum Retroviruses 2006;22:1036–1044 [DOI] [PubMed] [Google Scholar]
  • 24. Brennan CA, Bodelle P, Coffey R, et al. : The prevalence of diverse HIV-1 strains was stable in Cameroonian blood donors from 1996 to 2004. J Acquir Immune Defic Syndr 2008;49:432–439 [DOI] [PubMed] [Google Scholar]
  • 25. Deng X, Achari A, Federman S, et al. : Metagenomic sequencing with spiked primer enrichment (MSSPE) for viral diagnostics and genomic surveillance. Nat Microbiol 2019;5:443–454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Pineda-Pena AC, Faria NR, Imbrechts S, et al. : Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: Performance evaluation of the new REGA version 3 and seven other tools. Infect Genet Evol 2013;19:337–348 [DOI] [PubMed] [Google Scholar]
  • 27. Schultz AK, Zhang M, Bulla I, et al. : jpHMM: Improving the reliability of recombination prediction in HIV-1. Nucleic Acids Res 2009;37:W647–W651 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Salminen MO, Carr JK, Burke DS, McCutchan FE: Identification of breakpoints in intergenotypic recombinants of HIV type 1 by bootscanning. AIDS Res Hum Retroviruses 1995;11:1423–1425 [DOI] [PubMed] [Google Scholar]
  • 29. Stamatakis A, Ludwig T, Meier H: RAxML-III: A fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 2005;21:456–463 [DOI] [PubMed] [Google Scholar]
  • 30. Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS: ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat Methods 2017;14:587–589 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Felsenstein J. Confidence limits on phylogenies: An approach using the bootstrap. Evolution 1985;39:783–791 [DOI] [PubMed] [Google Scholar]
  • 32. Lemoine F, Domelevo Entfellner JB, Wilkinson E, et al. : Renewing Felsenstein's phylogenetic bootstrap in the era of big data. Nature 2018;556:452–456 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Hadfield J, Megill C, Bell SM, et al. : Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 2018;34:4121–4123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Katoh K, Standley DM: MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol 2013;30:772–780 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Lanave C, Preparata G, Saccone C, Serio G: A new method for calculating evolutionary substitution rates. J Mol Evol 1984;20:86–93 [DOI] [PubMed] [Google Scholar]
  • 36. To TH, Jung M, Lycett S, Gascuel O: Fast dating using least-squares criteria and algorithms. Syst Biol 2016;65:82–97 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Suchard MA, Lemey P, Baele G, Ayres DL, Drummond AJ, Rambaut A: Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol 2018;4:vey016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Rambaut A. FigTree. Available from http://tree.bio.ed.ac.uk/software/figtree/ (2018)
  • 39. Paradis E, Claude J, Strimmer K: APE: Analyses of phylogenetics and evolution in R language. Bioinformatics 2004;20:289–290 [DOI] [PubMed] [Google Scholar]
  • 40. Rambaut A, Lam TT, Max Carvalho L, Pybus OG: Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol 2016;2:vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ: IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 2015;32:268–274 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Drummond AJ, Ho SY, Phillips MJ, Rambaut A: Relaxed phylogenetics and dating with confidence. PLoS Biol 2006;4:e88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Pybus OG, Rambaut A, Harvey PH: An integrated framework for the inference of viral population history from reconstructed genealogies. Genetics 2000;155:1429–1437 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA: Posterior summarisation in Bayesian phylogenetics using Tracer 1.7. Syst Biol 2018;67:901–904 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Vidal N, Frange P, Chaix ML, et al. : Characterization of an old complex circulating recombinant form, CRF27_cpx, originating from the Democratic Republic of Congo (DRC) and circulating in France. AIDS Res Hum Retroviruses 2008;24:315–321 [DOI] [PubMed] [Google Scholar]
  • 46. Howard TM, Rasheed S: Genomic structure and nucleotide sequence analysis of a new HIV type 1 subtype A strain from Nigeria. AIDS Res Hum Retroviruses 1996;12:1413–1425 [DOI] [PubMed] [Google Scholar]
  • 47. Khoja S, Ojwang P, Khan S, Okinda N, Harania R, Ali S: Genetic analysis of HIV-1 subtypes in Nairobi, Kenya. PLoS One 2008;3:e3191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Rhee SY, Jordan MR, Raizes E, et al. : HIV-1 drug resistance mutations: Potential applications for point-of-care genotypic resistance testing. PLoS One 2015;10:e0145772. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Siraprapasiri T, Ongwangdee S, Benjarattanaporn P, Peerapatanapokin W, Sharma M: The impact of Thailand's public health response to the HIV epidemic 1984–2015: Understanding the ingredients of success. J Virus Erad 2016;2(Suppl. 4):7–14 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental data
Supp_Table2.pdf (17.2KB, pdf)
Supplemental data
Supp_Table1.docx (47.4KB, docx)
Supplemental data
Supp_Fig1.pdf (109.2KB, pdf)
Supplemental data
Supp_Fig2.pdf (193.5KB, pdf)

Articles from AIDS Research and Human Retroviruses are provided here courtesy of Mary Ann Liebert, Inc.

RESOURCES