Abstract
Background
HIV-1 subtype B and subtype F are prevalent in the AIDS epidemic of Brazil. Recombinations between these subtypes have generated at least four BF circulating recombinant forms (CRFs). CRF28_BF and CRF29_BF are among the first two BF recombinants being identified in Brazil and they contributed significantly to the epidemic. However, the evolution and demographic histories of the CRFs are unclear.
Methodology/Principal Findings
A collection of gag and pol sequences sampled within Brazil was screened for CRF28_BF-like and CRF29_BF-like recombination patterns. A Bayesian coalescent framework was employed to delineate the phylogenetic, divergence time and population dynamics of the virus having CRF28_BF-like and CRF29_BF-like genotype. These recombinants were phylogenetically related to each other and formed a well-supported monophyletic clade dated to 1988–1989. The effective number of infections by these recombinants grew exponentially over a five-year period after their emergence, but then decreased toward the present following a logistic model of population growth. The demographic pattern of both recombinants closely resembles those previously reported for CRF31_BC.
Conclusions
We revealed that HIV-1 recombinants of the CRF28_BF/CRF29_BF clade are still circulating in the Brazilian population. These recombinants did not exhibit a strong founder effect and showed a decreasing prevalence in the AIDS epidemic of Brazil. Our data suggested that multiple URFs may also play a role in shaping the epidemic of recombinant BF HIV-1 in the region.
Introduction
HIV-1 is classified into Group M, O, N and P. Group M HIV-1 dominates the AIDS pandemic with at least nine subtypes and multiple intersubtype recombinants currently identified [1]. The intersubtype recombinants are the result of recombination among the subtypes of HIV-1. Furthermore, these intersubtype recombinants can also recombine with HIV-1 of the same or different subtypes or with other recombinants to generate more complex recombinants. They emerge in almost every region of the world where more than one HIV-1 subtype is present. Currently, 48 circulating recombinant forms (CRFs) and a large number of unique recombinant forms (URFs) have been identified. These CRFs and URFs accounted for almost 18% of new infections in 2004 [2] and they continue to play an increasingly important role in shaping the AIDS pandemic.
It is reported that there are more than 545,000 AIDS cases in Brazil [3]. The epidemic is particularly severe in the southeastern part of the country, where large urban centers such as Rio de Janeiro and Sao Paulo are situated. At the end of 2008, the southeastern states (Espirito Santo, Minas Gerais, Rio de Janeiro and Sao Paulo) reported at least 323,000 diagnosed AIDS cases (59.3% of all cases in the country) with approximately 19.2 AIDS cases per 100,000 people [3]. The epidemic in the southeastern region of the country is complicated by multiple subtypes and recombinants of HIV-1 circulating in the population. For instance in the city of Sao Paulo, the major subtypes are subtype B (79%–88%) and subtype F (4%–11%) [4], [5]. Such a dense transmission network, with a high HIV-1 incidence rate and multiple subtypes present in the region, provides a perfect breeding ground for new HIV-1 recombinants. Indeed, HIV-1 recombinants between subtype B and subtype F have been frequently identified since the introduction of subtype F in Brazil [6], [7], [8], [9], [10], [11], [12]. Studies have shown that up to 9% of the HIV-1 isolated in Sao Paulo have a mosaic B/F pol genome [4], [5]; in particular, the CRF28_BF and CRF29_BF recombinants were identified in the region in 1999 [13], [14].
CRF28_BF and CRF29_BF were identified in patients at the Counseling and Testing Centers in Santos, which is a port city located 80 km from the city of Sao Paulo [14]. CRF29_BF were also identified in patients from Sao Paulo and Rio de Janeiro [6], [13]. These CRFs are genetically distinct from CRF12_BF, which was first detected and subsequently found to be widely circulating in Argentina [15], [16], [17]. The majority of the CRF28_BF and CRF29_BF genome belongs to subtype B and is closely related to the subtype B HIV-1 circulating in Brazil (Figure 1A) [14]. The subtype F regions are phylogenetically linked to the Brazilian subtype F1. CRF29_BF and CRF28_BF share similar recombination breakpoints in the gag and pol genes at nucleotide (nt) positions 1,322 and 2,571 (HXB2 numbering). CRF29_BF has an additional subtype F region spanning from nt 3,682 to 5,462, creating additional breakpoints in the pol and vif genes. Owing to the common recombination breakpoints that the recombinants share, it is possible that a common ancestor of the BF recombinants emerged early in the epidemic and has evolved through subsequent recombination events.
Although several studies have been carried out to characterize the epidemiology and phylogenetics of subtype B and subtype F HIV-1, and BF URFs in Brazil [4], [5], [6], [7], [8], [9], [10], [11], [13], [18], little is known about the evolutionary and demographic histories, and the epidemic potential of the CRF28_BF and CRF29_BF. Here, we reconstructed the evolutionary history of the CRF28_BF and CRF29_BF by coalescent inference to delineate their origins and population dynamics. Our data showed that since the emergence of the CRF28_BF and CRF29_BF in the late 1980s, both CRFs underwent rapid population growth; however, the growth rate has slowed in recent years. Both CRFs did not exhibit a strong founder effect, which suggested that multiple URFs may also play a role in shaping the recombinant BF HIV-1 epidemic in the region. Our study provides important insights into the contributions of HIV-1 recombinants to the AIDS epidemic in South America.
Methods
Study population and data preparation
Two regions of the HIV-1 genome encompassing the recombination breakpoints in pol (nt position 2,572) and gag (nt position 1,323) of CRF28_BF and CRF29_BF were analyzed (Figure 1A) [13], [14]. For the pol region, we studied a set of 111 HIV-1 sequences obtained from patients attending HIV/AIDS treatment centers in Sao Paulo [13]. An 830-bp-long pol region covering part of the protease and reverse transcriptase genes (nt positions 2,265–3,094) was sequenced as previously described [13]. Sequences were analyzed together with 433 subtype B, 61 recombinant BF, three CRF28_BF and four CRF29_BF sequences isolated in Brazil with known sampling dates retrieved from the Los Alamos HIV Sequence Database. This resulted in a dataset consisting of 612 pol sequences collected between 1989 and 2007. Similarly, we obtained 184 HIV-1 gag sequences isolated in Brazil with known sampling dates from the Los Alamos HIV Sequence Database, covering parts of the matrix and capsid genes (nt positions 1,087–1,560). This dataset consisted of 127 subtype B, 50 recombinant BF, three CRF28_BF and four CRF29_BF sequences collected between 1989 and 2006. The nucleotide sequences were aligned using Clustal X [19], and alignment gaps were excluded from the analyses.
Identification of the CRF28/29_BF-like HIV-1
The workflow for identifying recombination in HIV-1 was described previously [20]. The sequences in the pol and gag datasets were first analyzed using the Recombination Identification Program (RIP) 3.0 of the Los Alamos HIV Sequence Database to identify subtype B and CRF28/29_BF-like sequences. In the analysis, each individual query sequence was compared with subtype B and subtype F1 consensuses using a window size of 200 bp. Any query sequence that had a value of 0.9 or higher in the s-distance with the subtype B consensus, free of recombination breakpoints, was classified as subtype B. The sequences having a recombination pattern identical to those of CRF28_BF and CRF29_BF were regarded as CRF28/29_BF-like. The recombinant sequences were compared with the subtype B, subtype F1 and subtype K consensus sequences obtained from RIP in a bootscanning analysis using SimPlot to confirm their genetic identity [21]. Based on the analyses, we then compiled a refined dataset consisting of pol sequences of a selection of subtype B and all of the identified CRF28/29_BF-like, three CRF28_BF and four CRF29_BF sequences with a compromise between wide sampling date and a realistic size of the dataset. A refined dataset with the gag sequence was also constructed using the same approach.
Bayesian evolutionary analysis by sampling phylogenetic trees of subtype B and CRF28/29_BF-like HIV-1
We performed phylogenetic analysis on the two refined datasets of pol and gag sequences using a Bayesian approach with the SRD06 nucleotide substitution model [22]. The purpose of this phylogenetic analysis was to identify the overall topology and confirm the relationship between the subtype B and the CRF28/29_BF-like HIV-1. A Bayesian Markov Chain Monte Carlo (MCMC) analysis, as implemented in the program BEAST v1.5.4 [23], was carried out for 10 million generations to produce 10,000 trees for each dataset. The maximum clade credibility phylogenetic tree was determined in TreeAnnotator v1.5.4 after excluding an initial 10% of the sample.
Estimation of nucleotide substitution rate, demographic history and time of divergence
We used an MCMC method, as implemented in the program BEAST v1.5.4 [23], to estimate the rate of nucleotide substitution. The refined pol and gag datasets were analyzed with the codon-based SRD06 nucleotide substitution model [22]. Three clock models were used in the analyses: the strict clock model assumes a single evolutionary rate for all the lineages; and the uncorrelated exponential relaxed clock and the uncorrelated lognormal relaxed clock allow evolutionary rates to vary among lineages within exponential and lognormal distributions, respectively [24]. These three clock models were statistically compared for each dataset using a Bayes Factor test to find the best fit [25]. We used the best fit clock model to estimate a posterior distribution for the rate of nucleotide substitution and then used as an empirical prior distribution in the coalescent analyses that follow.
The CRF28/29_BF-like sequences were analyzed using a nonparametric model (Bayesian skyline plot) to estimate the change in the effective population size through time and to infer demographic information within the lineage [26]. Four demographic models were compared to select the model that best described the epidemiological history of the CRF28/29_BF-like HIV-1. The models tested in this study were as follows: constant population size, exponential growth, logistic growth and expansion growth [27]. The Bayes Factor test was used to evaluate the demographic models for each dataset [25]. We performed three independent Bayesian MCMC runs for 10 million generations to produce 10,000 trees for each analysis. The convergence of parameters was assessed through the effective sampling size, with all parameters for each run having values >100 indicating a sufficient level of sampling. The mean time to the most recent common ancestor (TMRCA) and the maximum clade credibility of the phylogenetic trees were then calculated after the removal of 10% of the samples following visual inspection in the programs Tracer v1.5 and FigTree v1.3.1.
Recombination between CRF28/29_BF-like recombinants and the recombination breakpoints in the gag and pol sequences may cause biases in the estimation of nucleotide substitution rate and TMRCA [28], [29]. To test the possible effects of recombination on the current analysis, we employed an unlinked multilocus model in BEAST, which allows independent coalescent pathways for different gene loci [30]. In this model, the linked recombinant pol gene was divided into two unlinked loci of subtype F (nt positions 2,265–2,572) and subtype B (nt positions 2,573–3,094) sequences. The nucleotide substitution rates, demographic histories and times of divergence of the two unlinked loci were estimated in the Bayesian coalescent framework. Similar approach was applied to the subtype B loci (nt positions 1,087–1,323) and subtype F loci (nt positions 1,324–1,560) of the recombinant gag gene.
Results
Identification of CRF28/29_BF-like recombination breakpoints in HIV-1
We analyzed the 612 sequences of the pol dataset using RIP to identify recombination breakpoints similar to those of CRF28_BF and CRF29_BF. Of the 433 subtype B pol sequences in the dataset, we found four sequences retrieved from the HIV Sequence Database that had recombination breakpoints similar to those of CRF28/29_BF (Figure 1B). The accession numbers of these four sequences are AF112891, AF112898, AF112953 and AY213549. In addition, we identified CRF28/29_BF-like recombination breakpoints in the pol gene of two sequences from the 111 patient samples sequenced in this study. Among the 61 recombinant BF sequences from the HIV Sequence Database, 21 of them possessed CRF28/29_BF-like recombination breakpoints. Based on the recombination breakpoint analysis, we compiled a refined pol dataset with 102 sequences consisting of 66 Brazilian subtype B, 27 CRF28/29_BF-like, 3 CRF28_BF and 4 CRF29_BF sequences sampled between 1989 and 2007.
The 184 sequences of the gag dataset were analyzed using a similar approach. We found that there were three sequences (U86560, AY071973 and AY071985) in the 127 subtype B sequences that had CRF28/29_BF-like recombination breakpoints (Figure 1B). We also identified 32 CRF28/29_BF-like recombinant sequences from the 50 BF recombinant sequences. We then compiled a refined dataset of 90 gag sequences containing 48 Brazilian subtype B, 35 CRF28/29_BF-like, three CRF28_BF and four CRF29_BF sequences sampled between 1989 and 2006.
Bayesian phylogeny of Brazilian subtype B and CRF28/29_BF-like HIV-1
A total of 102 pol sequences, consisting of 66 subtype B and 27 CRF28/29_BF-like HIV-1, were aligned with the reference sequences of CRF28_BF and CRF29_BF. A Bayesian analysis was used to infer the phylogenetic relationships of the isolates. The resulting tree showed that the CRF28/29_BF-like sequences form a well-supported monophyletic group (posterior probability = 1) distinct from the subtype B sequences (Figure 2). All of the sequences in the CRF28/29_BF-like lineage have recombination breakpoints similar to those of CRF28_BF and CRF29_BF (Figure 1B). Remarkably, the CRF28_BF and the CRF29_BF references did not form a distinct group, and both references were scattered in different branches of the CRF28/29_BF-like lineage, suggesting that the two strains share a common ancestry.
The Bayesian analysis of the refined dataset of gag sequences, which consisted of 48 subtype B, 35 CRF28/29_BF-like, three CRF28_BF and four CRF29_BF sequences, revealed a similar topology to the pol phylogenetic tree (Figure 2). The CRF28/29_BF-like sequences formed a distinct lineage with high support, and all of the sequences within had a similar recombination profile as the reference CRF28_BF and CRF29_BF (Figure 1B). The phylogenies inferred from the gag and pol datasets indicated that CRF28_BF and CRF29_BF are closely related and have co-evolved since their emergence.
Estimation of the evolution rate of the Brazilian CRF28/29_BF-like HIV-1
Bayesian MCMC analyses under a skyline tree prior were used to estimate the rate of evolution of the Brazilian CRF28/29_BF-like HIV-1. We first evaluated the marginal likelihood of the three clock models employed in this study to determine the best clock model for estimating the evolution rate of the pol gene. Analysis of the Bayes Factor showed that the model using a lognormal relaxed clock fit the dataset better than the models assuming an exponential relaxed clock or a strict clock (Table S1). Under the lognormal relaxed clock model, the estimated mean rate of evolution of the pol gene was 2.60×10−3 substitutions per nucleotide site per year (Table 1).
Table 1. Population dynamics estimates for the CRF28/29_BF epidemic.a .
Gene | μ, site−1 year−1 | Origin of the lineage | Number of effective infections | r, year−1 | λ, month |
pol | |||||
Linked locus [nt 2265–3094] | 2.60×10−3 (1.72×10−3–3.46×10−3) | 1989 (1987–1993) | 2537 (405–15320) | 1.18 (0.64–1.38) | 7.05 (6.00–12.96) |
Unlinked F locus [nt 2265–2572] | 2.69×10−3 (1.78×10−3–3.54×10−3) | 1991 (1988–1994) | 3176 (651–18028) | 1.25 (0.78–1.55) | 6.68 (5.37–10.66) |
Unlinked B locus [nt 2573–3094] | 2.55×10−3 (1.91×10−3–3.33×10−3) | 1989 (1986–1992) | 2415 (367–13994) | 1.13 (0.61–1.26) | 7.36 (6.65–13.64) |
gag | |||||
Linked locus [nt 1087–1560] | 1.94×10−3 (0.91×10−3–3.06×10−3) | 1988 (1984–1992) | 1778 (191–10212) | 1.20 (0.59–1.47) | 6.93 (5.66–14.10) |
Unlinked B locus [nt 1087–1323] | 2.02×10−3 (0.82×10−3–3.24×10−3) | 1989 (1986–1994) | 2092 (322–12273) | 1.27 (0.67–1.61) | 6.55 (5.17–12.42) |
Unlinked F locus [nt 1324–1560] | 1.95×10−3 (0.85×10−3–3.17×10−3) | 1988 (1983–1991) | 1619 (152–9867) | 1.15 (0.55–1.42) | 7.24 (5.86–15.12) |
Numbers in parentheses are the range of the 95% upper and lower high posterior density. μ, rate of nucleotide substitution; r, median rate of exponential growth; λ, epidemic doubling time.
Similarly, we determined the best clock model for estimating the evolution rate of the gag gene. The Bayes Factor test of the clock model showed that the lognormal relaxed clock most appropriately described the data, indicating that the substitution rate varies among branches (Table S1). The evolution rate assuming a lognormal relaxed clock was 1.94×10−3 substitutions per nucleotide site per year (Table 1). In comparison, previous estimates of the evolution rate of HIV-1 pol have ranged from 1.5×10−3 to 2.6×10−3 substitutions per nucleotide site per year [31], [32], [33], [34]. Our estimate is consistent with the order of magnitude of 10−3 expected for the HIV-1 pol gene.
To evaluate the possible effect of the recombination breakpoints in the pol and gag genes on the estimation of nucleotide substitution rate, we applied a multilocus model that assumes different genealogies for each of the unlinked subtype B and subtype F loci [30]. Bayesian MCMC analyses revealed that the estimated substitute rate for the unlinked loci were not significantly different from those of the linked loci. The rates of the unlinked pol loci ranged from 2.55×10−3 to 2.69×10−3 substitutions per nucleotide site per year and those of the unlinked gag loci were 1.95×10−3 and 2.02×10−3 substitutions per nucleotide site per year. Thus the presence of recombination breakpoint in the dataset does not have a significant effect on the evolution rate estimations. Based on the above investigation, the rates of nucleotide substitution for the pol (2.60×10−3) and gag (1.94×10−3) genes were used as empirical prior distributions in the subsequent coalescent analyses.
Demographic history and epidemic parameter of the Brazilian CRF28/29_BF-like HIV-1
The nonparametric Bayesian skyline method with a lognormal relaxed clock model was used to simultaneously estimate the demographic history and TMRCA of the CRF28/29_BF-like strains. The empirical prior of 2.60×10−3 substitutions per nucleotide site per year was used to analyze the pol gene. The Bayesian skyline plot of the epidemic history of the CRF28/29_BF-like strains, with appropriate confidence intervals, is shown in Figure 3. The change in the effective number of infections over time appears to follow the model of logistic population growth. A similar observation was inferred from the nonparametric reconstruction of the epidemic history of the CRF28/29_BF-like strains using the gag gene and an empirical prior of 1.94×10−3 substitutions per nucleotide site per year (Figure 3). We used a Bayes Factor test to determine the likelihoods of the various population growth models. Among the four models tested, the logistic growth model best fit the demographic information contained in both tree topologies thus confirming the epidemic history of the CRF28/29_BF-like strains is best described by the logistic growth model (Table S2).
The evolutionary and demographic parameters associated with the CRF28/29_BF epidemic are shown in Table 1. Our estimates suggest that the CRF28/29_BF lineage originated in the late 1980s. The TMRCA of the CRF28/29_BF-like HIV-1 determined from the pol genes was 1989 and that determined from the gag genes was 1988. Soon after the emergence of the CRF28/29_BF-like HIV-1, the recombinant virus grew at an exponential rate (Figure 3). The estimated growth rate inferred from the pol gene was 1.18 per year, with an epidemic doubling time of seven months. The number of effective infections in 2007 was 2,537. Similar parameters were estimated from the gag gene of the CRF28/29_BF-like strains. The inferred growth rate was 1.20 per year, which corresponds to a doubling time of approximately seven months. The number of effective infections in 2006 was 1,778. We then applied the aforementioned nonparametric Bayesian skyline method to the unlinked loci of pol and gag genes to rule out the possibility that recombination breakpoint may have an effect on estimating the evolutionary and demographic parameters. The resulting estimates for the TMRCA and demographic parameters are listed in Table 1. The TMRCA, growth rate and doubling time estimates are highly consistent among the different linked and unlinked loci of the pol or gag gene. This confirms previous studies [30], [35] showing that quantitative effect of recombination on evolutionary analyses of HIV-1 may be less severe than initially proposed [28], [29].
Discussion
In this study, we investigated the evolutionary and demographic histories of the epidemic caused by CRF28_BF and CRF29_BF HIV-1 using a phylogenetic approach and a coalescent framework. Recent reports indicated that BF recombinants have an increased prevalence in the Brazilian population [4], [13], [14], [36], but little is known about the origins of these recombinants. Two of these recombinants, CRF28_BF and CRF29_BF, were detected in 1999 and are prevalent in the southeastern region of Brazil. The southeastern region of the country has been affected by HIV/AIDS very significantly in particular, the state of Sao Paulo, which has the highest number of reported AIDS cases (37% of total cases reported) [37]. Our data showed that the mean TMRCA of CRF28_BF and CRF29_BF was 1988–1989, which was about 10 years after the mean estimated onset date of the subtype F epidemic in Brazil (1976–1981) [10], [18], [38]. In addition, the estimated TMRCA of the CRF28/29_BF-like HIV-1 is consistent with the epidemiological data, which suggested that BF recombinants have been circulating in the Brazilian population since the 1990s. The HIV-1 BF recombinant was initially described in Rio de Janeiro in 1992 [7], [8]. Shortly thereafter, CRF28_BF and CRF29_BF-related recombinants were detected in Rio de Janeiro in 1996 [9]. Multiple cases of viruses sharing the recombination breakpoints of CRF28_BF and CRF29_BF were identified during the period of 1999–2002 [14]. In particular, one of the CRF29_BF samples was isolated in 2002 from a 5-year-old girl who acquired the virus from her mother [4], [14], which suggests that the CRF29_BF was circulating in the population on or before 1997.
Bayesian skyline plots showed that the CRF28/29_BF epidemic follows a model of logistic population growth. Under the logistic model, the effective number of infections grows exponentially from the time of emergence and then decreases in growth toward the present. Soon after the emergence of the CRF28/29_BF-like HIV-1 in 1988–1989, the recombinant virus experienced exponential growth over 5 to 6 years, followed by a decline in its growth rate since 1995. This pattern of population growth is similar to the patterns described for the epidemics of subtype B in the U.S. and the U.K. [31], [39], subtype B in Southeast Asia [40], and subtypes B, C and F in Brazil [11], [18], [41], [42]. In contrast to the aforementioned epidemics, the CRF28/29_BF epidemic has a higher estimated initial growth rate than the initial growth rates of the subtype B epidemic in the U.S. (0.83 year−1) and the U.K (0.8 year−1), as well as the epidemics of the subtypes B, C and F in Brazil, which ranged from 0.17–0.81 year−1. The more rapid initial growth rate of the CRF28/29_BF-like strain indicates that both CRFs harbor a higher transmissibility than their parental subtypes. However, CRF28/29_BF-like strain did not exhibit a strong founder effect and have contributed only to a limited degree to the current epidemic in southeastern Brazil, which is dominated by subtype B. The strong founder effect of CRF07_BC and CRF08_BC in southern China, where both CRFs dominate the epidemics in the region, was probably due to the rapid initial dissemination of HIV-1 in illegal blood donation networks. The absence of such a highly connected transmission network may explain why CRF28_BF and CRF29_BF failed to prevail in the Brazilian AIDS epidemic.
In addition to the BF recombinants, CRF31_BC is circulating in Brazil, mainly in the southern part of the country. Interestingly, several demographic characteristics of CRF28/29_BF and CRF31_BC are remarkably similar. Both epidemics follow a logistic growth model with similar growth rates and doubling times. The growth rates of CRF28_BF and CRF29_BF (1.18–1.20 year−1) are very similar to that of CRF31_BC (1.26–1.27 year−1), and both have doubling times equivalent to 6–7 months [41]. Both epidemics emerged in the late 1980s, caused an exponentially increasing number of infections during the late 1980s and early 1990s, and resulted in similar numbers of effective infections. The average effective number of infections (i.e., infections that contribute to onward transmission) of CRF28/29_BF is 2,158, which is approximately 6.4% of the infected population. This value is similar to the value of CRF31_BC, where the effective number of infections accounted for about 6.8% of the infected population in Brazil in 2006. Thus, it appears that CRF28/29_BF and CRF31_BC may have similar fitness and transmissibility. Moreover, as with the case of CRF28/29_BF, CRF31_BC also did not dominate the AIDS epidemic in Brazil and has shown a decline in its growth rate in recent years [41].
High-frequency genetic recombination is a hallmark of HIV-1 replication [43], [44], [45], which may lead to a loss of phylogenetic correlation between different loci resulting in overestimations of tree length, evolution rate and TMRCA [28], [29]. Therefore in the current studies, the recombinant locus (CRF28/29_BF-like) and the parental locus (subtype B) were not grouped together in estimating phylodynamic parameters because the two loci have different genealogy and such analysis suffers from the abovementioned biases in estimates on linkage disequilibrium. In contrast, analyzing the phylodynamic relationship of recombinants within the CRF28/29_BF-like clade showed that recombination between the CRFs has no significant effect on the estimated evolution rate, TMRCA and demographic parameters (Table 1). Indeed, previous reports studying the population genetics of CRF07_BC, CRF08_BC and group O HIV-1 have demonstrated similar observations [30], [35]. Moreover, our current investigation together with two recent studies (CRF12_BF in Argentina [34], [46] and CRF38_BF in Uruguay [34]) demonstrated the feasibility of using the recombinant region of CRF genome both for identifying intersubtype recombinant and for phylodynamic analysis of the recombinant. Such framework can simplify and facilitate phylogenetic and population dynamic studies of CRFs.
Comparing the epidemic history of CRF28/29_BF to the overall epidemiology of AIDS in Brazil, we observed that the exponential growth phase of the BF epidemic coincided with a decline in the reported number of new AIDS cases in Brazil (which reflect epidemics of HIV transmissions occurring several years earlier) (Figure 4). The stabilization of the epidemic in the 1990s may be partly due to the government reform in the mid-1990s, which contributed significantly to the improvement of the healthcare system of the country. This included the availability of HAART in 1996 for people living with HIV and AIDS, which has significantly reduced the number of AIDS cases diagnosed. However, relying on the reported number of new AIDS cases alone and the absence of a comprehensive epidemiological dataset on HIV cases may limit the interpretation of the current investigation and our understanding of the true picture of the HIV/AIDS epidemic in Brazil. This limitation highlights the urgent need for more virological surveillance in the country.
Nonetheless, we observed that the CRF28/29_BF-like strains seemingly did not exert a strong effect on the overall epidemic in Brazil, but appeared to represent a major portion of the BF recombinant population. Of the 67 pol and 53 gag recombinants of the BF sequences we analyzed, 27 and 35 of them, respectively, were related to CRF28/29_BF. This observation, though, has a caveat: the BF sequences analyzed in this study correspond to data up to 2007, and recent studies have shown that new BF URFs are playing an increasing role in the AIDS epidemic in Brazil [4], [5], [6], [10], [11], [13], [18]. Therefore, with respect to the BF HIV-1 epidemic, although it was largely dominated by the CRF28/29_BF-like strains, it is now likely to be characterized by new CRFs such as CRF39_BF and CRF40_BF, and a variety of recently emerged URFs that do not share a common ancestor [4], [5], [6], [13], [47], [48], [49], [50]. Interestingly, a recent study on the molecular epidemiology of CRF12_BF in Argentina from 1986 to 2008 supports this prediction. The study identified a decreasing prevalence of CRF12_BF and high diversity of BF URFs in Argentina [46]. Therefore, the most recent epidemiological picture of the BF recombinant in Brazil is likely to change and will be revealed when more contemporary BF recombinants are characterized.
In conclusion, we showed that CRF28_BF and CRF29_BF constitute a monophyletic lineage and evolved under similar evolutionary and epidemiological parameters. Both CRFs emerged in the population in the late 1980s, and the rate of spreading of this lineage slowed in the mid-1990s. The diminishing prevalence of CRF28_BF and CRF29_BF combines with high recombination rate between subtype B and subtype F HIV-1 [51]; it is reasonable to assume that new CRFs and URFs will continue to emerge in the Brazilian population. The increasing prevalence of new CRFs and URFs poses a problem in controlling AIDS epidemics given the potential differences among resistance profiles, responses to antiviral interventions and other biological properties of the variants. This highlights the need for functional characterizations of the replication capacity and antiviral susceptibility of HIV-1 recombinants. In addition, more studies detailing the parameters shaping the epidemic and the evolution of HIV-1 recombinants would provide valuable insights into the transmission of HIV-1 in South America and would aid the design of region-specific vaccines.
Supporting Information
Acknowledgments
We thank the patients for providing samples for this study.
Footnotes
Competing Interests: The authors have declared that no competing interests exist.
Funding: This work was supported by internal funds of Aaron Diamond AIDS Research Center and National Institutes of Health grant DA026293. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Robertson DL, Anderson JP, Bradac JA, Carr JK, Foley B, et al. HIV-1 nomenclature proposal. Science. 2000;288:55–56. doi: 10.1126/science.288.5463.55d. [DOI] [PubMed] [Google Scholar]
- 2.Hemelaar J, Gouws E, Ghys PD, Osmanov S. Global and regional distribution of HIV-1 genetic subtypes and recombinants in 2004. AIDS. 2006;20:W13–23. doi: 10.1097/01.aids.0000247564.73009.bc. [DOI] [PubMed] [Google Scholar]
- 3.Brazilian Ministry of Health. 2009. AIDS Epidemiological Bulletin. Year VI No. 1 - July to December 2008/January to June 2009. http://www.aids.gov.br/
- 4.Sa Filho DJ, Sanabani S, Diaz RS, Munerato P, Brunstein A, et al. Analysis of full-length human immunodeficiency virus type 1 genome reveals a variable spectrum of subtypes B and f recombinants in Sao Paulo, Brazil. AIDS Res Hum Retroviruses. 2005;21:145–151. doi: 10.1089/aid.2005.21.145. [DOI] [PubMed] [Google Scholar]
- 5.Brigido LF, Franco HM, Custodio RM, Oliveira CA, JL PF, et al. Molecular characteristics of HIV type 1 circulating in Sao Paulo, Brazil. AIDS Res Hum Retroviruses. 2005;21:673–682. doi: 10.1089/aid.2005.21.673. [DOI] [PubMed] [Google Scholar]
- 6.Thomson MM, Sierra M, Tanuri A, May S, Casado G, et al. Analysis of near full-length genome sequences of HIV type 1 BF intersubtype recombinant viruses from Brazil reveals their independent origins and their lack of relationship to CRF12_BF. AIDS Res Hum Retroviruses. 2004;20:1126–1133. doi: 10.1089/aid.2004.20.1126. [DOI] [PubMed] [Google Scholar]
- 7.Morgado MG, Sabino EC, Shpaer EG, Bongertz V, Brigido L, et al. V3 region polymorphisms in HIV-1 from Brazil: prevalence of subtype B strains divergent from North American/European prototype and detection of subtype F. AIDS Res Hum Retroviruses. 1994;10:569–576. doi: 10.1089/aid.1994.10.569. [DOI] [PubMed] [Google Scholar]
- 8.Sabino EC, Shpaer EG, Morgado MG, Korber BT, Diaz RS, et al. Identification of human immunodeficiency virus type 1 envelope genes recombinant between subtypes B and F in two epidemiologically linked individuals from Brazil. J Virol. 1994;68:6340–6346. doi: 10.1128/jvi.68.10.6340-6346.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Tanuri A, Swanson P, Devare S, Berro OJ, Savedra A, et al. HIV-1 subtypes among blood donors from Rio de Janeiro, Brazil. J Acquir Immune Defic Syndr Hum Retrovirol. 1999;20:60–66. doi: 10.1097/00042560-199901010-00009. [DOI] [PubMed] [Google Scholar]
- 10.Bello G, Eyer-Silva WA, Couto-Fernandez JC, Guimaraes ML, Chequer-Fernandez SL, et al. Demographic history of HIV-1 subtypes B and F in Brazil. Infect Genet Evol. 2007;7:263–270. doi: 10.1016/j.meegid.2006.11.002. [DOI] [PubMed] [Google Scholar]
- 11.Bello G, Guimaraes ML, Morgado MG. Evolutionary history of HIV-1 subtype B and F infections in Brazil. Aids. 2006;20:763–768. doi: 10.1097/01.aids.0000216377.84313.52. [DOI] [PubMed] [Google Scholar]
- 12.Sanabani SS, Pastena ER, Neto WK, Martinez VP, Sabino EC. Characterization and frequency of a newly identified HIV-1 BF1 intersubtype circulating recombinant form in Sao Paulo, Brazil. Virol J. 2010;7:74. doi: 10.1186/1743-422X-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sanabani S, Kleine Neto W, Kalmar EM, Diaz RS, Janini LM, et al. Analysis of the near full length genomes of HIV-1 subtypes B, F and BF recombinant from a cohort of 14 patients in Sao Paulo, Brazil. Infect Genet Evol. 2006;6:368–377. doi: 10.1016/j.meegid.2006.01.003. [DOI] [PubMed] [Google Scholar]
- 14.De Sa Filho DJ, Sucupira MC, Caseiro MM, Sabino EC, Diaz RS, et al. Identification of two HIV type 1 circulating recombinant forms in Brazil. AIDS Res Hum Retroviruses. 2006;22:1–13. doi: 10.1089/aid.2006.22.1. [DOI] [PubMed] [Google Scholar]
- 15.Carr JK, Avila M, Gomez Carrillo M, Salomon H, Hierholzer J, et al. Diverse BF recombinants have spread widely since the introduction of HIV-1 into South America. Aids. 2001;15:F41–47. doi: 10.1097/00002030-200110190-00002. [DOI] [PubMed] [Google Scholar]
- 16.Thomson MM, Delgado E, Herrero I, Villahermosa ML, Vazquez-de Parga E, et al. Diversity of mosaic structures and common ancestry of human immunodeficiency virus type 1 BF intersubtype recombinant viruses from Argentina revealed by analysis of near full-length genome sequences. J Gen Virol. 2002;83:107–119. doi: 10.1099/0022-1317-83-1-107. [DOI] [PubMed] [Google Scholar]
- 17.Thomson MM, Villahermosa ML, Vazquez-de-Parga E, Cuevas MT, Delgado E, et al. Widespread circulation of a B/F intersubtype recombinant form among HIV-1-infected individuals in Buenos Aires, Argentina. Aids. 2000;14:897–899. doi: 10.1097/00002030-200005050-00020. [DOI] [PubMed] [Google Scholar]
- 18.Leal E, Martins LO, Janini LM, Diaz RS. Evolutionary Dynamics of HIV-1 BF and CB Recombinants and Its Parental Counterparts in South America. Retrovirology: Research and Treatment. 2008;1:1–14. [Google Scholar]
- 19.Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997;25:4876–4882. doi: 10.1093/nar/25.24.4876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Moore MD, Chin MPS, Hu WS. HIV-1 recombination: an experimental assay and a phylogenetic approach. In: Prasad V, Kalpana G, editors. HIV Protocols. 2nd ed. New York: Humana Press, Inc; 2009. pp. 87–105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, et al. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J Virol. 1999;73:152–160. doi: 10.1128/jvi.73.1.152-160.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Shapiro B, Rambaut A, Drummond AJ. Choosing appropriate substitution models for the phylogenetic analysis of protein-coding sequences. Mol Biol Evol. 2006;23:7–9. doi: 10.1093/molbev/msj021. [DOI] [PubMed] [Google Scholar]
- 23.Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007;7:214. doi: 10.1186/1471-2148-7-214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Drummond AJ, Ho SY, Phillips MJ, Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006;4:e88. doi: 10.1371/journal.pbio.0040088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Suchard MA, Weiss RE, Sinsheimer JS. Bayesian selection of continuous-time Markov chain evolutionary models. Mol Biol Evol. 2001;18:1001–1013. doi: 10.1093/oxfordjournals.molbev.a003872. [DOI] [PubMed] [Google Scholar]
- 26.Drummond AJ, Rambaut A, Shapiro B, Pybus OG. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol. 2005;22:1185–1192. doi: 10.1093/molbev/msi103. [DOI] [PubMed] [Google Scholar]
- 27.Felsenstein J. Maximum-likelihood estimation of evolutionary trees from continuous characters. Am J Hum Genet. 1973;25:471–492. [PMC free article] [PubMed] [Google Scholar]
- 28.Schierup MH, Hein J. Consequences of recombination on traditional phylogenetic analysis. Genetics. 2000;156:879–891. doi: 10.1093/genetics/156.2.879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Worobey M. A novel approach to detecting and measuring recombination: new insights into evolution in viruses, bacteria, and mitochondria. Mol Biol Evol. 2001;18:1425–1434. doi: 10.1093/oxfordjournals.molbev.a003928. [DOI] [PubMed] [Google Scholar]
- 30.Lemey P, Pybus OG, Rambaut A, Drummond AJ, Robertson DL, et al. The molecular population genetics of HIV-1 group O. Genetics. 2004;167:1059–1068. doi: 10.1534/genetics.104.026666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hue S, Pillay D, Clewley JP, Pybus OG. Genetic analysis reveals the complex structure of HIV-1 transmission within defined risk groups. Proc Natl Acad Sci U S A. 2005;102:4425–4429. doi: 10.1073/pnas.0407534102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Bello G, Passaes CP, Guimaraes ML, Lorete RS, Matos Almeida SE, et al. Origin and evolutionary history of HIV-1 subtype C in Brazil. Aids. 2008;22:1993–2000. doi: 10.1097/QAD.0b013e328315e0aa. [DOI] [PubMed] [Google Scholar]
- 33.Passaes CP, Bello G, Lorete RS, Matos Almeida SE, Junqueira DM, et al. Genetic characterization of HIV-1 BC recombinants and evolutionary history of the CRF31_BC in Southern Brazil. Infect Genet Evol. 2009;9:474–482. doi: 10.1016/j.meegid.2009.01.008. [DOI] [PubMed] [Google Scholar]
- 34.Bello G, Aulicino PC, Ruchansky D, Guimaraes ML, Lopez-Galindez C, et al. Phylodynamics of HIV-1 circulating recombinant forms 12_BF and 38_BF in Argentina and Uruguay. Retrovirology. 2010;7:22. doi: 10.1186/1742-4690-7-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Tee KK, Pybus OG, Li XJ, Han X, Shang H, et al. Temporal and spatial dynamics of human immunodeficiency virus type 1 circulating recombinant forms 08_BC and 07_BC in Asia. J Virol. 2008;82:9206–9215. doi: 10.1128/JVI.00399-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Sa-Filho D, Kallas EG, Sanabani S, Sabino E, Sucupira MC, et al. Characterization of the full-length human immunodeficiency virus-1 genome from recently infected subjects in Brazil. AIDS Res Hum Retroviruses. 2007;23:1087–1094. doi: 10.1089/aid.2006.0173. [DOI] [PubMed] [Google Scholar]
- 37.Brazilian Ministry of Health. AIDS Epidemiological Bulletin. 2009. Year VI No. 1 - July to December 2008/January to June 2009. http://www.aids.gov.br/
- 38.Aulicino PC, Bello G, Rocco C, Romero H, Mangano A, et al. Description of the first full-length HIV type 1 subtype F1 strain in Argentina: implications for the origin and dispersion of this subtype in South America. AIDS Res Hum Retroviruses. 2007;23:1176–1182. doi: 10.1089/aid.2007.0038. [DOI] [PubMed] [Google Scholar]
- 39.Robbins KE, Lemey P, Pybus OG, Jaffe HW, Youngpairoj AS, et al. U.S. Human immunodeficiency virus type 1 epidemic: date of origin, population history, and characterization of early strains. J Virol. 2003;77:6359–6366. doi: 10.1128/JVI.77.11.6359-6366.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Deng W, Fu P, Bao L, Vidal N, He Q, et al. Molecular epidemiological tracing of HIV-1 outbreaks in Hainan island of southern China. Aids. 2009;23:977–985. doi: 10.1097/QAD.0b013e328329217d. [DOI] [PubMed] [Google Scholar]
- 41.Bello G, Guimaraes ML, Passaes CP, Matos Almeida SE, Veloso VG, et al. Short communication: Evidences of recent decline in the expansion rate of the HIV type 1 subtype C and CRF31_BC epidemics in southern Brazil. AIDS Res Hum Retroviruses. 2009;25:1065–1069. doi: 10.1089/aid.2009.0106. [DOI] [PubMed] [Google Scholar]
- 42.Walker PR, Pybus OG, Rambaut A, Holmes EC. Comparative population dynamics of HIV-1 subtypes B and C: subtype-specific differences in patterns of epidemic growth. Infect Genet Evol. 2005;5:199–208. doi: 10.1016/j.meegid.2004.06.011. [DOI] [PubMed] [Google Scholar]
- 43.Chin MP, Chen J, Nikolaitchik OA, Hu WS. Molecular determinants of HIV-1 intersubtype recombination potential. Virology. 2007;363:437–446. doi: 10.1016/j.virol.2007.01.034. [DOI] [PubMed] [Google Scholar]
- 44.Chin MP, Rhodes TD, Chen J, Fu W, Hu WS. Identification of a major restriction in HIV-1 intersubtype recombination. Proc Natl Acad Sci U S A. 2005;102:9002–9007. doi: 10.1073/pnas.0502522102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Rhodes T, Wargo H, Hu WS. High rates of human immunodeficiency virus type 1 recombination: near-random segregation of markers one kilobase apart in one round of viral replication. J Virol. 2003;77:11193–11200. doi: 10.1128/JVI.77.20.11193-11200.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Aulicino PC, Bello G, Guimaraes ML, Ruchansky D, Rocco C, et al. Longitudinal analysis of HIV-1 BF1 recombinant strains in vertically infected children from Argentina reveals a decrease in CRF12_BF pol gene mosaic patterns and high diversity of BF unique recombinant forms. Infect Genet Evol. 2010 doi: 10.1016/j.meegid.2010.11.008. [DOI] [PubMed] [Google Scholar]
- 47.Brennan CA, Brites C, Bodelle P, Golden A, Hackett J, Jr, et al. HIV-1 strains identified in Brazilian blood donors: significant prevalence of B/F1 recombinants. AIDS Res Hum Retroviruses. 2007;23:1434–1441. doi: 10.1089/aid.2007.0121. [DOI] [PubMed] [Google Scholar]
- 48.Guimaraes ML, dos Santos Moreira A, Loureiro R, Galvao-Castro B, Morgado MG. High frequency of recombinant genomes in HIV type 1 samples from Brazilian southeastern and southern regions. AIDS Res Hum Retroviruses. 2002;18:1261–1269. doi: 10.1089/088922202320886307. [DOI] [PubMed] [Google Scholar]
- 49.Guimaraes ML, Eyer-Silva WA, Couto-Fernandez JC, Morgado MG. Identification of two new CRF_BF in Rio de Janeiro State, Brazil. Aids. 2008;22:433–435. doi: 10.1097/QAD.0b013e3282f47ad0. [DOI] [PubMed] [Google Scholar]
- 50.Ramos A, Tanuri A, Schechter M, Rayfield MA, Hu DJ, et al. Dual and recombinant infections: an integral part of the HIV-1 epidemic in Brazil. Emerg Infect Dis. 1999;5:65–74. doi: 10.3201/eid0501.990108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Galli A, Kearney M, Nikolaitchik OA, Yu S, Chin MP, et al. Patterns of Human Immunodeficiency Virus type 1 recombination ex vivo provide evidence for coadaptation of distant sites, resulting in purifying selection for intersubtype recombinants during replication. J Virol. 2010;84:7651–7661. doi: 10.1128/JVI.00276-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.