Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2016 Sep 22;6:33971. doi: 10.1038/srep33971

Multiple introductions and onward transmission of non-pandemic HIV-1 subtype B strains in North America and Europe.

Marina Cabello 1, Hector Romero 2, Gonzalo Bello 1,a
PMCID: PMC5032033  PMID: 27653834

Abstract

Most HIV-1 subtype B infections in North America and Europe seem to have resulted from the expansion of a single pandemic lineage (BPANDEMIC) disseminated from the United States (US). Some non-pandemic subtype B strains of Caribbean origin (BCAR) may have also reached North America and Europe, but their epidemiological relevance in those regions remains largely unknown. Here we analyze a total of 20,045 HIV-1 subtype B pol sequences from the US, Canada, and Europe, to estimate the prevalence and to reconstruct the spatiotemporal dynamics of dissemination of HIV-1 BCAR strains in those regions. We find that BCAR strains were probably disseminated from the Caribbean into North America and Europe at multiple times since the early 1970s onwards. The BCAR strains reached the US, Canada and at least 16 different European countries, where they account for a very low fraction (<5%) of subtype B infections, with exception of the Czech Republic (7.7%). We also find evidence of the onward transmission of BCAR clades in the US, Canada, the Czech Republic, Germany, Italy, Spain and the UK, as well as short-distance spreading of BCAR lineages between neighboring European countries from Central and Western Europe, and long-distance dissemination between the US and Europe.


By the end of 2012, the total number of people living with the Human Immunodeficiency Virus Type 1 (HIV-1) reached an estimated 1.3 million [980.000–1.900.000] in North America and 860.000 [800.000–930.000] in Western/Central Europe1. Unprotected sex between men who have sex with men (MSM) is the main driver of HIV transmission in the US, Canada and Europe, followed by heterosexual contact and injecting drug use (IDU)2,3,4. The proportion of newly diagnosed HIV cases attributed to MSM increased in the US, Canada and Europe over the last 10 years, while the corresponding proportions attributed to heterosexual contact and IDU remained stable or decreased in the same period2,3,4.

Subtype B dominates the HIV-1 epidemic in North America and in Western and Central Europe5; although each country has a unique HIV molecular pattern and an increasing prevalence of non-B subtypes have been observed in the United States (US)6, Canada7, and Europe8,9,10,11 in the last decade. The HIV-1 subtype B epidemic in North America and Europe probably started via a single viral introduction from Haiti into the US around the late 1960s, followed by rapid viral dissemination within the US and from the US to other American countries and Europe, leading to the origin of a pandemic subtype B clade (BPANDEMIC)12. Networks of MSM and IDUs probably played a crucial role in the early spread of the BPANDEMIC lineage within North America and from North America to Europe13,14,15. The HIV-1 subtype B epidemic in most European countries originated from multiple sources16,17, revealing the importance of human mobility across international borders in the spread of this clade. Current subtype B transmissions in Europe, however, seem to be predominantly occurring between patients from the same country18.

The study of Gilbert et al.12 also pointed to the existence of non-pandemic subtype B lineages that, in sharp contrast to the BPANDEMIC clade, remained mostly restricted to the Caribbean region (BCAR clades). More recent studies conducted by our group demonstrate that several non-pandemic BCAR lineages have been disseminated out of Haiti since the early 1970s and reach a high prevalence (40–95%) in several countries throughout the Caribbean region including Haiti, Dominican Republic, Jamaica, Trinidad and Tobago, other Lesser Antilles, Suriname and French Guiana19,20,21. The non-pandemic BCAR lineages did not remained restricted to the Caribbean region, but also reached most countries from Latin America20 and despite the overall low prevalence (<10%), established secondary outbreaks of small size in Argentina, Brazil, Mexico, Panama and Venezuela20,21. North America and Europe have maintained a high level of population movement with several Caribbean countries22,23,24, but the epidemiological relevance of non-pandemic BCAR strains in those regions has not been explored.

The objective of this study was to estimate the prevalence of the HIV-1 BPANDEMIC and BCAR clades in North America and Europe and to reconstruct the spatiotemporal dynamics of dissemination of non-pandemic subtype B lineages detected in those regions. For this, we used a comprehensive dataset of HIV-1 subtype B pol sequences (n = 20,045) isolated from the US, Canada, and 26 European countries between 1982 and 2012. North American and European subtype B sequences were combined with reference sequences representative of the BPANDEMIC and the BCAR clades and then analyzed using Maximum Likelihood and Bayesian phylogeographic approaches.

Materials and Methods

HIV-1 subtype B pol sequence dataset

We downloaded all HIV-1 subtype B pol sequences from North America (n = 9,670) and Europe (n = 10,885) that covered the entire protease and partial reverse transcriptase (PR/RT) regions (nucleotides 2253–3260 relative to HXB2 clone) and were available at the Los Alamos HIV Database (http://www.hiv.lanl.gov) by July 2014. Only one sequence per subject was selected and those sequences containing frameshift mutations (1.4%) or with incorrect subtype B assignment (1.1%) according to the REGA HIV subtyping tool v.225 were removed. This resulted in a final data set of 20,045 subtype B pol sequences isolated from North America (Canada and the US) and from Central (Austria, Germany, Poland, Slovakia and Switzerland), Northern (Denmark, Finland, Norway and Sweden), Southern (Italy, Portugal and Spain), Southeastern (Albania, Cyprus, Greece, Montenegro, Romania, Serbia and Slovenia) and Western (Belgium, Ireland, Luxembourg, Netherlands, France and the United Kingdom) Europe, between 1982 and 2012 (Table S1).

Subtype B Clade assignment

HIV-1 subtype B pol sequences from North America and Europe were aligned with subtype B pol sequences representative of the BPANDEMIC and the BCAR clades described previously19,21, using the Clustal W program26. Codons associated with major antiretroviral drug resistance in PR (n = 12) and RT (n = 21) were excluded, resulting in a final alignment of 909 nucleotides. Sequences were initially classified as BCAR or BPANDEMIC by using an evolutionary placement algorithm (EPA)27 available in RAxML28 for the rapid assignment of query sequences to edges of a reference phylogenetic tree under a maximum-likelihood (ML) model. Sequences classified within the BCAR clade were again combined with the BPANDEMIC and the BCAR reference sequences and subjected to new rounds of ML phylogenetic analysis with the PhyML program29 using an online web server30, for confirmation of clade assignment. Trees were inferred under the GTR+I+Γ nucleotide substitution model, selected using the jModeltest program31, and the SPR branch-swapping algorithm of heuristic tree search. The reliability of tree topology was estimated with the approximate likelihood-ratio test (aLRT)32 based on the Shimodaira-Hasegawa(SH)-like procedure. Clusters were classified as medium (SH-aLRT = 0.85–0.94) and well (SH-aLRT ≥ 0.95) supported, considering that the false positive error rate should not exceed 10% and 1% respectively33. Potential epidemiologic North American and European BCAR clades were defined as those strongly supported monophyletic clusters exclusively composed by sequences from those geographic regions. The ML trees were visualized using the FigTree v1.4.0 program34.

Analysis of the spatiotemporal dispersion pattern

The evolutionary rate, the age of the most recent common ancestor (TMRCA) and the spatial diffusion pattern of non-pandemic HIV-1 subtype B clades circulating in North America, Europe and the Caribbean were jointly estimated using the Bayesian Markov Chain Monte Carlo (MCMC) approach as implemented in BEAST v1.835,36 with BEAGLE to improve run-time37. Analyses were performed using the GTR+I+Г4 nucleotide substitution model, a relaxed uncorrelated lognormal molecular clock model38, and a Bayesian Skyline coalescent tree prior39. The regression analysis of root-to-tip genetic distance against sampling time performed by using the program TempEst40, revealed that the HIV-1 subtype B dataset here compiled does not contain sufficient temporal signal for reliable time-scale estimation (X-intercept [TMRCA] = 1908). Thus, we specified a uniform prior distribution on the substitution rate (2.0–3.0 × 10−3 subst./site/year) that encompass mean values previously estimated for the subtype B pol gene16,21,41,42. Migration events throughout the phylogenetic history and the most relevant migration pathways were reconstructed using a reversible discrete phylogeography model and the Bayesian stochastic search variable selection (BSSVS) approach43, with a CTMC rate reference prior44. Discrete locations were assigned according to the sampling country of the sequences, except for European sequences that were grouped according to the sampling region (Central, Northern, Southern, Southeastern and Western, see Table S1) to reduce the complexity of the migration matrix. Three MCMC chains were run for 5 × 108 generations and then combined using LogCombiner v1.8 while excluding the initial states (10–20%) of each run, needed to reach convergence. Convergence and uncertainty of parameter estimates were assessed by calculating the Effective Sample Size (ESS) and the 95% Highest Probability Density (HPD) values, respectively with Tracer v1.645. The maximum clade credibility (MCC) tree was summarized with TreeAnnotator v1.8 and visualized with FigTree v1.4.0. Assuming that the posterior probability (PP) support closely reflected the probability for a split to be correct (under the true evolutionary model and correct priors distributions)46, clades were defined as moderate (PP = 0.85–0.94) or strongly (PP ≥ 0.95) supported. The cross-platform SPREAD application47 was used to summarize the migratory events and to identify statistically well-supported migration routes (those with Bayes factor [BF] > 3).

Results

Detection of HIV-1 BCAR strains in North America and Europe

A total of 20,045 HIV-1 subtype B pol sequences from different countries from North America and Europe were combined with a set of BPANDEMIC and BCAR reference sequences19 and classified by using two sequential ML-based frameworks. This procedure classified a total of 274 (2.9%) and 189 (1.8%) HIV-1 subtype B pol sequences from North America and Europe within the BCAR clades, respectively. The final ML phylogenetic trees clearly showed that the BCAR sequences from North America and Europe were dispersed among the BCAR reference sequences of Caribbean origin, while all BPANDEMIC reference sequences branched in a medium supported (SH-aLRT = 0.85–0.90) clade nested within basal BCAR reference sequences (Fig. 1). These analyses confirmed the circulation of BCAR sequences in Canada, the US and 16 out of 26 European countries here analyzed (Fig. 2). Most European countries with detection of BCAR strains encompass larger number of sequences (n > 250) than European countries with no evidence of circulation of BCAR strains (n < 100) (Table S1). The BCAR strains reached a low prevalence (<5%) among subtype B-infected individuals from all North American and European countries analyzed, with the only exception of the Czech Republic (CZ) where the BCAR strains represent 7.7% of the sequences included (Fig. 2 and Table S1).

Figure 1.

Figure 1

ML phylogenetic trees of HIV-1 BCAR pol sequences identified in North America (a) and Europe (b). North American (n = 274) and European (n = 189) BCAR sequences were combined with representative sequences of the BPANDEMIC (US = 165, France = 135) and the BCAR (Caribbean = 200) clades. Branches are colored according to the geographic origin/clade classification of each sequence as indicated at the legend (upper right). Colored boxes show the positions of strongly supported (SH-aLRT ≥ 0.95) North American and European BCAR clades. Major (n ≥ 10) BCAR clades detected in North America (US-I and CA-I) and Europe (CZ-I) are indicated. SH-aLRT supports for subtype B and BPANDEMIC clades are shown. Trees were rooted using HIV-1 subtype D reference sequences. The branch lengths are drawn to scale with the bar at the bottom indicating nucleotide substitutions per site.

Figure 2. Estimated proportion of BCAR and BPANDEMIC clades among HIV-1 subtype B infected individuals from different North American and European countries.

Figure 2

Maps were created with Adobe Illustrator CC from templates obtained from d-maps.com (North America: http://d-maps.com/pays.php?num_pay=119&lang=es; and Europe: http://d-maps.com/pays.php?num_pay=192&lang=es).

The ML phylogenetic analyses also revealed that many BCAR strains from North America and Europe appear as sporadic lineages intermixed among Caribbean sequences, whereas other branched in strongly supported (aLRT ≥ 0.95) clades exclusively composed by sequences from those regions (Fig. 1). Most North American and European BCAR clades were of small size (n < 10) and comprise sequences from a single country (country-specific clades) (Fig. S1 and Table S2). A few country-specific clades of large size (n = 11–48), however, were detected in Canada, the US, and the CZ (Fig. 1). We also detected a few international clades that encompass sequences from at least two different countries (Canada/US, Germany/Netherlands, CZ/Portugal/UK, Belgium/CZ/Norway, Germany/Netherlands/Switzerland/UK and CZ/Germany/Spain/Switzerland) (Fig. S1). In most countries analyzed the BCAR sequences were mostly classified as sporadic lineages (57–67%), with exception of Canada, Germany and the CZ, where most BCAR sequences (60–97%) were classified within country-specific/international clades (Table S2).

Origin of HIV-1 BCAR strains introduced in North America and Europe

HIV-1 BCAR pol sequences with known sampling date from North America (n = 216) and Europe (n = 126) here identified, were next combined with BCAR pol sequences from the most widely sampled (n > 10) Caribbean islands (n = 258)19,21 and with subtype D pol sequences (the closest relative to subtype B) from the Democratic Republic of Congo (the most probable epicenter of ancestral subtype B)12 (n = 10) that served as outgroup (Table S3). HIV-1 sequences were classified into 10 discrete geographic locations and subjected to Bayesian phylogeographic analyses. When all BCAR sequences were combined in a single dataset, the MCMC chains fail to converge and many parameters showed low (<100) ESS values despite very long runs (1 × 109 generations). Furthermore, the most probable ancestral root location for the subtype B epidemic was traced to the US (posterior state probability [PSP] > 0.90), which is inconsistent with the seminal work of Gilbert and colleagues (2007) that clearly traced the origin of American subtype B epidemic to Haiti. Sampling bias can confuse the accurate estimation of the spatial root of an epidemic48 and our convenience sampling was clearly biased to the US (n = 164) (Table S3). Moreover, the BCAR US sequences were widely scattered among BCAR sequences of Hispaniola (probably due to many independent viral introductions into the US), so the BCAR genetic diversity in both locations was comparable and the simple overrepresentation of the US may bias the ancestral root location to this country.

In order to reduce the impact of sampling bias on the accuracy of phylogeographic reconstructions, sequences from the US were subdivided into three subsets (n = 54–55) at random, except for those sequences belonging to the small country-specific clades (n ≤ 5) that were placed in the same subset (Table S4) in order to recover their TMRCA. Each subset was then combined with sequences from the other locations and independently analyzed. The overall pattern of spatial and temporal dissemination of BCAR strains reconstructed from all subsets was roughly similar (Fig. 3). All three analyses pointed to the island of Hispaniola as the most probable root location of the HIV-1 subtype B ancestor (PSP = 0.76–0.99), and traced the median TMRCA of subtype B at around the middle 1960s (Table 1), fully consistent with previous estimates12,49,50. Phylogeographic reconstructions also suggest that Hispaniola was the major hub of international dissemination of BCAR strains, sending viruses to Canada, the US and all European regions at multiple times from the middle 1970 s onwards (Fig. 4A). Additional viral migrations from Jamaica to Canada/US/UK and from Trinidad and Tobago to the UK were also detected (Fig. 4A). The BF tests for significant nonzero rates support epidemiological linkage between: Hispaniola/Canada, Hispaniola/US, Hispaniola/Central Europe, Hispaniola/Southern Europe, Hispaniola/Western Europe and Jamaica/Western Europe (UK) (Fig. 4B).

Figure 3. Time-scaled Bayesian MCMC tree of HIV-1 BCAR pol sequences from North America and Europe.

Figure 3

North American (n = 216) and European (n = 126) BCAR sequences with known sampling date were combined with BCAR sequences from the Caribbean (n = 258), BPANDEMIC sequences (n = 50) the US and France and subtype D sequences (n = 10) from the DRC. Branches of trees reconstructed from subsets 1 (a), 2 (b), and 3 (c) are colored according to the most probable location state (see Table S3) of their descendent nodes as indicated in the legend (bottom left). Colored boxes indicate the positions of major (n ≥ 10) BCAR clades detected outside the Caribbean. Branch lengths are depicted in units of time (years). The trees were automatically rooted under the assumption of a relaxed molecular clock. CA: Canada, CD: Democratic Republic of Congo, CEU: Central Europe, HISP: Hispaniola, JM: Jamaica, NEU: Northern Europe, SEEU: Southeastern Europe, SEU: Southern Europe, TT: Trinidad and Tobago, US: United States of America, WEU: Western Europe.

Table 1. Bayesian estimates of the TMRCA of HIV-1 BCAR clades from North America and Europe.

Clade Countries Subset 1 TMRCA Subset 2 TMRCA Subset 3 TMRCA
Subtypes B/D 1956 (1947–1964) 1956 (1948–1964) 1957 (1948–1964)
Subtype B 1967 (1963–1971) 1967 (1964–1970) 1968 (1964–1971)
BCAR-NA/EU-I DE/FR/IT/LU/SE/US/UK 1974 (1971–1977) 1974 (1970–1976) 1974 (1971–1977)
BCAR-NA/EU-II BE/CA/CZ/NO/US 1978 (1974–1982)
BCAR-NA/EU-III US/ES 1980 (1976–1983)
BCAR-EU-I CH/CZ/DE 1981 (1977–1985) 1981 (1977–1985) 1981 (1977–1986)
BCAR-EU-II CH/CZ/DE 1979 (1976–1983) 1979 (1975–1983) 1980 (1976–1983)
BCAR-EU-III CH/DE/UK 1992 (1986–1996) 1992 (1987–1996) 1992 (1987–1996)
BCAR-EU-IV CZ/UK 1997 (1994–2000) 1997 (1994–2000) 1997 (1994–2000)
BCAR-EU-V IT/RO 1980 (1976–1983) 1979 (1976–1983) 1980 (1976–1983)
BCAR-EU-VI DE/NL 1982 (1977–1987) 1983 (1978–1989) 1982 (1977–1987)
BCAR-CZ-I CZ 1996 (1992–1999) 1996 (1992–1999) 1996 (1992–1999)
BCAR-US-II US 1994 (1991–1997)
BCAR-US-III US 1987 (1982–1994)
BCAR-US-IV US 1989 (1983–1993)
BCAR-US-V US 2005 (2004–2006)
BCAR-US-VI US 1994 (1988–2000)
BCAR-US-VII US 1994 (1991–1996)
BCAR-US-VIII US 1985 (1982–1989)
BCAR-US-IX US 1991 (1985–1996)
BCAR-US-X US 1996 (1995–1997)
BCAR-US-XII US 1995 (1991–1999)
BCAR-US-XIII US 1995 (1994–1997)
BCAR-US-XIV US 2004 (2002–2005)
BCAR-US-XV US 2005 (2002–2006)
BCAR-US-XIX US 1982 (1978–1987)
BCAR-CA-I CA 1979 (1975–1982) 1978 (1975–1982) 1979 (1975–1983)
BCAR-CA-II CA 1996 (1988–2002) 1996 (1988–2002) 1996 (1988–2002)
BCAR-CA-III CA 1983 (1979–1987) 1983 (1979–1987) 1984 (1979–1988)
BCAR-CA-IV CA 1998 (1994–2001) 1998 (1994–2001) 1998 (1994–2001)
BCAR-CA-V CA 1987 (1981–1993) 1987 (1981–1992) 1987 (1982–1993)
BCAR-CA-VI CA 1998 (1993–2002) 1998 (1993–2002) 1998 (1993–2002)
BCAR-CA-VII CA 1998 (1997–1998) 1998 (1996–1998) 1998 (1997–1998)
BCAR-DE-II DE 1988 (1982–1993) 1987(1982–1994) 1988 (1982–1993)
BCAR-DE-III DE 1987 (1981–1994) 1987 (1980–1993) 1988 (1982–1994)
BCAR-ES-I ES 2006 (2004–2008) 2006 (2004–2008) 2006 (2004–2008)
BCAR-IT-II IT 1989 (1983–1993) 2006 (2004–2008) 2006 (2004–2008)
BCAR-UK-I UK 1993 (1990–1996) 1993 (1990–1996) 1994 (1990–1996)
BCAR-UK-II UK 2004 (2002–2004) 2004 (2002–2004) 2004 (2002–2004)
BCAR-UK-III UK 1994 (1988–1999) 1994 (1988–1998) 1994 (1988–1999)

Figure 4. Spatiotemporal dynamics of dissemination of HIV-1 BCAR clades from the Caribbean to North America and Europe.

Figure 4

(a) Lines between locations represent branches in the Bayesian MCC trees along which viral migration events occurred. The line’s color informs the estimated time interval of the viral migrations as indicated in the legend (bottom left). Only the earliest transitions between each location pair were represented. (b) Most significant (Bayes factor rates > 3) epidemiological links of the dissemination process of BCAR clades. CEU: Central Europe, NEU: Northern Europe, WEU: Western Europe. UK: United Kingdom. Maps were created with Adobe Illustrator CC from templates obtained from d-maps.com (Caribbean: http://d-maps.com/pays.php?num_pay=118&lang=es; North America: http://d-maps.com/pays.php?num_pay=119&lang=es; and Europe: http://d-maps.com/pays.php?num_pay=192&lang=es).

Onward transmission of HIV-1 BCAR strains in North America and Europe

The Bayesian analysis confirms that some BCAR strains seeded secondary outbreaks in North America and Europe (Fig. 3). The overall distribution of BCAR sequences across sporadic lineages and country-specific/international clades was comparable to that inferred from the ML trees (Table S2). Nearly all highly supported North American and European clades previously identified also displayed a very high support in Bayesian trees (PP = 1) (Table S5). Bayesian analyses also recovered three medium/highly supported (PP = 0.90–0.99) European BCAR lineages (BCAR-EU-I, BCAR-EU-V and BCAR-EU-VI) that were observed as low/medium supported (aLRT = 0.82–0.89) clades in the ML trees (Table S5). Finally, the combination of North American and European sequences in the same dataset allowed the identification of three medium/highly supported (PP = 0.88-1) intercontinental BCAR clades (Fig. 3 and Table S5). The major intercontinental clade (BCAR-NA/EU-I) comprises sequences from the US and six European countries, the second one (BCAR-NA/EU-II) comprises sequences from the US, Canada, Belgium, the CZ and Norway, and the last one (BCAR-NA/EU-III) comprises sequences from the US and Spain.

A great proportion (40%) of North American and European BCAR clades probably arose between the late 1970 s and the late 1980 s, others (46%) arose during the 1990 s, and a minor fraction (14%) arose during the 2000 s (Table 1). Most (82%) North American and European BCAR clades seem to have remained confined to one single country. A few European BCAR clades were disseminated across neighboring countries such the CZ, Germany and Switzerland (BCAR-EU-I and BCAR-EU-II) or Germany and Netherlands (BCAR-EU-VI) and others were spread over longer distances, like those disseminated from Central Europe to UK (BCAR-NA/EU-III and BCAR-NA/EU-IV), and from Italy to Romania (BCAR-EU-V) (Fig. 4A). The most notable examples of long-distance dissemination, however, were the three intercontinental BCAR clades that probably arose between the early 1970 s and the early 1980 s and were spread between North America and Europe.

In order to gain a better understanding of their dissemination dynamics, the three intercontinental BCAR clades were combined in a single dataset with BCAR strains from the Caribbean (n = 258), a representative sub-set of BPANDEMIC sequences from the US and France (n = 50) and subtype D sequences from the DRC (n = 10). The Bayesian analysis clearly reconstructed long distance migrations of clades BCAR-NA/EU-I, BCAR-NA/EU-II and BCAR-NA/EU-III from the US to Europe at multiple times, supported by high BF rates (Fig. 5). The TMRCA of the BPANDEMIC clade was estimated at 1969 (1965–1973), consistent with a previous study12, of the BCAR-NA/EU-I clade at 1972 (1968–1976), of the BCAR-NA/EU-II clade at 1977 (1972–1982), and of the of the BCAR-NA/EU-III clade at 1978 (1973–1982). This phylogeographic analysis further suggests that clades BCAR-NA/EU-I, BCAR-NA/EU-II and BCAR-NA/EU-III may have evolved from a single ancestor traced to the US at 1970 (1966–1974), but this result should be interpreted with caution because the support for that large intercontinental BCAR clade was low (PP = 0.59).

Figure 5. Spatiotemporal dynamics of dissemination of HIV-1 BCAR clades from North America to Europe.

Figure 5

(a) Time-scaled Bayesian MCMC tree of HIV-1 BCAR pol sequences from intercontinental BCAR clades (BCAR-NA/EU-I, BCAR-NA/EU-II and BCAR-NA/EU-III) combined with BCAR sequences from the Caribbean (n = 258), BPANDEMIC sequences from the US and France (n = 50) and subtype D sequences (n = 10) from the DRC. Branches are colored according to the most probable location state (see Table S3) of their descendent nodes as indicated in the legend (upper left). Colored boxes indicate the positions of intercontinental BCAR clades and the BPANDEMIC clade. Branch lengths are depicted in units of time (years). The tree was automatically rooted under the assumption of a relaxed molecular clock. See the legend of Fig. 3 for location code. (b) Lines between locations represent branches in the Bayesian MCC tree along which viral migration events occurred. Dispersion pathways of different intercontinental BCAR clades are represented with different colors (see the legend at bottom left). Epidemiological links supported by Bayes factor rates >3 and <3 are displayed with continuous and discontinuous lines, respectively. See the legend of Fig. 4 for location code. Maps were created with Adobe Illustrator CC from templates obtained from d-maps.com (Caribbean: http://d-maps.com/pays.php?num_pay=118&lang=es; North America: http://d-maps.com/pays.php?num_pay=119&lang=es; and Europe: http://d-maps.com/pays.php?num_pay=192&lang=es).

Discussion

This study confirms that the HIV-1 subtype B epidemic in North America and Europe is mostly driven by the dissemination of the BPANDEMIC lineage, but also demonstrates several independent introductions of non-pandemic BCAR strains of Caribbean origin into those regions. The dissemination of BCAR strains from the Caribbean into North America and Europe should be expected considering the high human mobility between those regions. The discovery of the Americas in 15th century was the starting point for several European countries to create colonies in the Caribbean region that persisted for several centuries and many of them continue to have government ties with European countries (such as the UK, France and the Netherlands) and the US at the 21st century. The linguistic and socioeconomic links created during the colonial period certainly facilitated: (1) a large flow of labor migrants from Caribbean countries towards North America and Europe, particularly since 197022,23,24, and (2) a sharp increase in the number of visitors mainly from the US, Germany, the UK, France and Canada towards the Caribbean, particularly since the 1960 s when regular international airplane flights made vacations to the Caribbean more affordable23,51.

Our results indicate that the island of Hispaniola was probably the major source of BCAR lineages disseminated into those regions. Jamaica can be viewed as a secondary hub sending BCAR strains to the US, Canada and the UK, whereas Trinidad and Tobago seems to have played a minor role in long-distance dispersion of BCAR strains. It could be argued that these estimates were biased by the sampling scheme used here since, after down-sampling the US, most BCAR sequences were from Hispaniola. If we considered the relative contribution of each location to the total number of BCAR-infected individuals, however, our estimates are quite robust to sampling bias since all locations were overrepresented with the only exception of Hispaniola. This Caribbean island hosts about 70% of the total number of BCAR-infected individuals, but only comprises 27% of BCAR sequences in our subsets. Our convenience sampling, however, probably failed to recover some important dispersal routes between Caribbean countries with high prevalence of BCAR lineages and strong connection with North America/Europe, like many Lesser Antilles islands, Guyana, French Guyana and Suriname19,20, that were not represented in our subsets.

Some viral migration routes here recovered are fully consistent with the notion that postcolonial ties are an important driving force in the international dissemination of BCAR strains. The spread of these strains out of Jamaica, for example, was traced to those countries (the US, the UK and Canada) where most Jamaicans immigrants reside and from where most tourists visiting Jamaica originate22,23,24. Other factors apart from the historical, linguistic and socio-economic links with the Caribbean region, however, appear to be needed to explain the overall prevalence and distribution of BCAR strains across different European countries. We observed a very low prevalence (<1%) of BCAR strains among subtype B-infected individuals from some countries (France and Spain) that host large numbers of Caribbean immigrants and contribute many tourists to the Caribbean region, and a relative high prevalence of BCAR strains (2–8%) in other countries (the CZ, Greece, Luxembourg and Norway) located outside the main migration corridors from/to the Caribbean22,23,24.

Stochastic events may have influenced the BCAR circulation in some countries like the CZ, where most (73%) BCAR sequences branched in a country-specific sub-clade that was nested among basal BCAR sequences from Germany and Switzerland. This supports that the high prevalence of BCAR sequences observed in the CZ probably resulted from the local expansion of a single founder strain introduced from a neighboring European country. Such a founder BCAR strain probably gained access to some highly interconnected network of IDU or MSM from the CZ, although we have no epidemiological information about the BCAR-infected patients to test this hypothesis. Other country-specific BCAR clades were also detected in the US, Canada, Germany, Italy, Spain and the UK, supporting local dissemination of non-pandemic subtype B lineages in a number of North American and European countries. In addition, we also detected short-distance spreading of BCAR lineages between neighboring European countries from Central and Western Europe, and long-distance dissemination between the US and Europe.

While stochastic founder events may help to explain the dissemination of BCAR strains in countries that are not part of the main migration corridors with the Caribbean, cultural factors could be a major obstacle for the onward transmission of BCAR strains primarily introduced into countries strongly connected to the Caribbean. Migrant travelers might form a bridge population for HIV transmission between their country of origin and their country of residence. It has been estimated, for example, that about 6% of the Surinamese and Antillean migrant population act as a potential bridge population for HIV transmission in The Netherlands52. Molecular epidemiological data shows, however, that HIV transmission occurs mostly within migrant communities, whereas transmission between Surinamese, Antillean, and Dutch individuals living in The Netherlands are rare53. Consequently, the BCAR strains potentially introduced into The Netherlands from Suriname and the Netherlands Antilles should not be expected to fuel large outbreaks among indigenous Dutch population.

Although the time-scale here reconstructed was largely determined by the informative prior distribution specified on substitution rate, the TMRCA estimated for subtype B/D, subtype B and the BPANDEMIC clades were fully consistent with those obtained in previous studies12,49,50, thus indicating that our informative prior produced reliable time-scale estimates. According to our estimations, the TMRCA of most (77%) international BCAR clades detected in North America and Europe was traced to between the early 1970 s and the early 1980 s, whereas the onset date of a significant fraction (40%) of country-specific BCAR clades detected in those regions was traced to between the late 1970 s and the late 1980 s. This demonstrates that despite their low prevalence, the BCAR strains were introduced and circulate in North America and Europe since the early stages of the AIDS epidemic. In sharp contrast to the BPANDEMIC clade, however, all BCAR introductions in North America and Europe seem to have resulted in dead-end infections or in outbreaks of small size, as previously demonstrated for most Latin American countries20.

One interesting question is why BCAR strains introduced into North America and Europe fail to establish large secondary outbreaks. One hypothesis is that the BPANDEMIC clade displays a higher transmissibility than BCAR clades, but some evidences argue against it. First, analysis of partial genome regions revealed a paucity of amino acid substitutions mapping onto the branch leading to the BPANDEMIC ancestor12, suggesting that this clade probably possessed no selective advantage over BCAR strains. Second, in several Caribbean countries where both pandemic and non-pandemic lineages co-circulate, the BCAR clades reached a much higher prevalence than the BPANDEMIC one19,20. Another hypothesis is that the epidemic outcome of different subtype B strains was mostly shaped by ecological factors, like the chance of colonization of different populations. It is possible that the BPANDEMIC ancestor was the only subtype B Caribbean strain to gain access to the globally interconnected populations of MSM and IDU from the US, Europe and Latin America at the early stages of the epidemic15, thus resulting in a much more efficient dissemination at both local and global level.

In conclusion, this study demonstrates that several non-pandemic HIV-1 BCAR strains have been introduced from the Caribbean into North America and Europe since the early 1970 s onwards. Some BCAR strains were spread locally in the US, Canada and several European countries. We further detected short-distance spreading of BCAR lineages between neighboring European countries and long-distance disseminations between the US and Europe. Despite their early and frequent introduction, the BCAR strains only comprise a very low fraction of all HIV-1 subtype B infections from North America (3%) and Europe (1%). The epidemiological characterization of the transmission networks that sustain the dissemination of the BCAR clades in North America and Europe will be of paramount importance to determine why no large BCAR outbreaks have been established in those regions.

Additional Information

How to cite this article: Cabello, M. et al. Multiple introductions and onward transmission of non-pandemic HIV-1 subtype B strains in North America and Europe. Sci. Rep. 6, 33971; doi: 10.1038/srep33971 (2016).

Supplementary Material

Supplementary Information
srep33971-s1.pdf (1.8MB, pdf)

Acknowledgments

This work was supported by Public Health Service grant 490378/2013-7 from the CNPq (Brazil) - MEC (Uruguay).

Footnotes

Author Contributions The study was conceived and designed by G.B. and H.R. M.C. performed the data acquisition and determination of HIV subtype, H.R. performed phylogenetic inference and G.B. performed the phylogeographic inference. G.B. wrote the manuscript and M.C. produced all figures. All of the authors reviewed the final manuscript.

References

  1. UNAIDS. Report on the global AIDS epidemic, 2013. Available at http://www.unaids.org/en/media/unaids/contentassets/documents/epidemiology/2013/gr2013/UNAIDS_Global_Report_2013_en.pdf. (Accessed 10 August 2016).
  2. Centers for Disease Control and Prevention (CDC). HIV Surveillance Report, 2014. Available at: http://www.cdc.gov/hiv/library/reports/surveillance/. (Accessed 10 August 2016).
  3. Public Health Agency of Canada. Summary: Estimates of HIV incidence, prevalence and proportion undiagnosed in Canada, 2014. Available at: http://healthycanadians.gc.ca/publications/diseases-conditions-maladies-affections/hiv-aids-estimates-2014-vih-sida-estimations/alt/hiv-aids-estimates-2014-vih-sida-estimations-eng.pdf. (Accessed 10 August 2016).
  4. Pharris A. et al. Trends in HIV surveillance data in the EU/EEA, 2005 to 2014: new HIV diagnoses still increasing in men who have sex with men. Euro surveillance: European communicable disease bulletin . 20, (2015). [DOI] [PubMed] [Google Scholar]
  5. Hemelaar J., Gouws E., Ghys P. D. & Osmanov S. Global trends in molecular epidemiology of HIV-1 during 2000-2007. AIDS . 25, 679–689 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Pyne M. T., Hackett J. Jr., Holzmayer V. & Hillyard D. R. Large-scale analysis of the prevalence and geographic distribution of HIV-1 non-B variants in the United States. J Clin Microbiol . 51, 2662–2669 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Siemieniuk R. A., Beckthold B. & Gill M. J. Increasing HIV subtype diversity and its clinical implications in a sentinel North American population. The Canadian journal of infectious diseases & medical microbiology . 24, 69–73 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Stanojevic M. et al. HIV1 molecular epidemiology in the Balkans: a melting pot for high genetic diversity. AIDS Rev. 14, 28–36 (2012). [PubMed] [Google Scholar]
  9. Abecasis A. B. et al. HIV-1 subtype distribution and its demographic determinants in newly diagnosed patients in Europe suggest highly compartmentalized epidemics. Retrovirology . 10, 7 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Beloukas A. et al. Molecular epidemiology of HIV-1 infection in Europe: An overview. Infect Genet Evol. In press (2016). [DOI] [PubMed] [Google Scholar]
  11. Thomson M. M. & Najera R. Increasing HIV-1 genetic diversity in Europe. J Infect Dis . 196, 1120–1124 (2007). [DOI] [PubMed] [Google Scholar]
  12. Gilbert M. T. et al. The emergence of HIV/AIDS in the Americas and beyond. Proc Natl Acad Sci USA. 104, 18566–18570 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Lukashov V. V., Kuiken C. L., Vlahov D., Coutinho R. A. & Goudsmit J. Evidence for HIV type 1 strains of U.S. intravenous drug users as founders of AIDS epidemic among intravenous drug users in northern Europe. AIDS Res Hum Retroviruses . 12, 1179–1183 (1996). [DOI] [PubMed] [Google Scholar]
  14. Kuiken C., Thakallapalli R., Esklid A. & de Ronde A. Genetic analysis reveals epidemiologic patterns in the spread of human immunodeficiency virus. Am J Epidemiol. 152, 814–822 (2000). [DOI] [PubMed] [Google Scholar]
  15. Junqueira D. M. & Almeida S. E. HIV-1 subtype B: Traces of a pandemic. Virology . 495, 173–184 (2016). [DOI] [PubMed] [Google Scholar]
  16. Hue S., Pillay D., Clewley J. P. & Pybus O. G. Genetic analysis reveals the complex structure of HIV-1 transmission within defined risk groups. Proc Natl Acad Sci USA 102, 4425–4429 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Paraskevis D. et al. Tracing the HIV-1 subtype B mobility in Europe: a phylogeographic approach. Retrovirology . 6, 49 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Frentz D. et al. Limited cross-border infections in patients newly diagnosed with HIV in Europe. Retrovirology . 10, 36 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Cabello M., Mendoza Y. & Bello G. Spatiotemporal dynamics of dissemination of non-pandemic HIV-1 subtype B clades in the Caribbean region. PLoS One . 9, e106045 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Cabello M., Junqueira D. M. & Bello G. Dissemination of nonpandemic Caribbean HIV-1 subtype B clades in Latin America. AIDS . 29, 483–492 (2015). [DOI] [PubMed] [Google Scholar]
  21. Mendoza Y. et al. Human Immunodeficiency Virus Type 1 (HIV-1) Subtype B Epidemic in Panama Is Mainly Driven by Dissemination of Country-Specific Clades. PLoS One 9(4), e95360 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Nurse K. Diaspora, Migration and Development in the Caribbean. Policy Paper. Canadian Foundation for the Americas (FOCAL). (2004). Available at http://focal.ca/pdf/migration_Nurse_diaspora migration development Caribbean_September 2004_FPP-04-6.pdf. (Accessed 10 August 2016).
  23. Borland R. et al. HIV/AIDS and mobile populations in the Caribbean: a baseline assessment. International Organization for Migration (IOM). (2004) Available at https://publications.iom.int/books/hivaids-and-mobile-populations-caribbean-baseline-assessment. (Accessed 10 August 2016).
  24. Cordova Alcaraz R. Migratory routes and dynamics between Latin American and Caribbean (LAC) countries and between LAC and the European Union. International Organization for Migration (IOM). (2012) Available at https://publications.iom.int/system/files/pdf/migration_routes_digital.pdf. (Accessed 10 August 2016).
  25. de Oliveira T. et al. An automated genotyping system for analysis of HIV-1 and other microbial sequences. Bioinformatics . 21, 3797–3800 (2005). [DOI] [PubMed] [Google Scholar]
  26. Thompson J. D., Gibson T. J., Plewniak F., Jeanmougin F. & Higgins D. G. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25, 4876–4882 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Berger S. A., Krompass D. & Stamatakis A. Performance, accuracy, and Web server for evolutionary placement of short sequence reads under maximum likelihood. Syst Biol . 60, 291–302 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics . 30, 1312–1313 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Guindon S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol . 59, 307–321 (2010). [DOI] [PubMed] [Google Scholar]
  30. Guindon S., Lethiec F., Duroux P. & Gascuel O. PHYML Online--a web server for fast maximum likelihood-based phylogenetic inference. Nucleic Acids Res . 33, W557–W559 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Posada D. jModelTest: phylogenetic model averaging. Mol Biol Evol. 25, 1253–1256 (2008). [DOI] [PubMed] [Google Scholar]
  32. Anisimova M. & Gascuel O. Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. Syst Biol. 55, 539–552 (2006). [DOI] [PubMed] [Google Scholar]
  33. Anisimova M., Gil M., Dufayard J. F., Dessimoz C. & Gascuel O. Survey of branch support methods demonstrates accuracy, power, and robustness of fast likelihood-based approximation schemes. Syst Biol . 60, 685–699 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rambaut A. FigTree v1.4: Tree Figure Drawing Tool. Available at http://tree.bio.ed.ac.uk/software/figtree/. (Accessed 10 August 2016)
  35. Drummond A. J., Nicholls G. K., Rodrigo A. G. & Solomon W. Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data. Genetics. 161, 1307–1320 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Drummond A. J. & Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol . 7, 214 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Suchard M. A. & Rambaut A. Many-core algorithms for statistical phylogenetics. Bioinformatics . 25, 1370–1376 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Drummond A. J., Ho S. Y., Phillips M. J. & Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biol . 4, e88 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Drummond A. J., Rambaut A., Shapiro B. & Pybus O. G. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol . 22, 1185–1192 (2005). [DOI] [PubMed] [Google Scholar]
  40. Rambaut A., Lam T. T., Carvalho L. M. & Pybus O. G. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evolution 2, vew007 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Zehender G. et al. Population dynamics of HIV-1 subtype B in a cohort of men-having-sex-with-men in Rome, Italy. J Acquir Immune Defic Syndr . 55, 156–160 (2010). [DOI] [PubMed] [Google Scholar]
  42. Chen J. H. et al. Phylodynamics of HIV-1 subtype B among the men-having-sex-with-men (MSM) population in Hong Kong. PLoS One. 6, e25286 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Lemey P., Rambaut A., Drummond A. J. & Suchard M. A. Bayesian phylogeography finds its roots. PLoS Comput Biol . 5, e1000520 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Ferreira M. A. R. & M.A. S. Bayesian analysis of elapsed times in continuous-time Markov chains. Canadian Journal of Statistics . 26, 355–368 (2008). [Google Scholar]
  45. Rambaut A. & Drummond A. Tracer v1.6. Available at http://tree.bio.ed.ac.uk/software/tracer/. (Accessed 10 August 2016).
  46. Huelsenbeck J. P., Larget B., Miller R. E. & Ronquist F. Potential applications and pitfalls of Bayesian inference of phylogeny. Syst Biol . 51, 673–688 (2002). [DOI] [PubMed] [Google Scholar]
  47. Bielejec F., Rambaut A., Suchard M. A. & Lemey P. SPREAD: spatial phylogenetic reconstruction of evolutionary dynamics. Bioinformatics . 27, 2910–2912 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Faria N. R. et al. Phylogeographical footprint of colonial history in the global dispersal of human immunodeficiency virus type 2 group A. J Gen Virol . 93, 889–899 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Junqueira D. M. et al. Reviewing the history of HIV-1: spread of subtype B in the Americas. PLoS One . 6, e27489 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Faria N. R. et al. HIV epidemiology. The early spread and epidemic ignition of HIV-1 in human populations. Science . 346, 56–61 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Caribbean Tourism Organization (CTO). Caribbean Tourism Review 2014. Available at http://www.onecaribbean.org/statistics/annual-reviews-prospects/. (Accessed 10 August 2016).
  52. Kramer M. A. et al. Migrants travelling to their country of origin: a bridge population for HIV transmission? Sexually transmitted infections . 84, 554–555 (2008). [DOI] [PubMed] [Google Scholar]
  53. Kramer M. A. et al. HIV transmission patterns among The Netherlands, Suriname, and The Netherlands Antilles: a molecular epidemiological study. AIDS Res Hum Retroviruses . 27, 123–130 (2011). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information
srep33971-s1.pdf (1.8MB, pdf)

Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES