Skip to main content
Evolution, Medicine, and Public Health logoLink to Evolution, Medicine, and Public Health
. 2013 Sep 13;2013(1):209–224. doi: 10.1093/emph/eot019

How effectively can HIV phylogenies be used to measure heritability?

George Shirreff 1,2,*, Samuel Alizon 3, Anne Cori 1, Huldrych F Günthard 4, Oliver Laeyendecker 5,6, Ard van Sighem 7, Daniela Bezemer 7, Christophe Fraser 1
PMCID: PMC3850537  PMID: 24481201

Abstract

Background and objectives: The severity of HIV-1 infection, measured by set-point viral load (SPVL), is highly variable between individuals. Its heritability between infections quantifies the control the pathogen genotype has over disease severity. Heritability estimates vary widely between studies, but differences in methods make comparison difficult. Phylogenetic comparative analysis offers measures of phylogenetic signal, but it is unclear how to interpret them in terms of the fraction of variance in SPVL controlled by the virus genotype.

Methodology: We present computational methods which link statistics summarizing phylogenetic signal to heritability, h2 in order to test for and quantify it. We re-analyse data from Switzerland and Uganda, and apply it to new data from the Netherlands. We systematically compare established and new (e.g. phylogenetic pairs, PP) phylogenetic signal statistics.

Results: Heritability estimates varied by method and dataset. Several methods were consistently able to detect simulated heritability above Inline graphic, but none below. Pagel’s λ was the most robust and sensitive. The PP method found no heritability in the Netherlands data, whereas Pagel’s λ found significant heritability only in a narrow subdivision (P =0.038). Heritability was estimated at h2=0.52 (95% confidence interval 0.00–0.63).

Conclusions and implications: This standardized measure, h2, allows comparability of heritability between cohorts. We confirm high heritability in Swiss data, but neither in Ugandan data nor in the Netherlands, where it is barely significant or undetectable. Existing phylogenetic methods are ill-suited for detecting heritability below Inline graphic, which may nonetheless be biologically important.

Keywords: HIV-1, heritability, phylogenetic comparative analysis, set-point viral load, phylogenetic signal, virulence

BACKGROUND AND OBJECTIVES

HIV has a high mutation rate [1, 2] and daily turnover [3, 4] and therefore adapts rapidly under local selective pressure from the immune system [5, 6, 7] or antiretroviral drugs [8, 9]. Increasingly there is interest in the transmission of escape [10, 11] or drug resistance mutations [12], which may enable viral adaptation to the host population. The rate at which a trait evolves in response to natural selection is determined by its heritability.

Recent work has suggested that virulence may also evolve at the population level [13] by natural selection towards a level optimal for transmission [14, 15]. Variation in virulence has a large impact on mortality and morbidity, so its evolutionary potential may present challenges or provide opportunities in public health interventions [16]. For example, vaccines which reduce growth rate or toxicity are predicted to reduce the costs of virulence, to the vaccinated host and the pathogen. This potentially raises the optimal virulence, resulting in poorer outcomes for the unvaccinated individuals [17].

Virulence in HIV is well approximated by set-point viral load (SPVL), which refers to the density of virions in the blood during asymptomatic infection. SPVL is an early prognostic indicator for AIDS, as it varies by orders of magnitude between individuals, with high values having faster CD4 cell decline, progressing more rapidly to AIDS and death [18, 19, 20]. However, it is relatively stable within the individual [21] meaning that it can be measured at a wide range of time points in an individuals’ infection [22].

Many host factors influence SPVL (Human Leukocyte Antigen (HLA) type (reviewed in [23])), sex [24], ethnicity [25], age [26], co-infections [27, 28]). Specific human genetic markers have been identified to which 13% of SPVL variation can be attributed, with a further 9% explained by age, sex and population structure [29]. Recently, several studies have indicated that viral factors play a substantial role in SPVL variation by measuring its heritability between infections (reviewed in [30]). Most of these quantify the similarity in SPVL within transmission pairs, which are sexual couples in which one has infected the other [31–34].

The phenotype of any organism is controlled partly by its genome, and partly by its environment. Throughout this work we define heritability, h2, in the broad sense as the proportion of total phenotypic variance (Inline graphic) ascribed to genetic variance (Inline graphic) [equation (1), [35]]. In the environmental component of variance, Inline graphic, we conceptually include all host genetic and non-genetic effects, as well as interactions between host and virus genotype.

graphic file with name eot019m1.jpg (1)

Alizon et al. [36] used a phylogenetic comparative approach to identify phylogenetic signal as a measure of heritability, without requiring behavioural data. Phylogenetic signal is the extent to which individuals with similar traits can be observed to cluster together on the phylogeny. This approach has the advantage that any sample of well-characterized patients could be analysed in this way. However, the authors did not account for cofactors such as age and co-infections, which influence the SPVL and may cluster together on the phylogeny. It is also uncertain exactly how the quantity measured by the two methods used (Pagel’s λ and Blomberg’s K) should be interpreted in the context of heritability. We herein propose a new approach which also uses only the phylogenetic relationships to determine genetic proximity of viruses, but additionally links the results to true heritability, and in some cases allows inclusion of the effect of cofactors on SPVL.

The aims of this study were to evaluate the phylogenetic approach for estimating heritability, to compare the efficacy of the various statistics available for quantifying heritability on simulated and real data, to use these methods to confirm the presence of heritability in previously analysed data, and to measure heritability in a dataset from the Netherlands which has not been previously analysed for this purpose.

METHODOLOGY

Data

Data from Rakai, Uganda

The study population was enrolled in the Rakai Community Cohort Study in the rural Rakai District of south-western Uganda. The study methods for this cohort have been outlined elsewhere [34, 37, 38]. We used those individuals who were sampled at a single time point in April 1995 (n =332). Characteristics of the cohort are shown in Supplementary Table S1. Serum samples were used to measure SPVL, with most individuals providing a single measurement.

The phylogenetic analysis was conducted in RAxML [39] using the General Time Reversible substitution model [40], the Γ model of rate heterogeneity with four distinct rate categories and a proportion of invariant sites [41] (GTR+Γ+I), which had the best score when the alignment was analysed with ModelTest [42]. The alignment was also analysed using the Recombination Analysis Tool (RAT [43]), which found no apparent recombination within the studied genes (data not shown).

A Simian Immunodeficiency Virus (SIV) outgroup was included (accession number: AB177846.1). The phylogeny was constructed from either the gp41 (env) or p24 (gag) sequences (Supplementary Figs S1 and S2), which were analysed separately because they have different substitution rates [44]. We also performed the phylogenetic and heritability analysis on the subtype A and D sequences separately, due to apparent imbalance in the joint trees.

Data from Switzerland

The Swiss data were taken from the Swiss HIV Cohort Study and its integrated genotypic drug resistance database which has been described elsewhere [45, 46], and the specific selection has been analysed in previous work [36, 47, 48]. The phylogeny, constructed from the reverse transcriptase and protease components of the pol gene (Supplementary Fig. S3), had been inferred in this previous work [36] using PhyML [49] from subtype B infected individuals for whom at least three viral load measurements were available after primary infection and before anti-retroviral therapy (n =661). Characteristics of the cohort are shown in Supplementary Table S2.

Note that PhyML was used for the Swiss data, compared with RAxML in the other two datasets. However, we expect the methods to produce very similar results as both use a hill-climbing algorithm, likelihood scores are well matched when using the same nucleotide substitution model, and there is no evidence for systematic differences in the results [50].

We analysed four subdivisions of the data as in previous work [36]. The entire dataset (‘All’) fit the Liberal definition for viral load variability, meaning that at least three consecutive viral load measurements within the asymptomatic window (6–36 months after first positive viral RNA) remained within a one-log band of one another. The ‘Strict’ subdivision included only those individuals for whom the measurements in the asymptomatic window all sat within the one-log band. Excluding all but men who have sex with men (MSM) led to the further subdivisions, ‘MSM’ and ‘MSM Strict’.

Data from Netherlands

The study cohort was provided by the ATHENA national observational cohort (seroconverters from 1996 or onwards) and the Amsterdam Cohort Studies (seroconverters before 1996) [51]. We included only individuals infected with subtype B between 1985 and 2008, for whom appropriate genetic data and SPVL were available [52] (n =416).

We define SPVL as the mean of all log10 viral load measurements taken 6–24 months after the midpoint between the last negative and the first positive diagnosis. The phylogeny was reconstructed using a sequence of length 2064 containing the same elements of the pol gene as for the Swiss data.

We excluded codons strongly associated with drug resistance mutations (for details, see Supplementary Fig. S4). Four subtype C individuals from the same cohort were used as the outgroup. The phylogenetic analysis was performed using ModelTest [42], RAxML [39] and RAT [43] in the same way as the Rakai cohort, which also identified the same nucleotide substitution model as appropriate, and no recombination was detected (data not shown).

As with the Swiss data, individuals were categorized as ‘All’, ‘Strict’, ‘MSM’ and ‘MSM Strict’. Additionally, two further categories, ‘MSM NL’ and ‘MSM Strict NL’, were created from the ‘MSM’ groups, which excluded individuals not originating in the Netherlands in order to further reduce confounding factors.

Trees for simulations were read and manipulated using the ape package [53] in R [54], which was also used to plot the trees.

Methods for calculating heritability statistics

A phylogeny, reconstructed from genetic data, is an approximation of the transmission network. Phylogenetic signal is a measure of how well trait values at the tree tips match their relative positions on the phylogeny, and several established methods are available to quantify this signal in terms of a single statistic: the Mantel test [55]; Blomberg’s independent contrasts, which give us the Blomberg’s K and PICv (variance of phylogenetic independent contrasts) statistics [56]; Pagel’s λ transformation [57]; and the Abouheif–Moran (AM) tests [58], of which there are five variants (‘oriAbouheif’, ‘sumDD’, ‘nNodes’, ‘patristic’, ‘Abouheif’, with the latter used by default). We also developed two new methods which allow control of cofactors, the phylogenetic pairs (PP) method and the hierarchical clustering (HC) method, which are described briefly here.

The PP method identifies pairs of individuals on the tree which are each other’s closest neighbour, and these are assumed to be transmission pairs. Analysis of variance (ANOVA) identifies the degree to which the transmission partner explains an individual’s SPVL. Crucially, the ANOVA approach allows for the inclusion of cofactors. These are age, sex and genital ulcer disease in the Rakai dataset (Supplementary Table S1); age, sex and risk group in the Swiss dataset (Supplementary Table S2); and age, sex, risk group, region of origin and the type of assay used to measure viral load in the Netherlands dataset (Supplementary Table S3). This method also ignores individuals who are not part of a phylogenetic pair.

The HC method is similar but considers larger clusters of individuals identified on the phylogeny by a threshold branch length, and examines the amount of variance in SPVL explained by the cluster. Because there is no intuitive ideal cluster size, proportion included or number of clusters to use, the method integrates over the range of clustering distances.

All established and new methods are described in detail in the Supplementary data.

Randomization test

The significance of a test statistic can be measured as in [56] by comparing the statistic derived from the data with its distribution under no heritability, which is the null hypothesis. Randomizing the tips of the tree by randomly reallocating the tips scrambles any heritability signal. This is performed 1000 times, and the analysis is repeated for each. The proportion of randomized datasets that give a statistic higher than the true value is the one-tailed P-value for presence of heritability. When the randomization test was performed for the PP and HC statistics the cofactors remained with their corresponding SPVL data.

Method of simulating SPVL data on a known phylogeny

This method uses a simple algorithm to simulate evolution of a continuous trait on a known phylogeny for a given heritability. This method is similar to that used in previous work [36] and is a variation on the Ornstein–Uhlenbeck process [59, 60], which allows Brownian motion to occur while constraining the distribution of the population.

During the simulation, each node on the tree is assigned a trait value in log10 SPVL, beginning from the root, which is assigned the mean of the true SPVL data. Each daughter node is given a SPVL value depending on that of its parent, and on the SPVL distribution in the whole population. The higher the heritability, the more the value depends on the parent.

The SPVL at each subsequent daughter node, VD [equation (2)], is derived from the parent node, VP, the value h2, and the random variable M which is normally distributed according to the mean and variance of the population log10 SPVL (distribution 3).

graphic file with name eot019m2.jpg (2)
graphic file with name eot019m3.jpg (3)

The value h2 is therefore the regression slope between the trait values at a parent node, VP, and a daughter node, VD (or an index and secondary case). This has been demonstrated to be equal to the broad sense heritability, h2 as defined in equation (1) [61], and its further implications are discussed in the Supplementary data. The result is a set of data at the tips of the tree, simulated with known heritability h2, which is used for analysis. We have also explored an alternative method of simulation which allows for multiple transmission between nodes on the tree, which is also described in the Supplementary data.

Multiple hypothesis testing to estimating true heritability and confidence intervals

For each value of h2 between 0 and 1, in increments of 0.01, 100 simulations are performed and the relevant phylogenetic comparative statistics is calculated. A hypothesis test for the particular h2 is then performed with these 100 values. They are compared with the statistic calculated from the true data, and the proportion which is lower than the true statistic becomes the probability P that the data are consistent with that value of h2. The values of h2 which produce P-values closest to 0.025, 0.975 and 0.500 become the lower and upper 95% confidence bounds, and the median estimate of h2, respectively.

A visual distribution of h2 is estimated using approximate Bayesian computation (ABC), of which details are given in the Supplementary data.

Estimate power to detect an effect

To examine the ability of each statistic to detect non-null heritability, we simulated 100 datasets for each value of a range (0–1) of h2 values in increments of 0.05. To visualize the relationship between h2 and the statistic of interest, we calculated the mean and standard deviation of the phylogenetic comparative statistics calculated from these 100 simulations. In order to estimate the power of each statistic to detect a significant effect at each value of h2, we performed a randomization test on each simulated dataset, as described above (see ‘Randomization test’), but with 100 randomizations. For each simulated dataset, the proportion of randomization tests which find significant (Inline graphic) heritability represents the power to detect an effect at that value of h2.

All methods are available in R code on request from the corresponding author.

RESULTS

Testing significance in previously analysed data

The significance of heritability from the randomization test for the Rakai and Swiss data is shown in Tables 1 and 2, respectively.

Table 1.

The statistics (Z), P-values from a randomization test, medians and confidence intervals of h2 from MHT on the Rakai data

graphic file with name eot019t1.jpg

This table included from analysis of the gp41 and p24 genes in all the available Rakai data, or subtypes A or D separately. P-values showing borderline significance (P <0.1) are in blue, and formal significance (P <0.05) is in red, and confidence intervals in which the lower limit is above zero are also in red. n.d., not done.

Table 2.

The statistics (Z), P-values from a randomization test, medians and confidence intervals of h2 from MHT on the Swiss data

graphic file with name eot019t2.jpg

Individuals were subdivided according to variability of viral load (Strict), risk category (MSM) or both. P-values showing borderline significance (P <0.1) are in blue, and formal significance (P <0.05) is in red, and confidence intervals in which the lower limit is above zero are also in red. n.d., not done.

A consistent signal was not found in the Rakai data (Table 1), but in the Swiss data it was more so (Table 2), and was higher for the smaller subdivisions. We tested the possibility that high significance in the smallest group was an artefact of small tree size, but this was not the case (Supplementary Table S5). We also tested the robustness of the results to phylogenetic uncertainty and found that small levels of perturbation (3%) have little effect on the results, but as the level of perturbation increases, the signal weakens accordingly (Supplementary Table S6).

Estimates of heritability in previously analysed data

Tables 1 and 2 show the medians and confidence intervals of h2 as estimated by multiple hypothesis testing (MHT). In the Swiss data, the confidence intervals exclude zero in several cases. In all of these instances, the randomization test was also significant. However, a significant randomization test did not always correspond to confidence intervals which excluded zero, because these are one- and two-tailed tests, respectively.

The distribution of true heritability was calculated by ABC using both PP (red) and Pagel’s λ (blue) for each subdivision in the Swiss and Rakai datasets (Figs 1 and 2). In the Swiss figure, the MSM Strict subdivision exhibits the distribution of h2 most removed from zero, which supports the results from the randomization test and lower confidence bounds (Table 2). The method using Pagel’s λ exhibits higher heritability in the larger subdivisions.

Figure 1.

Figure 1.

Heritability estimated by ABC in all subdivisions of the Swiss data. The subdivision is written above the plot. Results from the PP method are in red, and from Pagel’s λ in blue, with the overlap in purple

Figure 2.

Figure 2.

Heritability estimated by ABC in all subdivisions of the Rakai data. The subdivision is written above the plot. Results from the PP method are in red, and from Pagel’s λ in blue, with the overlap in purple

Similarly in the Rakai cohort, the most significant result from the randomization test (Table 1) is the p24 subdivision analysed under Pagel’s λ, and this is reflected in the results from ABC, in which the distribution of estimated h2 is highly positive. Interestingly, the PP method finds that the most significant distribution visually is instead from the gp41 subdivision which is also confirmed by the randomization test.

We also tested an alternative method of simulation which accounted for differences in branch length and allowed multiple generations between two adjacent nodes. We applied this to the Rakai p24 subtype A data (Supplementary Table S7), and the alternative gave higher estimates of h2 but wider confidence intervals.

Heritability was also measured in the Rakai and Swiss data using a phylogenetic mixed model, which assumes that the trait is determined by independent viral and host effects [62]. These results are given in Supplementary Table S4.

Testing the sensitivity of each statistic

The sensitivity of each phylogenetic comparative statistic to simulated heritability was explored by visualizing the relationship between them, and measuring the power of the statistic to successfully detect an effect of that size, in the entire Swiss dataset and the MSM Strict subdivision (Figs 3 and 4, respectively). None of the statistics had substantial power to detect heritability below h2=0.4 on any subdivision, but most had the power to detect an effect above that level. The Mantel test performed poorly throughout. Blomberg’s K performed well in the MSM Strict subdivision, but was insensitive when applied to the entire Swiss dataset, with only values above h2=0.6 being consistently detected. In the entire Swiss dataset, the AM statistic appeared the most sensitive, but in the MSM Strict subdivision its performance matched that of Pagel’s λ.

Figure 3.

Figure 3.

Comparison of the sensitivity of various statistics to heritability on the entire Swiss phylogeny. (Left side) The relationship between heritability and six different statistics under comparison. The circles and bars represent the mean and standard deviation of the sample. (Right side) The power of each statistic to detect heritability at 5% significance. The bars represent the standard deviation of the proportion

Figure 4.

Figure 4.

Comparison of the sensitivity of various statistics on the Swiss MSM Strict phylogeny. Left and right side as in Figure 3

We also tested the power of variants of the HC and AM tests compared with their default methods (‘ward’ and ‘Abouheif’, respectively) (Supplementary Figs S5 and S6). The default methods performed equally well or better than any of the variations.

Application to data from the Netherlands

Supplementary Figure S4 shows the shape of the phylogeny inferred from the Netherlands data. These data have not been previously analysed for heritability, and so to reduce the problem of multiple testing only two methods, Pagel’s λ and PP, were used to detect and measure heritability in each subdivision of the data.

Significant heritability was found only in one subdivision (MSM from the Netherlands with Strict viral loads) and using one statistic (Pagel’s λ), which gave a heritability estimate of h2=0.52 (Table 3). No effect was found when the PP method was used. None of the confidence intervals on h2 excluded 0. The estimated distributions of h2 are shown in Fig. 5.

Table 3.

The statistics (Z), P-values from a randomization test, medians and confidence intervals of h2 from MHT on the Netherlands data, when analyzed with PP and Pagel's λ

graphic file with name eot019t3.jpg

Individuals were subdivided according to variability of viral load (Strict), risk category (MSM) or whether they were from the Netherlands (NL). P-values showing borderline significance (P <0.1) are in blue, and formal significance (P <0.05) is in red, and confidence intervals in which the lower limit is above zero are also in red.

Figure 5.

Figure 5.

Heritability estimated by ABC in all subdivisions of the Netherlands data. The subdivision is written above the plot. Results from the PP method are in red, and from Pagel’s λ in blue, with the overlap in purple

DISCUSSION

In this study, we developed new methods for detecting and measuring heritability of SPVL using phylogenetic relationships and compared them with established methods on real and simulated data.

Which cohorts exhibit heritability?

Heritability was detected consistently in the MSM Strict subdivision of the Swiss cohort (Table 2), supporting the previous study of these data which found significant and high heritability in that subdivision [36]. Note that in that study, the level of significance was detected by randomizing the PICv rather than the K statistic, hence the strong result for PICv in the MSM Strict subdivision. We also found that several statistics uncovered significant heritability in all of the subdivisions of the Swiss data. In many such cases, the confidence intervals of h2 also excluded zero, and the estimates for h2 were high (0.44–0.68).

In spite of significant heritability being found in previous work on the Rakai data [34], we did not consistently find heritability in this cohort. Although they are not strictly comparable, the results from the previous studies of the Rakai and Swiss data suggest that heritability is higher in the latter cohort, and this may underlie the differences in our results. However, the best estimate for h2 from the Rakai data was high (0.51) (Table 1).

Six different subdivisions of the Netherlands data were analysed using PP and Pagel’s λ, and only a single positive result was found with one statistic (λ) in one subdivision, suggesting that heritability is close to the detection borderline. It was estimated at h2=0.52 (Table 3), which falls high within the range of previous estimates [30].

It is interesting that the Swiss cohort shows more apparent heritability than the Netherlands in spite of a similar transmission routes and genetic backgrounds. One potential explanation would be differences in coverage. The prevalence in Switzerland is higher (0.4% compared with 0.2%) [63], but so is the sample size (661 compared with 416) which suggests similar levels of coverage, which are reportedly high in both cohorts [47, 64]. In the Swiss dataset, the mean SPVL of the Strict group is significantly higher than that of the rest (P =0.0004), and the same is true of the MSM over non-MSM (P =0.003) [36]. The same is not true of these two categories in the Netherlands cohort (data not shown), which suggests that viral virulence genotypes are less structured in this cohort and may explain lower heritability.

The exclusion of non-Strict individuals generally increased the level of significance, but this was dependent on the cohort and the method. In the Swiss cohort, the exclusion increased significance when using the PP and HC methods, but not otherwise (Table 2). In the Netherlands cohort, the exclusion increased significance markedly when using Pagel’s λ, but not as much using PP (Table 3). Understanding why the Strict population exhibits higher heritability than whole sample may be an important step in both estimating and understanding the mechanisms behind heritability. Simple models fitted to longitudinal viral load data within patients suggest that fluctuations do not just represent random noise [65]. Fluctuating viral load has been associated with untreated sexually transmitted co-infections [66] and this or other host-mediated effect may mask the effect of the viral genotype and justify the exclusion of individuals with highly variable viral loads.

The same gene pol was used to analyse the Swiss and Netherlands datasets, whereas the available genes from the Rakai cohort were env and gag. It has been shown that env and pol produce similar trees [67], but there are topological differences, and our work also demonstrates different results between gag and env (Fig. 2). The ABC method of estimation by simulation makes estimation of h2 robust to differences in gene usage and we do not expect systematic differences between the cohorts. There are also differences in transmission routes between the cohorts, and a higher diversity of HLA types in African than European populations [68]. Although these differences inhibit direct comparability between the European and Ugandan datasets, they add to the general nature of this work.

Which method is best for detecting heritability?

The principle test of these statistics is the detection of an effect in real data. Pagel’s λ detected heritability in every subdivision of the Swiss data (Table 2), and also produced the strongest result of any statistic applied to the Rakai dataset (Table 1). The PP and HC statistics performed well on the Swiss data, particularly on the Strict subdivisions. The AM statistics were less successful at detecting an effect in the Swiss data. In the Rakai data, the AM ‘patristic’ variant found a significant result in two subdivisions. However, in simulation studies it performed very poorly (Supplementary Fig. S6).

Testing the detection power by simulation relies on a simple model of trait evolution, but has the advantage that heritability is known. It revealed that the PP, HC, Pagel’s λ and AM statistics were comparably sensitive, detecting an effect at greater than approximately 0.4 heritability in the Swiss data, with AM slightly more sensitive (Figs 3 and 4). The K statistic was as sensitive as other statistics when applied to the MSM Strict subdivision, but in the entire Swiss dataset its sensitivity was low.

Simulations suggest that none of the methods can detect heritability lower than approximately h2=0.4, and this threshold is higher in some phylogenies. This threshold is confirmed by the finding that estimates of h2 are always above 0.4 when heritability was found to be significant. Most studies have estimated lower heritability than this [30], and previous modelling work has suggested that such low heritability is enough to produce a substantial rate of evolution [15]. This suggests that phylogenetic methods are not adequate to exclude the possibility of relevant heritability in HIV virulence in these datasets.

Interestingly, another study which took an analytical and computational approach to comparing between Blomberg’s K, the AM and the Mantel test found that K had a higher power to detect an effect than the AM statistic [69]. They also argue that these tests should all give the same significance as they are based on the cross-product of a phylogenetic similarity and trait similarity matrix. In contrast, we found marked differences between their performances, with the AM, K and Mantel statistic having decreasing power to detect an effect. They found that the sensitivity of these methods was dependent on the shape of the phylogeny, so differences in the source of trees (simulated versus inferred) are a possible source of discrepancy in the respective studies. This is beyond the scope of this study but deserves to be the subject of future work.

The PP, along with the HC methods, is able to account for cofactors of SPVL: ignoring these may lead to overestimates or underestimates of heritability if they associate or dissociate on the tree, respectively. Indeed, an increase in signal when cofactors were taken into account was seen in previous analysis of transmission pairs in the Rakai cohort [34]. However, in the absence of a method to include the effect of cofactors in simulations, this aspect of the PP or HC methods cannot be harnessed to measure h2. One possible method would be to calculate the fixed effects of the cofactors using a linear regression, simulate only the residuals, and subsequently add the fixed effects, but this requires treating residuals as data, which is inappropriate in the likely event that there is correlation between the effects of cofactors and the SPVL [70]. The PP method produces a dataset of couples, which is analogous to other couples studies [31–34]. The lack of sensitivity of the PP method and the other phylogenetic methods suggests that the phylogeny cannot (yet) tell us everything that the epidemiology does about the transmission network.

The λ statistic has the advantage that it incorporates both topology and branch lengths, and analyses the entire sample. It is notable, therefore, that the PP method is relatively successful in spite of its analysing only a subset of individuals who form apparent transmission pairs (60%), and in particular ignores deep relationships within the phylogeny. This suggests that most signal lies in the recent phylogenetic relationships. However, in unpublished work, Hodcroft et al. found SPVL heritability using pedigree analysis on UK data [71]. In contrast to our work, they found that collapsing poorly supported nodes in the tree and thereby ignoring some of the shallow relationships in the phylogeny had a negligible effect on their results for some datasets. The AM method is also successful, which (with the exception of the ‘patristic’ variant) ignores branch lengths, indicating that topology may be more important. Rigorously identifying which clades or levels of the phylogeny are responsible for heritability would be an interesting direction for future research. This may differ for phylogenies which are less star-like than HIV. The use of longer sequences and better sampled datasets is likely to result in better detection and estimation of heritability, as poorly resolved trees scramble the heritability signal. However, the detection threshold is unlikely to change even with improved sampling, as it was based on simulations which were blind to uncertainty in the tree.

It is noteworthy that the PP and Pagel’s λ each have their strengths in estimating the distributions of h2 in the different Rakai subdivisions. The PP method identifies a more strongly positive distribution in subdivision gp41, which Pagel’s λ does not, but the latter detects a more positive distribution in p24 and subtype A gp41. This suggests that an approach which combines these methods may be appropriate.

CONCLUSIONS AND IMPLICATIONS

In this study, we compare several phylogenetic comparative methods to detect heritability, h2. Many methods detect heritability successfully in real and simulated data, but sensitivity drops off below h2=0.4. We recommend the PP method and Pagel’s λ for use in detecting and estimating heritability, the former for its consideration of co-factors, and the latter for its marginally higher level of sensitivity.

Estimates of heritability were consistent with previous studies on the Rakai and Swiss data, and confirm that heritability can be very high, which has clinical and evolutionary implications. When applied to the Netherlands data, heritability was found only in the most homogeneous subdivision, MSM who originate in the Netherlands with Strict viral loads. Differences in heritability between cohorts, subdivisions and methods for estimation carry implications for the biology of heritability, which offer interesting avenues for future modelling work. Experimental and epidemiological research are also required to directly identify viral factors which contribute to variance in SPVL, as well as exploring the impact of treatment on virulence evolution.

SUPPLEMENTARY DATA

Supplementary data are available at EMPH online.

Supplementary Data

ACKNOWLEDGEMENTS

For advice, discussions and technical help the authors would like to thank Luuk Gras, Frank de Wolf, Neil Ferguson, Andrew Rambaut, Simon Cauchemez, Lulla Opatowski, Oliver Rattman, Thibaut Jombart, Deirdre Hollingsworth, Federico Calboli, Jeff Eaton, Matthew Hartfield, Melissa Ward, Emma Hodcroft, Andrew Leigh Brown, Joshua Herbeck, Tanja Stadler, Roland Regoes and Sebastian Bonhoeffer. They also thank anonymous reviewers for their helpful comments which have helped strengthen this work.

Swiss HIV Cohort Study

The SHCS was supported by the Swiss National Science Foundation (SNF grant #33CS30–134277). Further support was provided by, SNF grant #324730–130865 (to H.F.G.) and the SHCS projects #470, 528, 569, 683, the SHCS Research Foundation, the Jacob-Stiftung, Zürich (to H.F.G.), by an unrestricted research grant of the Union Bank of Switzerland, in the name of an anonymous donor to HFG, and an unrestricted research from Gilead, Switzerland (to H.F.G.), and by the University of Zurichs Clinical research Priority Program (CRPP) ‘Viral infectious diseases: Zurich Primary HIV Infection Study’ (to H.F.G.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

The authors thank the patients who participate in the SHCS; the physicians and study nurses for excellent patient care; the resistance laboratories for high quality genotypic drug resistance testing; SmartGene, Zug, Switzerland for technical support; Brigitte Remy, Martin Rickenbach, MD, F. Schöni-Affolter and Yannick Vallet from the SHCS Data Center in Lausanne for the data management and Danièle Perraudin and Mirijam Minichiello for administrative assistance. The members of the Swiss HIV Cohort Study are Aubert V, Barth J, Battegay M, Bernasconi E, Böni J, Bucher HC, Burton-Jeangros C, Calmy A, Cavassini M, Egger M, Elzi L, Fehr J, Fellay J, Francioli P, Furrer H (Chairman of the Clinical and Laboratory Committee), Fux CA, Gorgievski M, Günthard H (President of the SHCS), Haerry D (deputy of ‘Positive Council’), Hasse B, Hirsch HH, Hirschel B, Hösli I, Kahlert C, Kaiser L, Keiser O, Kind C, Klimkait T, Kovari H, Ledergerber B, Martinetti G, Martinez de Tejada B, Metzner K, Mller N, Nadal D, Pantaleo G, Rauch A (Chairman of the Scientific Board), Regenass S, Rickenbach M (Head of Data Center), Rudin C (Chairman of the Mother and Child Substudy), Schmid P, Schultze D, Schöni-Affolter F, Schüpbach J, Speck R, Taffé P, Tarr P, Telenti A, Trkola A, Vernazza P, Weber R, Yerly S.

Rakai Health Sciences Programme

The study was funded in part by the Department of Intramural Research, National Institute of Allergy and Infectious Diseases [NIAID], National Institutes of Health [NIH] (to O.L.). The authors thank the participants and the study team of the Rakai Community Cohort Study, which was supported by (i) the Bill and Melinda Gates Foundation (22006.03), (ii) the National Institutes of Health (NIH), Division of Allergy and Infectious Diseases (U1AI51171 and 1UO1AI075115-O1A1), (iii) the Department of the Army, US Army Medical Research and Materiel Command Cooperative Agreement (DAMD17-98-2-8007), and (v) the Henry M. Jackson Foundation (5D43TW00010). They also thank the participants of the Rakai Health Sciences Program and the physicians, nurses, data managers and laboratory specialists who provide such excellent care.

The members of the Rakai Health Sciences Program are: Sewankambo N, Serwadda D, Kigozi G, Nalugoda F, Wabwire-Mangen F, Musoke R, Kagaayi J, Ssekubugu R, Lutalo T, Kiddugavu M, Makumbi F, Ssempijja V, Galiwango RM, Nantume B, Kiwanuka N, Boaz I, Kong X, Quinn TC, Reynolds SJ, Wawer MJ, Gray RH.

ATHENA National Observational Cohort, The Netherlands

This is a collaboration between the following treatment centres and individuals in the Netherlands. Director: Prof. Dr F. de Wolf. Data analysis group: Dr D. O. Bezemer, Dr L. A. J. Gras, Dr A. M. Kesselring, Dr A. I. van Sighem, Dr Ir. C. Smit. Data collection: Dr S. Zaheri. Participating centres (site coordinating physicians): Academisch Medisch Centrum bij de Universiteit van Amsterdam, Amsterdam: Prof. Dr J. M. Prins, Prof. Dr T. W. Kuijpers, Dr H. J. Scherpbier, Dr J. T. M. van der Meer, Dr F. W. M. N. Wit, Dr M. H. Godfried, Prof. Dr P. Reiss, Prof. Dr T. van der Poll, Dr F. J. B. Nellen, Prof. Dr J. M. A. Lange, Dr S. E. Geerlings, Dr M. van Vugt, Dr D. Pajkrt, Dr J. C. Bos, Dr M. van der Valk, Dr M. L. Grijsen, Dr W. J. Wiersinga. Academisch Ziekenhuis Maastricht, Maastricht: Dr S. Lowe, Dr G. Schreij, Dr A. Oude Lashof, Dr D. Posthouwer. Catharina-ziekenhuis, Eindhoven: Dr M. J. H. Pronk, Dr B. Bravenboer. Erasmus Medisch Centrum, Rotterdam: Dr M. E. van der Ende, Dr T. E. M. S. de Vries-Sluijs, Dr C. A. M. Schurink, Dr J. L. Nouwen, Dr M. H. Nispen tot Pannerden, Dr A. Verbon, Dr B. J. A. Rijnders, Dr E. C. M. van Gorp, Dr R. J. Hassing, Dr A. W. M. Smeulders. Erasmus Medisch Centrum, Sophia, Rotterdam: Dr N. G. Hartwig, Dr G. J. A. Driessen. Flevoziekenhuis. Almere: Dr J. Branger. HagaZiekenhuis, Den Haag: Dr E. F. Schippers, Dr C. van Nieuwkoop, Dr E. P. van Elzakker. Isala Klinieken, Zwolle: Dr P. H. P. Groeneveld, Dr M. A. Alleman, Dr J. W. Bouwhuis. Kennemer Gasthuis: Dr R. Soetekouw, Prof. Dr R. W. ten Kate. Leids Universitair Medisch Centrum, Leiden: Dr F. P. Kroon, Prof. Dr J. T. van Dissel, Dr S. M. Arend, Dr M. G. J. de Boer, Dr H. Jolink, Dr H. J. M. ter Vollaard, Dr M. P. Bauer. Maasstadziekenhuis, Rotterdam: Dr J. G. den Hollander, Dr K. Pogany. Medisch Centrum Alkmaar, Alkmaar: Dr G. van Twillert, Dr W. Kortmann. Medisch Centrum Haaglanden, Den Haag: Dr E. M. S. Leyten, Dr L. B. S. Gelinck. Medisch Spectrum Twente, Enschede: Dr G. J. Kootstra, Dr C. E. Delsing. Onze Lieve Vrouwe Gasthuis, Amsterdam: Prof. Dr K. Brinkman, Dr W. L. Blok, Dr P. H. J. Frissen, Dr W. E. M. Schouten, Dr G. E. L. van den Berk. Sint Elisabeth Ziekenhuis, Tilburg: Dr M. E. E. van Kasteren. Sint Lucas Andreas Ziekenhuis, Amsterdam: Dr J. Veenstra, Dr K. D. Lettinga. Slotervaartziekenhuis, Amsterdam: Dr J. W. Mulder, Dr S. M. E. Vrouenraets, Dr F. N. Lauw. Stichting Medisch Centrum Jan van Goyen, Amsterdam: Dr A. van Eeden, Dr D. W. M. Verhagen. Universitair Medisch Centrum Groningen, Groningen: Dr H. G. Sprenger, Dr R. Doedens, Dr E. H. Scholvinck, Dr S. van Assen, Dr W. F. W. Bierman. Universitair Medisch Centrum Sint Radboud, Nijmegen: Dr P. P. Koopmans, Dr M. Keuter, Dr A. J. A. M. van der Ven, Dr H. J. M. ter Hofstede, Dr A. S. M. Dofferhoff, Dr A. Warris, Dr R. van Crevel. Universitair Medisch Centrum Utrecht, Utrecht: Prof. Dr A. I. M. Hoepelman, Dr T. Mudrikova, Dr M. M. E. Schneider, Dr P. M. Ellerbroek, Dr J. J. Oosterheert, Dr J. E. Arends, Dr M. W. M. Wassenberg, Dr R. E. Barth. Vrije Universiteit Amsterdam, Amsterdam: Dr M. A. van Agtmael, Dr R. M. Perenboom, Dr F. A. P. Claessen, Dr M. Bomers, Dr E. J. G. Peters. Wilhelmina Kinderziekenhuis, Utrecht: Dr S. P. M. Geelen, Dr T. F. W. Wolfs, Dr L. J. Bont. Ziekenhuis Rijnstate, Arnhem: Dr C. Richter, Dr J. P. van der Berg, Dr E. H. Gisolf. Admiraal De Ruyter Ziekenhuis, Vlissingen: Dr M. van den Berge, Dr A. Stegeman. Medisch Centrum Leeuwarden, Leeuwarden: Dr M. G. A. van Vonderen, Dr D. P. F. van Houte. Medisch Centrum Zuiderzee, Lelystad: Dr S. Weijer, Dr R. el Moussaoui. Sint Elisabeth Hospitaal, Willemstad – Curaçao: Dr C. Winkel, Dr F. Muskiet, Dr Durand, Dr R. Voigt.

FUNDING

The authors were funded by the European Research Council (ERC), Imperial College London, the Medical Research Council (MRC) Centre for Outbreak Research and Modelling, le Centre National de la Recherche Scientifique (CNRS), l'Institute National de la Santé et de la Recherche Médicale (INSERM), l'Institut de Recherche pour le Développement (IRD), Division of Intramural Research of the National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), the Dutch Ministry of Health, and the Royal Society.

Conflict of interest: None declared.

REFERENCES

  • 1.Ji JP, Loeb LA. Fidelity of HIV-1 reverse-transcriptase copying a hypervariable region of the HIV-1 Env gene. Virology. 1994;199:323–30. doi: 10.1006/viro.1994.1130. [DOI] [PubMed] [Google Scholar]
  • 2.Alizon S, et al. Virulence evolution and the trade-off hypothesis: history, current state of affairs and the future. J Evol Biol. 2009;22:245–59. doi: 10.1111/j.1420-9101.2008.01658.x. [DOI] [PubMed] [Google Scholar]
  • 3.Ho DD, et al. Rapid turnover of plasma virions and Cd4 lymphocytes in HIV-1 infection. Nature. 1995;373:123–6. doi: 10.1038/373123a0. [DOI] [PubMed] [Google Scholar]
  • 4.Wei X, et al. Viral dynamics in human immunodeficiency virus type 1 infection. Nature. 1995;373:117–22. doi: 10.1038/373117a0. [DOI] [PubMed] [Google Scholar]
  • 5.Wolfs TFW, et al. Naturally-occurring mutations within HIV-1 V3 genomic RNA lead to antigenic variation dependent on a single amino-acid substitution. Virology. 1991;185:195–205. doi: 10.1016/0042-6822(91)90767-6. [DOI] [PubMed] [Google Scholar]
  • 6.Borrow P, et al. Antiviral pressure exerted by HIV-1-specific cytotoxic T lymphocytes (CTLs) during primary infection demonstrated by rapid selection of CTL escape virus. Nat Med. 1997;3:205–11. doi: 10.1038/nm0297-205. [DOI] [PubMed] [Google Scholar]
  • 7.Nowak MA, et al. Antigenic diversity thresholds and the development of AIDS. Science. 1991;254:963–9. doi: 10.1126/science.1683006. [DOI] [PubMed] [Google Scholar]
  • 8.Richman D, et al. Human-immunodeficiency-virus type-1 mutants resistant to nonnucleoside inhibitors of reverse-transcriptase arise in tissue-culture. Proc Natl Acad Sci U S A. 1991;88:11241–5. doi: 10.1073/pnas.88.24.11241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Schuurman R, et al. Rapid changes in human-immunodeficiency-virus type-1 RNA load and appearance of drug-resistant virus populations in persons treated with lamivudine (3TC) J Infect Dis. 1995;171:1411–9. doi: 10.1093/infdis/171.6.1411. [DOI] [PubMed] [Google Scholar]
  • 10.Leslie AJ, et al. HIV evolution: CTL escape mutation and reversion after transmission. Nat Med. 2004;10:282–9. doi: 10.1038/nm992. [DOI] [PubMed] [Google Scholar]
  • 11.Poon AFY, et al. Adaptation to human populations is revealed by within-host polymorphisms in HIV-1 and hepatitis C virus. PLoS Pathog. 2007;3:e45. doi: 10.1371/journal.ppat.0030045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Yerly S, et al. Transmission of antiretroviral-drug-resistant HIV-1 variants. Lancet. 1999;354:729–33. doi: 10.1016/S0140-6736(98)12262-6. [DOI] [PubMed] [Google Scholar]
  • 13.Herbeck JT, et al. Is the virulence of HIV changing? A meta-analysis of trends in prognostic markers of HIV disease progression and transmission. AIDS. 2012;26:193–205. doi: 10.1097/QAD.0b013e32834db418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Fraser C, et al. Variation in HIV-1 set-point viral load: epidemiological analysis and an evolutionary hypothesis. Proc Natl Acad Sci U S A. 2007;104:17441–6. doi: 10.1073/pnas.0708559104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Shirreff G, et al. Transmission selects for HIV-1 strains of intermediate virulence: a modelling approach. PLoS Comput Biol. 2011;7:e1002185. doi: 10.1371/journal.pcbi.1002185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Dieckmann U, et al. Adaptive Dynamics of Infectious Diseases: In Pursuit of Virulence Management. Cambridge: Cambridge University Press; 2002. [Google Scholar]
  • 17.Gandon S, et al. Imperfect vaccines and the evolution of pathogen virulence. Nature. 2001;414:751–6. doi: 10.1038/414751a. [DOI] [PubMed] [Google Scholar]
  • 18.Wolf F, et al. AIDS prognosis based on HIV-1 RNA, CD4+ T-cell count and function: markers with reciprocal predictive value over time after seroconversion. AIDS. 1997;11:1799–806. doi: 10.1097/00002030-199715000-00003. [DOI] [PubMed] [Google Scholar]
  • 19.Korenromp EL, et al. Clinical prognostic value of RNA viral load and CD4 cell counts during untreated HIV-1 infection—a quantitative review. PLoS One. 2009;4:e5950. doi: 10.1371/journal.pone.0005950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Mellors JW, et al. Prognosis in HIV-1 infection predicted by the quantity of virus in plasma. Science. 1996;272:1167–70. doi: 10.1126/science.272.5265.1167. [DOI] [PubMed] [Google Scholar]
  • 21.Henrard DR, et al. Natural history of HIV-1 cell-free viremia. JAMA. 1995;274:554–8. [PubMed] [Google Scholar]
  • 22.Geskus RB, et al. The HIV RNA setpoint theory revisited. Retrovirology. 2007;4:65. doi: 10.1186/1742-4690-4-65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Carrington M, O’Brien SJ. The influence of HLA genotype on AIDS. Annu Rev Med. 2003;54:535–51. doi: 10.1146/annurev.med.54.101601.152346. [DOI] [PubMed] [Google Scholar]
  • 24.Donnelly CA, et al. Gender difference in HIV-1 RNA viral loads. HIV Med. 2005;6:170–8. doi: 10.1111/j.1468-1293.2005.00285.x. [DOI] [PubMed] [Google Scholar]
  • 25.Vüller M, et al. Increasing clinical virulence in two decades of the Italian HIV epidemic. Plos Pathog. 2009;5:e1000454. doi: 10.1371/journal.ppat.1000454. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Richardson BA, et al. Comparison of human immunodeficiency virus type 1 viral loads in Kenyan women, men, and infants during primary and early infection. J Virol. 2003;77:7120–3. doi: 10.1128/JVI.77.12.7120-7123.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Abu-Raddad LJ, Patnaik P, Kublin JG. Dual infection with HIV and malaria fuels the spread of both diseases in sub-Saharan Africa. Science. 2006;314:1603–6. doi: 10.1126/science.1132338. [DOI] [PubMed] [Google Scholar]
  • 28.Jones LE, Perelson AS. Transient viremia, plasma viral load, and reservoir replenishment in HIV-Infected patients on antiretroviral therapy. J AIDS. 2007;45:483–93. doi: 10.1097/QAI.0b013e3180654836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Fellay J, et al. Common genetic variation and the control of HIV-1 in humans. Plos Genet. 2009;5:e1000791. doi: 10.1371/journal.pgen.1000791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Vüller M, Fraser C, Herbeck JT. A strong case for viral genetic factors in HIV virulence. Viruses (Basel) 2011;3:204–16. doi: 10.3390/v3030204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Tang JM, et al. HLA allele sharing and HIV type 1 viremia in seroconverting Zambians with known transmitting partners. AIDS Res Hum Retroviruses. 2004;20:19–25. doi: 10.1089/088922204322749468. [DOI] [PubMed] [Google Scholar]
  • 32.Hecht FM, et al. HIV RNA level in early infection is predicted by viral load in the transmission source. AIDS. 2010;24:941–5. doi: 10.1097/QAD.0b013e328337b12e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.van der Kuyl AC, et al. HIV RNA levels in transmission sources only weakly predict plasma viral load in recipients. AIDS. 2010;24:1607–8. doi: 10.1097/QAD.0b013e32833b318f. [DOI] [PubMed] [Google Scholar]
  • 34.Hollingsworth TD, et al. HIV-1 transmitting couples have similar viral load set-points in Rakai, Uganda. Plos Pathog. 2010;6:e1000876. doi: 10.1371/journal.ppat.1000876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Fisher RA. The correlation between relatives on the supposition of Mendelian inheritance. Trans R Soc Edinb. 1918;52:399–433. [Google Scholar]
  • 36.Alizon S, et al. Phylogenetic approach reveals that virus genotype largely determines HIV set-point viral load. Plos Pathog. 2010;6:e1001123. doi: 10.1371/journal.ppat.1001123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wawer MJ, et al. A randomized, community trial of intensive sexually transmitted disease control for AIDS prevention, Rakai, Uganda. AIDS. 1998;12:1211–25. doi: 10.1097/00002030-199810000-00014. [DOI] [PubMed] [Google Scholar]
  • 38.Wawer MJ, et al. Control of sexually transmitted diseases for AIDS prevention in Uganda: a randomised community trial. Lancet. 1999;353:525–35. doi: 10.1016/s0140-6736(98)06439-3. [DOI] [PubMed] [Google Scholar]
  • 39.Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–90. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]
  • 40.Tavaré S. American Mathematical Society: Lectures on Mathematics in the Life Sciences. Vol. 17. American Mathematical Society; 1986. Some Probabilistic and Statistical Problems in the Analysis of DNA Sequences; pp. 57–86. [Google Scholar]
  • 41.Yang ZH. Among-site rate variation and its impact on phylogenetic analyses. Trends Ecol Evol. 1996;11:367–72. doi: 10.1016/0169-5347(96)10041-0. [DOI] [PubMed] [Google Scholar]
  • 42.Posada D, Crandall KA. MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998;14:817–8. doi: 10.1093/bioinformatics/14.9.817. [DOI] [PubMed] [Google Scholar]
  • 43.Etherington GJ, Dicks J, Roberts IN. Recombination Analysis Tool (RAT): a program for the high-throughput detection of recombination. Bioinformatics. 2005;21:278–81. doi: 10.1093/bioinformatics/bth500. [DOI] [PubMed] [Google Scholar]
  • 44.Novitsky V, et al. Intra-host evolutionary rates in HIV-1C env and gag during primary infection. Infect Genet Evol. 2013;19:361–8. doi: 10.1016/j.meegid.2013.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Ledergerber B, et al. The Swiss HIV Cohort Study—rationale, organization and selected base-line characteristics. Soz Praventivmed. 1994;39:387–94. doi: 10.1007/BF01299670. [DOI] [PubMed] [Google Scholar]
  • 46.Schoeni-Affolter F, et al. Cohort profile: the Swiss HIV cohort study. Int J Epidemiol. 2010;39:1179–89. doi: 10.1093/ije/dyp321. [DOI] [PubMed] [Google Scholar]
  • 47.Kouyos RD, et al. Molecular epidemiology reveals long-term changes in HIV type 1 subtype B transmission in Switzerland. J Infect Dis. 2010;201:1488–97. doi: 10.1086/651951. [DOI] [PubMed] [Google Scholar]
  • 48.von Wyl V, et al. Emergence of HIV-1 drug resistance in previously untreated patients initiating combination antiretroviral treatment: a comparison of different regimen types. Arch Intern Med. 2007;167:1782–90. doi: 10.1001/archinte.167.16.1782. [DOI] [PubMed] [Google Scholar]
  • 49.Guindon S, Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003;52:696–704. doi: 10.1080/10635150390235520. [DOI] [PubMed] [Google Scholar]
  • 50.Sundberg K, et al. Incomparability of results between phylogenetic search programs. pp. 81–4. http://www.ncbi.nlm.nih.gov/CBBresearch/Fellows/Carroll/pubs/sundberg2008incomparability.pdf (8 October 2013, date last accessed), 81–4. [Google Scholar]
  • 51.Gras L, et al. Viral load levels measured at set-point have risen over the last decade of the HIV epidemic in the Netherlands. PLoS One. 2009;4:e7365. doi: 10.1371/journal.pone.0007365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Bezemer D, et al. Transmission networks of HIV-1 among men having sex with men in the Netherlands. AIDS. 2010;24:271–82. doi: 10.1097/QAD.0b013e328333ddee. [DOI] [PubMed] [Google Scholar]
  • 53.Paradis E, Claude J, Strimmer K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics. 2004;20:289–90. doi: 10.1093/bioinformatics/btg412. [DOI] [PubMed] [Google Scholar]
  • 54.R Core Team. R: A language and environment for statistical computing. Vienna/Austria http://www.R-project.org/ (8 October 2013, date last accessed) [Google Scholar]
  • 55.Mantel N. The detection of disease clustering and a generalized regression approach. Cancer Res. 1967;27:209–20. [PubMed] [Google Scholar]
  • 56.Blomberg SP, Garland T, Ives AR. Testing for phylogenetic signal in comparative data: behavioral traits are more labile. Evolution. 2003;57:717–45. doi: 10.1111/j.0014-3820.2003.tb00285.x. [DOI] [PubMed] [Google Scholar]
  • 57.Freckleton RP, Harvey PH, Pagel M. Phylogenetic analysis and comparative data: a test and review of evidence. Am Nat. 2002;160:712–26. doi: 10.1086/343873. [DOI] [PubMed] [Google Scholar]
  • 58.Pavoine S, et al. Testing for phylogenetic signal in phenotypic traits: new matrices of phylogenetic proximities. Theor Popul Biol. 2008;73:79–91. doi: 10.1016/j.tpb.2007.10.001. [DOI] [PubMed] [Google Scholar]
  • 59.Uhlenbeck GE, Ornstein LS. On the theory of the Brownian motion. Phys Rev. 1930;36:823–41. [Google Scholar]
  • 60.Felsenstein J. Phylogenies and quantitative characters. Annu Rev Ecol Syst. 1988;19:445–71. [Google Scholar]
  • 61.Falconer DS. Introduction to Quantitative Genetics. Edinburgh/London: Oliver & Boyd; 1960. [Google Scholar]
  • 62.Housworth EA, Martins EP, Lynch M. The phylogenetic mixed model. Am Nat. 2004;163:84–96. doi: 10.1086/380570. [DOI] [PubMed] [Google Scholar]
  • 63.UNAIDS. HIV and AIDS Estimates. http://www.unaids.org/en/regionscountries/countries/ (8 October 2013, date last accessed) [Google Scholar]
  • 64.Bezemer D, et al. 27 years of the HIV epidemic amongst men having sex with men in the Netherlands: an in depth mathematical model-based analysis. Epidemics. 2010;2:66–79. doi: 10.1016/j.epidem.2010.04.001. [DOI] [PubMed] [Google Scholar]
  • 65.Masel J, et al. Fluctuations in HIV-1 viral load are correlated to CD4(+) T-lymphocyte count during the natural course of infection. J AIDS. 2000;23:375–9. doi: 10.1097/00126334-200004150-00003. [DOI] [PubMed] [Google Scholar]
  • 66.Atkins MC, et al. Fluctuations of HIV load in semen of HIV positive patients with newly acquired sexually transmitted diseases. Br Med J. 1996;313:341–2. doi: 10.1136/bmj.313.7053.341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Rieder P, et al. HIV-1 transmission after cessation of early antiretroviral therapy among men having sex with men. AIDS (London, England) 2010;24:1177–83. doi: 10.1097/QAD.0b013e328338e4de. [DOI] [PubMed] [Google Scholar]
  • 68.Sanchez-Mazas A. African diversity from the HLA point of view: influence of genetic drift, geography, linguistics, and natural selection. Hum Immunol. 2001;62:937–48. doi: 10.1016/s0198-8859(01)00293-2. [DOI] [PubMed] [Google Scholar]
  • 69.Pavoine S, Ricotta C. Testing for phylogenetic signal in biological traits: the ubiquity of cross-product statistics. Evolution. 2013;67:828–40. doi: 10.1111/j.1558-5646.2012.01823.x. [DOI] [PubMed] [Google Scholar]
  • 70.Freckleton RP. The seven deadly sins of comparative analysis. J Evol Biol. 2009;22:1367–75. doi: 10.1111/j.1420-9101.2009.01757.x. [DOI] [PubMed] [Google Scholar]
  • 71.Hodcroft H, et al. Population-based study indicates viral genetic effect on HIV virulence is small but significant. Conference on Retroviruses and Opportunistic Infections, Seattle, 2012. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Evolution, Medicine, and Public Health are provided here courtesy of Oxford University Press

RESOURCES