Abstract
Select HIV-1-infected individuals develop sera capable of neutralizing diverse viral strains. The molecular basis of this neutralization is currently being deciphered by the isolation of HIV-1-neutralizing antibodies. In one infected donor, three neutralizing antibodies, PGT135–137, were identified by assessment of neutralization from individually sorted B cells and found to recognize an epitope containing an N-linked glycan at residue 332 on HIV-1 gp120. Here we use next-generation sequencing and bioinformatics methods to interrogate the B cell record of this donor to gain a more complete understanding of the humoral immune response. PGT135–137-gene family specific primers were used to amplify heavy-chain and light-chain variable-domain sequences. Pyrosequencing produced 141,298 heavy-chain sequences of IGHV4-39 origin and 87,229 light-chain sequences of IGKV3-15 origin. A number of heavy and light-chain sequences of ∼90% identity to PGT137, several to PGT136, and none of high identity to PGT135 were identified. After expansion of these sequences to include close phylogenetic relatives, a total of 202 heavy-chain sequences and 72 light-chain sequences were identified. These sequences were clustered into populations of 95% identity comprising 15 for heavy chain and 10 for light chain, and a select sequence from each population was synthesized and reconstituted with a PGT137-partner chain. Reconstituted antibodies showed varied neutralization phenotypes for HIV-1 clade A and D isolates. Sequence diversity of the antibody population represented by these tested sequences was notably higher than observed with a 454 pyrosequencing-control analysis on 10 antibodies of defined sequence, suggesting that this diversity results primarily from somatic maturation. Our results thus provide an example of how pathogens like HIV-1 are opposed by a varied humoral immune response, derived from intrinsic mechanisms of antibody development, and embodied by somatic populations of diverse antibodies.
Keywords: antibody bioinformatics, high-throughput sequencing, HIV-1, immunity, N-linked glycan
Introduction
Recent years have seen revolutions in both genomics and computational science (Lander et al., 2001; Venter et al., 2001; Chen et al., 2012). In both of these fields, capabilities are advancing exponentially (Kahn, 2011). The impact of this non-linear development on biology is pervasive and multifaceted. With respect to virus research, the influence has been profound and is the focus of this special issue of Frontiers. Medical interest in viruses is focused on pathogens and their infection, and the biological mirror of infection is the host immune response. Advances in genomics and computational science have the potential for an equally profound impact on our understanding of the immune response. Here we focus on the application of new genomic and computational techniques, particularly 454 pyrosequencing of B cell transcripts (Reddy et al., 2009; Reddy and Georgiou, 2011; Wu et al., 2011) and systems-level bioinformatics (Kitano, 2002), to understand the antibody response to infection.
The human immunodeficiency virus type I, HIV-1, is the etiological agent of a global pandemic, which has killed over 30 million people, and currently infects ∼1% of adults worldwide (UNAIDS, 2010). HIV-1 is a retrovirus and member of the lentivirus genus (Gonda et al., 1985; Sonigo et al., 1985). Global genetic diversity of HIV-1 is extraordinarily high (Starcich et al., 1986; Korber et al., 2001), and this is thought to result from the low fidelity of its genome replication (Preston et al., 1988) as well as the persistent nature of the infection: the diversity of HIV-1 virus within a single individual after 6 years of infection is equivalent to the global diversity of H1N1 influenza observed annually (Korber et al., 2001). Infection by HIV-1 elicits many antibodies, but in general these are not capable of neutralization of diverse strains of HIV-1. However, after several years of infection, 10–25% of infected individuals develop broadly neutralizing antibodies (Li et al., 2007; Gray et al., 2009; Sather et al., 2009; Simek et al., 2009; Stamatatos et al., 2009; Doria-Rose et al., 2010; Gnanakaran et al., 2010). These antibodies provide little or no benefit to the infected host, as the evolution of the virus outpaces the immune response (Parren et al., 1999; Poignard et al., 1999; Wei et al., 2003). Nevertheless these antibodies, when tested in humanized mice or macaque models by passive antibody transfer, impart effective immunity to challenge with HIV-1 or simian/human chimeric immunodeficiency viruses (Mascola et al., 1999, 2000; Parren et al., 2001; Mascola, 2003; Veazey et al., 2003; Hessell et al., 2009a,b; Balazs et al., 2011), indicating the potential for their use as targets for re-elicitation by rationally designed vaccines (reviewed in Walker and Burton, 2010; Kwong et al., 2011). Thus, substantial interest has focused on understanding human antibodies that effectively neutralize diverse strains of HIV-1.
A number of techniques have recently been applied to identification of such antibodies. These methods – including antigen-specific B cell sorting (Scheid et al., 2009; Wu et al., 2010) and direct assessment of neutralization by antibodies secreted from individually sorted B cells (Walker et al., 2009, 2011), each coupled to single B cell sequencing techniques – have so far yielded dozens of broadly HIV-1-neutralizing antibodies. These antibodies represent an extraordinarily sparse sampling of the humoral immune response, which typically generates roughly a billion new B cells in a healthy individual each day. We therefore asked whether the revolutionary new capabilities of next-generation sequencing (Mardis, 2008a,b; Boyd et al., 2010; Hawkins et al., 2010) and computational science could expand this sampling to generate a more complete understanding of the humoral immune response. In principle, memory B cells contain a persistent record of the antibody response to infection. As memory B cells are readily attained from blood, they provide a convenient means to access the antibody record, with B cell transcripts in peripheral blood mononuclear cells (PBMCs) providing a genetic representation. Using three antibodies, PGT135–137 from Protocol G donor 39 (Walker et al., 2011) as an example, we used 454 pyrosequencing of PCR-amplified heavy- and light-chain transcripts to capture a more comprehensive genetic record. We used bioinformatics approaches to interrogate this record, to identify populations of neutralizing antibodies, and to characterize their ontogenies. We link these ontogenies to the natural mechanisms of B cell development to provide a view of how somatic populations of antibodies engender a diverse immunological response to infection.
Materials and Methods
Human specimens
The PBMCs of the HIV-1 infected donor 39 were obtained from the International AIDS Vaccine Initiative (IAVI) protocol G. The same sample was used to isolate broadly neutralizing antibodies PGT135–137 (Walker et al., 2011). Human peripheral blood samples were collected after obtaining informed consent and appropriate Institutional Review Board (IRB) approval.
Sample preparation for 454 pyrosequencing
Ten previously described heavy-chain plasmids with known sequences (Wu et al., 2011) were selected to assess 454 pyrosequencing error. Ten plasmids (100 ng each) were combined in 35 μl water, and 1 μl of the ten-plasmid combination was used to template polymerase chain reactions (PCRs). The heavy and kappa chain PCR samples for 454 pyrosequencing from donor 39 were prepared as described (Wu et al., 2011) with minor modifications. Briefly, mRNA was extracted from 20 million PBMCs into 200 μl of elution buffer (Oligotex kit, Qiagen), then concentrated to 10–30 μl by centrifuging the buffer through a 30 kD micron filter (Millipore). The reverse transcription was performed in one or multiple 35 μl-reactions, each composed of 13 μl of mRNA, 3 μl of oligo(dT)12–18 at 0.5 μg/μl (Invitrogen), 7 μl of 5× first strand buffer (Invitrogen), 3 μl of RNase Out (Invitrogen), 3 μl of 0.1 M DTT (Invitrogen), 3 μl of dNTP mix (each at 10 mM), and 3 μl of SuperScript II (Invitrogen). The reactions were incubated at 42°C for 2 h. The cDNAs from each reaction were combined, applied to the NucleoSpin Extract II kit (Clontech), and eluted in 20 μl of elution buffer. In this way, 1 μl of the cDNA comprised transcripts from 1 million PBMCs. The immunoglobulin gene family-specific PCR was set up in a total volume of 50 μl, using 1 μl of the heavy-chain plasmid mix or 5 μl of the cDNA as template (equivalent of transcripts from 5 million PBMCs). The DNA polymerase systems used was the Platinum Taq High-Fidelity (HiFi) DNA Polymerase System (Invitrogen). According to the instructions of the manufacturer, the reaction mix was composed of water, 5 μl of 10× buffer, and 1 μl of supplied MgSO4, 2 μl of dNTP mix (each at 10 mM), 1–2 μl of primers (Table S1 in Supplementary Material) at 25 μM, and 1 μl of Platinum Taq HiFi DNA polymerase. The primers each contained the appropriate adaptor sequences (XLR-A or XLR-B) for subsequent 454 pyrosequencing. The PCRs were initiated at 95°C for 30 s, followed by 25 cycles of 95°C for 30 s, 58°C for 30 s, and 72°C for 1 min, then incubated at 72°C for 10 min. The PCR products at the expected size (∼500 bp) were gel extracted and purified (Qiagen), followed by further phenol/chloroform purification.
454 pyrosequencing and library preparation
The 454 pyrosequencing was carried out as described previously (Wu et al., 2011). Briefly, PCR products were quantified using Qubit (Life Technologies, Carlsbad, CA, USA). Library concentrations were determined using the KAPA Biosystems qPCR system (Woburn, MA, USA) with 454 pyrosequencing standards provided in the KAPA system. Pyrosequencing of the PCR products was performed on a GS FLX sequencing instrument (Roche-454 Life Sciences, Bradford, CT, USA) using the manufacturer’s suggested methods and reagents. Initial image collection was performed on the GS FLX instrument and subsequent signal processing, quality filtering, and generation of nucleotide sequence and quality scores were performed on an off-instrument linux cluster using 454 application software (version 2.5.3). The amplicon quality filtering parameters were adjusted based on the manufacturer’s recommendations (Roche-454 Life Sciences Application Brief No. 001-2010). Quality scores were assigned to each nucleotide using methodologies incorporated into the 454 application software to convert flowgram intensity values to Phred-based quality scores and as described (Brockman et al., 2008). The quality of each run was assessed by analysis of internal control sequences included in the 454 pyrosequencing reagents. Reports were generated for each region of the PicoTiterPlate (PTP) for both the internal controls and the samples.
Bioinformatics analysis of 454 pyrosequencing-determined antibody sequences
Our previously described bioinformatics pipeline (Wu et al., 2011) was refined and currently consists of five steps. Starting from a 454 pyrosequencing-determined antibodyome, each sequence read was (1) reformatted and labeled with a unique index number; (2) assigned to variable (V), diverse (D), and joining (J) gene families and alleles using an in-house implementation of IgBLAST1, and sequences with E-value > 10−3 for V gene assignment were rejected; (3) subjected to a template-based error-correction procedure, in which 454 pyrosequencing homopolymer errors in V, D, and J regions were detected based on the alignment to their respective germline sequences. Note that only insertion and deletion errors of less than three nucleotides were corrected. D and J gene were corrected only when their gene assignment was reliable, indicated by E-value < 10−3; (4) compared with the a set of template antibody sequences at both nucleotide level and amino-acid level using a global alignment module in CLUSTALW2 (Larkin et al., 2007); (5) subjected to a multiple sequence alignment (MSA)-based scheme to determine the third complementarity-determining region (CDR H3 or L3), which was further compared with a set of template CDR H3 or L3 sequences at nucleotide level, and to determine the sequence boundary of variable domain. For a large population of highly similar sequences, a “divide-and-conquer” procedure could be used to derive a consensus sequence to represent the population and to reduce random sequencing errors. First, a clustering using BLASTClust (Altschul et al., 1997) with a 95% sequence identity cutoff is performed on the sequence population. Then, the largest cluster is divided into 10–50 sets, for each of which a consensus can be derived from MSA. A final consensus is obtained by averaging over the subset consensuses.
Intra-donor phylogenetic analysis use the same procedure as cross-donor phylogenetic analysis, which has been described in detail in previous study (Wu et al., 2011), except that the template antibodies are from the same donor (intra-donor) rather than added exogenously (cross-donor), and intra-donor phylogenetic analysis is equally applicable to heavy and light chains. Briefly, the computational procedure consists of an iterative analysis based on the neighbor-joining (NJ) method (Kuhner and Felsenstein, 1994) implemented in CLUSTALW2 (Larkin et al., 2007) and a final analysis based on the maximum-likelihood (ML) method with molecular clock implemented in DNAMLK2 in the PHYLIP package v3.693. In the NJ-based analysis, donor sequences of a particular germline origin were first randomly shuffled and divided into subsets of no more than 5,000 sequences. Then, PGT135–137 and respective germline sequence, IGHV4-39*07 for heavy chain and IGKV3-15*01 for light chain, were added to each subset. A NJ tree was constructed for each subset using the “Phylogenetic trees” option in CLUSTALW2 (Larkin et al., 2007). The donor sequences that clustered in the smallest branch that contains PGT135–137 were extracted from each NJ tree and combined into a new data set for the next round of analysis. The analysis was repeated until convergence, where all the donor sequences resided within a subtree containing PGT135–137 and no other sequences resided between this subtree and the root, and where further repeat of the analysis did not change the NJ tree. The ML-based analysis was used to confirm the intra-donor dendrogram derived from the NJ-based analysis. Starting from the data set obtained from the last iteration of NJ analysis, the MSA generated by CLUSTALW2 (Larkin et al., 2007) was provided as input to construct a phylogenetic tree using DNAMLK. Usually, any sequences outside the ML-defined subtree were discarded, but in this study we tested light chains identified by NJ method but immediately outside the rooted ML-defined PGT135–137 subtree. The displayed phylogenetic trees were generated using Dendroscope (Huson et al., 2007), ordered to ladderize right and rooted at the germline genes.
A description of the antibodyomics software (Antibodyomics1.0) utilized in this paper is being prepared for publication.
Antibody expression and purification
Antibody production followed previously described procedures (Wu et al., 2011). Briefly, sequences were selected using the respective bioinformatics procedure and checked for sequencing errors using an automatic error-correction procedure followed by manual inspection. The corrected antibody sequences were synthesized (GenScript USA Inc. and Blue Heron Biotech, LLC.) and cloned into the CMV/R expression vector (Barouch and Nabel, 2005) containing the constant regions of IgG1. All synthesized heavy chains were paired with PGT137 light-chain DNA, and synthesized light chains were paired with PGT137 heavy-chain DNA for transfection. Full-length IgGs were expressed from transient transfection of 293F cells and purified using a recombinant protein-A column (Pierce).
HIV-1 neutralization
Neutralization was measured using HIV-1 Env-pseudoviruses to infect TZM-bl cells as described (Li et al., 2005; Wu et al., 2009; Seaman et al., 2010). Neutralization curves were fit by non-linear regression using a five-parameter hill slope equation as described (Seaman et al., 2010). The 50% and 80% inhibitory concentrations (IC50 and IC80) were reported as the antibody concentrations required to inhibit infection by 50% and 80% respectively.
Results
Experiments involving both sequencing technologies and computational analyses are described. Because variable region transcripts of antibodies are over 300 nucleotides in length and because the high similarity between different antibody transcripts precludes assembly of full sequences from fragments, we used 454 pyrosequencing, which is currently one of the few next-generation sequencing technologies to provide reads of sufficient length (Reddy et al., 2009; Reddy and Georgiou, 2011; Wu et al., 2011). However, 454 pyrosequencing is known to suffer from high error rates (Prabakaran et al., 2011). We therefore begin by characterizing the accuracy of 454 pyrosequencing applied to a set of plasmid standards consisting of known HIV-neutralizing antibodies. We then describe 454 pyrosequencing of antibody heavy-chain transcripts from donor 39 (Walker et al., 2011), and analyze these data bioinformatically and functionally. We follow this with a similar analysis of donor 39 light-chain transcripts.
Characterization of 454 pyrosequencing errors on antibody transcripts
To investigate the extent of 454 pyrosequencing errors on the antibodyome analysis, we carried out a sequencing experiment on the heavy chains of 10 selected antibodies (Wu et al., 2011), including five from B cell sorting-based isolation, VRC01, VRC03, VRC-PG04, VRC-CH31, and VRC-CH33, one codon-optimized version of inferred reverted unmutated ancestor of VRC-PG04 (termed VRC-PG04cog), and four identified from previous 454 pyrosequencing study, gVRC-H3d74, gVRC-H6d74, gVRC-H12d74, and gVRC-H15d74. The plasmid sequencing data was processed with the same bioinformatics pipeline used for donor sequencing data (Figure S1 in Supplementary Material). Sequence reads were subjected to an error-correction procedure, which was aimed to fix deletion and insertion errors that cause protein translation problems (see Materials and Methods). Results obtained with and without error correction were compared to examine the effect of error correction on observed sequence variation.
A divergence/identity analysis was first carried out on the 10 plasmid data set, obtained without (Figure 1) and with error correction (Figure S2 in Supplementary Material). Since divergence and identity were calculated at the nucleotide level, error correction appeared to have little effect on the sequence distribution. Ideally, if the 454 pyrosequencing did not produce any errors, especially mutations, the distribution – irrespective of the antibody being used as template – would yield, on divergence/identity plots, 10 discrete points, each corresponding to one of the input sequences. In contrast, divergence/identity plots revealed broad islands centered around each of these 10 antibody sequences (Figure 1). The shape and area of each island provide a visual representation of the extent of the 454 pyrosequencing errors. As shown in Table 1, 5 of the 10 antibodies – those with an identity gap of 25% or greater to the next most closely related sequence – were easily distinguished from each other, while other more closely related variants, e.g., VRC-CH31 and VRC-CH33, overlapped (Figure 1). Based on identity considerations (Table 1) and the scope of each island in divergence/identity plots (Figure 1), a single cutoff of 75% was applied to group 454 pyrosequencing-determined sequences for VRC01, VRC03, VRC-PG04cog, gVRC-H3d74, and gVRC-H6d74.
Table 1.
VRC01 | VRC03 | VRC-PG04 | VRC-CH31 | VRC-CH33 | VRC-PG04cog | gVRC-H3d74 | gVRC-H6d74 | gVRC-H12d74 | gVRC-H15d74 | |
---|---|---|---|---|---|---|---|---|---|---|
VRC01 | 100.0 | 63.8 | 60.5 | 60.1 | 60.8 | 60.7 | 61.2 | 56.4 | 61.3 | 60.2 |
VRC03 | 69.1 | 100.0 | 62.7 | 56.0 | 61.8 | 59.6 | 62.2 | 59.9 | 62.7 | 62.5 |
VRC-PG04 | 62.5 | 59.7 | 100.0 | 61.3 | 60.6 | 67.5 | 68.0 | 68.7 | 95.7 | 95.3 |
VRC-CH31 | 65.3 | 56.4 | 64.3 | 100.0 | 91.3 | 63.4 | 68.8 | 61.9 | 65.6 | 63.0 |
VRC-CH33 | 65.8 | 62.3 | 63.2 | 91.3 | 100.0 | 64.5 | 66.9 | 60.9 | 63.7 | 62.2 |
VRC-PG04cog | 62.0 | 56.4 | 66.4 | 59.5 | 60.6 | 100.0 | 62.5 | 61.7 | 65.9 | 63.8 |
gVRC-H3d74 | 64.2 | 60.8 | 69.1 | 66.7 | 64.9 | 64.5 | 100.0 | 64.9 | 68.5 | 66.7 |
gVRC-H6d74 | 62.0 | 61.3 | 73.3 | 62.8 | 61.8 | 66.7 | 68.0 | 100.0 | 72.3 | 71.9 |
gVRC-H12d74 | 63.1 | 59.7 | 95.7 | 62.3 | 60.8 | 66.9 | 67.5 | 67.7 | 100.0 | 93.5 |
gVRC-H15d74 | 63.6 | 61.3 | 97.6 | 61.6 | 60.8 | 66.4 | 67.2 | 69.2 | 95.7 | 100.0 |
The heavy-chain variable domains of 10 antibodies were sequenced by 454 pyrosequencing, including VRC01, VRC03, VRC-PG04, VRC-CH31, VRC-CH33, a codon-optimized version of inferred reverted unmutated ancestor of VRC-PG04 (termed VRC-PG04cog), and four neutralizing antibodies whose heavy-chain variable domains were identified from donor 74 in a previous study (Wu et al., 2011) – gVRC-H3d74, gVRC-H6d74, gVRC-H12d74, and gVRC-H15d74. The diagonal values, circled and all 100%, separate the upper- and lower-triangular portions of the matrix, which differ in how the sequence identity was calculated, specifically, which sequence was used as the “template” and which as the “query”, as the differing sequences have different lengths.
Each of these five 454 pyrosequencing-determined sequence groups was analyzed for mutations, insertions, and deletions relative to the input plasmid sequence, as well as total number of reads and their redundancy (Table 2). For four of the plasmids ∼50,000 reads were obtained; for gVRC-H6d74, however, only about one fourth as many were obtained, which may relate to a lower efficiency of the primer used for gVRC-H6d74. In terms of redundancy, for three of the plasmids between one fifth and one half of the reads were identical to the input plasmid, whereas for VRC01 and gVRC-H6d74, only a small fraction (<1 and <10%) of the reads were identical to the input plasmid, a result of insertions in most of the sequences. Note that after error correction, 20–3254 more sequences became identical to the input antibodies (Table 2). Overall, for an antibody of typical length, ∼5-nucleotide mutations were observed between 454 pyrosequencing reads and corresponding input sequences. Error correction appeared to cause an increased count of mutation errors while decreasing insertion and deletion errors that produce stop codons and nonsense codons in protein translation. Currently used correction procedure was able to improve the identity of translated protein sequence to respective germline gene by an average of 14.1% (Figures S1C,D in Supplementary Material).
Table 2.
Antibody | Length (nt) | NSeq | NIden | Unnormalized |
Normalized (per 100 nt) |
||||
---|---|---|---|---|---|---|---|---|---|
RMSMut (nt) | RMSIns (nt) | RMSDel (nt) | RMSMut (nt) | RMSIns (nt) | RMSDel (nt) | ||||
NO ERROR CORRECTION | |||||||||
VRC01 | 363 | 47542 | 289 | 5.0 | 2.2 | 1.7 | 1.38 | 0.61 | 0.47 |
VRC03 | 390 | 53734 | 12309 | 4.6 | 4.1 | 0.9 | 1.18 | 1.05 | 0.23 |
VRC-PG04cog | 369 | 43718 | 21281 | 5.0 | 1.7 | 0.5 | 1.36 | 0.46 | 0.14 |
gVRC-H3d74 | 381 | 53147 | 19843 | 6.3 | 1.9 | 1.1 | 1.65 | 0.50 | 0.29 |
gVRC-H6d74 | 399 | 13639 | 1013 | 6.4 | 2.8 | 1.6 | 1.60 | 0.70 | 0.40 |
WITH ERROR CORRECTION | |||||||||
VRC01 | 363 | 47542 | 334 | 5.8 | 1.9 | 1.2 | 1.60 | 0.52 | 0.33 |
VRC03 | 390 | 53734 | 12948 | 4.7 | 4.0 | 0.9 | 1.21 | 1.03 | 0.23 |
VRC-PG04cog | 369 | 43718 | 22021 | 5.1 | 1.6 | 0.4 | 1.38 | 0.43 | 0.11 |
gVRC-H3d74 | 381 | 53147 | 23097 | 6.5 | 1.7 | 0.9 | 1.71 | 0.45 | 0.24 |
gVRC-H6d74 | 399 | 13639 | 1033 | 6.6 | 2.7 | 1.5 | 1.65 | 0.68 | 0.38 |
Columns include antibody name, nucleotide-sequence length of antibody heavy-chain variable domain, number of 454 pyrosequencing-determined heavy-chain variable-domain sequences for this antibody, number of sequences 100% identical to the sequenced antibody heavy-chain variable-domain, root-mean-square (RMS) fluctuation of 454 pyrosequencing-induced mutations, insertions, and deletions with respect to the input antibody sequence, and their values after normalization by a length of 100 nucleotides.
As shown in the percent sequence identity matrix in Table 1 and the divergence/identity plots in Figure 1, only five antibodies can be distinguished from others using a single sequence identity cutoff. After mapping the 454 pyrosequencing-determined heavy-chain variable-domain sequences onto 10 plasmid antibodies, a single cutoff of 75% was applied to extract sequences corresponding to VRC01, VRC03, VRC-PG04cog, gVRC-H3d74, and gVRC-H6d74, respectively.
RMSMut, RMSIns, and RMSDel were calculated using the formula:
,
X denotes the type of sequencing error to be characterized, mutation (Mut), insertion (Ins), and deletion (Del), respectively; denotes the averaged sequencing error; NSeq denotes the total number of sequences within a given antibody group.
The RMS values were normalized using RMSnormalized = RMSunnormalized/lenghunnormalized × 100 to take into account the difference in sequence length.
We then examined the accuracy of bioinformatically selected representative sequences for these five antibody groups. Note that all these sequences have been subjected to a template-based error-correction procedure in the pipeline processing. A “divide-and-conquer” procedure (See Materials and Methods) was used for sequence calculation. Remarkably, the representative sequence was 100% identical to the “true” sequence used as input for 454 pyrosequencing for VRC-PG04cog, gVRC-H3d74, and gVRC-H6d74, while having one 1-nucleotide deletion and two 1-nucleotide insertions for VRC01 and VRC03, respectively. None had mutation errors. Such consensus-based sequence picking procedure may prove useful in the cases where a population of closely related sequences is observed on the divergence/identity plot, as indicated by a densely populated island.
454 pyrosequencing of donor 39 IGHV4 family and bioinformatics analysis of heavy chains
We next performed 454 pyrosequencing of PGT135–137-related heavy-chain transcripts from donor 39 PBMCs. mRNA from ∼5 million PBMCs was used for reverse transcription to produce template cDNA, and PCR was used to amplify IgG and IgM heavy-chain sequences from the IGHV4 family using forward primers that overlapped the end of the V gene leader sequence and the start of the V region and reverse primers covering the start of the constant domain (Table S1 in Supplementary Material).
Next-generation pyrosequencing provided 918,298 reads, which were processed with a bioinformatics pipeline that involved assignment of germline origin genes, 454 pyrosequencing-error correction, and extraction of CDR H3 regions for lineage assignment. Overall about 85.3% of the raw reads spanned over 400 nucleotides, covering the entire variable domain. After computational assignment of V, D, and J gene components, 142,842 sequences were assigned to IGHV4-39 germline family, accounting for ∼16% of the expressed VH4 antibodyome. Each sequence was subjected to an automatic error-correction scheme. For donor 39 heavy chains, the correction procedure improved the accuracy of protein translation, measured by protein sequence identity to inferred gemline gene, by an average of 20.4%. The results for pipeline processing of heavy-chain data set are listed in Figure S3 in Supplementary Material.
First, germline family analyses were performed using two standard methods – IMGT (Brochet et al., 2008) and IgBLAST (see text footnote 1; Table 3). These analyses assigned PGT135–137 gene origins to IGHV4-39 with two possible alleles (*03 or *07), to three potential D genes, and the J gene IGHJ5*02. An analysis of the third complementarity-determining region of the heavy chain (CDR H3) showed 80–90% sequence identity between PGT135–137, suggestive of a common lineage. The likely clonal origin of PGT135–137 indicates that they will all have the same V(D)J origin, with the different origin gene assignments by IMGT and IgBLAST likely due to their high divergence of ∼20% from ancestral gene.
Table 3.
Antibody | V gene (SeqIDtogerm) | D gene | J gene (SeqIDtogerm) | SeqIDtoPGT135 |
---|---|---|---|---|
HEAVY CHAIN (CDR H3 LENGTH = 18) | ||||
PGT135 | IGHV4-39*07 (81.4%) | IGHD3-9*01 | IGHJ5*02 (72.0%) | 100.0% |
PGT136 | IGHV4-39*07 (82.1%) | IGHD2-8*02 | IGHJ5*02 (78.0%) | 83.0% |
PGT137 | IGHV4-39*03 (77.2%) | IGHD2-15*01 | IGHJ5*02 (78.0%) | 82.0% |
LIGHT CHAIN (CDR-L3 LENGTH = 9) | ||||
PGT135 | IGKV3-15*01 (82.4%) | – | IGKJ1*01 (94.3%) | 100.0% |
PGT136 | IGKV3-15*01 (86.4%) | – | IGKJ1*01 (97.1%) | 84.4% |
PGT137 | IGKV3-15*01 (87.8%) | – | IGKJ1*01 (97.1%) | 85.0% |
V, D, and J gene components of PGT135–137 were determined by two servers: IgBLAST (http://www.ncbi.nlm.nih.gov/igblast/) and IMGT/V-Quest (http://www.imgt.org/IMGT_vquest/vquest).
SeqIDtogerm is the nucleotide-sequence identity with respect to the germline gene.
SeqIDtoPGT135 is the nucleotide-sequence identity with respect to PGT135.
Second, a divergence/identity analysis of 454 pyrosequencing-derived sequences assigned to IGHV4-39 origin was performed (Figure 2). The IGHV4-39-related sequences revealed a maximum divergence of 30.4% and an average divergence of 7.7% from germline. An island of sequences was observed at ∼90% identity to PGT137 with divergence of 20–25% from VH4-39, indicative of PGT137-related antibodies with similar evolutionary distance from the origin.
Third, intra-donor phylogenetic analysis (see Materials and Methods) was applied to identify the somatic variants of PGT135–137 from the donor 39 heavy-chain sequencing data. In this analysis, a set of clonally related template antibodies is used to interrogate sequences from the same donor using phylogenetic analysis. Phylogenetic analysis, using a tree rooted by the inferred germline gene IGHV4-39*07, produced a ML dendrogram with 202 heavy-chain variable-domain sequences identified by their co-segregation with PGT135–137 (Figure 3). Most of the intra-donor-identified sequences clustered with PGT137, and one sequence clustered with PGT136.
Fourth, CDR H3 variation was analyzed for the 202 PGT135–137-related heavy-chain variable-domain sequences. One hundred seven were found to have identical CDR H3 sequences, as the same as the nucleotide-sequence consensus. With a maximum of five mutations from the consensus, the average CDR H3 variation was 1.2, indicative of a rather conserved signature of PGT135–137 lineage.
PGT135–137 somatic heavy-chain populations and functional characterization
To gain insight into the functional diversity of the antibodies identified by 454 pyrosequencing and bioinformatics methods, a clustering procedure was used to analyze the 202 identified heavy chains and to select representative sequences for further characterization. We used BLASTClust (Altschul et al., 1997) clustering function and an identity cutoff of 95% to sample the natural variation. We chose this cutoff to be greater than the ∼1.6% “false” sequence variation induced by 454 pyrosequencing errors (Table 2). A total of 15 clusters emerged. In the BLASTClust output, the first sequence of each cluster was selected to “represent” the cluster (Figure 4A) and were synthesized and reconstituted with the PGT137 light chain for functional assessment of HIV-1 neutralization, which was carried out on two viruses sensitive to PGT135–137 antibodies. Out of 15 tested heavy-chain variable domain sequences, when paired with PGT137 light chain, 11 reconstituted antibodies showed neutralization to different extents (Table 4).
Table 4.
Seq. index | Nab name | Neutralization IC50 titers (μg/ml) |
Neutralization IC80 titers (μg/ml) |
||||
---|---|---|---|---|---|---|---|
RW020.2 | UG024.2 | MuLV | RW020.2 | UG024.2 | MuLV | ||
HEAVY-CHAIN VARIANT PAIRED WITH PGT135 LIGHT CHAIN | |||||||
124635 | gVRC-H1d39 | 0.005 | 0.021 | >50 | 0.022 | 0.107 | >50 |
865591 | gVRC-H2d39 | 0.004 | 0.017 | >48 | 0.016 | 0.08 | >48 |
367624 | gVRC-H3d39 | 0.009 | 0.253 | >50 | 0.044 | 1.41 | >50 |
917335 | gVRC-H4d39 | 0.003 | 0.243 | >50 | 0.015 | 3.41 | >50 |
736494 | gVRC-H5d39 | 0.003 | 0.458 | >50 | 0.021 | 4.18 | >50 |
442262 | gVRC-H6d39 | 0.409 | 18.4 | >50 | 12.8 | >50 | >50 |
729986 | gVRC-H7d39 | 2.31 | >50 | >50 | >50 | >50 | >50 |
900425 | – | >50 | >50 | >50 | >50 | >50 | >50 |
174091 | gVRC-H8d39 | >50 | 0.02 | >50 | >50 | 0.072 | >50 |
673138 | gVRC-H9d39 | 1.49 | 1.86 | >50 | 23.6 | 11.5 | >50 |
444070 | gVRC-H10d39 | 0.008 | 0.027 | >50 | 0.03 | 0.138 | >50 |
ConsAA | gVRC-H11d39 | 0.003 | 0.016 | >50 | 0.012 | 0.08 | >50 |
LIGHT-CHAIN VARIANTS PAIRED WITH PGT137 HEAVY CHAIN | |||||||
107548 | gVRC-L1d39 | 0.0007 | 0.008 | >50 | 0.006 | 0.036 | >50 |
219622 | – | >50 | >50 | >50 | >50 | >50 | >50 |
210137 | – | >50 | >50 | >50 | >50 | >50 | >50 |
215528 | – | >50 | >50 | >50 | >50 | >50 | >50 |
425756 | gVRC-L2d39 | <0.0006 | 0.007 | >50 | 0.004 | 0.033 | >50 |
121553 | gVRC-L3d39 | 0.04 | 0.423 | >50 | 0.245 | 2.16 | >50 |
303540 | gVRC-L4d39 | 0.075 | 1.11 | >50 | 0.841 | 6.61 | >50 |
378597 | gVRC-L5d39 | 0.03 | 0.375 | >50 | 0.167 | 1.26 | >50 |
521298 | – | >50 | >50 | >50 | >50 | >50 | >50 |
537707 | gVRC-L6d39 | 0.012 | 0.115 | >50 | 0.057 | 0.436 | >50 |
WILD-TYPE PGT135–137 ANTIBODIES AND VRC01 ANTIBODY AS CONTROLS | |||||||
PGT135 | PGT135 | >50 | 0.002 | >50 | >50 | 0.007 | >50 |
PGT136 | PGT136 | >50 | 0.008 | >50 | >50 | 0.026 | >50 |
PGT137 | PGT137 | 0.001 | 0.007 | >50 | 0.006 | 0.041 | >50 |
VRC01 | VRC01 | 0.157 | 0.207 | >50 | 0.682 | 0.718 | >50 |
Columns include sequence index (for heavy chains, the amino-acid consensus is denoted by “ConsAA”; for controls, the antibody name is used as sequence index), neutralizing antibody name based on the nomenclature used in previous studies (Wu et al., 2011; Zhu et al., under review), neutralization IC50 and IC80 titers for viruses RW020.2 (HIV-1 clade A), UG024.2 (HIV-1 clade D) and MuLV (murine leukemia virus). Two sets of chimeric antibodies, 22 in total, were expressed by pairing the 12 heavy chains derived from 454 pyrosequencing and the PGT137 light chain, and pairing the 10 light chains derived from 454 pyrosequencing and the PGT137 heavy chain.
The wild-type mAbs PGT135–137 and wild-type VRC01 were included as controls.
MuLV stands for murine leukemia virus, which was included as a negative control.
“–” denotes expressed but non-neutralizing sequence after reconstituted with the PGT137-partner chain.
The two largest clusters, with 136 and 46 sequences, respectively, accounted for ∼90% of the sequences (Figure 4B), while 10 of the 15 clusters contained only a single member. A consensus sequence (ConsAA), calculated from the alignment of 15 representative sequences (Figure 4C), was also synthesized. Notably, the reconstituted amino-acid consensus displayed neutralization almost on par with wild-type PGT137 (Table 4).
Despite their apparent clonality, the clustering procedure reveals 15 clusters. The topology of the dendrogram produced from phylogenetic analysis indicates that these 15 clusters represent populations of somatically related antibodies evolving along distinct branches by standard mechanisms of hypermutation (Figure 3). We analyzed these 15 somatic populations for prevalence of mutations, insertions, and deletions (Table S2 in Supplementary Material). Note that the representative sequence of cluster 1 (#844305) contained two insertions in the CDR H3 region which were not seen in other members of the cluster, suggesting that these insertions might be sequencing errors. Indeed, this heavy chain could not be expressed when reconstituted with PGT137 light chain. We also analyzed each of these populations by divergence/identity plot (Figure 5). Overall, sequences chosen to represent the 15 somatic populations showed diverse neutralization characteristics (Table S2 in Supplementary Material). Some antibodies, for example from clusters 2, 3, 14, and 15 (gVRC-H1d39, gVRC-H2d39, gVRC-H9d39, and gVRC-H10d39), neutralized clade A – RW020.2 and clade D – UG024.2 with roughly equal potency. Some antibodies, for example from clusters 4, 5, 8, and 10 (gVRC-H3-H6d39), neutralized clade A-RW020.2 25-150-fold more potently than clade D. While the antibody from cluster 13 (gVRC-H8d39) neutralized clade D – UG024.2 with at least 100-fold greater potency than clade A. These results provide an example for how somatically related antibodies can significantly differ in their neutralization specificities. This begins to provide insight into how populations of somatically related antibodies can engender neutralization breadth significantly different than any individual member.
454 pyrosequencing of donor 39 IGKV3 family and bioinformatics analysis of light chains
We next performed 454 pyrosequencing of PGT135–137-related light-chain transcripts from donor 39 PBMCs. mRNA from ∼5 million PBMCs was used for reverse transcription to produce template cDNA, and PCR was used to amplify light-chain sequences from the IGKV3 family.
The 454 pyrosequencing provided 971,165 reads, which were then processed using a pipeline adapted for κ-chain analysis. For donor 39, about 83.3% of the raw reads were 400 nt or longer, effectively covering the light-chain variable domain. After V and J gene assignment, 91,951 sequences were determined to belong to IGKV3-15 germline family, accounting for 10% of the light chain reads obtained. After error correction, the accuracy of protein translation measured by the protein sequence identity to inferred gemline gene was improved by an average of 16.5%. The results for pipeline processing of light-chain data set are listed in Figure S4 in Supplementary Material.
First, the recombination origins of PGT135–137 light chains were analyzed (Table 3). PGT135–137 light chains were assigned to the same germline V gene allele, IGKV3-15*01, recombined with the same J gene, IGKJ1*01, supporting the notion that the discrepancy in heavy-chain germline assignment was likely an artifact caused by their high divergence.
Second, the divergence/identity analysis of 454 pyrosequencing-derived sequences assigned to the IGKV3-15*01 origin was performed (Figure 6). The IGKV3-15*01-related sequences revealed a maximum divergence of 20.9% and an average divergence of 6.3% from germline. Distinct sequence islands were observed at ∼100% identity to PGT136 and 95% identity to PGT137 – both with divergence of 10–15% from IGKV3-15*01. No distinct sequence island was observed that was closely related to PGT135.
Third, to identify light-chain somatic variants, we performed intra-donor phylogenetic analysis that combined an iterative NJ procedure for the high-throughput screening of sequencing data, and a ML calculation to confirm the NJ analysis and to provide the final dendrogram (see Materials and Methods). Two methods were usually in agreement, e.g., for donor 39 heavy chains, but differed here. The NJ-based analysis yielded 72 sequences within the PGT135–137 subtree, whereas the subsequent ML-based analysis retained 57 of the 72 sequences within the PGT135–137 subtree (Figure 7), providing an example for functional characterization of similar but somatically unrelated sequences.
PGT135–137 somatic light-chain populations and functional characterization
By using the same 95% clustering procedure as for heavy chains, 14 light-chain clusters were identified from the phylogenetic tree. Representative sequences were selected, also as described for heavy chains, from the first 10 clusters for functional characterization (Figure 8A). We analyzed these 10 clusters for prevalence of mutations, insertions, and deletions (Table S3 in Supplementary Material). The largest cluster, lying within the population of PGT137-like sequences, contained 45 sequences or 63% of the subtree sequences (Figure 8B). All selected light-chain sequences possessed CDR L3s of the same length except for the sequences selected from the clusters 2 and 3 (Figure 8C). Out of 10 tested light-chain variable domain sequences, when reconstituted with the PGT137 heavy chain, six antibodies – representing six sequence clusters – showed neutralization of two HIV-1 strains from clade A and clade D. Notably, two of the light chains (gVRC-L1d39 and gVRC-L2d39) showed neutralization breadth slightly better than PGT135–137, and the light-chain variants neutralized clade A about 10-fold more effectively than the clade D (Table 4).
In contrast to the 454 pyrosequencing-identified heavy chains, the six neutralizing light-chain clusters were not localized to a single divergence/identity island (Figure 9). Indeed, neutralization was observed with clusters from at least three diverse locations on the divergence/identity plot. Nevertheless, the topology of the light-chain phylogenetic analysis indicates that these six clusters represent populations of somatically related antibodies (Figure 7).
Discussion
Recently, select antibodies with the ability to neutralize diverse strains of HIV-1 have been identified in HIV-1 infected donors (Walker et al., 2009, 2011; Corti et al., 2010; Wu et al., 2010, 2011; Scheid et al., 2011). Like PGT135–137, antibodies from these donors often appear to be clonally related, to possess similar neutralization characteristics, and to cluster in a localized island (or islands) on identity/diversity plots. These islands observed in 454 pyrosequencing-derived analyses are often nearby but rarely overlap the few antibodies experimentally isolated from the same individual (even if they start with samples of exactly the same time point, as we have done here with donor 39). The differences between antibodies identified from sorting of memory B cells or by 454 pyrosequencing of B cell transcripts suggest that the experimental approaches may capture or sample different B cell population. In addition to exploring differences in phenotype of antibody identified by the two methods, we also explored differences related to the quantity of identified antibody. In particular, we ask whether the less-sparse view of the antibody repertoire provided by next-generation sequencing and systems-level bioinformatics might provide insight into the diversity of the antibody response.
With the heavy chains of PGT135–137, select sequences representing 15 distinct populations, showed dramatically different neutralization characteristics toward clade A and D viruses when reconstituted with the same light chain from PGT137. With the light chains of PGT135–137, select sequences representing 10 distinct populations were not localized to a discrete sequence island, indicating substantial differences in identity and diversity (Figure 8). Thus, even though these antibodies are somatically related, both their neutralization and sequence characteristics can diverge substantially (Table 4). These results demonstrate the utility of next-generation sequencing, which provides a more comprehensive sampling of sequences, and of systems-level bioinformatics approaches, which enable these data to be mined effectively. Overall, data-intensive methods may be generally required to obtain true insight into questions of biological diversity such as the humoral immune response.
Prior next-generation sequencing and bioinformatics analyses have revealed the extraordinary genetic diversity of HIV-1 (Eriksson et al., 2008; Archer et al., 2009; Tsibris et al., 2009; Fischer et al., 2010). These same methods are now beginning to reveal the extraordinary diversity of antibodies generated in response to HIV-1 infection (Wu et al., 2011). Although this response appears to provide little benefit to the HIV-1-infected host (Poignard et al., 1999), if similar responses could be generated through vaccination, then in principle effective protection could be achieved in the setting of initial infection (Burton, 2002; Burton et al., 2004, 2005). The populations of antibodies we identify here may provide broader protection than a monoclonal member of the group. Furthermore, responses to infection or vaccination would be expected to generate diverse populations of antibodies, as we have shown here. Thus, population diversity, even within a single antibody clone or lineage, is likely to have a substantial impact on the effectiveness of the immune response.
Data Deposition
Next-generation sequencing data from donor 39 (heavy and light chains) and also for the 10 plasmid control have been deposited in the National Center for Biotechnology Information Short Reads Archives (SRA) under accession no. SRA055820. Information deposited with GenBank includes the heavy- and light-chain variable region sequences of genomically identified neutralizers: 10 heavy chains, gVRC-H1-10d39 (JX313021-30), amino-acid consensus heavy-chain gVRC-H11d39 (JX444560), and 6 light chains, gVRC-L1-6d39 (JX313030-36).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary Material
The Supplementary Material for this article can be found on line at http://www.frontiersin.org/Virology/10.3389/fmicb.2012.00315/abstract
Acknowledgments
We thank H. Coleman, M. Park, B. Schmidt, and A. Young for 454 pyrosequencing at the NIH Intramural Sequencing Center (NISC), J. Stuckey for assistance with figures. We also thank members of the Structural Biology Section and Structural Bioinformatics Core, Vaccine Research Center, for discussions or comments on the manuscript. We would like to thank all the study participants and research staff at each of the Protocol G clinical centers, and all of the Protocol G team members, the IAVI Human Immunology Laboratory, and all of the Protocol G clinical investigators, specifically, George Miiro, Anton Pozniak, Dale McPhee, Olivier Manigart, Etienne Karita, Andre Inwoley, Walter Jaoko, Jack DeHovitz, Linda-Gail Bekker, Punnee Pitisuttithum, Robert Paris, Jennifer Serwanga, and Susan Allen. Support for this work was provided by the Intramural Research Program of the Vaccine Research Center, National Institute of Allergy and Infectious Diseases and the National Human Genome Research Institute, National Institutes of Health, and by grants from the International AIDS Vaccine Initiative’s Neutralizing Antibody Consortium.
Footnotes
References
- Altschul S. F., Madden T. L., Schaffer A. A., Zhang J. H., Zhang Z., Miller W., Lipman D. J. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 10.1093/nar/25.17.3389 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Archer J., Braverman M. S., Taillon B. E., Desany B., James I., Harrigan P. R., Lewis M., Robertson D. L. (2009). Detection of low-frequency pretherapy chemokine (CXC motif) receptor 4 (CXCR4)-using HIV-1 with ultra-deep pyrosequencing. AIDS 23, 1209–1218 10.1097/QAD.0b013e32832b4399 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Balazs A. B., Chen J., Hong C. M., Rao D. S., Yang L., Baltimore D. (2011). Antibody-based protection against HIV infection by vectored immunoprophylaxis. Nature 481, 81–84 10.1038/nature10660 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barouch D. H., Nabel G. J. (2005). Adenovirus vector-based vaccines for human immunodeficiency virus type 1. Hum. Gene Ther. 16, 149–156 10.1089/hum.2005.16.149 [DOI] [PubMed] [Google Scholar]
- Boyd S. D., Gaeta B. A., Jackson K. J., Fire A. Z., Marshall E. L., Merker J. D., Maniar J. M., Zhang L. N., Sahaf B., Jones C. D., Simen B. B., Hanczaruk B., Nguyen K. D., Nadeau K. C., Egholm M., Miklos D. B., Zehnder J. L., Collins A. M. (2010). Individual variation in the germline Ig gene repertoire inferred from variable region gene rearrangements. J. Immunol. 184, 6986–6992 10.4049/jimmunol.1000445 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brochet X., Lefranc M.-P., Giudicelli V. (2008). IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis. Nucleic Acids Res. 36, W503–W508 10.1093/nar/gkn316 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brockman W., Alvarez P., Young S., Garber M., Giannoukos G., Lee W. L., Russ C., Lander E. S., Nusbaum C., Jaffe D. B. (2008). Quality scores and SNP detection in sequencing-by-synthesis systems. Genome Res. 18, 763–770 10.1101/gr.070227.107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burton D. R. (2002). Antibodies, viruses and vaccines. Nat. Rev. Immunol. 2, 706–713 10.1038/nri891 [DOI] [PubMed] [Google Scholar]
- Burton D. R., Desrosiers R. C., Doms R. W., Koff W. C., Kwong P. D., Moore J. P., Nabel G. J., Sodroski J., Wilson I. A., Wyatt R. T. (2004). HIV vaccine design and the neutralizing antibody problem. Nat. Immunol. 5, 233–236 10.1038/ni0304-233 [DOI] [PubMed] [Google Scholar]
- Burton D. R., Stanfield R. L., Wilson I. A. (2005). Antibody vs. HIV in a clash of evolutionary titans. Proc. Natl. Acad. Sci. U.S.A. 102, 14943–14948 10.1073/pnas.0505126102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen R., Mias G. I., Li-Pook-Than J., Jiang L., Lam H. Y., Chen R., Miriami E., Karczewski K. J., Hariharan M., Dewey F. E., Cheng Y., Clark M. J., Im H., Habegger L., Balasubramanian S., O’Huallachain M., Dudley J. T., Hillenmeyer S., Haraksingh R., Sharon D., Euskirchen G., Lacroute P., Bettinger K., Boyle A. P., Kasowski M., Grubert F., Seki S., Garcia M., Whirl-Carrillo M., Gallardo M., Blasco M. A., Greenberg P. L., Snyder P., Klein T. E., Altman R. B., Butte A. J., Ashley E. A., Gerstein M., Nadeau K. C., Tang H., Snyder M. (2012). Personal omics profiling reveals dynamic molecular and medical phenotypes. Cell 148, 1293–1307 10.1016/j.cell.2012.02.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corti D., Langedijk J. P., Hinz A., Seaman M. S., Vanzetta F., Fernandez-Rodriguez B. M., Silacci C., Pinna D., Jarrossay D., Balla-Jhagjhoorsingh S., Willems B., Zekveld M. J., Dreja H., O’Sullivan E., Pade C., Orkin C., Jeffs S. A., Montefiori D. C., Davis D., Weissenhorn W., McKnight A., Heeney J. L., Sallusto F., Sattentau Q. J., Weiss R. A., Lanzavecchia A. (2010). Analysis of memory B cell responses and isolation of novel monoclonal antibodies with neutralizing breadth from HIV-1-infected individuals. PLoS ONE 5, e8805. 10.1371/journal.pone.0008805 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doria-Rose N. A., Klein R. M., Daniels M. G., O’Dell S., Nason M., Lapedes A., Bhattacharya T., Migueles S. A., Wyatt R. T., Korber B. T., Mascola J. R., Connors M. (2010). Breadth of human immunodeficiency virus-specific neutralizing activity in sera: clustering analysis and association with clinical variables. J. Virol. 84, 1631–1636 10.1128/JVI.01482-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eriksson N., Pachter L., Mitsuya Y., Rhee S.-Y., Wang C., Gharizadeh B., Ronaghi M., Shafer R. W., Beerenwinkel N. (2008). Viral population estimation using pyrosequencing. PLoS Comput. Biol. 4, e1000074. 10.1371/journal.pcbi.1000074 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fischer W., Ganusov V. V., Giorgi E. E., Hraber P. T., Keele B. F., Leitner T., Han C. S., Gleasner C. D., Green L., Lo C.-C., Nag A., Wallstrom T. C., Wang S., McMichael A. J., Haynes B. F., Hahn B. H., Perelson A. S., Borrow P., Shaw G. M., Bhattacharya T., Korber B. T. (2010). Transmission of single HIV-1 genomes and dynamics of early immune escape revealed by ultra-deep sequencing. PLoS ONE 5, e12303. 10.1371/journal.pone.0012303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gnanakaran S., Daniels M. G., Bhattacharya T., Lapedes A. S., Sethi A., Li M., Tang H., Greene K., Gao H., Haynes B. F., Cohen M. S., Shaw G. M., Seaman M. S., Kumar A., Gao F., Montefiori D. C., Korber B. (2010). Genetic signatures in the envelope glycoproteins of HIV-1 that associate with broadly neutralizing antibodies. PLoS Comput. Biol. 6, e1000955. 10.1371/journal.pcbi.1000955 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gonda M. A., Wongstaal F., Gallo R. C., Clements J. E., Narayan O., Gilden R. V. (1985). Sequence homology and morphologic similarity of HTLV-III and Visna virus, a pathogenic lenti-virus. Science 227, 173–177 10.1126/science.2981428 [DOI] [PubMed] [Google Scholar]
- Gray E. S., Taylor N., Wycuff D., Moore P. L., Tomaras G. D., Wibmer C. K., Puren A., Decamp A., Gilbert P. B., Wood B., Montefiori D. C., Binley J. M., Shaw G. M., Haynes B. F., Mascola J. R., Morris L. (2009). Antibody specificities associated with neutralization breadth in plasma from human immunodeficiency virus type 1 subtype C-infected blood donors. J. Virol. 83, 8925–8937 10.1128/JVI.01082-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hawkins R. D., Hon G. C., Ren B. (2010). Next-generation genomics: an integrative approach. Nat. Rev. Genet. 11, 476–486 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hessell A. J., Poignard P., Hunter M., Hangartner L., Tehrani D. M., Bleeker W. K., Parren P. W., Marx P. A., Burton D. R. (2009a). Effective, low-titer antibody protection against low-dose repeated mucosal SHIV challenge in macaques. Nat. Med. 15, 951–954 10.1038/nm.1974 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hessell A. J., Rakasz E. G., Poignard P., Hangartner L., Landucci G., Forthal D. N., Koff W. C., Watkins D. I., Burton D. R. (2009b). Broadly neutralizing human anti-HIV antibody 2G12 is effective in protection against mucosal SHIV challenge even at low serum neutralizing titers. PLoS Pathog. 5, e1000433. 10.1371/journal.ppat.1000433 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huson D. H., Richter D. C., Rausch C., Dezulian T., Franz M., Rupp R. (2007). Dendroscope: An interactive viewer for large phylogenetic trees. BMC Bioinformatics 8, 460. 10.1186/1471-2105-8-460 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kahn S. D. (2011). On the future of genomic data. Science 331, 728–729 10.1126/science.1197891 [DOI] [PubMed] [Google Scholar]
- Kitano H. (2002). Systems biology: a brief overview. Science 295, 1662–1664 10.1126/science.1069492 [DOI] [PubMed] [Google Scholar]
- Korber B., Gaschen B., Yusim K., Thakallapally R., Kesmir C., Detours V. (2001). Evolutionary and immunological implications of contemporary HIV-1 variation. Br. Med. Bull. 58, 19–42 10.1093/bmb/58.1.19 [DOI] [PubMed] [Google Scholar]
- Kuhner M. K., Felsenstein J. (1994). A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol. Biol. Evol. 11, 459–468 [DOI] [PubMed] [Google Scholar]
- Kwong P. D., Mascola J. R., Nabel G. J. (2011). Rational Design of vaccines to elicit broadly neutralizing antibodies to HIV-1. Cold Spring Harb. Perspect. Med. 1, a007278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lander E. S., Linton L. M., Birren B., Nusbaum C., Zody M. C., Baldwin J., Devon K., Dewar K., Doyle M., Fitzhugh W., Funke R., Gage D., Harris K., Heaford A., Howland J., Kann L., Lehoczky J., Levine R., McEwan P., McKernan K., Meldrim J., Mesirov J. P., Miranda C., Morris W., Naylor J., Raymond C., Rosetti M., Santos R., Sheridan A., Sougnez C., Stange-Thomann N., Stojanovic N., Subramanian A., Wyman D., Rogers J., Sulston J., Ainscough R., Beck S., Bentley D., Burton J., Clee C., Carter N., Coulson A., Deadman R., Deloukas P., Dunham A., Dunham I., Durbin R., French L., Grafham D., Gregory S., Hubbard T., Humphray S., Hunt A., Jones M., Lloyd C., McMurray A., Matthews L., Mercer S., Milne S., Mullikin J. C., Mungall A., Plumb R., Ross M., Shownkeen R., Sims S., Waterston R. H., Wilson R. K., Hillier L. W., McPherson J. D., Marra M. A., Mardis E. R., Fulton L. A., Chinwalla A. T., Pepin K. H., Gish W. R., Chissoe S. L., Wendl M. C., Delehaunty K. D., Miner T. L., Delehaunty A., Kramer J. B., Cook L. L., Fulton R. S., Johnson D. L., Minx P. J., Clifton S. W., Hawkins T., Branscomb E., Predki P., Richardson P., Wenning S., Slezak T., Doggett N., Cheng J. F., Olsen A., Lucas S., Elkin C., Uberbacher E., Frazier M., Gibbs R. A., Muzny D. M., Scherer S. E., Bouck J. B., Sodergren E. J., Worley K. C., Rives C. M., Gorrell J. H., Metzker M. L., Naylor S. L., Kucherlapati R. S., Nelson D. L., Weinstock G. M., Sakaki Y., Fujiyama A., Hattori M., Yada T., Toyoda A., Itoh T., Kawagoe C., Watanabe H., Totoki Y., Taylor T., Weissenbach J., Heilig R., Saurin W., Artiguenave F., Brottier P., Bruls T., Pelletier E., Robert C., Wincker P., Smith D. R., Doucette-Stamm L., Rubenfield M., Weinstock K., Lee H. M., Dubois J., Rosenthal A., Platzer M., Nyakatura G., Taudien S., Rump A., Yang H., Yu J., Wang J., Huang G., Gu J., Hood L., Rowen L., Madan A., Qin S., Davis R. W., Federspiel N. A., Abola A. P., Proctor M. J., Myers R. M., Schmutz J., Dickson M., Grimwood J., Cox D. R., Olson M. V., Kaul R., Raymond C., Shimizu N., Kawasaki K., Minoshima S., Evans G. A., Athanasiou M., Schultz R., Roe B. A., Chen F., Pan H., Ramser J., Lehrach H., Reinhardt R., McCombie W. R., de la Bastide M., Dedhia N., Blöcker H., Hornischer K., Nordsiek G., Agarwala R., Aravind L., Bailey J. A., Bateman A., Batzoglou S., Birney E., Bork P., Brown D. G., Burge C. B., Cerutti L., Chen H. C., Church D., Clamp M., Copley R. R., Doerks T., Eddy S. R., Eichler E. E., Furey T. S., Galagan J., Gilbert J. G., Harmon C., Hayashizaki Y., Haussler D., Hermjakob H., Hokamp K., Jang W., Johnson L. S., Jones T. A., Kasif S., Kaspryzk A., Kennedy S., Kent W. J., Kitts P., Koonin E. V., Korf I., Kulp D., Lancet D., Lowe T. M., McLysaght A., Mikkelsen T., Moran J. V., Mulder N., Pollara V. J., Ponting C. P., Schuler G., Schultz J., Slater G., Smit A. F., Stupka E., Szustakowski J., Thierry-Mieg D., Thierry-Mieg J., Wagner L., Wallis J., Wheeler R., Williams A., Wolf Y. I., Wolfe K. H., Yang S. P., Yeh R. F., Collins F., Guyer M. S., Peterson J., Felsenfeld A., Wetterstrand K. A., Patrinos A., Morgan M. J., de Jong P., Catanese J. J., Osoegawa K., Shizuya H., Choi S., Chen Y. J., International Human Genome Sequencing Consortium (2001). Initial sequencing and analysis of the human genome. Nature 409, 860–921 10.1038/35057062 [DOI] [PubMed] [Google Scholar]
- Larkin M. A., Blackshields G., Brown N. P., Chenna R., McGettigan P. A., McWilliam H., Valentin F., Wallace I. M., Wilm A., Lopez R., Thompson J. D., Gibson T. J., Higgins D. G. (2007). Clustal W and clustal X version 2.0 . Bioinformatics 23, 2947–2948 10.1093/bioinformatics/btm404 [DOI] [PubMed] [Google Scholar]
- Li M., Gao F., Mascola J. R., Stamatatos L., Polonis V. R., Koutsoukos M., Voss G., Goepfert P., Gilbert P., Greene K. M., Bilska M., Kothe D. L., Salazar-Gonzalez J. F., Wei X., Decker J. M., Hahn B. H., Montefiori D. C. (2005). Human immunodeficiency virus type 1 env clones from acute and early subtype B infections for standardized assessments of vaccine-elicited neutralizing antibodies. J. Virol. 79, 10108–10125 10.1128/JVI.79.18.12065-12076.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y., Migueles S. A., Welcher B., Svehla K., Phogat A., Louder M. K., Wu X., Shaw G. M., Connors M., Wyatt R. T., Mascola J. R. (2007). Broad HIV-1 neutralization mediated by CD4-binding site antibodies. Nat. Med. 13, 1032–1034 10.1038/nm1624 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mardis E. R. (2008a). The impact of next-generation sequencing technology on genetics. Trends Genet. 24, 133–141 10.1016/j.tig.2007.12.007 [DOI] [PubMed] [Google Scholar]
- Mardis E. R. (2008b). Next-generation DNA sequencing methods. Annu. Rev. Genomics Hum. Genet. 9, 387–402 10.1146/annurev.genom.9.081307.164359 [DOI] [PubMed] [Google Scholar]
- Mascola J. R. (2003). Defining the protective antibody response for HIV-1. Curr. Mol. Med. 3, 209–216 10.2174/1566524033479799 [DOI] [PubMed] [Google Scholar]
- Mascola J. R., Lewis M. G., Stiegler G., Harris D., Vancott T. C., Hayes D., Louder M. K., Brown C. R., Sapan C. V., Frankel S. S., Lu Y., Robb M. L., Katinger H., Birx D. L. (1999). Protection of macaques against pathogenic simian/human immunodeficiency virus 89.6PD by passive transfer of neutralizing antibodies. J. Virol. 73, 4009–4018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mascola J. R., Stiegler G., Vancott T. C., Katinger H., Carpenter C. B., Hanson C. E., Beary H., Hayes D., Frankel S. S., Birx D. L., Lewis M. G. (2000). Protection of macaques against vaginal transmission of a pathogenic HIV- 1/SIV chimeric virus by passive infusion of neutralizing antibodies. Nat. Med. 6, 207–210 10.1038/72318 [DOI] [PubMed] [Google Scholar]
- Parren P. W., Marx P. A., Hessell A. J., Luckay A., Harouse J., Cheng-Mayer C., Moore J. P., Burton D. R. (2001). Antibody protects macaques against vaginal challenge with a pathogenic R5 simian/human immunodeficiency virus at serum levels giving complete neutralization in vitro. J. Virol. 75, 8340–8347 10.1128/JVI.75.17.8340-8347.2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parren P. W., Moore J. P., Burton D. R., Sattentau Q. J. (1999). The neutralizing antibody response to HIV-1: viral evasion and escape from humoral immunity. AIDS 13, S137–S162 [PubMed] [Google Scholar]
- Poignard P., Sabbe R., Picchio G. R., Wang M., Gulizia R. J., Katinger H., Parren P. W., Mosier D. E., Burton D. R. (1999). Neutralizing antibodies have limited effects on the control of established HIV-1 infection in vivo. Immunity 10, 431–438 10.1016/S1074-7613(00)80043-6 [DOI] [PubMed] [Google Scholar]
- Prabakaran P., Streaker E., Chen W., Dimitrov D. S. (2011). 454 antibody sequencing – error characterization and correction. BMC Res. Notes 4, 404. 10.1186/1756-0500-4-404 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Preston B. D., Poiesz B. J., Loeb L. A. (1988). Fidelity of HIV-1 reverse transcriptase. Science 242, 1168–1171 10.1126/science.2460924 [DOI] [PubMed] [Google Scholar]
- Reddy S. T., Ge X., Miklos A. E., Hughes R. A., Kang S. H., Hoi K. H., Chrysostomou C., Hunicke-Smith S. P., Iverson B. L., Tucker P. W., Ellington A. D., Georgiou G. (2009). Monoclonal antibodies isolated without screening by analyzing the variable-gene repertoire of plasma cells. Nat. Biotechnol. 28, U965–U969 10.1038/nbt.1673 [DOI] [PubMed] [Google Scholar]
- Reddy S. T., Georgiou G. (2011). Systems analysis of adaptive immunity by utilization of high-throughput technologies. Curr. Opin. Biotechnol. 22, 584–589 10.1016/j.copbio.2011.04.015 [DOI] [PubMed] [Google Scholar]
- Sather D. N., Armann J., Ching L. K., Mavrantoni A., Sellhorn G., Caldwell Z., Yu X., Wood B., Self S., Kalams S., Stamatatos L. (2009). Factors associated with the development of cross-reactive neutralizing antibodies during human immunodeficiency virus type 1 infection. J. Virol. 83, 757–769 10.1128/JVI.02036-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scheid J. F., Mouquet H., Feldhahn N., Seaman M. S., Velinzon K., Pietzsch J., Ott R. G., Anthony R. M., Zebroski H., Hurley A., Phogat A., Chakrabarti B., Li Y., Connors M., Pereyra F., Walker B. D., Wardemann H., Ho D., Wyatt R. T., Mascola J. R., Ravetch J. V., Nussenzweig M. C. (2009). Broad diversity of neutralizing antibodies isolated from memory B cells in HIV-infected individuals. Nature 458, 636–640 10.1038/nature07930 [DOI] [PubMed] [Google Scholar]
- Scheid J. F., Mouquet H., Ueberheide B., Diskin R., Klein F., Oliveira T. Y., Pietzsch J., Fenyo D., Abadir A., Velinzon K., Hurley A., Myung S., Boulad F., Poignard P., Burton D. R., Pereyra F., Ho D. D., Walker B. D., Seaman M. S., Bjorkman P. J., Chait B. T., Nussenzweig M. C. (2011). Sequence and structural convergence of broad and potent HIV antibodies that mimic CD4 binding. Science 333, 1633–1637 10.1126/science.1207227 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seaman M. S., Janes H., Hawkins N., Grandpre L. E., Devoy C., Giri A., Coffey R. T., Harris L., Wood B., Daniels M. G., Bhattacharya T., Lapedes A., Polonis V. R., McCutchan F. E., Gilbert P. B., Self S. G., Korber B. T., Montefiori D. C., Mascola J. R. (2010). Tiered categorization of a diverse panel of HIV-1 Env pseudoviruses for neutralizing antibody assessment. J. Virol. 84, 1439–1452 10.1128/JVI.02108-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simek M. D., Rida W., Priddy F. H., Pung P., Carrow E., Laufer D. S., Lehrman J. K., Boaz M., Tarragona-Fiol T., Miiro G., Birungi J., Pozniak A., McPhee D. A., Manigart O., Karita E., Inwoley A., Jaoko W., Dehovitz J., Bekker L. G., Pitisuttithum P., Paris R., Walker L. M., Poignard P., Wrin T., Fast P. E., Burton D. R., Koff W. C. (2009). Human immunodeficiency virus type 1 elite neutralizers: individuals with broad and potent neutralizing activity identified by using a high-throughput neutralization assay together with an analytical selection algorithm. J. Virol. 83, 7337–7348 10.1128/JVI.00110-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sonigo P., Alizon M., Staskus K., Klatzmann D., Cole S., Danos O., Retzel E., Tiollais P., Haase A., Wainhobson S. (1985). Nucleotide-sequence of the visna lentivirus – relationship to the AIDS virus. Cell 42, 369–382 10.1016/S0092-8674(85)80132-X [DOI] [PubMed] [Google Scholar]
- Stamatatos L., Morris L., Burton D. R., Mascola J. R. (2009). Neutralizing antibodies generated during natural HIV-1 infection: good news for an HIV-1 vaccine? Nat. Med. 15, 866–870 [DOI] [PubMed] [Google Scholar]
- Starcich B. R., Hahn B. H., Shaw G. M., McNeely P. D., Modrow S., Wolf H., Parks E. S., Parks W. P., Josephs S. F., Gallo R. C., Wongstaal F. (1986). Identification and characterization of conserved and variable regions in the envelope gene of HTLV-III/LAV, the retrovirus of AIDS. Cell 45, 637–648 10.1016/0092-8674(86)90778-6 [DOI] [PubMed] [Google Scholar]
- Tsibris A. M. N., Korber B., Arnaout R., Russ C., Lo C.-C., Leitner T., Gaschen B., Theiler J., Paredes R., Su Z., Hughes M. D., Gulick R. M., Greaves W., Coakley E., Flexner C., Nusbaum C., Kuritzkes D. R. (2009). Quantitative deep sequencing reveals dynamic HIV-1 escape and large population shifts during CCR5 antagonist therapy in vivo. PLoS ONE 4, e5683. 10.1371/journal.pone.0005683 [DOI] [PMC free article] [PubMed] [Google Scholar]
- UNAIDS. (2010). Joint United Nations Programme on HIV/AIDS. UN Report on the Global AIDS Epidemic 2010. Available at: http://www.unaids.org/globalreport/Global_report.htm [Google Scholar]
- Veazey R. S., Shattock R. J., Pope M., Kirijan J. C., Jones J., Hu Q., Ketas T., Marx P. A., Klasse P. J., Burton D. R., Moore J. P. (2003). Prevention of virus transmission to macaque monkeys by a vaginally applied monoclonal antibody to HIV-1 gp120. Nat. Med. 9, 343–346 10.1038/nm833 [DOI] [PubMed] [Google Scholar]
- Venter J. C., Adams M. D., Myers E. W., Li P. W., Mural R. J., Sutton G. G., Smith H. O., Yandell M., Evans C. A., Holt R. A., Gocayne J. D., Amanatides P., Ballew R. M., Huson D. H., Wortman J. R., Zhang Q., Kodira C. D., Zheng X. Q. H., Chen L., Skupski M., Subramanian G., Thomas P. D., Zhang J. H., Miklos G. L. G., Nelson C., Broder S., Clark A. G., Nadeau C., McKusick V. A., Zinder N., Levine A. J., Roberts R. J., Simon M., Slayman C., Hunkapiller M., Bolanos R., Delcher A., Dew I., Fasulo D., Flanigan M., Florea L., Halpern A., Hannenhalli S., Kravitz S., Levy S., Mobarry C., Reinert K., Remington K., Abu-Threideh J., Beasley E., Biddick K., Bonazzi V., Brandon R., Cargill M., Chandramouliswaran I., Charlab R., Chaturvedi K., Deng Z. M., Di Francesco V., Dunn P., Eilbeck K., Evangelista C., Gabrielian A. E., Gan W., Ge W. M., Gong F. C., Gu Z. P., Guan P., Heiman T. J., Higgins M. E., Ji R. R., Ke Z. X., Ketchum K. A., Lai Z. W., Lei Y. D., Li Z. Y., Li J. Y., Liang Y., Lin X. Y., Lu F., Merkulov G. V., Milshina N., Moore H. M., Naik A. K., Narayan V. A., Neelam B., Nusskern D., Rusch D. B., Salzberg S., Shao W., Shue B. X., Sun J. T., Wang Z. Y., Wang A. H., Wang X., Wang J., Wei M. H., Wides R., Xiao C. L., Yan C. H., Yao A., Ye J., Zhan M., Zhang W., Zhang H., Zhao Q., Zheng L., Zhong F., Zhong W., Zhu S., Zhao S., Gilbert D., Baumhueter S., Spier G., Carter C., Cravchik A., Woodage T., Ali F., An H., Awe A., Baldwin D., Baden H., Barnstead M., Barrow I., Beeson K., Busam D., Carver A., Center A., Cheng M. L., Curry L., Danaher S., Davenport L., Desilets R., Dietz S., Dodson K., Doup L., Ferriera S., Garg N., Gluecksmann A., Hart B., Haynes J., Haynes C., Heiner C., Hladun S., Hostin D., Houck J., Howland T., Ibegwam C., Johnson J., Kalush F., Kline L., Koduru S., Love A., Mann F., May D., McCawley S., McIntosh T., McMullen I., Moy M., Moy L., Murphy B., Nelson K., Pfannkoch C., Pratts E., Puri V., Qureshi H., Reardon M., Rodriguez R., Rogers Y. H., Romblad D., Ruhfel B., Scott R., Sitter C., Smallwood M., Stewart E., Strong R., Suh E., Thomas R., Tint N. N., Tse S., Vech C., Wang G., Wetter J., Williams S., Williams M., Windsor S., Winn-Deen E., Wolfe K., Zaveri J., Zaveri K., Abril J. F., Guigó R., Campbell M. J., Sjolander K. V., Karlak B., Kejariwal A., Mi H., Lazareva B., Hatton T., Narechania A., Diemer K., Muruganujan A., Guo N., Sato S., Bafna V., Istrail S., Lippert R., Schwartz R., Walenz B., Yooseph S., Allen D., Basu A., Baxendale J., Blick L., Caminha M., Carnes-Stine J., Caulk P., Chiang Y. H., Coyne M., Dahlke C., Mays A., Dombroski M., Donnelly M., Ely D., Esparham S., Fosler C., Gire H., Glanowski S., Glasser K., Glodek A., Gorokhov M., Graham K., Gropman B., Harris M., Heil J., Henderson S., Hoover J., Jennings D., Jordan C., Jordan J., Kasha J., Kagan L., Kraft C., Levitsky A., Lewis M., Liu X., Lopez J., Ma D., Majoros W., McDaniel J., Murphy S., Newman M., Nguyen T., Nguyen N., Nodell M., Pan S., Peck J., Peterson M., Rowe W., Sanders R., Scott J., Simpson M., Smith T., Sprague A., Stockwell T., Turner R., Venter E., Wang M., Wen M., Wu D., Wu M., Xia A., Zandieh A., Zhu X. (2001). The sequence of the human genome. Science 291, 1304–1351 10.1126/science.1058040 [DOI] [PubMed] [Google Scholar]
- Walker L. M., Burton D. R. (2010). Rational antibody-based HIV-1 vaccine design: current approaches and future directions. Curr. Opin. Immunol. 22, 358–366 10.1016/j.coi.2010.02.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walker L. M., Huber M., Doores K. J., Falkowska E., Pejchal R., Julien J. P., Wang S. K., Ramos A., Chan-Hui P. Y., Moyle M., Mitcham J. L., Hammond P. W., Olsen O. A., Phung P., Fling S., Wong C. H., Phogat S., Wrin T., Simek M. D., Koff W. C., Wilson I. A., Burton D. R., Poignard P. (2011). Broad neutralization coverage of HIV by multiple highly potent antibodies. Nature 477, 466–470 10.1038/nature10373 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walker L. M., Phogat S. K., Chan-Hui P. Y., Wagner D., Phung P., Goss J. L., Wrin T., Simek M. D., Fling S., Mitcham J. L., Lehrman J. K., Priddy F. H., Olsen O. A., Frey S. M., Hammond P. W., Kaminsky S., Zamb T., Moyle M., Koff W. C., Poignard P., Burton D. R. (2009). Broad and potent neutralizing antibodies from an African donor reveal a new HIV-1 vaccine target. Science 326, 285–289 10.1126/science.1178746 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wei X., Decker J. M., Wang S., Hui H., Kappes J. C., Wu X., Salazar-Gonzalez J. F., Salazar M. G., Kilby J. M., Saag M. S., Komarova N. L., Nowak M. A., Hahn B. H., Kwong P. D., Shaw G. M. (2003). Antibody neutralization and escape by HIV-1. Nature 422, 307–312 10.1038/nature01470 [DOI] [PubMed] [Google Scholar]
- Wu X., Yang Z. Y., Li Y., Hogerkorp C. M., Schief W. R., Seaman M. S., Zhou T., Schmidt S. D., Wu L., Xu L., Longo N. S., McKee K., O’Dell S., Louder M. K., Wycuff D. L., Feng Y., Nason M., Doria-Rose N., Connors M., Kwong P. D., Roederer M., Wyatt R. T., Nabel G. J., Mascola J. R. (2010). Rational design of envelope identifies broadly neutralizing human monoclonal antibodies to HIV-1. Science 329, 856–861 10.1126/science.1190485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu X., Zhou T., O’Dell S., Wyatt R. T., Kwong P. D., Mascola J. R. (2009). Mechanism of human immunodeficiency virus type 1 resistance to monoclonal antibody B12 that effectively targets the site of CD4 attachment. J. Virol. 83, 10892–10907 10.1128/JVI.00260-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu X., Zhou T., Zhu J., Zhang B., Georgiev I., Wang C., Chen X., Longo N. S., Louder M., McKee K., O’Dell S., Perfetto S., Schmidt S. D., Shi W., Wu L., Yang Y., Yang Z. Y., Yang Z., Zhang Z., Bonsignori M., Crump J. A., Kapiga S. H., Sam N. E., Haynes B. F., Simek M., Burton D. R., Koff W. C., Doria-Rose N. A., Connors M., Mullikin J. C., Nabel G. J., Roederer M., Shapiro L., Kwong P. D., Mascola J. R. (2011). Focused evolution of HIV-1 neutralizing antibodies revealed by structures and deep sequencing. Science 333, 1593–1602 10.1126/science.1204117 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.