Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2020 Jul 10;16(7):e1007731. doi: 10.1371/journal.pcbi.1007731

AncesTree: An interactive immunoglobulin lineage tree visualizer

Mathilde Foglierini 1,2,*, Leontios Pappas 1, Antonio Lanzavecchia 1, Davide Corti 3, Laurent Perez 1
Editor: Mihaela Pertea4
PMCID: PMC7375605  PMID: 32649725

Abstract

High-throughput sequencing of human immunoglobulin genes allows analysis of antibody repertoires and the reconstruction of clonal lineage evolution. The study of antibodies (Abs) affinity maturation is of specific interest to understand the generation of Abs with high affinity or broadly neutralizing activities. Moreover, phylogenic analysis enables the identification of the key somatic mutations required to achieve optimal antigen binding. The Immcantation framework provides a start-to-finish set of analytical methods for high-throughput adaptive immune receptor repertoire sequencing (AIRR-Seq; Rep-Seq) data. Furthermore, Immcantation’s Change-O package has developed IgPhyML, an algorithm designed to build specifically immunoglobulin (Ig) phylogenic trees. Meanwhile Phylip, an algorithm that has been originally developed for applications in ecology and macroevolution, can also be used for the phylogenic reconstruction of antibodies maturation pathway. To complement Ig lineages made by IgPhyML or Dnaml (Phylip), we developed AncesTree, a graphic user interface (GUI) that aims to give researchers the opportunity to interactively explore antibodies clonal evolution. AncesTree displays interactive immunoglobulins phylogenic tree, Ig related mutations and sequence alignments using additional information coming from specialized antibody tools. The GUI is a Java standalone application allowing interaction with Ig tree that can run under Windows, Linux and Mac OS.


This is a PLOS Computational Biology Software paper.

Introduction

Development of Next Generation Sequencing (NGS) methodology and its use for high-throughput sequencing of the Adaptive Immune Receptor Repertoire (AIRR-seq) has provided unprecedented molecular insight into the complexity of the humoral adaptive immune response by generating Ig data sets of 100 million to billions of reads. Different computational methods have been developed to exploit and analyze these data [1]. Retracing the antigen-driven evolution of Ig repertoires by inferring antibody evolution lineages is a powerful method to understand how vaccines or pathogens shape the humoral immune response [25]. Indeed, Abs maturation is the result of clonal selection during B cell expansion and a clonal lineage is defined as immunoglobulin sequences originating from the same recombination event occurring between the V, D and J segments [6]. B cell receptor (BCR) engagement by a given antigen will trigger somatic hypermutations (SHMs) events generating a large BCR diversity. This process leads to antibodies with mutated Ig variable regions, thus forming a specific B-cell lineage that extends from the naive unmutated B-cells, to somatically hypermutated and class switched memory B or plasma-cells [7]. Lineage tree building requires a common preprocessing step, the clonal lineage assignment [8]. A common starting approach is to initially cluster sequences by their V and J genes and by their CDR3 length. Commonly used tools capable of aligning Ig sequences are MiXCR, IMGT, IgBlast, SONAR, IGoR and iHMMunealign [913]. One of the major drawbacks of the previously mentioned tools is the reliance of the initial alignment with the germline and the exclusion of insertion/deletions (indels) events in the lineage. To circumvent those problems other methods were developed: i) Partis and SONAR [14, 15] can perform both unseeded and seeded lineage assignment ii) Clonify, using a hierarchical clustering approach, performs unseeded lineage assignment [16]. Nonetheless, there is no consensus as to which phylogenetic method is optimal to infer the ancestral evolutionary relationships among Ig sequences [17, 18]. Literally, several methods have been used, such as Levenshtein distance (LD), neighbor joining (NJ), maximum parsimony (MP), maximum likelihood (ML), and Bayesian inference (BEAST) [1922]. The DNA Maximum Likelihood program (Dnaml) of the PHYLIP package [23], is a ML method that has been originally developed for applications in ecology. It is also commonly used to infer B cell clonal lineages [2429]. Visualization of the phylogeny is performed using Dendroscope [30, 31]. Meanwhile, a framework was developed to provide a start-to-finish toolbox to process high-throughput AIRR-seq datasets. The Immcantation framework (https://immcantation.readthedocs.io/en/stable/) is currently the gold standard for antibody repertoire analysis. The Change-O tool [32], which is part of Immcantation, was developed to make i) a V(D)J reference alignment standardization after sequences annotation by IMGT/High-VQUEST [33] or IgBlast ii) clonal clustering iii) germline reconstruction iv) conversion and annotation. The IgPhyML algorithm, which is part of Change-O, allows the reconstruction of phylogenic tree by implementing substitution models that correct for the context-sensitive nature of SHM, and combines information from multiple lineages to give more precisely estimated repertoire-wide model parameter estimates. Currently, there is no efficient bioinformatics tool allowing an interactive display of phylogenic tree inferred from Ig sequences. Here we developed AncesTree, an Ig lineage tree visualizer that also integrates information coming from most used antibody bioinformatics tools: IgBlast, IMGT, Change-O, Kabat numbering [34] and BASELINe [35]. AncesTree enables users to interact with a tree containing up to thousands Ig sequences, which were generated by Dnaml or IgPhyML, via the GUI. It is a standalone application that is platform independent and only need JAVA JRE 12 or higher as prerequisite software installed.

Design and implementation

The AncesTree workflow is presented in Fig 1, it consists of three different main steps: Input, Processing and Outputs. Importantly, phylogenetic tree analyses coming from two different tools can be used by AncesTree (Dnaml or Immcantation). If the Dnaml workflow is used, AncesTree will parse the Dnaml output text file. If the Immcantation workflow is used (i.e. RepSeq data), the Change-O tab file in AIRR format, the IgPhyML tab file and it related fasta file (with the reconstructed intermediate sequences) are used as input. Once AncesTree processes the input file(s), it will generate a tree in a graphic interface to allow direct interactivity. Features specific for Ig analysis are included in the GUI.

Fig 1. AncesTree workflow.

Fig 1

Dnaml workflow is used (left part of the figure), DNA sequences are aligned with Clustal Ω and processed by Dnaml, a phylogenetic tree is generated. If the Immcantation workflow is used (right part of the figure), reads are processed through Change-O pipeline and IgPhyML will generate the trees. AncesTree processes the different inputs and reconstructs the phylogenic tree with all information related to Ig. The tree is displayed in a GUI and an Extensible Markup Language (XML) file is produced (that could be used as direct input into AncesTree).

Input

AncesTree has two possibilities to display an Ig lineage tree: Dnaml workflow or Immcantation workflow (Fig 1). The first one is the Dnaml workflow. The required input for AncesTree usage is the output text file generated by Dnaml. Optionally, a fasta file with data obtained from IMGT can also be used to have full AncesTree features. A clonal family is composed of heavy (or light) V(D)J sequences and their related unmutated common ancestor (UCA). The UCA can be inferred with Antigen Receptor Probabilistic Parser (ARPP) UA Inference software [36] or Cloanalyst [37]. Then, sequences are aligned with Clustal Ω [38] and the generated file in PHYLIP format can be provided for Dnaml. Next, Dnaml is launched with the following settings: ‘O’ for the outgroup root with the number corresponding to the UCA position provided in the PHYLIP input text file and ‘5’ to reconstruct hypothetical sequences. The generated ‘outfile’ text file can be used as input for AncesTree. To visualize the different frameworks (FW) and complementary-determining (CDR) regions that composed the Ig variable region, a fasta file can be uploaded. The user provides a fasta file containing the following information: the UCA V(D)J sequence in IMGT format including gaps, and the end positions of each region included in the fasta identifier (separated by a space). This information is easily retrieved using IMGT/V-QUEST [33] with the UCA nucleotide sequence as input. The second possibility for AncesTree to display an Ig lineage tree is through the Immcantation workflow. In most of the cases, this workflow is applied to RepSeq data coming from NGS experiments, but it can also be applied to a small set of sequences. Change-O is processed to run the following steps: i) align to V(D)J reference (sequences are annotated by IMGT/High-VQUEST or IgBlast prior to this step), ii) filter in the productive sequences, cluster sequences by clone, reconstruct germline by clone and iii) convert the change-O file into AIRR format [39]. To infer phylogenic trees with the clones of interest, IgPhyML is run with the ‘—asr’ option, allowing to reconstruct the intermediate sequences of the tree. Of note, this option is only available with the docker image develop version of Immcantation. AncesTree takes as input the Change-O file in AIRR format, the related IgPhyML tab file and the fasta IgPhyML file produced by the ‘—asr’ option. Finally, the user has to specify with a drop down menu which clone id (from the IgPhyML tab file) he wants to be displayed into AncesTree GUI.

Processing

AncesTree parses the Dnaml output file or the Change-O and IgPhyML files. The Dnaml file is a text file and does not required a tree in Newick format. Naturally, the relationship between the different nodes of the tree is already stored, in addition to the sequence of each node in the Dnaml output text file. Conversely, in the case of IgPhyML input, the tree is reconstructed from a newick format. The theoretical intermediate reconstructed sequences are renamed branch points (BPs). In cases of ambiguous nucleotide notation (IUPAC nomenclature) with the Dnaml input file, AncesTree selects the nucleotide with the highest probability based on the Ig sequences retrieved after this BP. Of note, IgPhyML already makes this correction with the ‘—asr’ option set to 0.1. AncesTree has the ability to collapse a node if the sequences are identical, for example in the case of a theoretical BP corresponding to an existing Ig. Moreover, AncesTree will also draw different nodes clustered together in the case of identical Ig sequences, thus providing a clear topology view of the tree.

Outputs

After running AncesTree, a sub-folder is automatically created in the ‘output’ folder using the name of the Dnaml output file or the name of the IgPhyML file with the selected clone id. The resulting folder will contain all produced files such as a XML file that can be used for direct loading into the GUI.

AncesTree displays the processed tree in the main panel of the GUI (Fig 2A). The number of nucleotide and amino acid mutations are written on the edge between each node/sequence (with amino acid mutations shown in parenthesis) and it is clickable, enabling the opening of a new window frame that displays the detailed location of each mutation (Fig 2B). Of note, the color of the box around each mutated codon indicates whether the mutation is replacement (R) in red or silent (S) in green. This information is also available as R/S numbers under each region. The user can view the amino acid (a.a.) mutations, and have access by default to the Kabat numbering of the related a.a. position (without internet access, AncesTree will use the a.a. absolute position). To obtain the nucleotide or protein sequence of a node, the user can click on it (Fig 2C). The user also has the possibility to enter the EC50 for the specified Ig. The sequence alignments (DNA or protein) are accessible in a new frame via the ‘Menu’ button on the top (Fig 2D). Finally, the alignment view is customizable: the sequences can be selected or deselected, as well as different positions or regions. Different color modes can be chosen.

Fig 2. Snapshot of AncesTree GUI.

Fig 2

(A) The tree generated by Dnaml is displayed in the main panel. The BASELINe analysis for the clonal family is displayed in the right upper corner. (B) The mutations between two nodes can be displayed in a separate window and they are positioned using IMGT sequence annotation. (C) The user can have access to each specific node to obtain the related sequences (DNA or protein) and add comments. (D) An alignment is generated with the UCA appearing in the first lane, and a ruler indicates the different regions that compose an Ig sequence.

If the user is interested in a BASELINe analysis of a clonal family of interest, and if the optional input fasta file (with the UCA VDJ sequence including gaps) was provided with the Dnaml input, AncesTree will automatically generates the fasta input file needed for this software (http://selection.med.yale.edu/baseline/). Once BASELINe has finished to process, the output file can be loaded into AncesTree to have a nice graphic view of antigen-driven selection occurring for this particular clonal family. All graph generated can be exported in PNG or EPS format and the alignment can be exported as Tab-separated Values (TSV) file.

Results

To demonstrate the utility of AncesTree we analyzed a case study by performing the analysis of an Ig lineage tree targeting the fusion protein (F) of the Respiratory Syncytial Virus (RSV). RSV is an enveloped RNA virus belonging to the recently defined Pneumoviridae family [40]. Infection of healthy adults by RSV typically results in mild respiratory symptoms. However, viral infection of infants and older adults, accounts for a substantial hospitalization burden in both age groups [41]. Indeed, RSV infection is the second cause of infant mortality worldwide after malaria [42]. Understanding the immunological basis for the development of potent neutralizing antibodies is a key step for the development of an effective vaccine for RSV.

Case study: Exploration of Ig lineage targeting the Fusion protein of the Respiratory Syncytial Virus (F-RSV)

To demonstrate the practical use of AncesTree, we re-analyzed an Ig dataset generated post infection by Respiratory Syncytial Viral infection (HRSV). The dataset was collected by isolating antibodies direct against the F-RSV protein, a class I fusion protein mediating viral entry into host cells [43]. The Ig sequences were clustered by grouping antibodies sharing the same VH and VL gene usage, HCDR3 length and identity (at least 85% for HCDR3). Among the clusters generated, we chose Igs targeting the antigenic site V of F-RSV located near amino acid 447 between the α3 helix and β3/β4 hairpin of F-RSV in prefusion (Fig 3A). About 70% of the mAbs targeting this site use the same VH and VL germline pair (VH1–18 and VK2–30) [4345]. We identified an Ig family of interest containing potent neutralizers targeting site V with one outlier, the mAb ADI-14576, being less potent and with a 10-fold decrease in binding affinity (Fig 3B). We used Dnaml to generate a VH sequences phylogenic tree and launched AncesTree to analyze and interact with the produced phylogenic tree (Fig 3C). The EC50 (ng/ml) related to the neutralization assay against RSV subtype A is reported in each node (of note, EC50 against subtype B is in the same range for each Ig). Surprisingly, a common mutation 92: G->A (kabat position 31: S ->N) is shared between all the Igs, except for ADI-14576 that does not share this mutation. The alignment of the Ig protein sequences highlights clearly this shared mutation (Fig 3D). A result suggesting that ADI-14576 underwent less affinity maturation and therefore diverges from all the other family members. Interestingly, the 31: S->N mutation is located in the HCDR1 and asparagine residues are often involved in protein binding sites. It is tempting to speculate that the Serine to Asparagine substitution is in part responsible for the higher potency and binding titer of the antibodies.

Fig 3. Clonal family against F-RSV protein antigenic site V.

Fig 3

(A) Shown is the prefusion conformation of F-RSV trimer (PDB ID: 4MMU) [46]. The antigenic sites are colored, site Ø (red), I (blue), II (yellow), III (green), IV (purple) and V (orange). (B) Table showing the different characteristic of a mAbs clonal family isolated from an infant (≥ 6 months) after RSV infection. The Igs neutralization titers are shown as well as their related Germline annotations. ADI-14576 is highlighted because of is lower neutralization value in comparison to the other mAbs of the same clonal family. Phylogenetic analysis of the VH chain of a clonal family F-RSV specific. (C) Phylogenic tree displayed in AncesTree where the user clicked on the mutation shared by all Igs below BP3 node (31: S->N). (D) Protein alignment of the different Ig sequences, the mutation 31: S->N is boxed.

Concluding remarks

To summarize, we developed an intuitive, easy and interactive GUI allowing the visualization and exploration of antibody clonal evolution. Our application is open access and platform independent. AncesTree only needs the file(s) that can be produced either by Dnaml or by IgPhyML, which is one of the most used tool for antibody repertoire analysis. In the latter, AncesTree processed the Change-O file in AIRR format that allows to share RepSeq data in a standardized format. The possibility to visualize tree independently of the used pipeline allows a broad AncesTree usage. AncesTree was successfully tested with phylogenic trees up to 500 unique sequences processed by IgPhyML and more than a thousand sequences processed by Dnaml. Our interface will provide the users a practical tool containing several useful features that will be of high utility for the immunologists’ community and especially those with little or no computational skills.

Acknowledgments

The authors acknowledge present and past members of the Lanzavecchia’s group for comments and feedback on the software.

Data Availability

Software, documentation, source code and examples are available at https://github.com/MathildeFogPerez/ancestree

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.Miho E, Yermanos A, Weber CR, Berger CT, Reddy ST, Greiff V. Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires. Frontiers in Immunology. 2018;9(224). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Wang C, Liu Y, Cavanagh MM, Le Saux S, Qi Q, Roskin KM, et al. B-cell repertoire responses to varicella-zoster vaccination in human identical twins. Proc Natl Acad Sci U S A. 2015;112(2):500–5. 10.1073/pnas.1415875112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Hoehn KB, Fowler A, Lunter G, Pybus OG. The Diversity and Molecular Evolution of B-Cell Receptors during Infection. Mol Biol Evol. 2016;33(5):1147–57. 10.1093/molbev/msw015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Zhu J, Ofek G, Yang Y, Zhang B, Louder MK, Lu G, et al. Mining the antibodyome for HIV-1–neutralizing antibodies with next-generation sequencing and phylogenetic pairing of heavy/light chains. Proceedings of the National Academy of Sciences. 2013;110(16):6470–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Jackson KJ, Liu Y, Roskin KM, Glanville J, Hoh RA, Seo K, et al. Human responses to influenza vaccination show seroconversion signatures and convergent antibody rearrangements. Cell host & microbe. 2014;16(1):105–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Tonegawa S. Somatic generation of antibody diversity. Nature. 1983;302(5909):575–81. 10.1038/302575a0 [DOI] [PubMed] [Google Scholar]
  • 7.Adler R. Janeway's immunobiology. Choice: Current Reviews for Academic Libraries. 2008;45(10):1793–4. [Google Scholar]
  • 8.Yermanos AD, Dounas AK, Stadler T, Oxenius A, Reddy ST. Tracing Antibody Repertoire Evolution by Systems Phylogeny. Frontiers in Immunology. 2018;9(2149). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bolotin DA, Poslavsky S, Mitrophanov I, Shugay M, Mamedov IZ, Putintseva EV, et al. MiXCR: software for comprehensive adaptive immunity profiling. Nature methods. 2015;12(5):380–1. 10.1038/nmeth.3364 [DOI] [PubMed] [Google Scholar]
  • 10.Lefranc MP, Giudicelli V, Duroux P, Jabado-Michaloud J, Folch G, Aouinti S, et al. IMGT(R), the international ImMunoGeneTics information system(R) 25 years on. Nucleic Acids Res. 2015;43(Database issue):D413–22. 10.1093/nar/gku1056 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Ye J, Ma N, Madden TL, Ostell JM. IgBLAST: an immunoglobulin variable domain sequence analysis tool. Nucleic Acids Res. 2013;41(Web Server issue):W34–40. 10.1093/nar/gkt382 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Marcou Q, Mora T, Walczak AM. High-throughput immune repertoire analysis with IGoR. Nat Commun. 2018;9(1):561 10.1038/s41467-018-02832-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Gaeta BA, Malming HR, Jackson KJ, Bain ME, Wilson P, Collins AM. iHMMune-align: hidden Markov model-based alignment and identification of germline genes in rearranged immunoglobulin gene sequences. Bioinformatics. 2007;23(13):1580–7. 10.1093/bioinformatics/btm147 [DOI] [PubMed] [Google Scholar]
  • 14.Ralph DK, Matsen FAt. Likelihood-Based Inference of B Cell Clonal Families. PLoS Comput Biol. 2016;12(10):e1005086 10.1371/journal.pcbi.1005086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Schramm CA, Sheng Z, Zhang Z, Mascola JR, Kwong PD, Shapiro L. SONAR: A High-Throughput Pipeline for Inferring Antibody Ontogenies from Longitudinal Sequencing of B Cell Transcripts. Front Immunol. 2016;7:372 10.3389/fimmu.2016.00372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Briney B, Le K, Zhu J, Burton DR. Clonify: unseeded antibody lineage assignment from next-generation sequencing data. Scientific Reports. 2016;6(1):23901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Greiff V, Miho E, Menzel U, Reddy ST. Bioinformatic and Statistical Analysis of Adaptive Immune Repertoires. Trends in immunology. 2015;36(11):738–49. 10.1016/j.it.2015.09.006 [DOI] [PubMed] [Google Scholar]
  • 18.Yaari G, Vander Heiden JA, Uduman M, Gadala-Maria D, Gupta N, Stern JN, et al. Models of somatic hypermutation targeting and substitution based on synonymous mutations from high-throughput immunoglobulin sequencing data. Front Immunol. 2013;4:358 10.3389/fimmu.2013.00358 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Stern JN, Yaari G, Vander Heiden JA, Church G, Donahue WF, Hintzen RQ, et al. B cells populating the multiple sclerosis brain mature in the draining cervical lymph nodes. Sci Transl Med. 2014;6(248):248ra107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Barak M, Zuckerman NS, Edelman H, Unger R, Mehr R. IgTree: creating Immunoglobulin variable region gene lineage trees. J Immunol Methods. 2008;338(1–2):67–74. 10.1016/j.jim.2008.06.006 [DOI] [PubMed] [Google Scholar]
  • 21.Andrews SF, Kaur K, Pauli NT, Huang M, Huang Y, Wilson PC. High preexisting serological antibody levels correlate with diversification of the influenza vaccine response. J Virol. 2015;89(6):3308–17. 10.1128/JVI.02871-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Wu X, Zhang Z, Schramm CA, Joyce MG, Kwon YD, Zhou T, et al. Maturation and Diversity of the VRC01-Antibody Lineage over 15 Years of Chronic HIV-1 Infection. Cell. 2015;161(3):470–85. 10.1016/j.cell.2015.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Felsenstein J. PHYLIP—phylogeny inference package (version 3.2). Cladistics. 1989;5:164–6. [Google Scholar]
  • 24.Liao HX, Lynch R, Zhou T, Gao F, Alam SM, Boyd SD, et al. Co-evolution of a broadly neutralizing HIV-1 antibody and founder virus. Nature. 2013;496(7446):469–76. 10.1038/nature12053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Pappas L, Foglierini M, Piccoli L, Kallewaard NL, Turrini F, Silacci C, et al. Rapid development of broadly influenza neutralizing antibodies through redundant mutations. Nature. 2014;516(7531):418–22. 10.1038/nature13764 [DOI] [PubMed] [Google Scholar]
  • 26.Kallewaard NL, Corti D, Collins PJ, Neu U, McAuliffe JM, Benjamin E, et al. Structure and Function Analysis of an Antibody Recognizing All Influenza A Subtypes. Cell. 2016;166(3):596–608. 10.1016/j.cell.2016.05.073 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Tan J, Pieper K, Piccoli L, Abdi A, Perez MF, Geiger R, et al. A LAIR1 insertion generates broadly reactive antibodies against malaria variant antigens. Nature. 2016;529(7584):105–9. 10.1038/nature16450 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Di Niro R, Lee SJ, Vander Heiden JA, Elsner RA, Trivedi N, Bannock JM, et al. Salmonella Infection Drives Promiscuous B Cell Activation Followed by Extrafollicular Affinity Maturation. Immunity. 2015;43(1):120–31. 10.1016/j.immuni.2015.06.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Revell LJ, Chamberlain SA. Rphylip: an R interface for PHYLIP. Methods in Ecology and Evolution. 2014;5(9):976–81. [Google Scholar]
  • 30.Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, Rupp R. Dendroscope: An interactive viewer for large phylogenetic trees. BMC Bioinformatics. 2007;8:460 10.1186/1471-2105-8-460 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wu X, Zhang Z, Schramm CA, Joyce MG, Kwon YD, Zhou T, et al. Maturation and Diversity of the VRC01-Antibody Lineage over 15 Years of Chronic HIV-1 Infection. Cell. 2015;161(3):470–85. 10.1016/j.cell.2015.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Gupta NT, Vander Heiden JA, Uduman M, Gadala-Maria D, Yaari G, Kleinstein SH. Change-O: a toolkit for analyzing large-scale B cell immunoglobulin repertoire sequencing data. Bioinformatics. 2015;31(20):3356–8. 10.1093/bioinformatics/btv359 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Brochet X, Lefranc MP, Giudicelli V. IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis. Nucleic Acids Res. 2008;36(Web Server issue):W503–8. 10.1093/nar/gkn316 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Abhinandan KR, Martin AC. Analysis and improvements to Kabat and structurally correct numbering of antibody variable domains. Mol Immunol. 2008;45(14):3832–9. 10.1016/j.molimm.2008.05.022 [DOI] [PubMed] [Google Scholar]
  • 35.Yaari G, Uduman M, Kleinstein SH. Quantifying selection in high-throughput Immunoglobulin sequencing data sets. Nucleic Acids Res. 2012;40(17):e134 10.1093/nar/gks457 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kepler TB. Reconstructing a B-cell clonal lineage. I. Statistical inference of unobserved ancestors. F1000Res. 2013;2:103 10.12688/f1000research.2-103.v1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kepler TB, Munshaw S, Wiehe K, Zhang R, Yu JS, Woods CW, et al. Reconstructing a B-Cell Clonal Lineage. II. Mutation, Selection, and Affinity Maturation. Front Immunol. 2014;5:170 10.3389/fimmu.2014.00170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Sievers F, Higgins DG. Clustal omega. Curr Protoc Bioinformatics. 2014;48:3.13.1–6. [DOI] [PubMed] [Google Scholar]
  • 39.Breden F, Luning Prak ET, Peters B, Rubelt F, Schramm CA, Busse CE, et al. Reproducibility and Reuse of Adaptive Immune Receptor Repertoire Data. Frontiers in Immunology. 2017;8(1418). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Collins PL, Fearns R, Graham BS. Respiratory syncytial virus: virology, reverse genetics, and pathogenesis of disease. Curr Top Microbiol Immunol. 2013;372:3–38. 10.1007/978-3-642-38919-1_1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Widmer K, Griffin MR, Zhu Y, Williams JV, Talbot HK. Respiratory syncytial virus- and human metapneumovirus-associated emergency department and hospital burden in adults. Influenza Other Respir Viruses. 2014;8(3):347–52. 10.1111/irv.12234 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Lozano R, Naghavi M, Foreman K, Lim S, Shibuya K, Aboyans V, et al. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet. 2012;380(9859):2095–128. 10.1016/S0140-6736(12)61728-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Goodwin E, Gilman MSA, Wrapp D, Chen M, Ngwuta JO, Moin SM, et al. Infants Infected with Respiratory Syncytial Virus Generate Potent Neutralizing Antibodies that Lack Somatic Hypermutation. Immunity. 2018;48(2):339-49.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Gilman MS, Castellanos CA, Chen M, Ngwuta JO, Goodwin E, Moin SM, et al. Rapid profiling of RSV antibody repertoires from the memory B cells of naturally infected adult donors. Science immunology. 2016;1(6). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Mousa JJ, Kose N, Matta P, Gilchuk P, Crowe JE Jr. A novel pre-fusion conformation-specific neutralizing epitope on the respiratory syncytial virus fusion protein. Nat Microbiol. 2017;2:16271 10.1038/nmicrobiol.2016.271 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.McLellan JS, Chen M, Joyce MG, Sastry M, Stewart-Jones GB, Yang Y, et al. Structure-based design of a fusion glycoprotein vaccine for respiratory syncytial virus. Science. 2013;342(6158):592–8. 10.1126/science.1243283 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007731.r001

Decision Letter 0

Mihaela Pertea

25 Apr 2020

Dear Mrs Foglierini,

Thank you very much for submitting your manuscript "AncesTree: an interactive immunoglobulin lineage tree visualizer" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Mihaela Pertea

Software Editor

PLOS Computational Biology

Mihaela Pertea

Software Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors present a graphical user interface for visualizing B cell clonal lineage trees. The program takes results from Phylip, a computational phylogenetics package for inferring evolutionary trees, as an input and allows the user to visually explore the constructed trees by providing features such as annotation of somatic hypermutations or IC50 on the tree edges and annotation of tree nodes (i.e. Ig sequences) with IMGT and BASELINe tools.

The interface provides a good user experience in terms of available features but lacks broader support of alternative B-cell phylogenetic inference tools.

The program is being distributed as a .jar-package which makes it easy to install and run on the provided tutorial data.

Major issues:

- The authors built their interface upon Phylip, which is indeed a widely used tool for constructing phylogenetic trees. However, adding support of other AIRR data processing packages (such as Immcantation, including Change-O and IgPhyML that also uses Phylip algorithm) would greatly increase the number of potential users that can profit from this great UI. Indeed, the AncesTree should be able to run on any tree independent of how the tree was constructed. Thus, the importance given to phylip, especially in the abstract, is not understandable.

- An issue that can hinder the use of the program is the need for a user to decompose the repertoire into clonal lineages by themselves prior to Ancestree analysis, which may be done by Change-O or Clonify. Why is the lineage decomposition not provided out of the box by Ancestree?

- The link to the wiki https://bitbucket.org/mathildefog/ancestree/wiki/ is closed: a user from the outside would not have access to it.

- The graphical interface has a lot of minor visual bugs. Each of the bugs is not critical on its own, but in total, they spoil the impression that the appearance of the interface leaves (see attachment).

- The authors did not include a quantification of how large datasets may be to be feasibly analysed with AncesTree?

- Figure 3 is not a data figure (or unrelated to the overall purpose of the paper) and should therefore be merged with Figure 4 or moved to Supplementary.

Minor issues:

- The trees in the tutorial video and the example data differ.

- The manuscript text requires language editing (many typos/grammar mistakes)

- Add reference to change-o

Reviewer #2: The authors of "AncesTree: an interactive immunoglobulin lineage tree visualizer" describe a software tool aimed at facilitating the analysis of B-cell receptor (BCR) evolution and exploration of BCR lineage trees. I find this tool to be of high utility for the community, especially immunologists with little or no computational skills. While I find this tool quite versatile, it seems there is an issue with supported input formats.

1) There is a standard coming from AIRR community [https://docs.airr-community.org/en/latest/] that provides a comprehensive description of T- and B-cell receptor repertoire sequencing data. In order to be useful for a wider community I suggest incorporating support for AIRR format in the software.

2) In line with previous point, tools like MIXCR and IgBlast are widely used to analyze large amounts of B-cell receptor sequencing data, while IMGT web server is limited to a certain number of input reads and is unfeasible for large sequencing datasets. Authors should consider implementing a mean to import results produced by MIXCR and IgBlast.

3) There is no comparison with Immcantation suite [https://immcantation.readthedocs.io/en/stable/], nor it is mentioned in discussion. This suite is widely used for B-cell lineage analysis so authors should comment on AncesTree features that are not present in this software.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions, please see http://journals.plos.org/compbiol/s/submission-guidelines#loc-materials-and-methods

Attachment

Submitted filename: AncesTree_review_visual_bug.pdf

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007731.r003

Decision Letter 1

Mihaela Pertea

14 Jun 2020

Dear Mrs Foglierini,

We are pleased to inform you that your manuscript 'AncesTree: an interactive immunoglobulin lineage tree visualizer' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Mihaela Pertea

Software Editor

PLOS Computational Biology

Mihaela Pertea

Software Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have addressed all comments – if not already done, it would be great if they could add to the manuscript that there is no theoretical upper limit of sequence input.

Reviewer #2: After careful consideration, I find my request regarding MIXCR to be out of scope of present study: it is up the developer to implement an AIRR-compliable format, implementing IgBlast support is enough to resolve the issue I've raised. I have no further comments.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007731.r004

Acceptance letter

Mihaela Pertea

1 Jul 2020

PCOMPBIOL-D-20-00234R1

AncesTree: An interactive immunoglobulin lineage tree visualizer

Dear Dr Foglierini,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Laura Mallard

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: AncesTree_review_visual_bug.pdf

    Attachment

    Submitted filename: Rebuttal_letter_Foglierini_et_al.docx

    Data Availability Statement

    Software, documentation, source code and examples are available at https://github.com/MathildeFogPerez/ancestree


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES