Abstract
Identifying resistance to antiretroviral drugs is crucial for ensuring the successful treatment of patients infected with viruses such as human immunodeficiency virus (HIV) or hepatitis C virus (HCV). In contrast to Sanger sequencing, next-generation sequencing (NGS) can detect resistance mutations in minority populations. Thus, genotypic resistance testing based on NGS data can offer novel, treatment-relevant insights. Since existing web services for analyzing resistance in NGS samples are subject to long processing times and follow strictly rules-based approaches, we developed geno2pheno[ngs-freq], a web service for rapidly identifying drug resistance in HIV-1 and HCV samples. By relying on frequency files that provide the read counts of nucleotides or codons along a viral genome, the time-intensive step of processing raw NGS data is eliminated. Once a frequency file has been uploaded, consensus sequences are generated for a set of user-defined prevalence cutoffs, such that the constructed sequences contain only those nucleotides whose codon prevalence exceeds a given cutoff. After locally aligning the sequences to a set of references, resistance is predicted using the well-established approaches of geno2pheno[resistance] and geno2pheno[hcv]. geno2pheno[ngs-freq] can assist clinical decision making by enabling users to explore resistance in viral populations with different abundances and is freely available at http://ngs.geno2pheno.org.
INTRODUCTION
Drug resistance mutations can emerge rapidly in patients infected with pathogens such as human immunodeficiency virus type 1 (HIV-1) or hepatitis C virus (HCV). Since viral resistance can severely impact the success of antiretroviral therapy, genotypic resistance testing is performed when treatment is initiated or in case of treatment failure. Genotypic resistance tests consist of two steps: sequencing the relevant segments of the viral genome followed by the interpretation of drug resistance based on the amino-acid sequence (1). There exist two approaches for interpreting drug resistance: rules-based interpretation systems and algorithm-driven interpretation systems. While rules-based interpretation systems rely on the knowledge of expert panels, algorithm-driven systems are based on statistical models that are trained on clinical or virological data using machine learning algorithms. The spectrum of expert opinions has given rise to several sets of rules, for example, the rule sets from ANRS, HIVdb (2), HIV-GRADE and the Rega institute, all of which are available via the HIV-GRADE website (3). Similarly, algorithm-driven approaches differ among each other with respect to the applied machine learning algorithms and the data sets that are used for training the models. For example, geno2pheno[resistance] (4–7) uses support vector regression and classification, while the more recent SHIVA software (8) employs random forests.
Despite their differences, all existing genotypic resistance interpretation systems share one commonality: They interpret data from Sanger sequencing, a technology that has dominated the field due to its cost effectiveness and low rate of errors. However, with a detection limit of 10%-20% (9,10), Sanger sequencing is unable to identify resistance mutations in minority populations. Next-generation sequencing (NGS), on the other hand, allows for the identification of variants even at low abundances (11,12). Due to the potential clinical relevance of minority resistance mutations (13,14) and the decreasing costs of NGS, the implementation of NGS for viral resistance analysis has increased considerably in recent years. Still, few web services for interpreting NGS data with respect to drug resistance are available. To our best knowledge, the only existing web services for this purpose are PASeq and HyDRA (15). These tools require the raw sequencing data resulting from subjecting an HIV-1 sample to NGS (e.g. as a FASTQ, gzipped FASTQ, or SFF file). After a sample has been uploaded, a processing pipeline performs the following tasks: (i) reads are trimmed in order to remove low-quality positions; (ii) reads are mapped to a reference sequence; (iii) the abundance of mutations is quantified and (iv) resistance is inferred. In contrast to web services for interpreting Sanger sequences, which provide results immediately, PASeq and HyDRA perform more time-intensive computations and notify users via email when the results become available. Both PASeq and HyDRA support only rules-based interpretations and use Stanford's popular HIVdb by default, although HyDRA also allows for the consideration of user-defined sets of rules.
In this work, we present geno2pheno[ngs-freq], a web service for identifying resistance in NGS samples of HIV-1 and HCV that is based on the well-established methods of geno2pheno[resistance] (4–7) and geno2pheno[hcv] (16). geno2pheno[ngs-freq] does not require the input of raw sequencing data and instead relies on frequency files that tabulate either the counts of observed nucleotides or codons along a viral genome. In contrast to raw NGS data, whose sheer size may prevent some labs from performing resistance analyses over the internet, frequency files are quite small. Since geno2pheno[ngs-freq] does not need to map thousands of reads to a reference sequence, batches of frequency files can be analyzed quickly and results can be inspected immediately.
MATERIALS AND METHODS
In the following sections, we introduce the frequency file format, illustrate the mechanisms behind geno2pheno[ngs-freq], and outline how we validated the web service.
Format of input files
Frequency files are CSV files containing either the counts of observed codons (Supplementary File S1) or nucleotides (Figure 1A and Supplementary File S2) along a viral genome. These files can be generated via custom or available NGS processing pipelines such as VirVarSeq (17) or MinVar (18). In the following, we consider a frequency file as a matrix whose number of rows is determined by the number of genomic positions and whose number of columns is defined either by the number of nucleotides or triplets. Let be the nucleotide alphabet and let be the triplet alphabet. Nucleotide frequency files contain entries that denote the number of reads supporting the nucleotide at position , while codon frequency files are defined by entries where relates to triplets instead.
Workflow
Once a user has supplied a set of prevalence cutoffs, a frequency file, and an optional sample identifier, geno2pheno[ngs-freq] performs the following steps: (i) generation of a consensus sequence for every prevalence cutoff; (ii) inference of the viral species and (iii) identification of viral resistance for every consensus sequence. Once all computations have completed, users can contrast the estimated impact of identified variants at low abundances with those at greater abundances by selecting two cutoffs for displaying the results, the personal and the reference cutoff. By default, the personal cutoff is set to 2%. The default setting should ensure tolerance with regard to sequencing errors for all NGS platforms suitable for viral samples (19,20), and, at the same time, allow for the consideration of clinically relevant minority variants (21). The default reference cutoff for samples from HIV-1 is set to 10% (22) and to 15% for samples from HCV (23) such that results based on the reference cutoff should agree well with those that would be obtained via Sanger sequencing.
Generation of consensus sequences
For each prevalence cutoff provided for a frequency file , the web service generates a consensus sequence in the following manner. Let indicate the depth of coverage at position in the frequency file. If is a codon frequency file, the ratio at which the codon occurs at position is determined by , and, if is a single-nucleotide file, as described in Supplementary Text S3. We construct the consensus sequence by considering only , the set of observations whose prevalence is at least , and setting position of the consensus sequence as . In case that is empty (i.e. no frequencies are greater or equal to the cutoff), we use the greedy criterion, instead. The function translates nucleotides or codons into their corresponding IUPAC representation (24). Given a prevalence cutoff of and the observed prevalence ratios , and (Figure 1B), we would set (Figure 1C).
In order to correctly extract the target amplicon from a frequency file, we truncate the sequence by combining a relative and an absolute cutoff. Let denote the median depth of coverage over all positions with non-zero coverage. We set the coverage cutoff to and use it to define the start of the target region as . If is undefined, no further computations are performed. Otherwise, we define the end of the region as . If is undefined, we set to the value of .
Please note that geno2pheno[ngs-freq] provides warnings for individual positions with , when the 25th percentile of a genomic region is smaller than 100, or when stop codons or frameshift mutations are found in a genomic region. In the following, we use the term default consensus sequence to denote the consensus sequence of a sample that was constructed according to the corresponding default reference cutoff (i.e. for HIV-1 and for HCV samples).
Inference of the viral species
We identify the viral species from which an input sample originates by aligning its default consensus sequence to the genomic segments of the reference sequences for HIV-1 and HCV, HXB2 (25) and H77 (26), respectively. To ensure that we perform resistance analyses only for the supported viral species, we consider only high-quality alignments, i.e. alignments with high similarities between query and reference sequence. If no high-quality alignments are available, it is assumed that the sample derives from a species that is not supported and no further computations are performed. Otherwise, the annotated species of the reference sequence with the greatest alignment score is used. An alignment is considered a high-similarity alignment if it satisfies two similarity criteria, which are defined by dividing the number of matching amino acids in the alignment either by the length of the alignment (alignment similarity) or by the length of the reference sequence (reference similarity). For HIV-1 sequences, a minimal alignment similarity of 60% and a minimal reference similarity of 50% is used for all regions, except for the reverse transcriptase (RT). Since all major drug resistance mutations are located within the first half of the gene, the RT region is frequently merely partially amplified. Thus, we require a reference similarity of only 20% for the RT. Due to the greater phylogenetic divergence of HCV, we require an alignment similarity of 40% and a reference similarity of 20% for all HCV regions.
Identification of viral resistance
Viral resistance of HIV-1 and HCV samples is interpreted using the approaches of geno2pheno[resistance] (4–7) and geno2pheno[hcv] (16), respectively. geno2pheno[resistance] provides two types of approaches. The original g2p[resistance] approach relies on support vector regression models with linear kernel functions. These models were trained on genotype-phenotype pairs consisting of Sanger sequences from HIV-1 and corresponding measurements of drug-specific resistance factors (RF), which quantify the fold change in the half maximal inhibitory concentration of a mutated sample with respect to the wildtype (6,27). The more recently developed approach of g2p[drug-exposure] is based on support vector classification models. These models were trained using clinical data consisting of Sanger sequences and corresponding labels indicating whether a sequence originates from a patient that had received treatment with a specific drug (7). This approach estimates a quantity that is correlated with the degree of drug exposure, the so-called drug-exposure score (DES). Because RFs and DESs vary considerably across drugs, geno2pheno[resistance] standardizes these quantities to z-scores providing the number of standard deviations that a value is above/below the mean of therapy-naïve patients. Finally, each z-score is transformed to one of three interpretable, clinically-motivated levels of resistance (5): susceptible, intermediate, or resistant.
geno2pheno[hcv], on the other hand, relies on a set of drug- and genotype/subtype-specific rules that was chosen by an expert panel through extensive reviewing and weighting of literature related to HCV drug resistance. The level of drug resistance associated with an input sequence is determined by scanning the amino acids of nonstructural protein 3 (NS3), nonstructural protein 5A (NS5A), and non-structural protein 5B (NS5B) for matches to any of the rules and reporting the worst-case resistance level. For example, given a virus with subtype 1b, the mutation 41R would not affect susceptibility to the NS3 inhibitor asunaprevir, however, susceptibility would be considered to be reduced if both 41R and 80R were present. geno2pheno[ngs-freq] uses the following outcomes for classifying the resistance of HCV samples to individual drugs: susceptible, substitution on scored position (substitution at a position for which a rule exists), resistance-associated mutation in closest subtype (for rare subtypes only: existence of a rule in the closest non-rare subtype), reduced susceptibility, resistant, and unlicensed (drug is not approved for the identified subtype).
Please note that both approaches yield warnings when resistance-associated positions are missing from the constructed consensus sequences.
Validation
For validating the web server, we analyzed a total of 3844 frequency files of which 926 files represented samples from HIV-1 (24.1%) and 2918 files represented samples from HCV (75.9%). Resistance interpretations were obtained for 922 of 926 HIV-1 samples (99.6%) and 2898 of 2918 HCV samples (99.3%). For the remaining samples, geno2pheno[ngs-freq] did not provide a result due to low depth of coverage. Since we had re-implemented the approach of geno2pheno[hcv] during the development of geno2pheno[ngs-freq], we investigated the concordance between the predictions of geno2pheno[ngs-freq] and geno2pheno[hcv] using the default consensus sequences constructed from the 2866 successfully analyzed HCV frequency files. We did not perform an analogous validation for the HIV-1 samples because predictions for HIV-1 samples are based on the current version of geno2pheno[resistance].
Technical details
The geno2pheno backend is implemented in C++ and relies on an Oracle database for data storage. We implemented the frontend with Typescript and the React library. The web interface allows for the analysis of batches containing at most 20 files.
RESULTS
Validation
The levels of resistance that were predicted with geno2pheno[ngs-freq] and geno2pheno[hcv] for the default consensus sequences had an agreement of 99.7%. The median runtimes required for analyzing HIV-1 and HCV samples were 6 seconds and 4 seconds, respectively.
Case studies
In this section, we provide two case studies that illustrate how geno2pheno[ngs-freq] can offer insights that may impact clinical decision making. While non-nucleoside reverse transcriptase inhibitor (NNRTI) resistance mutations at low abundances are associated with virological failure (14,28–30), it is still generally unclear how minority resistant variants influence the treatment outcomes of HIV-1 infected persons (31–36). The impact of HCV minority resistant variants is less studied than for HIV-1 but the presence of minority resistant variants has recently been shown to deteriorate the outcomes in subtype 1 patients being treated with NS5A inhibitors (37). Although treatment choices based on minority resistant variants should be taken with particular care as noted previously (38), regimens that could be impaired by resistant minorities can be excluded if suitable alternative treatment options are available.
The case studies can be replicated by visiting http://ngs.geno2pheno.org, ensuring that the default prevalence cutoffs (2%, 10% and 15%) are selected, and loading the frequency files that are provided through Supplementary Files S1 and S2. The HIV-1 case study was performed using the g2p[resistance] model, which predicts phenotypic drug resistance (27), while the HCV case study was performed using the geno2pheno[hcv] rule set (16).
Case study 1: HIV-1 resistance interpretation
This case study (Supplementary File S1) is based on a plasma isolate from an HIV-1 infected patient with a viral load of 102 000 copies/ml. The plot of viral drug resistance (Figure 2) reveals that the major viral population (at the reference cutoff of 10%) seems to be susceptible to nearly all drugs. When considering also minor viral populations (at the personal cutoff of 2%) we find highly increased levels of resistance to ABC, ddI and 3TC, drugs from the class of nucleoside reverse transcriptase inhibitors (NRTIs). Using the resistance table, we can determine that the increased level of resistance is caused by the well-studied resistance mutation M184V, which occurs at a population prevalence of 2.36%. M184V is not only known for enhancing the susceptibility to the NRTIs ZDV, d4T and TDF, but also for delaying the emergence of resistance to these drugs (39). Therefore, a combination therapy consisting of two such NRTIs and one protease inhibitor such as TDF + ZDV + DRV would be a reasonable choice. An alternative treatment with fewer side effects could consist of TDF + FTC + DRV. The idea behind this treatment is that FTC could stabilize M184V such that susceptibility to TDF is ensured (39). Moreover, although M184V is associated with a >100-fold reduction in susceptibility to FTC in vitro (40), FTC exhibits residual activity in the presence of M184V in vivo (41,42). Therefore, even if the minority population characterized by M184V were to become the major viral population over time, FTC would still be residually active.
Case study 2: HCV resistance interpretation
The visualization of resistance for the provided HCV sample (Supplementary File S2) reveals an interesting scenario (Supplementary Figure S4). Although the viral population at the 15% cutoff seems to be susceptible to all direct- acting antiviral agents targeting NS5A, the population at the 2% cutoff seems to be resistant to most NS5A inhibitors due to the presence of the resistance mutation 30R, which was found at a prevalence of 6.1%. Using this information, the treating clinician may decide to avoid the use of the NS5A inhibitors DCV, EBR, LDV, and OBV, for which resistance was reported, and instead use VEL, to which the viral population seems to be susceptible.
DISCUSSION
In contrast to existing web services for interpreting viral resistance for NGS samples, geno2pheno[ngs-freq] uses frequency files instead of raw NGS data, which offers many benefits. First, due to their small size (kilobytes vs megabytes), samples can be uploaded quickly even in settings with limited bandwidth. Second, resistance interpretation does not require the time-intensive step of processing the raw NGS data, allowing for rapid analyses (a few seconds vs several minutes or hours). Third, the use of frequency files offers greater flexibility than basing the analysis on raw sequencing data since interpretation engines based on the latter data apply pre-determined pipelines for the basic processing of the NGS samples. By relying on frequency files, geno2pheno[ngs-freq] does not impose limitations on the manner in which NGS samples are processed.
Of course, using frequency files also entails loss of information. For nucleotide frequency files, amino acid frequencies need to be estimated and spurious amino acids may be generated. Imagine that the codons ATA (Ile) and TTT (Phe) are observed at the same genomic position. In this case, the triplets ATT (Ile) and TTA (Leu) would be considered in addition to the observed nucleotides when constructing the consensus sequence. Thus, the unobserved amino acid Leu would be erroneously taken into account during the resistance interpretation, which may lead to an incorrect estimate of drug resistance. Therefore, we generally recommend the use of codon frequency files because this file format retains information on the abundance of tri-nucleotides. Thus, amino-acid frequencies are represented correctly and the appropriate translation of codons containing multiple ambiguous positions can be determined. Another limitation of frequency files is that they do not allow for quasispecies reconstruction. While quasispecies reconstruction may offer insights in some scenarios, the mediocre precision/recall trade-off of most methods for inferring quasispecies (43) suggests that these approaches are not yet mature enough for routine use. However, more recent approaches seem more promising (44,45). Last, future resistance interpretation systems based on read-based models may provide another incentive for the use of raw sequencing data.
CONCLUSIONS AND FUTURE WORK
We have developed geno2pheno[ngs-freq], a free and publicly accessible web server for the rapid genotypic interpretation of viral drug resistance in NGS samples. geno2pheno[ngs-freq] is the first service that enables the application of geno2pheno[resistance] on NGS samples and, to the best of our knowledge, provides the first interpretation engine for NGS samples from HCV. We developed a new visualization of drug resistance (Figure 2) that enhances the interpretability of both algorithm-driven and rules-based interpretation engines. Due to its reliance on frequency files, geno2pheno[ngs-freq] can be integrated into existing NGS pipelines for interpreting viral resistance. By providing a means of exploring drug resistance for viral populations at multiple prevalence levels, we expect that geno2pheno[ngs-freq] will contribute to making clinical decisions and researching the impact of low-prevalence resistance mutations.
In future versions of geno2pheno[ngs-freq], we plan to improve the interpretability of the service, for example by annotating called variants using data from the literature. We also intend to incorporate further prediction models, particularly for identifying the coreceptor that is used by HIV (46,47) and for predicting the susceptibility of HIV-1 towards integrase strand transfer inhibitors. At a later point in time, support for samples from other viral species such as HBV, for which the emergence of resistance is relevant (48,49), could be added. Last, we are working towards providing an application programming interface that will be made available for the use in research settings.
DATA AVAILABILITY
geno2pheno[ngs-freq] (http://ngs.geno2pheno.org) identifies viral resistance based on the approaches of geno2pheno[resistance] (www.geno2pheno.org) and geno2pheno[hcv] (http://hcv.geno2pheno.org). geno2pheno[ngs-freq] relies on geno2pheno[mutext] (http://align.geno2pheno.org) for performing local pairwise alignments.
Supplementary Material
ACKNOWLEDGEMENTS
We would like to express our gratitude to everyone who was involved in testing geno2pheno[ngs-freq]: Markus List, Christian Brossette, Tomas Bastys, Peter Ebert, Fatemeh Behjati, Florian Schmidt, Anna Hake, Sebastian Keller, Nora Speicher, and Robert Ehret. We are indebted to Eugen Schülter for his work on the alignment software geno2pheno[mutext]. This work would not have been possible without the effort of everyone that worked on geno2pheno over the last two decades. Therefore, we would like to acknowledge the contributions of Tobias Sing, Niko Beerenwinkel, Bastian Beggel, Sven-Eric Schelhorn, and our collaborators in the fields of virology and medicine.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Bundesministerium für Gesundheit [MASTER HIV/Hep]; Deutsches Zentrum für Infektionsforschung [TTU 05.809 HCV Treatment Optimizaton]. Funding for open access charge: Max Planck Gesellschaft.
Conflict of interest statement. None declared.
REFERENCES
- 1. Vercauteren J., Vandamme A.-M.. Algorithms for the interpretation of HIV-1 genotypic drug resistance information. Antiviral Res. 2006; 71:335–342. [DOI] [PubMed] [Google Scholar]
- 2. Tang M.W., Liu T.F., Shafer R.W.. The HIVdb system for HIV-1 genotypic resistance interpretation. Intervirology. 2012; 55:98–101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Obermeier M., Pironti A., Berg T., Braun P., Däumer M., Eberle J., Ehret R., Kaiser R., Kleinkauf N., Korn K. et al. . HIV-GRADE: a publicly available, rules-based drug resistance interpretation algorithm integrating bioinformatic knowledge. Intervirology. 2012; 55:102–107. [DOI] [PubMed] [Google Scholar]
- 4. Beerenwinkel N., Däumer M., Oette M., Korn K., Hoffmann D., Kaiser R., Lengauer T., Selbig J., Walter H.. Geno2pheno: estimating phenotypic drug resistance from HIV-1 genotypes. Nucleic Acids Res. 2003; 31:3850–3855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Pironti A., Walter H., Pfeifer N., Knops E., Lübke N., Büch J., Di Giambenedetto S., Kaiser R., Lengauer T. EuResist Network Study Group . Determination of Phenotypic Resistance Cutoffs From Routine Clinical Data. JAIDS J. Acquir. Immune Defic. Syndr. 2017; 74:e129–e137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Lengauer T., Sing T.. Bioinformatics-assisted anti-HIV therapy. Nat. Rev. Microbiol. 2006; 4:790–797. [DOI] [PubMed] [Google Scholar]
- 7. Pironti A., Pfeifer N., Walter H., Jensen B.-E.O., Zazzi M., Gomes P., Kaiser R., Lengauer T.. Using drug exposure for predicting drug resistance – a data-driven genotypic interpretation tool. PLoS One. 2017; 12:e0174992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Riemenschneider M., Hummel T., Heider D.. SHIVA - a web application for drug resistance and tropism testing in HIV. BMC Bioinformatics. 2016; 17:314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Tsiatis A.C., Norris-Kirby A., Rich R.G., Hafez M.J., Gocke C.D., Eshleman J.R., Murphy K.M.. Comparison of Sanger Sequencing, Pyrosequencing,and Melting Curve Analysis for the Detection ofKRASMutations: Diagnostic and Clinical Implications. J. Mol. Diagn. 2010; 12:425–432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Davidson C.J., Zeringer E., Champion K.J., Gauthier M.-P., Wang F., Boonyaratanakornkit J., Jones J.R., Schreiber E.. Improving the limit of detection for Sanger sequencing: A comparison of methodologies for KRAS variant detection. Biotechniques. 2012; 53:182–188. [DOI] [PubMed] [Google Scholar]
- 11. Lin M.-T., Mosier S.L., Thiess M., Beierl K.F., Debeljak M., Tseng L.-H., Chen G., Yegnasubramanian S., Ho H., Cope L. et al. . Clinical validation of KRAS, BRAF, and EGFR mutation detection using next-generation sequencing. Am. J. Clin. Pathol. 2014; 141:856–866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Fox E.J., Reid-Bayliss K.S., Emond M.J., Loeb L.A. Accuracy of Next Generation Sequencing Platforms. Next Gener. Seq. Appl. 2014; 1:1000106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Vrancken B., Trovão N., Baele G., van Wijngaerden E., Vandamme A.-M., van Laethem K., Lemey P.. Quantifying Next Generation Sequencing Sample Pre-Processing Bias in HIV-1 Complete Genome Sequencing. Viruses. 2016; 8:12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Cozzi-Lepri A., Noguera-Julian M., Di Giallonardo F., Schuurman R., Däumer M., Aitken S., Ceccherini-Silberstein F., Monforte A.D.A., Geretti A.M., Booth C.L. et al. . Low-frequency drug-resistant HIV-1 and risk of virological failure to first-line NNRTI-based ART: a multicohort European case-control study using centralized ultrasensitive 454 pyrosequencing. J. Antimicrob. Chemother. 2015; 70:930–940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Noguera-Julian M., Edgil D., Harrigan P.R., Sandstrom P., Godfrey C., Paredes R.. Next-Generation Human Immunodeficiency Virus Sequencing for Patient Management and Drug Resistance Surveillance. J. Infect. Dis. 2017; 17:19752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Kalaghatgi P., Sikorski A.M., Knops E., Rupp D., Sierra S., Heger E., Neumann-Fraune M., Beggel B., Walker A., Timm J. et al. . Geno2pheno[HCV] – A Web-based Interpretation System to Support Hepatitis C Treatment Decisions in the Era of Direct-Acting Antiviral Agents. PLoS One. 2016; 11:e0155869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Verbist B.M.P., Thys K., Reumers J., Wetzels Y., Van der Borght K., Talloen W., Aerssens J., Clement L., Thas O.. VirVarSeq: a low-frequency virus variant detection pipeline for Illumina sequencing using adaptive base-calling accuracy filtering. Bioinformatics. 2015; 31:94–101. [DOI] [PubMed] [Google Scholar]
- 18. Huber M., Metzner K.J., Geissberger F.D., Shah C., Leemann C., Klimkait T., Böni J., Trkola A., Zagordi O.. MinVar: A rapid and versatile tool for HIV-1 drug resistance genotyping by deep sequencing. J. Virol. Methods. 2017; 240:7–13. [DOI] [PubMed] [Google Scholar]
- 19. Loman N.J., Misra R. V, Dallman T.J., Constantinidou C., Gharbia S.E., Wain J., Pallen M.J.. Performance comparison of benchtop high-throughput sequencing platforms. Nat. Biotechnol. 2012; 30:434–439. [DOI] [PubMed] [Google Scholar]
- 20. Archer J., Weber J., Henry K., Winner D., Gibson R., Lee L., Paxinos E., Arts E.J., Robertson D.L., Mimms L. et al. . Use of Four Next-Generation Sequencing Platforms to Determine HIV-1 Coreceptor Tropism. PLoS One. 2012; 7:e49602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Goodman D., Margot N., McColl D., Miller M., Borroto-Esoda K., Svarovskaia E.. Pre-Existing Low-Levels of the K103N HIV-1 RT Mutation Above a Threshold is Associated with Virological Failure in Treatment-Naïve Patients Undergoing EFV-Containing Antiretroviral Treatment. 18th HIV Drug Resistance Workshop. 2009; Fort Myers. [Google Scholar]
- 22. Thielen A. One year of routine HIV-1 drug resistance testing by deep sequencing: insights from comparative Sanger sequencing. 12th European HIV & Hepatitis Workshop. 2014; Barcelona. [Google Scholar]
- 23. Pawlotsky J.-M. Hepatitis C Virus Resistance to Direct-Acting Antiviral Drugs in Interferon-Free Regimens. Gastroenterology. 2016; 151:70–86. [DOI] [PubMed] [Google Scholar]
- 24. Cornish-Bowden A. Nomenclature for incompletely specified bases in nucleic acid sequences: recommendations 1984. Nucleic Acids Res. 1985; 13:3021–3030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Ratner L., Haseltine W., Patarca R., Livak K.J., Starcich B., Josephs S.F., Doran E.R., Rafalski J.A., Whitehorn E.A., Baumeister K. et al. . Complete nucleotide sequence of the AIDS virus, HTLV-III. Nature. 1985; 313:277–284. [DOI] [PubMed] [Google Scholar]
- 26. Kolykhalov A.A., Agapov E. V, Blight K.J., Mihalik K., Feinstone S.M., Rice C.M.. Transmission of Hepatitis C by Intrahepatic Inoculation with Transcribed RNA. Science. 1997; 277:570–574. [DOI] [PubMed] [Google Scholar]
- 27. Beerenwinkel N., Däumer M., Oette M., Korn K., Hoffmann D., Kaiser R., Lengauer T., Selbig J., Walter H.. Geno2pheno: estimating phenotypic drug resistance from HIV-1 genotypes. Nucleic Acids Res. 2003; 31:3850–3855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Van Laethem K., De Munter P., Schrooten Y., Verbesselt R., Van Ranst M., Van Wijngaerden E., Vandamme A.-M.. No response to first-line tenofovir+lamivudine+efavirenz despite optimization according to baseline resistance testing: Impact of resistant minority variants on efficacy of low genetic barrier drugs. J. Clin. Virol. 2007; 39:43–47. [DOI] [PubMed] [Google Scholar]
- 29. Delobel P., Saliou A., Nicot F., Dubois M., Trancart S., Tangre P., Aboulker J.-P., Taburet A.-M., Molina J.-M., Massip P. et al. . Minor HIV-1 Variants with the K103N Resistance Mutation during Intermittent Efavirenz-Containing Antiretroviral Therapy and Virological Failure. PLoS One. 2011; 6:e21655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Simen B.B., Simons J.F., Hullsiek K.H., Novak R.M., MacArthur R.D., Baxter J.D., Huang C., Lubeski C., Turenchalk G.S., Braverman M.S. et al. . Low-Abundance Drug-Resistant Viral Variants in Chronically HIV-Infected, Antiretroviral Treatment–Naive Patients Significantly Impact Treatment Outcomes. J. Infect. Dis. 2009; 199:693–701. [DOI] [PubMed] [Google Scholar]
- 31. Johnson J.A., Li J.-F., Wei X., Lipscomb J., Irlbeck D., Craig C., Smith A., Bennett D.E., Monsour M., Sandstrom P. et al. . Minority HIV-1 Drug Resistance Mutations Are Present in Antiretroviral Treatment–Naïve Populations and Associate with Reduced Treatment Efficacy. PLoS Med. 2008; 5:e158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Lataillade M., Chiarella J., Yang R., Schnittman S., Wirtz V., Uy J., Seekins D., Krystal M., Mancini M., McGrath D. et al. . Prevalence and Clinical Significance of HIV Drug Resistance Mutations by Ultra-Deep Sequencing in Antiretroviral-Naïve Subjects in the CASTLE Study. PLoS One. 2010; 5:e10952. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Stekler J.D., Ellis G.M., Carlsson J., Eilers B., Holte S., Maenza J., Stevens C.E., Collier A.C., Frenkel L.M.. Prevalence and Impact of Minority Variant Drug Resistance Mutations in Primary HIV-1 Infection. PLoS One. 2011; 6:e28952. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Peuchant O., Thiébaut R., Capdepont S., Lavignolle-Aurillac V., Neau D., Morlat P., Dabis F., Fleury H., Masquelier B. ANRS CO3 Aquitaine Cohort . Transmission of HIV-1 minority-resistant variants and response to first-line antiretroviral therapy. AIDS. 2008; 22:1417–1423. [DOI] [PubMed] [Google Scholar]
- 35. Moscona R., Ram D., Wax M., Bucris E., Levy I., Mendelson E., Mor O.. Comparison between next-generation and Sanger-based sequencing for the detection of transmitted drug-resistance mutations among recently infected HIV-1 patients in Israel, 2000–2014. J. Int. AIDS Soc. 2017; 20:21846. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Callegaro A., Di Filippo E., Astuti N., Ortega P.A.S., Rizzi M., Farina C., Valenti D., Maggiolo F.. Early clinical response and presence of viral resistant minority variants: a proof of concept study. J. Int. AIDS Soc. 2014; 17:19759. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Zeuzem S., Mizokami M., Pianko S., Mangia A., Han K.-H., Martin R., Svarovskaia E., Dvory-Sobol H., Doehle B., Hedskog C. et al. . NS5A resistance-associated substitutions in patients with genotype 1 hepatitis C virus: Prevalence and effect on treatment outcome. J. Hepatol. 2017; 66:910–918. [DOI] [PubMed] [Google Scholar]
- 38. Johnson J.A., Geretti A.M.. Low-frequency HIV-1 drug resistance mutations can be clinically significant but must be interpreted with caution. J. Antimicrob. Chemother. 2010; 65:1322–1326. [DOI] [PubMed] [Google Scholar]
- 39. Gallant J.E. The M184V mutation: what it does, how to prevent it, and what to do with it when it's there. AIDS Read. 2006; 16:556–559. [PubMed] [Google Scholar]
- 40. Schinazi R.F., Lloyd R.M., Nguyen M.H., Cannon D.L., McMillan A., Ilksoy N., Chu C.K., Liotta D.C., Bazmi H.Z., Mellors J.W.. Characterization of Human Immunodeficiency Viruses Resistant to Oxathiolane-Cytosine Nucleosides. Antimicrob. Agents Chemother. 1993; 37:875–881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Campbell T.B., Shulman N.S., Johnson S.C., Zolopa A.R., Young R.K., Bushman L., Fletcher C.V., Lanier E.R., Merigan T.C., Kuritzkes D.R.. Antiviral Activity of Lamivudine in Salvage Therapy for Multidrug-Resistant HIV-1 Infection. Clin. Infect. Dis. 2005; 41:236–242. [DOI] [PubMed] [Google Scholar]
- 42. Castagna A., Danise A., Menzo S., Galli L., Gianotti N., Carini E., Boeri E., Galli A., Cernuschi M., Hasson H. et al. . Lamivudine monotherapy in HIV-1-infected patients harbouring a lamivudine-resistant virus: a randomized pilot study (E-184V study). AIDS. 2006; 20:795–803. [DOI] [PubMed] [Google Scholar]
- 43. Prosperi M.C.F., Yin L., Nolan D.J., Lowe A.D., Goodenow M.M., Salemi M.. Empirical validation of viral quasispecies assembly algorithms: state-of-the-art and challenges. Sci. Rep. 2013; 3:2837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Töpfer A., Marschall T., Bull R.A., Luciani F., Schönhuth A., Beerenwinkel N.. Viral Quasispecies Assembly via Maximal Clique Enumeration. PLoS Comput. Biol. 2014; 10:e1003515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Posada-Cespedes S., Seifert D., Beerenwinkel N.. Recent advances in inferring viral diversity from high-throughput sequencing data. Virus Res. 2017; 239:17–32. [DOI] [PubMed] [Google Scholar]
- 46. Lengauer T., Sander O., Sierra S., Thielen A., Kaiser R.. Bioinformatics prediction of HIV coreceptor usage. Nat. Biotechnol. 2007; 25:1407–1410. [DOI] [PubMed] [Google Scholar]
- 47. Döring M., Borrego P., Büch J., Martins A., Friedrich G., Camacho R.J., Eberle J., Kaiser R., Lengauer T., Taveira N. et al. . A genotypic method for determining HIV-2 coreceptor usage enables epidemiological studies and clinical decision support. Retrovirology. 2016; 13:85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Shaw T., Bartholomeusz A., Locarnini S.. HBV drug resistance: Mechanisms, detection and interpretation. J. Hepatol. 2006; 44:593–606. [DOI] [PubMed] [Google Scholar]
- 49. Beggel B., Neumann-Fraune M., Döring M., Lawyer G., Kaiser R., Verheyen J., Lengauer T.. Genotyping hepatitis B virus dual infections using population-based sequence data. J. Gen. Virol. 2012; 93:1899–1907. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
geno2pheno[ngs-freq] (http://ngs.geno2pheno.org) identifies viral resistance based on the approaches of geno2pheno[resistance] (www.geno2pheno.org) and geno2pheno[hcv] (http://hcv.geno2pheno.org). geno2pheno[ngs-freq] relies on geno2pheno[mutext] (http://align.geno2pheno.org) for performing local pairwise alignments.