Abstract
Human immunodeficiency virus type 1 (HIV-1) sequences are accumulating in the literature at a rapid pace. For this ever-expanding resource to be maximally useful, it is critical that researchers strive to maintain a high level of quality assurance, both in experimental design and conduct and in analyses. Here we present detailed analyses of problematic sets of HIV-1 sequences in the database that include sequence anomalies suggestive of mislabeling or sample contamination problems. These data are examined in the context of currently available HIV-1 sequence information to provide an example of how to identify potentially flawed data. Indicators of potential problems with sequences are (i) sequences that are nearly identical that are supposed to be derived from unlinked individuals and that are markedly distinct from other sequences from the putative source or (ii) sequences that are nearly identical to those of laboratory strains. We provide an outline of methods that researchers can use to perform preliminary laboratory and computational analyses that could help identify problematic data and thus help ensure the integrity of sequence databases.
Full Text
The Full Text of this article is available as a PDF (302.1 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Ahmad N., Baroudy B. M., Baker R. C., Chappey C. Genetic analysis of human immunodeficiency virus type 1 envelope V3 region isolates from mothers and infants after perinatal transmission. J Virol. 1995 Feb;69(2):1001–1012. doi: 10.1128/jvi.69.2.1001-1012.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Antonioli I. M., Baumberger C., Yerly S., Perrin L. V3 sequences in primary HIV-1 infection. AIDS. 1995 Jan;9(1):11–17. doi: 10.1097/00002030-199501000-00002. [DOI] [PubMed] [Google Scholar]
- Balfe P., Simmonds P., Ludlam C. A., Bishop J. O., Brown A. J. Concurrent evolution of human immunodeficiency virus type 1 in patients infected from the same source: rate of sequence change and low frequency of inactivating mutations. J Virol. 1990 Dec;64(12):6221–6233. doi: 10.1128/jvi.64.12.6221-6233.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barbacid M. Oncogenes and human cancer: cause or consequence? Carcinogenesis. 1986 Jul;7(7):1037–1042. doi: 10.1093/carcin/7.7.1037. [DOI] [PubMed] [Google Scholar]
- Briant L., Wade C. M., Puel J., Brown A. J., Guyader M. Analysis of envelope sequence variants suggests multiple mechanisms of mother-to-child transmission of human immunodeficiency virus type 1. J Virol. 1995 Jun;69(6):3778–3788. doi: 10.1128/jvi.69.6.3778-3788.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang S. Y., Bowman B. H., Weiss J. B., Garcia R. E., White T. J. The origin of HIV-1 isolate HTLV-IIIB. Nature. 1993 Jun 3;363(6428):466–469. doi: 10.1038/363466a0. [DOI] [PubMed] [Google Scholar]
- Desrosiers R. C., Daniel M. D., Letvin N. L., King N. W., Hunt R. D. Origins of HTLV-4. Nature. 1987 May 14;327(6118):107–107. doi: 10.1038/327107a0. [DOI] [PubMed] [Google Scholar]
- Faulkner D. V., Jurka J. Multiple aligned sequence editor (MASE). Trends Biochem Sci. 1988 Aug;13(8):321–322. doi: 10.1016/0968-0004(88)90129-6. [DOI] [PubMed] [Google Scholar]
- Felsenstein J. Phylogenies from molecular sequences: inference and reliability. Annu Rev Genet. 1988;22:521–565. doi: 10.1146/annurev.ge.22.120188.002513. [DOI] [PubMed] [Google Scholar]
- Jaffe H. W., McCurdy J. M., Kalish M. L., Liberti T., Metellus G., Bowman B. H., Richards S. B., Neasman A. R., Witte J. J. Lack of HIV transmission in the practice of a dentist with AIDS. Ann Intern Med. 1994 Dec 1;121(11):855–859. doi: 10.7326/0003-4819-121-11-199412010-00005. [DOI] [PubMed] [Google Scholar]
- Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980 Dec;16(2):111–120. doi: 10.1007/BF01731581. [DOI] [PubMed] [Google Scholar]
- Korber B. T., Learn G., Mullins J. I., Hahn B. H., Wolinsky S. Protecting HIV databases. Nature. 1995 Nov 16;378(6554):242–244. doi: 10.1038/378242a0. [DOI] [PubMed] [Google Scholar]
- Korber B. T., Learn G., Mullins J. I., Hahn B. H., Wolinsky S. Protecting HIV databases. Nature. 1995 Nov 16;378(6554):242–244. doi: 10.1038/378242a0. [DOI] [PubMed] [Google Scholar]
- Kornfeld H., Riedel N., Viglianti G. A., Hirsch V., Mullins J. I. Cloning of HTLV-4 and its relation to simian and human immunodeficiency viruses. Nature. 1987 Apr 9;326(6113):610–613. doi: 10.1038/326610a0. [DOI] [PubMed] [Google Scholar]
- Koyanagi Y., Miles S., Mitsuyasu R. T., Merrill J. E., Vinters H. V., Chen I. S. Dual infection of the central nervous system by AIDS viruses with distinct cellular tropisms. Science. 1987 May 15;236(4803):819–822. doi: 10.1126/science.3646751. [DOI] [PubMed] [Google Scholar]
- Kwok S., Higuchi R. Avoiding false positives with PCR. Nature. 1989 May 18;339(6221):237–238. doi: 10.1038/339237a0. [DOI] [PubMed] [Google Scholar]
- Li W. H., Tanimura M., Sharp P. M. Rates and dates of divergence between AIDS virus nucleotide sequences. Mol Biol Evol. 1988 Jul;5(4):313–330. doi: 10.1093/oxfordjournals.molbev.a040503. [DOI] [PubMed] [Google Scholar]
- Longo M. C., Berninger M. S., Hartley J. L. Use of uracil DNA glycosylase to control carry-over contamination in polymerase chain reactions. Gene. 1990 Sep 1;93(1):125–128. doi: 10.1016/0378-1119(90)90145-h. [DOI] [PubMed] [Google Scholar]
- Meyerhans A., Vartanian J. P., Wain-Hobson S. DNA recombination during PCR. Nucleic Acids Res. 1990 Apr 11;18(7):1687–1691. doi: 10.1093/nar/18.7.1687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mulder-Kampinga G. A., Simonon A., Kuiken C. L., Dekker J., Scherpbier H. J., van de Perre P., Boer K., Goudsmit J. Similarity in env and gag genes between genomic RNAs of human immunodeficiency virus type 1 (HIV-1) from mother and infant is unrelated to time of HIV-1 RNA positivity in the child. J Virol. 1995 Apr;69(4):2285–2296. doi: 10.1128/jvi.69.4.2285-2296.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nei M., Jin L. Variances of the average numbers of nucleotide substitutions within and between populations. Mol Biol Evol. 1989 May;6(3):290–300. doi: 10.1093/oxfordjournals.molbev.a040547. [DOI] [PubMed] [Google Scholar]
- Pearson W. R., Lipman D. J. Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444–2448. doi: 10.1073/pnas.85.8.2444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robertson D. L., Hahn B. H., Sharp P. M. Recombination in AIDS viruses. J Mol Evol. 1995 Mar;40(3):249–259. doi: 10.1007/BF00163230. [DOI] [PubMed] [Google Scholar]
- Robertson D. L., Sharp P. M., McCutchan F. E., Hahn B. H. Recombination in HIV-1. Nature. 1995 Mar 9;374(6518):124–126. doi: 10.1038/374124b0. [DOI] [PubMed] [Google Scholar]
- Sabino E. C., Shpaer E. G., Morgado M. G., Korber B. T., Diaz R. S., Bongertz V., Cavalcante S., Galvão-Castro B., Mullins J. I., Mayer A. Identification of human immunodeficiency virus type 1 envelope genes recombinant between subtypes B and F in two epidemiologically linked individuals from Brazil. J Virol. 1994 Oct;68(10):6340–6346. doi: 10.1128/jvi.68.10.6340-6346.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saitou N., Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987 Jul;4(4):406–425. doi: 10.1093/oxfordjournals.molbev.a040454. [DOI] [PubMed] [Google Scholar]
- Salminen M. O., Carr J. K., Burke D. S., McCutchan F. E. Identification of breakpoints in intergenotypic recombinants of HIV type 1 by bootscanning. AIDS Res Hum Retroviruses. 1995 Nov;11(11):1423–1425. doi: 10.1089/aid.1995.11.1423. [DOI] [PubMed] [Google Scholar]
- Schuler G. D., Altschul S. F., Lipman D. J. A workbench for multiple alignment construction and analysis. Proteins. 1991;9(3):180–190. doi: 10.1002/prot.340090304. [DOI] [PubMed] [Google Scholar]
- Shapshak P., Nagano I., Xin K., Bradley W., McCoy C. B., Sun N. C., Stewart R. V., Yoshioka M., Petito C., Goodkin K. HIV-1 heterogeneity and cytokines. Neuropathogenesis. Adv Exp Med Biol. 1995;373:225–238. doi: 10.1007/978-1-4615-1951-5_31. [DOI] [PubMed] [Google Scholar]
- Siepel A. C., Halpern A. L., Macken C., Korber B. T. A computer program designed to screen rapidly for HIV type 1 intersubtype recombinant sequences. AIDS Res Hum Retroviruses. 1995 Nov;11(11):1413–1416. doi: 10.1089/aid.1995.11.1413. [DOI] [PubMed] [Google Scholar]
- Smith S. W., Overbeek R., Woese C. R., Gilbert W., Gillevet P. M. The genetic data environment an expandable GUI for multiple sequence analysis. Comput Appl Biosci. 1994 Dec;10(6):671–675. doi: 10.1093/bioinformatics/10.6.671. [DOI] [PubMed] [Google Scholar]
- Smith T. F., Srinivasan A., Schochetman G., Marcus M., Myers G. The phylogenetic history of immunodeficiency viruses. Nature. 1988 Jun 9;333(6173):573–575. doi: 10.1038/333573a0. [DOI] [PubMed] [Google Scholar]
- Thompson J. D., Higgins D. G., Gibson T. J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994 Nov 11;22(22):4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Udaykumar, Epstein J. S., Hewlett I. K. A novel method employing UNG to avoid carry-over contamination in RNA-PCR. Nucleic Acids Res. 1993 Aug 11;21(16):3917–3918. doi: 10.1093/nar/21.16.3917. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wain-Hobson S. Human immunodeficiency virus type 1 quasispecies in vivo and ex vivo. Curr Top Microbiol Immunol. 1992;176:181–193. doi: 10.1007/978-3-642-77011-1_12. [DOI] [PubMed] [Google Scholar]
- Wolinsky S. M., Wike C. M., Korber B. T., Hutto C., Parks W. P., Rosenblum L. L., Kunstman K. J., Furtado M. R., Muñoz J. L. Selective transmission of human immunodeficiency virus type-1 variants from mothers to infants. Science. 1992 Feb 28;255(5048):1134–1137. doi: 10.1126/science.1546316. [DOI] [PubMed] [Google Scholar]
- Xin K. Q., Shapshak P., Kawamoto S., Nagano I., McCoy C. B., Okuda K. Highly divergent env sequences of HIV-1 B subtype with two novel V3 loop motifs detected in an AIDS patient in Miami, Florida. AIDS Res Hum Retroviruses. 1995 Sep;11(9):1139–1141. doi: 10.1089/aid.1995.11.1139. [DOI] [PubMed] [Google Scholar]
- Zhang L. Q., MacKenzie P., Cleland A., Holmes E. C., Brown A. J., Simmonds P. Selection for specific sequences in the external envelope protein of human immunodeficiency virus type 1 upon primary infection. J Virol. 1993 Jun;67(6):3345–3356. doi: 10.1128/jvi.67.6.3345-3356.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu T., Mo H., Wang N., Nam D. S., Cao Y., Koup R. A., Ho D. D. Genotypic and phenotypic characterization of HIV-1 patients with primary infection. Science. 1993 Aug 27;261(5125):1179–1181. doi: 10.1126/science.8356453. [DOI] [PubMed] [Google Scholar]
- Zhu T., Wang N., Carr A., Wolinsky S., Ho D. D. Evidence for coinfection by multiple strains of human immunodeficiency virus type 1 subtype B in an acute seroconvertor. J Virol. 1995 Feb;69(2):1324–1327. doi: 10.1128/jvi.69.2.1324-1327.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]