Skip to main content
. 2010 Sep 22;84(23):12458–12462. doi: 10.1128/JVI.01789-10

TABLE 1.

Selected endogenous sequences in vertebrate genomes related to single-stranded DNA viruses

Virus group and vertebrate species Initial genomic search using TBLASTN
Best sequence homology identified using BLASTX
Predicted nucleotide drift (%) Integration label Age (million yr) or timing of integration based on sequence aging
Chromosomal or scaffold location Protein BLAST E value/% sequence identity Most similar virusa Protein Coordinates No. of stop codons/frameshifts
Circoviruses
    Cat Scaffold_62068 Rep 6E−05/37 Canary circovirus Rep 4-283 3/7 in 268 aab 14.2 fcECLG-1 82
Scaffold_24038 Rep 6E−06/51 Columbid circovirus Rep 44-317 4/5 in 231 aac 15.2 fcECLG-2 87
    Dog Chr5d Rep 7E−16/46 Raven circovirus Rep 16-263 6/5 in 250 aa 17.6 cfECLG-1 98
Chr22 Rep 1E−14/43 Beak and feather disease virus Rep 7-264 2/1 in 261 aac 4.5 cfECLG-2 54
    Opossum Chr3 Rep 4E−46/44 Finch circovirus Rep 2-291 0/2 in 282 aa 2.3 mdECLG 12
Cap 6-36 0/0 in 30 aa
Dependoviruses
    Dog ChrX Rep 6E−05/55 AAV5 Rep 239-445 3/4 in 200 aa 14.0 cfEDLG-1 78
    Dolphin GeneScaffold1475 Rep 8E−39/39 Avian AAV DA1 Rep 79-486 3/4 in 379 aac 6.6 ttEDLG-2 55
Cap 4E−61/47 Cap 1-738 4/7 in 678 aac
    Elephant Scaffold_4 Rep 0/55 AAV5 Rep 3-589 0/0 in 579 aa 0.0 laEDLG Recent
    Hyrax GeneScaffold5020 Cap 3E−34/53 AAV3 Cap 485-735 0/5 in 256 aa 7.0 pcEDLG-1 29
Scaffold_19252 Rep 9E−72/47 Bovine AAV Rep 2-348 8/4 in 348 aa 14.3 pcEDLG-2 60
    Megabat Scaffold_5601 Rep 2E−13/31 AAV2 Rep 315-479 1/5 in 175 aa 13.1 pvEDLG-3 76
    Microbat GeneScaffold2026 Rep 1E−117/50 AAV2 Rep 1-617 2/5 in 612 aa 5.8 mlEDLG-1 27
Cap 9E−33/51 Cap 1-731 2/9 in 509 aac
Scaffold_146492 Cap 6E−32/42 AAV2 Cap 479-732 0/3 in 252 aa 4.2 mlEDLG-2 19
    Mouse Chr1 Rep 2E−06/34 AAV2 Rep 4-206 3/5 in 191 aa 17.1 mmEDLG-1 39
Chr3 Rep 2E−24/31 AAV5 Rep 71-478 12/7 in 389 aa 16.5 mmEDLG-2 37
Cap 2E−22/45 Cap 22-724 12/10 in 649aac
Chr8 Rep 1E−08/46 AAV2 Rep 314-473 3/3 in 147 aa 13.8 mmEDLG-3 31
Cap 1-137 1/2 in 114 aa
    Panda Scaffold2359 Rep 2E−06/37 Bovine AAV Rep 238-426 2/3 in 186 aa 10.4 amEDLG-1 59
    Pika Scaffold_9941 Rep 4E−14/28 AAV5 Rep 126-415 2/2 in 282 aa 5.4 opEDLG 14
    Platypus Chr2 Rep 9E−10/35 Bovine AAV Rep 297-437 4/3 in 138 aa 17.1 oaEDLG-1 79
Cap 272-419 1/2 in 150 aac
Contig12430 Rep 2E−09/47 Bovine AAV Rep 353-450 3/1 in 123 aa 12.0 oaEDLG-2 55
Cap 2E−05/32 Cap 253-367 2/1 in 116 aa
    Rabbit Chr10 Rep 3E−97/39 AAV2 Rep 1-619 3/9 in 613 aa 9.3 ocEDLG 43
Cap 5E−50/45 Cap 1-723 10/9 in 675 aa
    Rat Chr13 Rep 2E−09/33 AAV2 Rep 4-175 2/4 in 177 aa 13.3 rnEDLG-1 28
Chr2 Rep 4E−18/40 AAV5 Rep 1-461 12/12 in 454 aa 22.7 rnEDLG-2 51
Chr19 Rep 2E−07/33 AAV5 Rep 329-464 2/4 in 136 aa 16.1 rnEDLG-3 35
Cap 31-133 2/1 in 93 aa
    Tarsier Scaffold_178326 Rep 4E−14/23 AAV5 Rep 96-465 2/3 in 356 aa 5.3 tsEDLG 23
Parvoviruses
    Guinea pig Scaffold_188 Rep 3E−24/46 Porcine parvovirus Rep 313-567 5/3 in 250 aa 12.3 cpEPLG-1 40
Cap 1E−16/36 Cap 10-689 11/12 in 672 aa
Scaffold_27 Rep 1E−50/39 Canine parvovirus Rep 11-640 1/4 in 616 aa 5.3 cpEPLG-2 17
Cap 1E−38/39 Porcine parvovirus Cap 3-719 2/14 in 700 aa
    Tenrec Scaffold_260946 Rep 2E−20/38 LuIII virus Rep 406-598 4/4 in 190 aa 19.0 etEPLG-2 60
Cap 11-639 16/15 in 595 aa
    Rat Chr5 Rep 6E−10/56 Canine parvovirus Rep 1-282 0/0 in 312 aa 0.6 rnEPLG Recent
Cap 0/62 Cap 637-667 0/2 in 760 aa
Rep 0/63 1-751
    Opossum Chr3 Rep 2E−39/33 LuIII virus Rep 7-570 11/3 in 502 aa 10.9 mdEPLG-2 56
Cap 7E−8/33 Cap 11-729 14/7 in 704 aa
Chr6 Rep 6E−58/44 Porcine parvovirus Rep 16-563 3/7 in 534 aac 4.6 mdEPLG-3 24
Cap 6E−60/38 Cap 10-715 2/5 in 707 aac
    Wallaby Scaffold_108040 Rep 4E−74/62 Canine parvovirus Rep 341-645 0/0 in 287 aa 1.3 meEPLG-3 7
Cap 8E−37/32 Cap 35-738 0/4 in 687 aa
Scaffold_72496 Rep 2E−61/42 Porcine parvovirus Rep 23-567 4/3 in 531 aa 5.7 meEPLG-6 30
Cap 2E−31/38 Cap 10-532 6/4 in 514 aa
Scaffold_88340 Rep 7E−37/55 Mouse parvovirus 1 Rep 344-566 0/3 in 223 aa 6.7 meEPLG-16 36
Cap 7E−22/33 Cap 11-713 6/9 in 700 aa
a

Some ambiguity in choosing the most similar virus is possible. We generally used the alignment with the lowest E value in the BLAST results. However, one or two points in the exponent of an E value were sometimes sacrificed to achieve a longer sequence alignment.

b

aa, amino acids.

c

These sequences have long insertions compared to the present-day viruses. In all cases tested, these insertions originated from short interspersed elements (SINEs). These insertions were excluded from the counts of stop codons and frameshifts and the estimation of integration age.

d

Chr, chromosome.