TABLE 1.
Virus group and vertebrate species | Initial genomic search using TBLASTN |
Best sequence homology identified using BLASTX |
Predicted nucleotide drift (%) | Integration label | Age (million yr) or timing of integration based on sequence aging | |||||
---|---|---|---|---|---|---|---|---|---|---|
Chromosomal or scaffold location | Protein | BLAST E value/% sequence identity | Most similar virusa | Protein | Coordinates | No. of stop codons/frameshifts | ||||
Circoviruses | ||||||||||
Cat | Scaffold_62068 | Rep | 6E−05/37 | Canary circovirus | Rep | 4-283 | 3/7 in 268 aab | 14.2 | fcECLG-1 | 82 |
Scaffold_24038 | Rep | 6E−06/51 | Columbid circovirus | Rep | 44-317 | 4/5 in 231 aac | 15.2 | fcECLG-2 | 87 | |
Dog | Chr5d | Rep | 7E−16/46 | Raven circovirus | Rep | 16-263 | 6/5 in 250 aa | 17.6 | cfECLG-1 | 98 |
Chr22 | Rep | 1E−14/43 | Beak and feather disease virus | Rep | 7-264 | 2/1 in 261 aac | 4.5 | cfECLG-2 | 54 | |
Opossum | Chr3 | Rep | 4E−46/44 | Finch circovirus | Rep | 2-291 | 0/2 in 282 aa | 2.3 | mdECLG | 12 |
Cap | 6-36 | 0/0 in 30 aa | ||||||||
Dependoviruses | ||||||||||
Dog | ChrX | Rep | 6E−05/55 | AAV5 | Rep | 239-445 | 3/4 in 200 aa | 14.0 | cfEDLG-1 | 78 |
Dolphin | GeneScaffold1475 | Rep | 8E−39/39 | Avian AAV DA1 | Rep | 79-486 | 3/4 in 379 aac | 6.6 | ttEDLG-2 | 55 |
Cap | 4E−61/47 | Cap | 1-738 | 4/7 in 678 aac | ||||||
Elephant | Scaffold_4 | Rep | 0/55 | AAV5 | Rep | 3-589 | 0/0 in 579 aa | 0.0 | laEDLG | Recent |
Hyrax | GeneScaffold5020 | Cap | 3E−34/53 | AAV3 | Cap | 485-735 | 0/5 in 256 aa | 7.0 | pcEDLG-1 | 29 |
Scaffold_19252 | Rep | 9E−72/47 | Bovine AAV | Rep | 2-348 | 8/4 in 348 aa | 14.3 | pcEDLG-2 | 60 | |
Megabat | Scaffold_5601 | Rep | 2E−13/31 | AAV2 | Rep | 315-479 | 1/5 in 175 aa | 13.1 | pvEDLG-3 | 76 |
Microbat | GeneScaffold2026 | Rep | 1E−117/50 | AAV2 | Rep | 1-617 | 2/5 in 612 aa | 5.8 | mlEDLG-1 | 27 |
Cap | 9E−33/51 | Cap | 1-731 | 2/9 in 509 aac | ||||||
Scaffold_146492 | Cap | 6E−32/42 | AAV2 | Cap | 479-732 | 0/3 in 252 aa | 4.2 | mlEDLG-2 | 19 | |
Mouse | Chr1 | Rep | 2E−06/34 | AAV2 | Rep | 4-206 | 3/5 in 191 aa | 17.1 | mmEDLG-1 | 39 |
Chr3 | Rep | 2E−24/31 | AAV5 | Rep | 71-478 | 12/7 in 389 aa | 16.5 | mmEDLG-2 | 37 | |
Cap | 2E−22/45 | Cap | 22-724 | 12/10 in 649aac | ||||||
Chr8 | Rep | 1E−08/46 | AAV2 | Rep | 314-473 | 3/3 in 147 aa | 13.8 | mmEDLG-3 | 31 | |
Cap | 1-137 | 1/2 in 114 aa | ||||||||
Panda | Scaffold2359 | Rep | 2E−06/37 | Bovine AAV | Rep | 238-426 | 2/3 in 186 aa | 10.4 | amEDLG-1 | 59 |
Pika | Scaffold_9941 | Rep | 4E−14/28 | AAV5 | Rep | 126-415 | 2/2 in 282 aa | 5.4 | opEDLG | 14 |
Platypus | Chr2 | Rep | 9E−10/35 | Bovine AAV | Rep | 297-437 | 4/3 in 138 aa | 17.1 | oaEDLG-1 | 79 |
Cap | 272-419 | 1/2 in 150 aac | ||||||||
Contig12430 | Rep | 2E−09/47 | Bovine AAV | Rep | 353-450 | 3/1 in 123 aa | 12.0 | oaEDLG-2 | 55 | |
Cap | 2E−05/32 | Cap | 253-367 | 2/1 in 116 aa | ||||||
Rabbit | Chr10 | Rep | 3E−97/39 | AAV2 | Rep | 1-619 | 3/9 in 613 aa | 9.3 | ocEDLG | 43 |
Cap | 5E−50/45 | Cap | 1-723 | 10/9 in 675 aa | ||||||
Rat | Chr13 | Rep | 2E−09/33 | AAV2 | Rep | 4-175 | 2/4 in 177 aa | 13.3 | rnEDLG-1 | 28 |
Chr2 | Rep | 4E−18/40 | AAV5 | Rep | 1-461 | 12/12 in 454 aa | 22.7 | rnEDLG-2 | 51 | |
Chr19 | Rep | 2E−07/33 | AAV5 | Rep | 329-464 | 2/4 in 136 aa | 16.1 | rnEDLG-3 | 35 | |
Cap | 31-133 | 2/1 in 93 aa | ||||||||
Tarsier | Scaffold_178326 | Rep | 4E−14/23 | AAV5 | Rep | 96-465 | 2/3 in 356 aa | 5.3 | tsEDLG | 23 |
Parvoviruses | ||||||||||
Guinea pig | Scaffold_188 | Rep | 3E−24/46 | Porcine parvovirus | Rep | 313-567 | 5/3 in 250 aa | 12.3 | cpEPLG-1 | 40 |
Cap | 1E−16/36 | Cap | 10-689 | 11/12 in 672 aa | ||||||
Scaffold_27 | Rep | 1E−50/39 | Canine parvovirus | Rep | 11-640 | 1/4 in 616 aa | 5.3 | cpEPLG-2 | 17 | |
Cap | 1E−38/39 | Porcine parvovirus | Cap | 3-719 | 2/14 in 700 aa | |||||
Tenrec | Scaffold_260946 | Rep | 2E−20/38 | LuIII virus | Rep | 406-598 | 4/4 in 190 aa | 19.0 | etEPLG-2 | 60 |
Cap | 11-639 | 16/15 in 595 aa | ||||||||
Rat | Chr5 | Rep | 6E−10/56 | Canine parvovirus | Rep | 1-282 | 0/0 in 312 aa | 0.6 | rnEPLG | Recent |
Cap | 0/62 | Cap | 637-667 | 0/2 in 760 aa | ||||||
Rep | 0/63 | 1-751 | ||||||||
Opossum | Chr3 | Rep | 2E−39/33 | LuIII virus | Rep | 7-570 | 11/3 in 502 aa | 10.9 | mdEPLG-2 | 56 |
Cap | 7E−8/33 | Cap | 11-729 | 14/7 in 704 aa | ||||||
Chr6 | Rep | 6E−58/44 | Porcine parvovirus | Rep | 16-563 | 3/7 in 534 aac | 4.6 | mdEPLG-3 | 24 | |
Cap | 6E−60/38 | Cap | 10-715 | 2/5 in 707 aac | ||||||
Wallaby | Scaffold_108040 | Rep | 4E−74/62 | Canine parvovirus | Rep | 341-645 | 0/0 in 287 aa | 1.3 | meEPLG-3 | 7 |
Cap | 8E−37/32 | Cap | 35-738 | 0/4 in 687 aa | ||||||
Scaffold_72496 | Rep | 2E−61/42 | Porcine parvovirus | Rep | 23-567 | 4/3 in 531 aa | 5.7 | meEPLG-6 | 30 | |
Cap | 2E−31/38 | Cap | 10-532 | 6/4 in 514 aa | ||||||
Scaffold_88340 | Rep | 7E−37/55 | Mouse parvovirus 1 | Rep | 344-566 | 0/3 in 223 aa | 6.7 | meEPLG-16 | 36 | |
Cap | 7E−22/33 | Cap | 11-713 | 6/9 in 700 aa |
Some ambiguity in choosing the most similar virus is possible. We generally used the alignment with the lowest E value in the BLAST results. However, one or two points in the exponent of an E value were sometimes sacrificed to achieve a longer sequence alignment.
aa, amino acids.
These sequences have long insertions compared to the present-day viruses. In all cases tested, these insertions originated from short interspersed elements (SINEs). These insertions were excluded from the counts of stop codons and frameshifts and the estimation of integration age.
Chr, chromosome.