Skip to main content
[Preprint]. 2024 Oct 24:2024.10.03.616531. [Version 2] doi: 10.1101/2024.10.03.616531

Table 2:

Virus sequences identified in Drosophila museum collection specimens.

Date Collected Location Sample ID Virus GenBank Sequence Accession Nearest GenBank1 % Query Coverage2,3 % Identity4,5 Completeness Average Coverage6 Estimated Evolutionary Rate7
Known
1908 Illinois lllinois_1908 Drosophila melanogaster sigmavirus - MH384277 73 99.4 Partial sequence 2 -
1908 Illinois lllinois_1908 Tobacco mosaic virus OR820564 KY810785 100 99.7 Coding complete sequence 850 7.13E-05
1908 Illinois lllinois_1908 Vera virus OR820566, OR820565 MT742171, MT742172 100 99.9–100 Coding complete sequence 9–34 1.09E-05 – 0.00E
1908 Illinois lllinois_1908 Galbut virus OR820562, OR820561, OR820563 OR729093, MT742165, OR729068 100 98.9–99.3 Coding complete sequence 7–14 6.43E-05 – 1.00E-04
1908 Illinois lllinois_1908 Chaq-like virus OR820560 MT742173 100 99.9 Coding complete sequence 18 6.36E-06
1915 Illinois lllinois_1915_1 Galbut virus - MT742164, MT742161, MT742162 37 96.1 Partial sequence 4 -
1915 Illinois lllinois_1915_3 Galbut virus - MT742164, MT742161, MT742162 39 96.5 Partial sequence I -
1919 Minnesota Minnesota_1919_3 Dansoman virus OR820572, OR820573 MH384295, MH384270 100 97.4 – 98.3 Coding complete sequence 40–46 2.93E-04 – 3.93E-04
1919 Minnesota Minnesota_1919_3 Chaq virus OR820571 MH384367 100 96.9 Coding complete sequence 317 4.11E-04
1919 Minnesota Minnesota_919_3 Galbut virus OR820575, OR820574, OR820576 MT742164, MT742165, OR729068 100 99.4 – 99.6 Coding complete sequence 247–523 4.55E-05 – 9.31E-05
1927 New York NewYork_1927 Drosophila melanogaster sigmavirus - NC_038281 26 99.2 Partial sequence 2 -
1930 Illinois lllinois_1930 Chaq virus OR820567 MH384311 100 97.1 Coding complete sequence 32 4.88E-04
1930 Illinois lllinois_1930 Galbut virus OR820569, OR820568, OR820570 OR729094, MT742165, OR729070 100 89.3 – 99.7 Coding complete sequence 29–50 3.64E-05 – 1.19E-03
1942 New Jersey NewJersey_1942 Craigies Hill virus OR820579, OR820578 MH384377, MH384349 100 90.4 – 98.1 Coding complete sequence 22–40 2.95E-04 – 1.46E-03
1942 New Jersey NewJersey_1942 Galbut virus - MT742164, MT742161, MT742162 18 97.1 Partial sequence 17 -
1942 New Jersey NewJersey_1942 Chaq-like virus OR820577 MT742173 100 99.9 Coding complete sequence 490 9.21E-06
1942 New Jersey NewJersey_1942 Vera virus OR820582, OR820581 MT742171, MT742172 100 99.9–100 Coding complete sequence 463–686 1.58E-05 – 0.00E
1942 New Jersey NewJersey_1942 Drosophila melanogaster sigmavirus OR820580 MH384306 100 99.2 Coding complete sequence 46 1.88E-04
1953 Hawai’i Hawaii_1953_1 Galbut virus - MT742164, MT742161, MT742162 65 92.4 Partial sequence 3 -
1953 Hawai’i Hawaii_1953_2 Galbut virus - MT742164, MT742161, MT742162 63 99.7 Partial sequence 3 -
1963 Pennsylvania Pennsylvania_1963_1 Vera virus OR820593, OR820592 MT742168, MT742172 100 99.6–100 Coding complete sequence 2512–2727 6.55E-05 – 0.00E
1963 Pennsylvania Pennsylvania_1963_1 Chaq-like virus OR820591 MT742173 100 99.8 Coding complete sequence 5323 3.64E-05
1963 Pennsylvania Pennsylvania_1963_2 Galbut virus OR820588, OR820587, OR820589 MH384303, MH384304, MH384276 100 96.4 – 99.2 Coding complete sequence 8–81 1.73E-04 – 7.98E-04
1963 Pennsylvania Pennsylvania_1963_2 Nora virus OR820590 JX220408 100 97.1 Coding complete sequence 16 5.98E-04
2000 California California_2000_2 La Jolla virus OR820598 MH384285 97 95.35 Partial sequence 11 5.81E-03
2000 California California_2000_2 Chaq virus OR820594 MT742163 100 82.8 Coding complete sequence 62 1.01 E-02
2000 California Ca I ifornia_2000_2 Galbut virus OR820596, OR820595, OR820597 MH384283, MH384336, MH384366 100 83.5 – 92.4 Coding complete sequence 150–271 6.00E-04 – 2.07E-02
2000 California California_2000_1 Galbut virus - MT742164, MT742161, MT742162 37 95 Partial sequence 2 -
2006 North Carolina NorthCarolina_2006_1 Drosophila C virus OR820583 OK188767 100 98.3 Coding complete sequence 264 1.86E-03
2006 North Carolina NorthCarolina_2006_2 Bloomfield virus - MF416371, KP714091, KP714093, KP714094 50 99.6–100 Partial sequence 0.4–2 -
2011 California California_2011_2 Galbut virus OR820600, OR820599, OR820601 MH384283, MH384336, MH384366 100 83.8 – 99.4 Coding complete sequence 16–18 2.00E-03 – 2.61E-02
Novel
2003 Pennsylvania Pennsylvania_2003_1 Drosophila-associated sobemo-like virus OR820603, OR820602 UYL94340.1, QHA33877.1 100 38.1 – 70.6 Coding complete sequence 84–136 -
2010 Ontario, CAN Canada_2010_1 Puslinch virus OR820586, OR820585, QR820584 YP_009362026.1, YP_010840683.1, YP_009362024.1 100 32.8 – 39.1 Coding complete sequence 81–188 -
1

Nearest GenBank sequence is provided for RdRp of Drosophila-associated sobemo-like virus and the L, M and S segment of Puslinch virus.

2

%Query coverage from BLASTN alignment

3

% Query coverage from BLASTN alignment for novel virus sequences was determined based on nearest GenBank sequence.

4

nt, percentage nucleotide identy to closest GenBank sequence identified via BLASTn.

5

For novel virus sequences, % nt identity to the reference is shown.

6

Average mapped read coverage across contigs as reported in Geneious.

7

Calculated using the GenBank sequence with date collected data.