Table 2.
Genomic coordinates of the glue genes in 20 Drosophila species
Species | Sgs1 | Sgs3 | Sgs4 | Sgs5 Sgs5bis* | Sgs7 | Sgs8 | Eig71Ee |
---|---|---|---|---|---|---|---|
D. melanogaster | CG3047 | CG11720 | CG12181 | CG7596 CG7587* |
CG18087 | CG6132 | CG7604 |
D. simulans | GB:CM002910 4,752,550–4,754,973 | Dsim\GD14311 | Dsim\GD16637 | Dsim\GD19170 Dsim\GD19169* |
Dsim\GD17634 | Dsim\GD28639 | Dsim\GD12546 |
D. sechellia | Dsec\GM18501 (M) | Dsec\GM25279 (M) | GB:CH480825 2,852,711–2,853,386 (M) | Dsec\GM15245 Dsec\GM15244* |
Dsec\GM25278 | Dsec\GM24748 |
NW_001999689 7,761,215–7,759,941 |
D. mauritiana | 2 L: 4721427–4,722,731 | 3 L: 11002313–11,003,109 | X: 2864998–2,865,616 (M) | 3R: 7695225–7,694,660 relictual Sgs5bis 3R: 7696600–7,695,629 |
3 L: 10999955–11,000,249 | no | 3 L: 15018149–15,017,249 |
D. yakuba |
NT_167062 10,588,365–10,585,585 |
Dyak\Sgs3 | Dyak\GE28681 | Dyak\GE25481 Dyak\GE25480* |
Dyak\GE20214 Dyak\GE21218 |
Dyak\Sgs8 | Dyak\GE19823 |
D. santomea | 2 L: 10595909–10,588,129 | 3 L: 11541799–11,542,678 (M) | X: 5242740–5,241,688 (M) | 3R: 1975190–1,975,883 3R: 1974195–1,974,756* |
3 L: 11539572–11,539,861 3 L: 11536774–11,536,485 |
3 L: 11537383–11,537,681 | 3 L: 18202978–18,201,736 |
D. erecta | no | Dere\Sgs3 | Dere\GG27095 | no Sgs5 Dere\GG22329* |
Dere\GG13918 | Dere\Sgs8 | Dere\GG13528 |
D. eugracilis |
AFPQ02004874 817,906–819,883 |
KB465257 3,401,691–3,402,412 3,385,186–3,386,300 |
no |
KB464468 62,658–63,338 61,657–62,202* |
KB465257 3,378,701–3,378,995 |
KB465257 3,378,110–3,377,822 |
KB464880 383,836–382,228 (XM_017230731) |
D. takahashii |
KB461520 248,469–250,276 |
KB460792 317,161–317,949 |
no |
KB461611 188,299–187,637 189,545–188,599* |
KB461234 120,246–120,467 |
KB461234 119,117–118,896 |
XM_017142344 |
D. ficusphila |
KB457325 1,315,471–1,313,145 |
KB457563 3,180,441–3,179,541 KB457373 332,100–331,262 3,199,436–3,198,351 |
no |
KB457381 2,059,719–2,058,971 2,061,615–2,060,148* |
no | no |
KB457515 1,660,700–1,661,809 (XM_017197540) |
D. biarmipes |
KB462641 1,521,394–1,523,538 |
KB462590 1,536,842–1,537,624 (M) KB462646 54,238–53,374 (M) |
no |
KB462814 8,082,338–8,083,047 8,081,336–8,081,891* |
KB462646 76,095–75,801 |
KB462646 77,216–77,501 |
KB462754 733,209–734,564 |
D. suzukii |
KI419149 6,645,021–6,638,237 |
no | no |
KI420542 10,372–9639 11,441–10,912* |
KI419359 22,757–22,464 KI420769 54,293–54,584 KI420610 25,121–25,412 55,385–55,094 |
KI420769 53,260–52,976 |
XM_017082231 |
D. elegans |
KB458429 2,603,084–2,605,600 |
KB458268 2,467,758–2,468,497 KB458387 820,622–819,957 KB458387 18,429–17,499 |
no |
KB458458 2,864,199–2,863,401 no Sgs5bis |
no | no | no |
D. rhopaloa |
KB450401 (Nterm) KB452165 (Cterm) |
KB450817 117,692–118,515 KB452471 215,593–216,424 KB451944** |
no |
KB451039 15,186–16,018 no Sgs5bis |
no | no | no |
D. kikkawai | no |
KB459615 1,331,679–1,331,220 KB459522 291,906–292,542 |
no |
KB459676 1,112,222–1,111,011 1,113,233–1,112,671* |
no | no |
KB459876 1,106,397–1,107,027 (Nterm) |
D. ananassae | no |
NW_001939300 3,959,435–3,957,637 NW_001939293 5,806,878–5,808,646 |
no |
NW_001939291 17,741,832–17,741,201 17,742,892–17,742,284* |
no | no | GF10382(Nterm): NW_001939293 11,506,744–11,507,112 |
D. bipectinata | no |
KB464001 557,673–558,039 KB464098 1,120,437–1,121,198 |
no |
KB464382 185,749–186,362 184,743–185,354* |
KB464098 1,109,828–1,110,127 |
KB464098 1,109,077–1,108,802 |
KB464259 2,466,431–2,466,234 (ortholog of GF10382) |
D. pseudoobscura | no | GA23425, GA23426, GA23878 | no | no Sgs5 Dpse\GA20459 * |
no | no | no |
D. willistoni | no |
NW_002032853 3,296,683–3,295,766 NW_002032860 11,643,758–11,641,972 |
no | no |
NW_002032853 2,792,051–2,792,347 2,793,811–2,794,107 |
no | no |
D. virilis | no |
NW_002014431 6,839,085–6,838,999 (GJ27025) 6,841,799–6,840,888(GJ26085) |
no | no Sgs5 NW_002014424 14,511,533–14,512,083*(modified from GJ24445) |
no | no | no |
* indicates annotations and coordinates of the Sgs5bis gene; “M” indicates that part of the coding sequence was inferred manually by sequencing of PCR amplicons of relevant regions; “no” means that the gene sequence was not found by BLAST searches; Nterm and Cterm mean N-terminal and C-terminal region, respectively. **: this contig probably contains two paralogs of Sgs3 with uncertain sequences