Skip to main content
. 2019 Jan 29;19:36. doi: 10.1186/s12862-019-1364-9

Table 3.

Characteristics of glue proteins in the species studied (except Sgs7 and Sgs8)

Protein Species Length (aa) Kind of repeat Approx. nr of repeats N glyc O glyc Disoredered repeats
Sgs1 melanogaster 1286 PTTTTPR/STTTTSTSR ca 85 2 > 25 yes
simulans 785 CAPTTTTPR ca 40 1 > 25 yes
mauritiana 412 CAPTTTTPR ca 13 1 > 25 yes
sechellia 492 CAPTTTTPR ca 22 1 > 25 yes
santomea uncertain sequence
yakuba 619? RPPTTSPSC uncertain > 25
elegans 837 T rich stretches 0 > 25 yes
rhopaloa ca. 624 T rich stretches 1 > 25 yes
ficusphila 758 CAPTTTPST ca 59 0 > 25 yes
takahashii 585 TSTTTTPR ca 25 1 > 25 yes
eugracilis 635 PRCTTTTT ca 39 0 > 25 yes
biarmipes 696 VPTT/KCQMTTSSSAPTTAAPTATSTTAATTSTP 3/ca 12 1 > 25 yes
suzukii 2245 VPTT/RCPITTSTSAPTTTTATTTSTSTSTTSTP 8/ca 63 1 > 25 yes
Sgs3 melanogaster 307 KPTTT ca 31 0 > 25 yes
simulans 188 a few T rich stretches 0 > 25 yes
mauritiana 183 CAPPTRPPCTSPTTTTTTTTTT ca 5 1 > 25 yes
sechellia 172 CKPTTTTTT ca 8 0 > 25 yes
santomea 273 PTTTTTTTRR ca 6 0 > 25 yes
yakuba 273 PTTTTTTTRR ca 6 0 > 25 yes
erecta 333 TTRR ca 35 3 > 25 yes
elegans a 216 CAPTTTTTTTQR ca 7 0 > 25 yes
elegans b 202 KATT ca 24 0 > 25 yes
elegans c 287 PTTTTTKK ca 23 1 > 25 yes
ficusphila a 266 CAPTTTTTT ca 12 0 > 25 yes
ficusphila b 259 T rich stretches 0 > 25 yes
ficusphila c 335 CKPPTTS/KPSKPT ca 10/ca 28 1 > 25 yes
takahashii 585 PTTTSTTR ca 27 1 > 25 yes
eugracilis a 214 CAPTTTTTTTTT ca 7 0 > 25 yes
eugracilis b 348 PTK ca 65 2 > 25 yes
biarmipes a 244 KKPXTT ca 21 0 > 25 yes
biarmipes b 302 T rich stretches 0 > 25 yes
rhopaloa a 254 ATTK ca 21 0 > 25 yes
rhopaloa b 256 T rich stretches 0 > 25 yes
rhopaloa c 253 CAPTTTTTT ca 12 0 > 25 yes
rhopaloa d incomplete 5’ CAPTTTTTT ca 9 0 > 25 yes
kikkawai a 129 KPQP ca 10 0 2 yes
kikkawai b 190 KPQPP ca 16 0 6 yes
ananassae a 579 KPTTP ca 55 1 > 25 yes
ananassae b 566 PTR/PTE/PTV ca 71/42/22 2 > 25 yes
bipectinata a 272 T rich stretches/PTKSTR ca 8 0 > 25 yes
bipectinata b 254 QPPTKSTPKPT ca 8 0 > 25 yes
pseudoobscura a 207 KPT ca 23 0 > 25 yes
pseudoobscura b 229 KPTTTP ca 14 0 > 25 yes
pseudoobscura c 224 KPT ca 33 0 > 25 yes
willistoni 283 P/T-rich stretch 0 > 25 yes
willistoni sgs3-like 546 CVTTRSSTPTP/CGPTPSPSPT ca. 15/17 0 > 25 yes
virilis a 242 RTTTTPTTTT ca 12 0 > 25 yes
virilis b 283 KPTTTRRT/KTIPTTTP ca 11/9 2 > 25 yes
Sgs4 melanogaster 287 CRTEPPT ca 19 0 > 25 yes*
simulans 266 CDTEPPT ca 8 0 > 25 yes*
mauritiana 360 CNTEPPT ca 31 0 > 25 yes*
sechellia 255 CNTEPPT/CDTEPPT ca5/4 0 > 25 yes*
santomea 351 C(K/R)T(E/T)PPT / CKTKPPCTTV ca 14/9 0 > 25 yes*
yakuba 361 C(K/R)T(E/T)PPT ca 23 0 > 25 yes*
erecta 280 CRTEPPT/NAPTRRT ca 8/7 1 > 25 yes*
Sgs5 and 5bis melanogaster 163 no repeats 0 2 NA
melanogaster bis 142 no repeats 0 0 NA
simulans 169 PE/TE ca 6 0 8 yes
simulans bis 142 no repeats 0 0 NA
mauritiana 169 PE/TE ca 6 0 10 yes
sechellia 169 PE/TE ca 6 0 10 yes
sechellia bis 142 no repeats 0 0 NA
santomea 192 TE ca 7 0 8 yes
santomea bis 142 no repeats 0 0 NA
yakuba 192 TE ca 7 0 12 yes
erecta bis 142 no repeats 0 0 NA
ficusphila 208 DP or EP, ES, ET ca 28 0 22 yes
ficusphila bis 142 no repeats 0 0 NA
takahashii 217 EP or EE ca 12 0 19 yes
takahashii bis 161 no repeats 0 3 NA
biarmipes 190 PED or PET ca 10 0 17 yes
biarmipes bis 143 no repeats 0 1 NA
elegans 223 EP ca 27 0 11 yes
eugracilis 187 PE ca 16 0 14 yes
eugracilis bis 142 no repeats 0 0 NA
suzukii 203 PETE ca 11 0 23 yes
suzukii bis 142? no repeats 0 1 NA
kikkawai 362 PEDEED ca 37 0 11 yes
kikkawai bis 146 no repeats 0 2 NA
rhopaloa 236 EP ca 38 0 9 yes
ananassae 172 almost no repeats 0 2 NA
ananassae bis 146 no repeats 0 0 NA
bipectinata 162 almost no repeats 0 3 NA
bipectinata bis 146 no repeats 0 1 NA
pseudoobscura bis 144 no repeats 0 0 NA
virilis 143 no repeats 0 0 NA
Eig71Ee melanogaster 445 CTCTESTT/(R/K)TNPT ca 9/ca 7 8 > 25 yes
simulans 321 CTCTDSTT(R/K)KTNPT ca 4/ca 2 2 > 25 yes
sechellia 408 CTDSTTKTTNPPCT ca 8 3 > 25 yes
mauritiana 284 no clear repeats 0 > 25 yes
yakuba 417 CTESTTQKPNPPSTQKTRPPCG ca 5 1 > 25 yes
santomea 394 CTESTTQKPNPPSTEKTRPPCG ca 3 1 > 25 yes
erecta 454 CTESTTRRTKPPSTRKTRPP ca 5 0 > 25 yes
ficusphila 384 TE(K/R)T ca 11 1 > 25 yes
takahashii 302 CTEKTTQKPEPP ca 7 0 > 25 yes
biarmipes 434 no clear repeats 6 > 25 yes
suzukii 346 no clear repeats 0 > 25 yes
eugracilis 447 CTETTTQKTNPP ca 5 0 > 25 yes

Glycosylation sites were predicted from http://www.cbs.dtu.dk/services/NetNGlyc/ and http://www.cbs.dtu.dk/services/NetOGlyc/ for N glycosylation and O glycosylation, respectively. *: except for IUPred and PrDOS