TABLE 2.
Potentially expressed ORFs in GpSGHV
| ORF | Position | Length (amino acids) | Intergenic distance (bp)a | Best BLASTP match
|
Conserved domain(s) or signature(s)c | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Name, source | Accession no. | Score | E value | Identity (%) | Length (amino acids) | MegAlign identity (%)b | |||||
| SGHV001 | 1>2091 | 696 | 7 | p74 protein, Spodoptera litura NPV | CAA09849 | 90 | 6.E−16 | 131/656 (19) | 657 | 20.8 | TM, baculo-p74 |
| SGHV002 | 3071<2088 | 327 | −4 | ||||||||
| SGHV003 | 3785<3234 | 183 | 162 | ||||||||
| SGHV004 | 4448<4185 | 87 | 399 | TM | |||||||
| SGHV005 | 5439<4378 | 353 | −71 | ODV-E66, Epiphyas postvittana nucleopolyhedrovirus | NP_203211 | 62 | 2.E−08 | 46/192 (23) | 680 | 15.3 | TM, SP, lyase, Baculo-E66 |
| SGHV006 | 6576<5491 | 361 | 51 | DUF676 hydrolase, lipase | |||||||
| SGHV007 | 6714>7778 | 354 | 137 | ||||||||
| SGHV008 | 8618<7803 | 271 | 24 | Y045_METJA 13-219 | |||||||
| SGHV009 | 8631>10871 | 746 | 12 | Hypothetical protein MAL7P1.132, Plasmodium falciparum 3D7 | XP_001349148 | 50 | 6.E−04 | 60/225 (26) | 2,041 | 22.4 | |
| SGHV010 | 14205<10891 | 1,104 | 19 | ORF MSV156, Melanoplus sanguinipes EPV | NP_048227 | 67 | 9.E−09 | 98/449 (21) | 1,127 | 15.4 | Coiled-coil region, kinetochore Spc7 |
| SGHV011 | 14780<14202 | 192 | −4 | ||||||||
| SGHV012 | 15897<14836 | 353 | 55 | ||||||||
| SGHV013 | 16552<16376 | 58 | 478 | ||||||||
| SGHV014 | 16895<16719 | 58 | 166 | TM, SP | |||||||
| SGHV015 | 17011<16853 | 52 | −43 | TM | |||||||
| SGHV016 | 17068>17436 | 122 | 56 | ||||||||
| SGHV017 | 18627<17449 | 392 | 12 | ZFC3HC4 ring | |||||||
| SGHV018 | 18941>19168 | 75 | 313 | ||||||||
| SGHV019 | 21510<20179 | 443 | 1,010 | Hypothetical protein TA04315, Theileria annulata strain Ankara | XP_954808 | 55 | 9.E−06 | 32/131 (24) | 791 | 18.7 | |
| SGHV020 | 23524>24054 | 176 | 2,013 | ||||||||
| SGHV021 | 25329<24049 | 426 | −6 | ||||||||
| SGHV022 | 25337>26368 | 343 | 7 | ||||||||
| SGHV023 | 26422>27552 | 376 | 53 | AARP2CN (NUC121) domain SM00785 | |||||||
| SGHV024 | 29113<27557 | 518 | 4 | Mv-ORF74 peptide, Maruca vitrata MNPV | YP_950804 | 49 | 1.E−03 | 31/103 (30) | 224 | 24.6 | TM |
| SGHV025 | 29205>29957 | 250 | 91 | ||||||||
| SGHV026 | 30329>31375 | 348 | 371 | ||||||||
| SGHV027 | 31398>32774 | 458 | 22 | Chitinase Chit1 precursor, Glossina morsitans morsitans | AAL65401 | 226 | 3.E−57 | 129/365 (35) | 460 | 33 | TM, Glycoside hydrolase family 18 catalytic domain |
| SGHV028 | 33269>33862 | 197 | 494 | ||||||||
| SGHV029 | 33890>34573 | 227 | 27 | Staphylococcal AgrD protein SM00794 | |||||||
| SGHV030 | 34978<34577 | 133 | 3 | ||||||||
| SGHV031 | 35917<35060 | 285 | 81 | HDAC interaction domain, histone deacetylase | |||||||
| SGHV032 | 36744<35965 | 259 | 47 | ||||||||
| SGHV033 | 38089<37043 | 348 | 298 | ||||||||
| SGHV034 | 39187<38138 | 349 | 48 | ORF AMV 260, Amsacta moorei EPV | NP_065042 | 44 | 2.E−02 | 45/173 (26) | 504 | 19.2 | |
| SGHV035 | 39460<39197 | 87 | 9 | ORF MSV 238, Melanoplus sanguinipes EPV | NP_048309 | 57 | 3.E−07 | 28/80 (35) | 292 | 33.3 | |
| SGHV036 | 40088<39741 | 115 | 280 | ORF 67, shrimp white spot syndrome virus thymidylate synthase | NP_477589 | 62 | 1.E−08 | 28/78 (35) | 289 | 37.4 | |
| SGHV037 | 40351<40187 | 54 | 98 | ||||||||
| SGHV038 | 44374<40850 | 1,174 | 498 | SP | |||||||
| SGHV039 | 45446<44439 | 335 | 64 | TM, SP | |||||||
| SGHV040 | 45768>48473 | 901 | −1 | ORF AMV 130, Amsacta moorei EPV | NP_064912 | 44 | 5.E−02 | 44/141 (31) | 1,384 | 19.1 | |
| SGHV041 | 49765<48524 | 413 | 50 | ORF MSV 214, Melanoplus sanguinipes EPV, SCG gene family protein | NP_048285 | 44 | 2.E−02 | 61/319 (21) | 386 | 18.2 | |
| SGHV042 | 50183<49809 | 124 | 43 | TM | |||||||
| SGHV043 | 50652<50218 | 144 | 34 | TM | |||||||
| SGHV044 | 51903<50824 | 359 | 171 | SP | |||||||
| SGHV045 | 57046<51860 | 1,728 | −44 | ||||||||
| SGHV046 | 57316>58917 | 533 | 269 | PPASE, inorganic pyrophosphatase signature | |||||||
| SGHV047 | 58933>60147 | 404 | 15 | Hypothetical protein CBG22662, Caenorhabditis briggsae | CAE74824 | 56 | 4.E−06 | 57/311 (18) | 743 | 16.6 | |
| SGHV048 | 60195>60986 | 263 | 47 | ORF 033, Heliocoverpa armigera nucleopolyhedrovirus G4 ADP-pyrophosphatase | NP_075102 | 59 | 4.E−07 | 56/231 (24) | 238 | 18.8 | |
| SGHV049 | 60935>61207 | 90 | −52 | ||||||||
| SGHV050 | 61314>62189 | 291 | 106 | ||||||||
| SGHV051 | 64184<62202 | 660 | 12 | ORF MSV 152, Melanoplus sanguinipes EPV putative core protein | NP_048223 | 47 | 3.E−03 | 52/218 (25) | 1,306 | 20 | TM |
| SGHV052 | 65228<64308 | 306 | 123 | Hypothetical protein, Plasmodium falciparum | XP_001351434 | 53 | 2.E−05 | 69/251 (27) | 540 | 23.9 | |
| SGHV053 | 66323<65241 | 360 | 12 | Per os infectivity factor 2, Gryllus bimaculatus nudivirus | YP_001111333 | 66 | 3.E−09 | 68/274 (24) | 378 | 21.2 | Baculo-44 |
| SGHV054 | 67507<66473 | 344 | 149 | ORF AMV 054, Amsacta moorei EPV (putative RNA polymerase-associated transcriptional specificity factor) | NP_064836 | 45 | 5.E−03 | 90/379 (23) | 822 | 29.9 | |
| SGHV055 | 69398<67500 | 632 | −8 | ORF AMV253, Amsacta moorei EPV (possible surface protein) | NP_065035 | 55 | 1.E−05 | 100/456 (21) | 485 | 20.6 | Internal repeat |
| SGHV056 | 69464>70111 | 215 | 65 | ||||||||
| SGHV057 | 70161>71117 | 318 | 49 | Rhoptry protein, Plasmodium yoelli yoelli strain 17XNL | XP_725453 | 57 | 2.E−06 | 51/167 (30) | 2,664 | 30.9 | |
| SGHV058 | 71829<71593 | 78 | 475 | TM, SP | |||||||
| SGHV059 | 75616<74639 | 325 | 2809 | RpoD, Plasmodium falciparum | CAA64574 | 55 | 5.E−06 | 74/308 (24) | 960 | 28.9 | |
| SGHV060 | 76249<75623 | 208 | 6 | ||||||||
| SGHV061 | 77789<76305 | 494 | 55 | ||||||||
| SGHV062 | 77753>90874 | 4,373 | −37 | ORF 147, Tricoplusia ni ascovirus 2c | YP_803369 | 123 | 3.E−25 | 271/1374 (19) | 1,481 | 22.5 | |
| SGHV063 | 92270<90876 | 464 | 1 | TM, Pfam:CBF, SP | |||||||
| SGHV064 | 94234<92447 | 595 | 176 | ORF AMV130, Amsacta moorei EPV (putative ATP-binding cassette transporter) | NP_064912 | 44 | 4.E−02 | 81/354 (22) | 1,384 | 22.4 | |
| SGHV065 | 98511<94246 | 1,421 | 11 | ORF AMV039, Amsacta moorei EPV (putative ATPase/DNA helicase) | NP_064821 | 46 | 2.E−02 | 72/302 (23) | 532 | 22.8 | |
| SGHV066 | 98557>98877 | 106 | 45 | ||||||||
| SGHV067 | 99700<98921 | 259 | 43 | ||||||||
| SGHV068 | 100046<99720 | 108 | 19 | ||||||||
| SGHV069 | 100917<100105 | 270 | 58 | TM, SP | |||||||
| SGHV070 | 102415<101105 | 436 | 187 | ||||||||
| SGHV071 | 102502>104328 | 608 | 86 | ||||||||
| SGHV072 | 105120<104311 | 269 | 1,808 | TM | |||||||
| SGHV073 | 105372<105133 | 79 | 12 | ||||||||
| SGHV074 | 105419>107557 | 712 | 46 | ORF 105, Choristoneura occidentalis GV (helicase-2) | YP_654526 | 45 | 1.E−02 | 25/75 (33) | 461 | 20.6 | ATPases |
| SGHV075 | 108415<107600 | 271 | −1 | ||||||||
| SGHV076 | 108439>109074 | 211 | 23 | ORF101, Helicoverpa zea single nucleocapsid NPV per os infectivity factor 3 | NP_542724 | 64 | 4.E−09 | 38/154 (24) | 199 | 21.9 | TM Pfam:DUF666, SP |
| SGHV077 | 109141>112320 | 1,059 | 66 | ORF AMV, 130 Amsacta moorei EPV (putative ATP-binding cassette transporter) | NP_064912 | 45 | 4.E−02 | 80/368 (21) | 1,384 | 17.8 | |
| SGHV078 | 113291<112581 | 236 | 260 | TM | |||||||
| SGHV079 | 113342>116203 | 953 | 50 | Alcelaphine herpesvirus 1 DNA polymerase | NP_065512 | 240 | 3.E−61 | 208/740 (28) | 1,026 | 24.9 | |
| SGHV080 | 116825<116220 | 201 | 16 | Coiled-coil region | |||||||
| SGHV081 | 117346<116831 | 171 | 5 | TM | |||||||
| SGHV082 | 117839<117360 | 159 | 13 | ||||||||
| SGHV083 | 119926<117842 | 694 | 2 | ORF AMV 214, Amsacta moorei EPV | NP_064996 | 46 | 7.E−03 | 69/271 (25) | 404 | 20.8 | |
| SGHV084 | 120018>120677 | 219 | 2,175 | ||||||||
| SGHV085 | 120898>121665 | 255 | 220 | ||||||||
| SGHV086 | 121756>123534 | 592 | 90 | ||||||||
| SGHV087 | 123845<123555 | 96 | 20 | C2C2 zinc finger | |||||||
| SGHV088 | 123967>125925 | 652 | 121 | ||||||||
| SGHV089 | 125940>126482 | 180 | 14 | Coiled-coil region | |||||||
| SGHV090 | 126855>127091 | 78 | 372 | TM | |||||||
| SGHV091 | 127188>127994 | 268 | 96 | TM, SP | |||||||
| SGHV092 | 128014>128265 | 83 | 19 | ||||||||
| SGHV093 | 128273>129262 | 329 | 7 | TM | |||||||
| SGHV094 | 129284>130105 | 273 | 21 | ||||||||
| SGHV095 | 130116>130589 | 157 | 10 | ||||||||
| SGHV096 | 130626>131771 | 381 | 36 | TM, SP | |||||||
| SGHV097 | 131758>132942 | 394 | −14 | TM | |||||||
| SGHV098 | 132962>133309 | 115 | 19 | ||||||||
| SGHV099 | 134004<133522 | 160 | 212 | ||||||||
| SGHV100 | 134769<134335 | 144 | 330 | TM, SP | |||||||
| SGHV101 | 134768>135088 | 106 | −2 | TM, SP | |||||||
| SGHV102 | 135141>137099 | 652 | 52 | Per os infectivity factor 1, Neodiprion abietis nucleopolyhedrovirus | YP_667927 | 79 | 1.E−12 | 92/355 (25) | 537 | 19.4 | EGF-like domain, TM, SP |
| SGHV103 | 138292<137162 | 376 | 62 | TM, SP | |||||||
| SGHV104 | 138341>140323 | 660 | 48 | TM, coiled-coil region | |||||||
| SGHV105 | 141381<140503 | 292 | 179 | Coiled-coil region | |||||||
| SGHV106 | 142793<141384 | 469 | 2 | Coiled-coil region, Fib-alpha, fibrinogen alpha chain | |||||||
| SGHV107 | 144369<142804 | 521 | 10 | Lymphocystis disease virus isolate China cell division protein 48 | YP_073712 | 67 | 2.E−09 | 47/156 (30) | 690 | 20.7 | |
| SGHV108 | 145984<144347 | 545 | −23 | Lymphocystis disease virus isolate China cell division protein 48 | YP_073712 | 53 | 5.E−05 | 41/162 (25) | 690 | 13.8 | |
| SGHV109 | 147225<146368 | 285 | 383 | TM, SP | |||||||
| SGHV110 | 147867<147262 | 201 | 36 | mp-nase, Spodoptera litura granulovirus | YP_001256988 | 60 | 5.E−08 | 37/104 (35) | 464 | 28.9 | TM, zinc-dependent metalloprotease, matrixin signature peptidase_M10 |
| SGHV111 | 148623<147964 | 219 | 96 | TM, SP, zinc protease | |||||||
| SGHV112 | 149132<148629 | 167 | 5 | TM, SP | |||||||
| SGHV113 | 149968<149123 | 281 | −10 | Hypothetical protein PY00593, Plasmodium yoelli yoelli strain 17XNL | XP_725532 | 50 | 2.E−04 | 58/231 (25) | 1647 | 23.8 | |
| SGHV114 | 150262>151584 | 440 | 293 | ORF MSV016, Melanoplus sanguinipes EPV (leucine-rich repeat gene family protein) | NP_048087 | 55 | 7.E−06 | 88/377 (23) | 572 | 20.5 | |
| SGHV115 | 151638>152870 | 410 | 53 | ||||||||
| SGHV116 | 152888>153967 | 359 | 17 | ORF AMV134, Amsacta moorei EPV (leucine-rich repeat gene family protein) | NP_064916 | 52 | 6.E−05 | 55/236 (23) | 535 | 18.4 | |
| SGHV117 | 154687<153968 | 239 | 0 | ORF 099L, infectious spleen and kidney necrosis virus | NP_612321 | 47 | 1.E−03 | 19/52 (36) | 107 | 29 | |
| SGHV118 | 154959<154711 | 82 | 23 | ZF C3HC4 type, ring finger | |||||||
| SGHV119 | 155648<154995 | 217 | 35 | ||||||||
| SGHV120 | 155692>156747 | 351 | 43 | ||||||||
| SGHV121 | 157037>157798 | 253 | 289 | TM | |||||||
| SGHV122 | 157992>158342 | 116 | 193 | TM, SP | |||||||
| SGHV123 | 159162>159632 | 156 | 819 | ZnF_C2H2 domain, coiled-coil region | |||||||
| SGHV124 | 160975>162828 | 617 | 1,342 | ORF 168, Xestia c-nigrum GV | NP_059316 | 46 | 6.E−03 | 20/63 (31) | 198 | 22.2 | |
| SGHV125 | 163098>163592 | 164 | 269 | ORF 067, Ecotropis obliqua NPV Cg30 | YP_874260 | 50 | 6.E−05 | 35/131 (26) | 269 | 25.9 | |
| SGHV126 | 163629>163937 | 102 | 36 | ORF 149, Anticarsia gemmatalis NPV pe38-like | YP_803543 | 44 | 3.E−03 | 25/66 v(37) | 209 | 29.4 | |
| SGHV127 | 164989>165237 | 82 | 1,051 | ZnF_C2H2 domain | |||||||
| SGHV128 | 165802>166026 | 74 | 564 | ||||||||
| SGHV129 | 166271>166507 | 78 | 244 | Coiled-coil region | |||||||
| SGHV130 | 166702>167232 | 176 | 194 | ||||||||
| SGHV131 | 167329>169824 | 831 | 96 | ORF MSV156, Melanoplus sanguinipes EPV | NP_048227 | 51 | 4.E−04 | 114/463 (24) | 1,127 | 19.1 | |
| SGHV132 | 169957>170157 | 66 | 132 | ||||||||
| SGHV133 | 170364>171623 | 419 | 206 | EGF-like domain signature 1 | |||||||
| SGHV134 | 171764>172063 | 99 | 140 | ||||||||
| SGHV135 | 172083>172313 | 76 | 19 | ||||||||
| SGHV136 | 172640>172804 | 54 | 326 | Coiled-coil region, | |||||||
| SGHV137 | 172868>173140 | 90 | 63 | ||||||||
| SGHV138 | 173153>173515 | 120 | 12 | Coiled-coil region, | |||||||
| SGHV139 | 173797>173994 | 65 | 281 | Coiled-coil region, | |||||||
| SGHV140 | 174002>175228 | 408 | 7 | Coiled-coil region, | |||||||
| SGHV141 | 175407<175141 | 88 | −88 | TM | |||||||
| SGHV142 | 175631>176704 | 357 | 223 | ||||||||
| SGHV143 | 176745>178037 | 430 | 40 | Coiled-coil region, DUF572, family of unknown function | |||||||
| SGHV144 | 179093<178719 | 124 | 681 | ORF 086, Trichoplusia ni ascovirus 2c | YP_803309 | 55 | 1.E−06 | 25/63 (39) | 116 | 33.6 | |
| SGHV145 | 179383>180018 | 211 | 289 | Similar to Plasmodium falciparum trophozoite antigen r45-like protein, Danio rerio | XP_001343112 | 61 | 4.E−08 | 64/174 (36) | 334 | 38.5 | |
| SGHV146 | 180034>180669 | 211 | 15 | ||||||||
| SGHV147 | 181201>181731 | 176 | 531 | ORF 179, shrimp white spot syndrome virus | NP_477701 | 45 | 2.E−03 | 25/116 (21) | 221 | 19.3 | |
| SGHV148 | 182102>182929 | 275 | 370 | ORF 179, shrimp white spot syndrome virus | NP_477701 | 55 | 3.E−06 | 41/190 (21) | 221 | 19.9 | |
| SGHV149 | 183239>183601 | 120 | 309 | Sensory appendage protein 5, Manduca sexta | AAF16716 | 53 | 6.E−06 | 38/113 (33) | 231 | 20.8 | Internal repeat |
| SGHV150 | 184602<183613 | 329 | 11 | ||||||||
| SGHV151 | 184795>185043 | 82 | 192 | ||||||||
| SGHV152 | 185143>185514 | 123 | 99 | ||||||||
| SGHV153 | 185514>185747 | 77 | −1 | ||||||||
| SGHV154 | 185801>186817 | 338 | 53 | ||||||||
| SGHV155 | 186879>187079 | 66 | 61 | ||||||||
| SGHV156 | 187447<187274 | 57 | 194 | ||||||||
| SGHV157 | 187856<187626 | 76 | 178 | ||||||||
| SGHV158 | 188655<187951 | 234 | 94 | TM | |||||||
| SGHV159 | 188579>188788 | 69 | −77 | ||||||||
| SGHV160 | 188838>190025 | 395 | 49 | Possible surface protein AMV253, Amsacta moorei entomopoxvirus | NP_065035 | 50 | 2.E−04 | 68/274 (24) | 485 | 21.5 | TM |
−, overlap between adjacent ORFs.
Amino acid identity based on MegAlign ClustalW analysis of entire ORFs.
SP, signal peptides; TM, transmembrane domains.