TABLE 1.
Clone(s) | Match(es), % identity, and GenBank accession no. | Kozak sequencee | Putative signal sequenceb | No. of internal transmembrane domains | SignalP-NNc
|
Signal P-HMMd | |||
---|---|---|---|---|---|---|---|---|---|
C | S | Y | s | ||||||
p43F4 | Tetraspanin: human CD63, 39%, and Sm23, 35% | ttaggg_ | MFGACMKNVCLLTTYCILLSILMVAEIAAGIFA IV... | 2 | Y | Y | Y | Y | SP |
p64A3 | Tetraspanin: bovine CD9, 35%; M81720 | ttcatt_ | MKGCIQCLRVILVVFNFLVVLIGLSVLGFS VY... | 2 | Y | Y | Y | Y | SP |
p97D2 | S. mansoni eggshell protein EGG2, 100%; M21607 | xxaagc_ | MKQSLTLVFLVAIGYATA HT... | None | Y | Y | Y | Y | SP |
p25G7, p112E4 | S. mansoni eggshell protein EGG3, 100%; J03982 | tgaaaa_ | MKQSLTLVFLVAIGYATA YT... | None | Y | Y | Y | Y | SP |
p33F5 | S. mansoni EST,a 100%; AA233973 | cgcatc_ | MKYFICVIITVIIGVALS YS... | None | Y | Y | Y | Y | SP |
p83H3 | S. mansoni EST,a 100%; BE505109 | tcaatc_ | MNRFFWTVTQCTILLVIICNLNTMKA TS... | None | Y | Y | Y | Y | SP |
p45H5 | S. mansoni EST, 100%; BG931230 | tcggag_ | MAASHACLDLRALLSVVGLLLASA GR... | None | Y | N | Y | Y | SP |
p76E2 | S. mansoni EST, 100%; AA559399 | gtgccc_ | MRFPAGTYDELQIPQGSWSELHKQHNKLYNKFFIVSATIATALFAAA FY... | None | N | Y | Y | N | SA |
p43G5 | Unknown | gtgtat_ | MRKMPRFLSIHSGFLHILLS FY... | None | N | N | Y | Y | SA |
p30C2 | Unknown | atactg_ | MMMIILMVLLSVIRIIIVGLISLVVVKG KL... | None | Y | Y | Y | Y | SP |
p25D5, p43F7, p101E3 | Antisense S. mansoni ORF-RF2, 100%; M14309 | tccata_ | MIHNSTGASVTASIMAGFVSATFAAFATFATTLTLTATTAITLSIIVAFTTTFSKAVAT VT... | 2 | Y | Y | Y | Y | SA |
p17C5 | Antisense S. mansoni GST, 100%; M98271 | atattc_ | MFYIFSLTNQLFNTIVFLVCFTHHMMFL RH... | None | N | N | Y | Y | NS |
p90C8 | S. mansoni actin I, 100%; M80334 | MTQIMFETFNVPAMYVAIQAVLSLYASGRTTG IV... | None | Y | Y | N | N | NS | |
p28C7 | Human protein kinase DYR2,f 46%; Q92630 | MHKAVIFSFPMIIWLIDMKFS NP... | N/Ag | Y | N | Y | Y | NS | |
p25C4 | S. mansoni Sm23,f 100%; M34453 | MLLFYCLDNCALA NN... | N/A | N | N | Y | N | NS |
Clone sequence extends 5′ of the matching EST.
The putative cleavage point of each signal peptide is denoted by a space followed by the first two N-terminal residues of the processed protein.
SignalP-NN is based on neural networks.
SignalP-HMM is based on hidden Markov models. SP, N-terminal signal peptide; SA, signal anchor; NS, nonsecretory.
The Kozak consensus sequence for eukaryotes is GCCRCC_ where R is A or G and _ is an ATG start codon.
Incorrect reading frame drove surface expression.
N/A, not applicable.