Supplemental File 3 BLAST results for Unigenes in Supplemental Figure 3A-C. A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPECPWN401N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_10027 Length=492 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002315491.1| predicted protein [Populus trichocarpa] >... 175 6e-52 emb|CBI22961.3| unnamed protein product [Vitis vinifera] 181 9e-52 ref|XP_003528629.1| PREDICTED: uncharacterized protein LOC100... 179 5e-51 ref|XP_003550543.1| PREDICTED: uncharacterized protein LOC100... 177 1e-50 ref|NP_851004.1| tetratricopeptide repeat domain-containing p... 177 2e-50 ALIGNMENTS >ref|XP_002315491.1| predicted protein [Populus trichocarpa] gb|EEF01662.1| predicted protein [Populus trichocarpa] Length=222 Score = 175 bits (443), Expect = 6e-52 Identities = 87/104 (84%), Positives = 94/104 (90%), Gaps = 0/104 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAVVLLEK GD T+EVLS+RASCYKEVGEYKKAVADC+KVLE + NV+VL Sbjct 118 DAIKWLSWAVVLLEKTGDKASTMEVLSTRASCYKEVGEYKKAVADCSKVLEHDDANVSVL 177 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMA 179 VQRALLYESMEKY+LGAEDLR V+ IDP NRVARSTVHRLTKMA Sbjct 178 VQRALLYESMEKYRLGAEDLRVVLKIDPANRVARSTVHRLTKMA 221 >emb|CBI22961.3| unnamed protein product [Vitis vinifera] Length=451 Score = 181 bits (458), Expect = 9e-52 Identities = 89/105 (85%), Positives = 98/105 (93%), Gaps = 0/105 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAVVLLEKAGD GT+EVL+ RASCYKEVGEYKKAVADC+KVLE + KNV+VL Sbjct 347 DAIKWLSWAVVLLEKAGDDAGTMEVLTCRASCYKEVGEYKKAVADCSKVLEHDEKNVSVL 406 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMAS 176 VQRALLYES+EKYKLGAEDLRTV+ DPGNRVARST+HRLTKMA+ Sbjct 407 VQRALLYESIEKYKLGAEDLRTVLKFDPGNRVARSTIHRLTKMAA 451 >ref|XP_003528629.1| PREDICTED: uncharacterized protein LOC100799789 [Glycine max] Length=478 Score = 179 bits (454), Expect = 5e-51 Identities = 88/104 (85%), Positives = 96/104 (92%), Gaps = 0/104 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAV+LL+KAGD+ TVEVLS RASCYKEVGEYKKAVADCTKVLE + NV+VL Sbjct 374 DAIKWLSWAVILLQKAGDSAATVEVLSCRASCYKEVGEYKKAVADCTKVLENDETNVSVL 433 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMA 179 VQRALLYESMEKY+LGAEDLRTV+ IDPGNR+ARSTVHRL KMA Sbjct 434 VQRALLYESMEKYRLGAEDLRTVLKIDPGNRIARSTVHRLAKMA 477 >ref|XP_003550543.1| PREDICTED: uncharacterized protein LOC100800725 [Glycine max] Length=442 Score = 177 bits (450), Expect = 1e-50 Identities = 89/104 (86%), Positives = 95/104 (91%), Gaps = 0/104 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAVVLLEKAGD+ T EVLSSRASCYKEVGEYKKAVADCTKVLE + NV+VL Sbjct 338 DAIKWLSWAVVLLEKAGDSATTGEVLSSRASCYKEVGEYKKAVADCTKVLENDETNVSVL 397 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMA 179 VQRALLYESMEKY+LGAEDLRTV+ IDPGNR+AR TVHRL KMA Sbjct 398 VQRALLYESMEKYRLGAEDLRTVLKIDPGNRIARGTVHRLAKMA 441 >ref|NP_851004.1| tetratricopeptide repeat domain-containing protein [Arabidopsis thaliana] gb|AAS49048.1| At3g16760 [Arabidopsis thaliana] gb|AAT71974.1| At3g16760 [Arabidopsis thaliana] dbj|BAF00191.1| hypothetical protein [Arabidopsis thaliana] gb|AEE75862.1| tetratricopeptide repeat domain-containing protein [Arabidopsis thaliana] Length=475 Score = 177 bits (450), Expect = 2e-50 Identities = 85/104 (82%), Positives = 96/104 (92%), Gaps = 0/104 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAV+L+++AGD G+ EVLS+RASCYKEVGEYKKAVADCTKVL+ + KNV +L Sbjct 371 DAIKWLSWAVILMDRAGDEAGSAEVLSTRASCYKEVGEYKKAVADCTKVLDHDKKNVTIL 430 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMA 179 VQRALLYESMEKYKLGAEDLR V+ IDPGNR+ARSTVHRLTKMA Sbjct 431 VQRALLYESMEKYKLGAEDLRMVLKIDPGNRIARSTVHRLTKMA 474 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 1,855,251,573 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 276337787 Number of extensions: 6330351 Number of successful extensions: 19991 Number of sequences better than 1e-10: 1 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 19864 Number of HSP's successfully gapped: 1 Length of query: 492 Length of database: 6150218869 Length adjustment: 125 Effective length of query: 367 Effective length of database: 3910333369 Effective search space: 152503001391 Effective search space used: 152503001391 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits)


A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE1P5Z101N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_1010

Length=931


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|NP_566697.1|  cysteine-rich repeat secretory protein 38 [A...   177    2e-50
dbj|BAJ34612.1|  unnamed protein product [Thellungiella haloph...   176    6e-50
ref|XP_002883339.1|  hypothetical protein ARALYDRAFT_479720 [A...   175    2e-49
ref|XP_003597035.1|  Cysteine-rich repeat secretory protein [M...   173    7e-49
ref|XP_002515716.1|  DUF26 domain-containing protein 2 precurs...   169    2e-47

ALIGNMENTS
>ref|NP_566697.1| cysteine-rich repeat secretory protein 38 [Arabidopsis thaliana]
sp|Q9LRJ9.1|CRR38_ARATH RecName: Full=Cysteine-rich repeat secretory protein 38; Flags:
Precursor
dbj|BAB01391.1| unnamed protein product [Arabidopsis thaliana]
gb|AAK59407.1| unknown protein [Arabidopsis thaliana]
gb|ABD91491.1| At3g22060 [Arabidopsis thaliana]
gb|AEE76585.1| cysteine-rich repeat secretory protein 38 [Arabidopsis thaliana]
Length=252

Score =  177 bits (449),  Expect = 2e-50
Identities = 100/229 (44%), Positives = 130/229 (57%), Gaps = 9/229 (4%)
Frame = -3

Query  908  SQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEGPNKVYGL  729
SQ ++ L H C++   ++T  + Y +NL  L   L  K P    F   S G  PN V GL
Sbjct  29   SQNNAFLFHKCSDIEGSFTSKSLYESNLNNLFSQLSYKVPST-GFAASSTGNTPNNVNGL  87

Query  728  AQCRGDVSPDDCDYCLYDARCKITQRCYKRSA-VVWYDNCQLKYSDVNFLGKIDKENDFV  552
A CRGD S  DC  CL  A  ++ QRC    A +VWYDNC +KYS  NF GKID EN F
Sbjct  88   ALCRGDASSSDCRSCLETAIPELRQRCPNNKAGIVWYDNCLVKYSSTNFFGKIDFENRFY  147

Query  551  MINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKNVTIHGMVLC  372
+ NVKNVS  +   F   +  LLT+L+K+A+ + N+ +FA GEK     K   ++G+V C
Sbjct  148  LYNVKNVS--DPSTFNSQTKALLTELTKKATTRDNQKLFATGEKNIGKNK---LYGLVQC  202

Query  371  TQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFL  225
T+DL    CK CL+G++ ELP         G RVVG SC  RYE Y F+
Sbjct  203  TRDLKSITCKACLNGIIGELP--NCCDGKEGGRVVGGSCNFRYEIYPFV  249


>dbj|BAJ34612.1| unnamed protein product [Thellungiella halophila]
Length=255

Score =  176 bits (446),  Expect = 6e-50
Identities = 97/229 (42%), Positives = 132/229 (58%), Gaps = 9/229 (4%)
Frame = -3

Query  908  SQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEGPNKVYGL  729
SQ ++ L H C++   N+T  + Y +NL  L   +  + P +  F   S G  P+ V GL
Sbjct  32   SQNNAFLYHKCSDIEGNFTSKSPYESNLDSLFRRISYRVPSS-GFAASSAGNSPDNVNGL  90

Query  728  AQCRGDVSPDDCDYCLYDARCKITQRCYKRSA-VVWYDNCQLKYSDVNFLGKIDKENDFV  552
A CRGD S  DC  CL  A  ++ QRC    A ++WYDNC +KYS  NF GKID EN F
Sbjct  91   ALCRGDASSSDCGSCLATAIPELRQRCPNNKAGIIWYDNCLVKYSSTNFFGKIDYENRFY  150

Query  551  MINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKNVTIHGMVLC  372
+ NV NVS  +   F   +  LLT+L+++A+   N+ +FA GEK  + KK   ++G+V C
Sbjct  151  LYNVNNVS--DPASFNTQTKALLTELTQKATTGDNQKLFATGEKNLEKKK---LYGLVQC  205

Query  371  TQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFL  225
T+DL  + CK CLDG++ ELP         G RVVG SC  RYE Y F+
Sbjct  206  TRDLRRESCKACLDGIIGELP--NCCDGKEGGRVVGGSCNFRYEIYPFV  252


>ref|XP_002883339.1| hypothetical protein ARALYDRAFT_479720 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH59598.1| hypothetical protein ARALYDRAFT_479720 [Arabidopsis lyrata subsp.
lyrata]
Length=252

Score =  175 bits (443),  Expect = 2e-49
Identities = 100/230 (43%), Positives = 130/230 (57%), Gaps = 9/230 (4%)
Frame = -3

Query  908  SQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEGPNKVYGL  729
SQ ++ L H C++   ++T  + Y +NL  L   L  K P    F   S G  P+ V GL
Sbjct  29   SQNNAFLFHKCSDIEGSFTSKSPYESNLNNLFPQLSYKVPST-GFATSSAGITPDNVNGL  87

Query  728  AQCRGDVSPDDCDYCLYDARCKITQRCYKRSA-VVWYDNCQLKYSDVNFLGKIDKENDFV  552
A CRGD S  DC  CL  A  +I QRC    A ++WYDNC +KYS  NF GKID EN F
Sbjct  88   ALCRGDASSSDCSSCLATAIPEIRQRCPSNKAGIIWYDNCLVKYSSTNFFGKIDFENRFY  147

Query  551  MINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKNVTIHGMVLC  372
+ NV NVS  +   F   +  LLT L+K+A+   N+ +FA GEK    KK   ++G+V C
Sbjct  148  LYNVNNVS--DPSTFNTQTKALLTKLTKKATTGDNQKLFATGEKNIGMKK---LYGLVQC  202

Query  371  TQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFLN  222
T+DL  + CK CL+G++ ELP         G RVVG SC  RYE Y F+N
Sbjct  203  TRDLKSEACKACLNGIIGELP--NCCDGKEGGRVVGGSCNFRYEIYPFVN  250


>ref|XP_003597035.1| Cysteine-rich repeat secretory protein [Medicago truncatula]
gb|AES67286.1| Cysteine-rich repeat secretory protein [Medicago truncatula]
Length=246

Score =  173 bits (438),  Expect = 7e-49
Identities = 104/240 (43%), Positives = 140/240 (58%), Gaps = 16/240 (7%)
Frame = -3

Query  926  LIPTTISQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEGP  747
LI TT+  G+  L H+C+ TS+N+T  + Y +NLK LI+ L  KTP    FG GS
Sbjct  19   LIQTTL--GTDPLFHICS-TSENFTAHSPYESNLKTLINSLIYKTPST-GFGSGSIDLTQ  74

Query  746  ---NKVYGLAQCRGDVSPDDCDYCLYDARCKITQRC-YKRSAVVWYDNCQLKYSDVNFLG  579
K YGLA CRGDVS  +C  C+  A  +I   C YK+ A++WYDNC  KY D +F G
Sbjct  75   YQNQKAYGLALCRGDVSTSECKTCVSQATKEILNVCPYKKGAIIWYDNCMFKYLDNDFFG  134

Query  578  KIDKENDFVMINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKN  399
KID  N F ++NV+NVS   K  F + +  LL+ L+ +AS   N  ++A GE    + +
Sbjct  135  KIDNTNKFALLNVQNVSDPIK--FNNMTNDLLSFLANEAS--MNHKLYATGELKIGESER  190

Query  398  VTIHGMVLCTQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFLND  219
V  +G+  CT+D+S  DCK CLDG + ELP         G RVVG SC +RYE Y F+ +
Sbjct  191  V--YGLTQCTRDISSVDCKKCLDGAISELP--NCCDGKKGGRVVGGSCNIRYEIYPFVRE  246


>ref|XP_002515716.1| DUF26 domain-containing protein 2 precursor, putative [Ricinus
communis]
gb|EEF46663.1| DUF26 domain-containing protein 2 precursor, putative [Ricinus
communis]
Length=240

Score =  169 bits (428),  Expect = 2e-47
Identities = 97/237 (41%), Positives = 137/237 (58%), Gaps = 12/237 (5%)
Frame = -3

Query  929  LLIPTTISQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEG  750
LL+ T    G   L H C++ S+N+T +  Y +NL  L  +LY + P+   FG GS G
Sbjct  14   LLVQTVY--GDDPLFHFCSS-SENFTANGPYESNLNKLAGNLYFQVPKE-GFGFGSSGRD  69

Query  749  PNKVYGLAQCRGDVSPDDCDYCLYDARCKITQRC-YKRSAVVWYDNCQLKYSDVNFLGKI  573
P++ YGLA CRGDVS  DC  C+ +A  +I +RC   ++A++WYDNC  KYSD  + G+I
Sbjct  70   PDQAYGLALCRGDVSSSDCKTCVVEASSEIRKRCPTNKAAIIWYDNCLYKYSDKKYFGQI  129

Query  572  DKENDFVMINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKNVT  393
D  N F M NV+ V+ + +  F   +  LL++L+ QA   +  Y     +KL    K
Sbjct  130  DNRNKFYMWNVRVVNDSAE--FNQKTKELLSELASQAYVTSKLYATGESDKLGKSNK---  184

Query  392  IHGMVLCTQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFLN  222
++G+V CT+DLS  DC+ CLDG++ ELP         G RVV  SC  RYE Y F+N
Sbjct  185  LYGLVQCTRDLSSGDCRKCLDGIITELP--SCCDGKEGGRVVSGSCNFRYEIYPFVN  239


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 471020381
Number of extensions: 9864198
Number of successful extensions: 26830
Number of sequences better than 1e-10: 62
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 26544
Number of HSP's successfully gapped: 63
Length of query: 931
Length of database: 6150218869
Length adjustment: 140
Effective length of query: 791
Effective length of database: 3641547109
Effective search space: 619063008530
Effective search space used: 619063008530
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 176 (72.4 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE1VWB5012


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_1041

Length=926


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002518985.1|  conserved hypothetical protein [Ricinus c...   174    1e-46
ref|XP_003534287.1|  PREDICTED: inactive poly [ADP-ribose] pol...   174    2e-46
emb|CAN61759.1|  hypothetical protein VITISV_006105 [Vitis vin...   173    4e-46
ref|XP_003516978.1|  PREDICTED: inactive poly [ADP-ribose] pol...   164    5e-43
ref|XP_002890975.1|  hypothetical protein ARALYDRAFT_473413 [A...   164    5e-43

ALIGNMENTS
>ref|XP_002518985.1| conserved hypothetical protein [Ricinus communis]
gb|EEF43518.1| conserved hypothetical protein [Ricinus communis]
Length=536

Score =  174 bits (440),  Expect = 1e-46
Identities = 97/183 (53%), Positives = 120/183 (66%), Gaps = 5/183 (3%)
Frame = +3

Query  24   SLSCFDVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNM  203
S+   DVDEN  RH+V C VIMG +E ++ GS Q  P+SE FDSGVD L+NP  YVVWNM
Sbjct  355  SVKFCDVDENGVRHIVFCRVIMGKMELVQPGSTQSHPSSENFDSGVDDLQNPGQYVVWNM  414

Query  204  NMNSHIYAECVVSFRATSVVEEP--VFGKEKNVGIPQFSSYGEPQAQALGESVEAKSPNS  377
NMN+HIY E +VSF+ +   E    +F       +    S G+  +Q    S   ++P S
Sbjct  415  NMNTHIYPEFIVSFKVSLNAEGDMLLFFVNSQTRLESGGSLGKASSQG---SSNTRTPKS  471

Query  378  PWMPFPVLFAAIADKVTPESMYLVRTNYALFKNKKMSRDAFVKNLRLIVGDNLLKSAITS  557
P+MPFPVLFAAI +KV  E M LV T+Y  F+  KMSR  F+K+LRLIVGD LLKS ITS
Sbjct  472  PFMPFPVLFAAIRNKVPSEQMKLVLTDYKQFQANKMSRGDFIKSLRLIVGDALLKSTITS  531

Query  558  LQS  566
LQS
Sbjct  532  LQS  534


>ref|XP_003534287.1| PREDICTED: inactive poly [ADP-ribose] polymerase RCD1-like [Glycine
max]
Length=583

Score =  174 bits (440),  Expect = 2e-46
Identities = 99/191 (52%), Positives = 125/191 (65%), Gaps = 15/191 (8%)
Frame = +3

Query  39   DVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNMNMNSH  218
DVDEN  RH+ LC VIMGN+E LR G+DQF P+S E+D+GVD +E P  YVVWNMNMN+H
Sbjct  379  DVDENGVRHLALCRVIMGNMEILRPGTDQFHPSSCEYDNGVDAIECPQYYVVWNMNMNTH  438

Query  219  IYAECVVSFRATSVVEEPVFGKE-KNV-GI------------PQFSSYGEPQAQALGESV  356
IY E VVSF+ +S  E    G E KNV G+             + S+    +A ++  S
Sbjct  439  IYPEFVVSFKVSSDAEGHFCGSEGKNVSGVNTACDGPHGLLNSESSTVDNGKAPSMVSST  498

Query  357  EAKSPNSPWMPFPVLFAAIADKVTPESMYLVRTNYALFKNKKMSRDAFVKNLRLIVGDNL  536
K P SPWMPFPVL  AI D+V P  M +++T Y  F++K +SRD FVK LRLIVGD L
Sbjct  499  -PKVPKSPWMPFPVLLDAIRDQVPPTGMDVIKTYYEQFRSKHISRDDFVKMLRLIVGDGL  557

Query  537  LKSAITSLQSK  569
L++ IT+LQ K
Sbjct  558  LRTTITNLQYK  568


>emb|CAN61759.1| hypothetical protein VITISV_006105 [Vitis vinifera]
Length=604

Score =  173 bits (439),  Expect = 4e-46
Identities = 103/212 (49%), Positives = 127/212 (60%), Gaps = 30/212 (14%)
Frame = +3

Query  24   SLSCFDVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNM  203
S++  DVDEN  +H+VLC VIMGN+E +  GS Q  P+SE FDSGVD L+NP  Y++WNM
Sbjct  366  SVNYCDVDENGVQHIVLCRVIMGNMELVHPGSGQCHPSSENFDSGVDDLQNPKHYIIWNM  425

Query  204  NMNSHIYAECVVSFRATSVV--EEPVFGKEKN-----------------------VGIPQ  308
NMN+HIY E VVSF+ +S V  E  + G E N                       VG+
Sbjct  426  NMNTHIYPEYVVSFKVSSRVGAEGYLIGNESNYDISGVTTCQGQSQGHSKLGLHPVGLXN  485

Query  309  FSSYGEPQAQALGESVEAKS-----PNSPWMPFPVLFAAIADKVTPESMYLVRTNYALFK  473
S       ++LG++    S     P SPWMPFP+LFAAI+ KV  + M LV T Y LF+
Sbjct  486  DSHPTPSLGRSLGKATTLGSSTLRVPKSPWMPFPMLFAAISKKVPLKDMQLVDTQYELFR  545

Query  474  NKKMSRDAFVKNLRLIVGDNLLKSAITSLQSK  569
KK+SR  FVK LRLIVGD LLKS IT LQ K
Sbjct  546  RKKISRADFVKKLRLIVGDTLLKSTITHLQCK  577


>ref|XP_003516978.1| PREDICTED: inactive poly [ADP-ribose] polymerase RCD1-like [Glycine
max]
Length=583

Score =  164 bits (416),  Expect = 5e-43
Identities = 94/191 (49%), Positives = 124/191 (65%), Gaps = 15/191 (8%)
Frame = +3

Query  39   DVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNMNMNSH  218
DVDEN  RH+ LC VIMGN+E L+ G+ QF P+S E+D+GVD +E P  YVVWNMNMN+H
Sbjct  379  DVDENGVRHLALCRVIMGNMEILQPGTGQFHPSSCEYDNGVDSIECPRYYVVWNMNMNTH  438

Query  219  IYAECVVSFRATSVVEEPVFGKE-KNV-GI------------PQFSSYGEPQAQALGESV  356
IY E VVSF+ +S  E    G E KNV G+             + S+    +A ++  S
Sbjct  439  IYPEFVVSFKVSSDAEGHFCGSEGKNVSGVNSACQGPHGLLHSESSTVDNGKAPSMVAST  498

Query  357  EAKSPNSPWMPFPVLFAAIADKVTPESMYLVRTNYALFKNKKMSRDAFVKNLRLIVGDNL  536
K P SPWMPFP+L  AI ++V P+ M +++  Y  F++K +SRD FVK LRLIVGD L
Sbjct  499  -PKVPKSPWMPFPLLLDAIRNQVPPKGMDVIKIYYEQFRSKHISRDDFVKMLRLIVGDGL  557

Query  537  LKSAITSLQSK  569
L++ IT+LQ K
Sbjct  558  LRTTITNLQFK  568


>ref|XP_002890975.1| hypothetical protein ARALYDRAFT_473413 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH67234.1| hypothetical protein ARALYDRAFT_473413 [Arabidopsis lyrata subsp.
lyrata]
Length=588

Score =  164 bits (416),  Expect = 5e-43
Identities = 90/192 (47%), Positives = 121/192 (63%), Gaps = 17/192 (9%)
Frame = +3

Query  39   DVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNMNMNSH  218
DVDEN  R+MVLC VIMGN+E LR    QF    EE+D+GVD +ENP  Y+VWN+NMN+H
Sbjct  378  DVDENGVRYMVLCRVIMGNMELLRGDKAQFFSGGEEYDNGVDDIENPKNYIVWNINMNTH  437

Query  219  IYAECVVSFRATSV--VEEPVFGKEKNVGI---------PQFSSY----GEPQAQALGES  353
I+ E VV F+ +++   E  +  K  N G+         PQ  S     G   A ++G S
Sbjct  438  IFPEFVVRFKLSNLPNAEGNLIAKRDNSGVTLEGPKNPPPQVESNHGAGGSGSANSVGSS  497

Query  354  VEAKSPNSPWMPFPVLFAAIADKVTPESMYLVRTNYALFKNKKMSRDAFVKNLRLIVGDN  533
P SPWMPFP LFAAI+ KV  + M L+  +Y   ++KKM+R  FV+ LR+IVGD+
Sbjct  498  --TTRPKSPWMPFPTLFAAISHKVAEKDMSLINADYQQLRDKKMTRAEFVRKLRVIVGDD  555

Query  534  LLKSAITSLQSK  569
LL+S IT+LQ++
Sbjct  556  LLRSTITTLQNQ  567


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 450260976
Number of extensions: 9268664
Number of successful extensions: 19694
Number of sequences better than 1e-10: 4
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 19677
Number of HSP's successfully gapped: 4
Length of query: 926
Length of database: 6150218869
Length adjustment: 140
Effective length of query: 786
Effective length of database: 3641547109
Effective search space: 611779914312
Effective search space used: 611779914312
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 176 (72.4 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPECVPUY016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_10651

Length=476


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002469870.1|  candidate beta-glucosidase from glycoside...  72.0    3e-12
gb|ABK24629.1|  unknown [Picea sitchensis]                         68.6    4e-11

ALIGNMENTS
>ref|XP_002469870.1| candidate beta-glucosidase from glycoside hydrolase family 1
[Postia placenta Mad-698-R]
gb|EED84949.1| candidate beta-glucosidase from glycoside hydrolase family 1
[Postia placenta Mad-698-R]
Length=480

Score = 72.0 bits (175),  Expect = 3e-12
Identities = 29/39 (74%), Positives = 32/39 (82%), Gaps = 0/39 (0%)
Frame = -3

Query  474  SNEDGRGPSIWDAFCRAPGNICDGSNADVAVDQYHRYKE  358
+NE GRGPSIWD FC+ PGNI DGSN D+A D YHRYKE
Sbjct  23   ANEGGRGPSIWDTFCKVPGNIRDGSNGDIATDSYHRYKE  61


>gb|ABK24629.1| unknown [Picea sitchensis]
Length=477

Score = 68.6 bits (166),  Expect = 4e-11
Identities = 29/39 (74%), Positives = 32/39 (82%), Gaps = 0/39 (0%)
Frame = -3

Query  474  SNEDGRGPSIWDAFCRAPGNICDGSNADVAVDQYHRYKE  358
+ E G+GPSIWD+F R PG I DGSN DVAVDQYHRYKE
Sbjct  34   AKEGGKGPSIWDSFSRTPGKILDGSNGDVAVDQYHRYKE  72


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 272888281
Number of extensions: 6189810
Number of successful extensions: 12786
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 12722
Number of HSP's successfully gapped: 0
Length of query: 476
Length of database: 6150218869
Length adjustment: 120
Effective length of query: 356
Effective length of database: 3999928789
Effective search space: 151997293982
Effective search space used: 151997293982
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPDY6ESF01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_107

Length=1296


Score     E
Sequences producing significant alignments:                       (Bits)  Value

emb|CAH59737.1|  hypothetical protein [Plantago major]              119    2e-28
ref|XP_003554330.1|  PREDICTED: uncharacterized protein LOC100...  92.8    1e-18
ref|NP_001241600.1|  uncharacterized protein LOC100804857 [Gly...  92.4    2e-18
ref|XP_002274395.1|  PREDICTED: uncharacterized protein LOC100...  87.8    1e-16
ref|XP_003537192.1|  PREDICTED: uncharacterized protein LOC100...  74.3    3e-12

ALIGNMENTS
>emb|CAH59737.1| hypothetical protein [Plantago major]
Length=196

Score =  119 bits (298),  Expect = 2e-28
Identities = 82/199 (41%), Positives = 110/199 (55%), Gaps = 16/199 (8%)
Frame = -2

Query  1076  MEVFIPYKTSHCPETCISDGSSATD-PHNLIQIKDDLSMND-LKTCLSELMNVEEAGISS  903
MEV +P  TS      IS  S  +D P N       + + D LKT   + + +E+  IS
Sbjct  1     MEVCVPCNTS--AGEYISKNSGFSDGPLNQPHCNSRIPLVDCLKTSSPQHLEIEDDEISP  58

Query  902   GGFNLAPEKYYIST---------EVRESEGKDSEKCLSKSATFQPPCGPKLSEDVFVGEK  750
G +  + +   +S          +  ES    S KCL KSATF P   PKLS +   G +
Sbjct  59    GCYLASDKTCEVSLNEGNGGRELDKSESNASSSTKCLDKSATFPPCRDPKLSTERLCGRR  118

Query  749   RNHEEHLTHEVSEGNGSASTFNTCNTQSTSLPVHLKPISAMKGSREKQQGKPLPKKLSVT  570
+ H +  T  V+E N  A + N C ++  SLP   K +S++KGSREKQ G P PKKLSV+
Sbjct  119   KKHTKR-TDAVAEKNDMAESPNRCPSRCNSLPTPSKLVSSLKGSREKQ-GMP-PKKLSVS  175

Query  569   WAPDVYDPIPTSVSHVPSD  513
WAPDVYDP+PTSVSHVP++
Sbjct  176   WAPDVYDPVPTSVSHVPNN  194


>ref|XP_003554330.1| PREDICTED: uncharacterized protein LOC100783136 [Glycine max]
Length=254

Score = 92.8 bits (229),  Expect = 1e-18
Identities = 93/278 (33%), Positives = 128/278 (46%), Gaps = 57/278 (21%)
Frame = -2

Query  983  IKDDLSMNDLKTCLSELMNVEEAGIS------SGGFNLAPEKYYISTEVRESEGKDSEKC  822
+ DD  +++L    +E + +++A  S      +  +N+  EK  +   +++ E   +  C
Sbjct  10   VLDDNFISNLGRTFNESLQIQDAHKSVLASEENDIYNVNEEK--LCEAMKDQETNINMTC  67

Query  821  LSKSATFQPPC-------GPKLSEDVFVGEKRNHEEHLTHEVSEGNGSASTFNTCNTQST  663
L KSATF  P          K ++    G    H  H T+ +S
Sbjct  68   LKKSATFPIPNTMLPSSPSDKEADTSVTGPLNEHSAHQTYSLS----------------V  111

Query  662  SLPVHLKPISAMKGSREKQQGKPLPKKLSVTWAPDVYDPIPTSVSH-VPSDKNQRYRnkk  486
S P  LK ISAMKGSREK  G  +  KL+V WA DVYDPIPT +SH V S+K Q+
Sbjct  112  SPPAPLKLISAMKGSREKHGGSQV--KLNVKWASDVYDPIPTLLSHTVRSNKKQQ-----  164

Query  485  ygkynkhkssgkssrgskskdskkqgrkssGAGSNKLKQPFHL--DSGVVLREPPQVG--  318
+  +  K    K Q   SS  GSNK KQ   L   SG+  +
Sbjct  165  ---------KSRKKKPEKKNGKKGQKGNSSRGGSNKDKQVRKLGGTSGLCYKSMDSCDKV  215

Query  317  -----PLDYNNVGSQDPFCGSSFLKESVTKLHYPVAEA  219
LD   V SQD +CG+SFLK+SVT++HY VAEA
Sbjct  216  LGASTELDALEVRSQDSYCGTSFLKKSVTEVHYSVAEA  253


>ref|NP_001241600.1| uncharacterized protein LOC100804857 [Glycine max]
gb|ACU17939.1| unknown [Glycine max]
Length=257

Score = 92.4 bits (228),  Expect = 2e-18
Identities = 95/273 (35%), Positives = 136/273 (50%), Gaps = 47/273 (17%)
Frame = -2

Query  983  IKDDLSMNDLKTCLSELMNVEEAGIS------SGGFNLAPEKYYISTEVRESEGKDSEKC  822
+ DD  +++L    SE +++++A  S      S  +N+  EK  I   +++   K +  C
Sbjct  13   VHDDNFISNLGRTFSESLHIQDAQKSLLASEGSDIYNVNEEK--ICKAMKDQATKVNMAC  70

Query  821  LSKSATFQPPCG--PKLSEDVFVGEKRNHEEHLTHEVSEGNGSASTFNTCNTQSTSLPVH  648
L KSATF  P    P  S D      +  +  +T  + E + +  T++    +S SLP
Sbjct  71   LKKSATFPIPNTMLPSSSSD------KEADTSVTEPLYE-HSAHQTYS----RSVSLPAP  119

Query  647  LKPISAMKGSREKQQGKPLPKKLSVTWAPDVYDPIPTSVSH-VPSDKNQRYRnkkygkyn  471
LK I A+KGSREK  G  +  KL+V WA DVYDP+PT +SH V S+K Q+
Sbjct  120  LKLIPAIKGSREKHGGSQV--KLNVKWAADVYDPVPTLLSHTVRSNKKQQ----------  167

Query  470  khkssgkssrgskskdskkqgrkssGAGSNKLKQPFHL--DSGVVLREPPQ-------VG  318
+  +  K    K Q   SS  GS+K KQ   L   SG+  +
Sbjct  168  ----KSRKKKPEKKNGKKGQKGNSSRGGSSKDKQFRKLGGTSGLCYKSMDSCDKVLGVAT  223

Query  317  PLDYNNVGSQDPFCGSSFLKESVTKLHYPVAEA  219
LD  +V SQD +CG+SFLK+SVT+LHY VAEA
Sbjct  224  ELDALDVRSQDSYCGTSFLKKSVTELHYSVAEA  256


>ref|XP_002274395.1| PREDICTED: uncharacterized protein LOC100248230 isoform 3 [Vitis
vinifera]
ref|XP_002274341.1| PREDICTED: uncharacterized protein LOC100248230 isoform 1 [Vitis
vinifera]
emb|CAN77855.1| hypothetical protein VITISV_037693 [Vitis vinifera]
emb|CBI32610.3| unnamed protein product [Vitis vinifera]
Length=293

Score = 87.8 bits (216),  Expect = 1e-16
Identities = 90/257 (35%), Positives = 125/257 (49%), Gaps = 34/257 (13%)
Frame = -2

Query  944  LSELMN--VEEAGI----SSGGFNLAPEKYYISTEVRESEGKDSEKCLSKSATFQPPCGP  783
+ EL N  +E++G+    S+   N+  E   +     ES  +  +K   K ATF  P
Sbjct  57   VKELANSCIEQSGMDVISSTEAENICGE---LKVRQTESPTRSYQKSFCKCATF--PSSG  111

Query  782  KLSEDVFVGEKRNHEEHLTHEVSEGNGSASTFNTCNTQSTSLPVHLKPISAMKGSREKQQ  603
K S     GE+  + +    E    N S  + +   +++ SLP  LK +SAMKGSR+K+
Sbjct  112  KTSLAGSSGEEDGNPDATLQE----NYSLKSLSPDISRTASLPTPLKLVSAMKGSRDKE-  166

Query  602  GKPLPKKLSVTWAPDVYDPIPTSVSHV--------PSDKNQRYRnkkygkynkhkssgks  447
G PL KKL+VTWAPDVYDP PT VSH          S  N++    K        S+GK
Sbjct  167  GIPL-KKLNVTWAPDVYDPPPTIVSHTVRNCKKQQQSKNNRKNGKHKQKGKAARGSNGKD  225

Query  446  srgskskdskkqgrkssGAGSNKLKQPFHLDSGVVLREPPQVGPLDYNNVGSQDPFCGSS  267
+  +           S    +KL     +DS    RE      L+   VGS D +CGSS
Sbjct  226  KKQLRKIVGSGDRGFRSFEVRDKLIASNFIDSS---RE------LEDFEVGSPDGYCGSS  276

Query  266  FLKESVTKLHYPVAEAT  216
FL++SV K+H+ VAEAT
Sbjct  277  FLRKSVAKVHFSVAEAT  293


>ref|XP_003537192.1| PREDICTED: uncharacterized protein LOC100812338 [Glycine max]
Length=239

Score = 74.3 bits (181),  Expect = 3e-12
Identities = 77/271 (28%), Positives = 120/271 (44%), Gaps = 43/271 (16%)
Frame = -2

Query  1004  DPHNLIQIKDDLSMNDLKTCLSELMNVEEA--------GISSGGFNLAPEKYYISTEVRE  849
D  + +    D  ++ L+   SE +++ +A        G   G  ++         E++E
Sbjct  2     DTRSPVNAVHDNIISKLEVTFSESLHIHDAQNSEHASEGDHIGNCDVGERNLREGFELQE  61

Query  848   SEGKDSEKCLSKSATFQPPCGPKLSEDVFVGEKRNHEEHLTHEVSEGNGSASTFNTCNTQ  669
+  K   KCL + +TF  P       D+ +    + EE      S    S     +C
Sbjct  62    T--KLEIKCLKECSTFPYP-------DMMLPSSSSDEE--ADASSPSKQSPRQNYSC---  107

Query  668   STSLPVHLKPISAMKGSREKQQGKPLPKKLSVTWAPDVYDPIPTSVSHVPSDKNQRYRnk  489
S SLP   K +SAMKGSREK++G  +  KL+V WAPDVYDP+PT +SH   +K Q+
Sbjct  108   SVSLPAPRKLVSAMKGSREKERGSQM--KLTVKWAPDVYDPVPTLLSHTVKNKKQQ----  161

Query  488   kygkynkhkssgkssrgskskdskkqgrkssGAGSNKLKQPFHLDSGVVLREPPQVGPLD  309
+  +  K    K Q    S  GS+K KQ              +V
Sbjct  162   ----------KPRIKKSEKKNGKKGQKVSYSKRGSSKDKQ----YRNRWFYSHDEVFEAS  207

Query  308   YNNVGSQDPFCGSS-FLKESVTKLHYPVAEA  219
+N  + D +CG+S +L+ S+TK+H+ + EA
Sbjct  208   SDNAANHDSYCGTSYYLETSLTKVHWSIGEA  238


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 682390971
Number of extensions: 14727534
Number of successful extensions: 32988
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 32979
Number of HSP's successfully gapped: 0
Length of query: 1296
Length of database: 6150218869
Length adjustment: 144
Effective length of query: 1152
Effective length of database: 3569870773
Effective search space: 1028122782624
Effective search space used: 1028122782624
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 178 (73.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEDH95R013


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_11016

Length=467


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002530622.1|  chaperone protein DNAj, putative [Ricinus...   113    1e-28
ref|XP_003551367.1|  PREDICTED: chaperone protein dnaJ 11, chl...   107    1e-26
ref|XP_003544555.1|  PREDICTED: chaperone protein dnaJ 11, chl...   104    1e-25
ref|XP_002300579.1|  predicted protein [Populus trichocarpa] >...   102    2e-25
ref|NP_187939.1|  chaperone DnaJ-domain containing protein [Ar...  99.4    2e-23

ALIGNMENTS
>ref|XP_002530622.1| chaperone protein DNAj, putative [Ricinus communis]
gb|EEF31765.1| chaperone protein DNAj, putative [Ricinus communis]
Length=168

Score =  113 bits (282),  Expect = 1e-28
Identities = 59/98 (60%), Positives = 71/98 (72%), Gaps = 3/98 (3%)
Frame = -1

Query  437  NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIEIRNAYATLSDPE  258
+LYE+LR++R AS  EIKTAYR+LAKLYHPDAA R  D    GRDF+EI NAY TLSDP
Sbjct  72   SLYEILRIKRTASLMEIKTAYRSLAKLYHPDAAVR-EDVETDGRDFMEIHNAYETLSDPA  130

Query  257  ARSAYDNNLEVSLQRLRLRSTTGSRGRFYPTRNWETDQ  144
AR+ YD +L+ + +  R R   G  G +YPTR WETDQ
Sbjct  131  ARALYDLSLDAASR--RRRPAVGFTGGYYPTRRWETDQ  166


>ref|XP_003551367.1| PREDICTED: chaperone protein dnaJ 11, chloroplastic-like [Glycine
max]
Length=151

Score =  107 bits (267),  Expect = 1e-26
Identities = 64/113 (57%), Positives = 75/113 (66%), Gaps = 8/113 (7%)
Frame = -1

Query  458  APAVDNR----NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIEI  291
A AVD +    +LYEVLRV R+AS TEIK+AYR+LAKLYHPDAA +      G  DFI++
Sbjct  37   AEAVDTQRPAASLYEVLRVERDASPTEIKSAYRSLAKLYHPDAAVQRSPETDGDGDFIQL  96

Query  290  RNAYATLSDPEARSAYDNNLEVSL-QRLRLRSTTGSRGR---FYPTRNWETDQ  144
RNAY TLSDP AR+ YD  L  +   R R  ST+ SR     FY TR WETDQ
Sbjct  97   RNAYETLSDPSARAMYDRTLAAAHGGRHRRFSTSLSRNHSSAFYTTRRWETDQ  149


>ref|XP_003544555.1| PREDICTED: chaperone protein dnaJ 11, chloroplastic-like [Glycine
max]
Length=142

Score =  104 bits (260),  Expect = 1e-25
Identities = 59/113 (52%), Positives = 76/113 (67%), Gaps = 10/113 (9%)
Frame = -1

Query  458  APAVDNR---NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIEIR  288
A A+D+R   +LYEVLR+++NAS  EIK+AYR LAK+YHPD+A R   S +  RDFIEI
Sbjct  30   ATAIDSRRAASLYEVLRIKQNASAVEIKSAYRNLAKVYHPDSALR--RSESDERDFIEIH  87

Query  287  NAYATLSDPEARSAYDNNLEVSLQRLR-----LRSTTGSRGRFYPTRNWETDQ  144
+AY TLSDP AR+ YD +L  +    R     + +  GS G +Y TR WETDQ
Sbjct  88   DAYETLSDPSARALYDLSLMAARDDNRSFSSLVAAPNGSSGFYYQTRKWETDQ  140


>ref|XP_002300579.1| predicted protein [Populus trichocarpa]
gb|EEE85384.1| predicted protein [Populus trichocarpa]
Length=99

Score =  102 bits (255),  Expect = 2e-25
Identities = 54/98 (55%), Positives = 64/98 (65%), Gaps = 1/98 (1%)
Frame = -1

Query  437  NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIEIRNAYATLSDPE  258
+LYE+LRV   AS  EIKTAYR+LAK+YHPDA     D  + G DFIEI NAY TLSDP
Sbjct  1    SLYEILRVNPTASQVEIKTAYRSLAKVYHPDAMLDRDDEPSEGVDFIEIHNAYETLSDPA  60

Query  257  ARSAYDNNLEVSLQRLRLRSTTGSRGRFYPTRNWETDQ  144
AR+ YD +L  + +    R   G  G +Y TR WETDQ
Sbjct  61   ARAVYDMSLSAAARDF-YRRAVGYSGGYYTTRRWETDQ  97


>ref|NP_187939.1| chaperone DnaJ-domain containing protein [Arabidopsis thaliana]
dbj|BAB02800.1| DnaJ-like protein [Arabidopsis thaliana]
gb|AAM64632.1| DnaJ protein, putative [Arabidopsis thaliana]
gb|AAP88343.1| At3g13310 [Arabidopsis thaliana]
gb|AEE75332.1| chaperone DnaJ-domain containing protein [Arabidopsis thaliana]
Length=157

Score = 99.4 bits (246),  Expect = 2e-23
Identities = 58/111 (52%), Positives = 71/111 (64%), Gaps = 10/111 (9%)
Frame = -1

Query  467  PAFAPAVDNR--NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIE  294
PA   +V  R  +LYE+L+V   AS TEIKTAYR+LAK+YHPDA      S + GRDF+E
Sbjct  52   PAVTESVRRRVSSLYELLKVNETASLTEIKTAYRSLAKVYHPDA------SESDGRDFME  105

Query  293  IRNAYATLSDPEARSAYDNNLEVSLQRLRLRSTTGSRGRFY-PTRNWETDQ  144
I  AYATL+DP  R+ YD+ L V  +R+      G  GR Y  TR WETDQ
Sbjct  106  IHKAYATLADPTTRAIYDSTLRVPRRRVH-AGAMGRSGRVYATTRRWETDQ  155


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 236168012
Number of extensions: 4973328
Number of successful extensions: 13151
Number of sequences better than 1e-10: 1
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 12733
Number of HSP's successfully gapped: 1
Length of query: 467
Length of database: 6150218869
Length adjustment: 117
Effective length of query: 350
Effective length of database: 4053686041
Effective search space: 154040069558
Effective search space used: 154040069558
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEDUEFD016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_11358

Length=459


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003531875.1|  PREDICTED: uncharacterized protein LOC100...   142    5e-41
ref|XP_002531002.1|  conserved hypothetical protein [Ricinus c...   139    1e-39
ref|XP_003552604.1|  PREDICTED: uncharacterized protein LOC100...   134    7e-38
ref|XP_002330916.1|  predicted protein [Populus trichocarpa] >...   134    1e-37
gb|ACG30695.1|  B12D protein [Zea mays]                             131    1e-36

ALIGNMENTS
>ref|XP_003531875.1| PREDICTED: uncharacterized protein LOC100527287 [Glycine max]
gb|ACU16358.1| unknown [Glycine max]
Length=86

Score =  142 bits (358),  Expect = 5e-41
Identities = 64/85 (75%), Positives = 76/85 (89%), Gaps = 0/85 (0%)
Frame = +1

Query  25   MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ  204
M RW+KPEVYPL+AAM+FVTG+CVFQL+RNV+ NPDVR+NK  RS AVLEN+EEGEKYA+
Sbjct  1    MGRWMKPEVYPLLAAMTFVTGMCVFQLTRNVLGNPDVRINKTRRSMAVLENREEGEKYAE  60

Query  205  HSLRKFLRTRTPEIMPALNRFFSQN  279
H LRKFLRTR PEIMP +N FFS++
Sbjct  61   HGLRKFLRTRPPEIMPTINHFFSED  85


>ref|XP_002531002.1| conserved hypothetical protein [Ricinus communis]
gb|EEF31390.1| conserved hypothetical protein [Ricinus communis]
Length=86

Score =  139 bits (349),  Expect = 1e-39
Identities = 62/85 (73%), Positives = 75/85 (88%), Gaps = 0/85 (0%)
Frame = +1

Query  25   MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ  204
M RW+KPEVYPL+AAM+FVT LC FQL+RN+  NPDVR+NKAHR TAVLEN+ EGE+YA+
Sbjct  1    MARWIKPEVYPLMAAMTFVTSLCAFQLTRNMFLNPDVRINKAHRRTAVLENEVEGEQYAE  60

Query  205  HSLRKFLRTRTPEIMPALNRFFSQN  279
H LRKFLRTR PEIMP++N FFS++
Sbjct  61   HGLRKFLRTRPPEIMPSINHFFSED  85


>ref|XP_003552604.1| PREDICTED: uncharacterized protein LOC100785946 [Glycine max]
Length=86

Score =  134 bits (337),  Expect = 7e-38
Identities = 59/85 (69%), Positives = 74/85 (87%), Gaps = 0/85 (0%)
Frame = +1

Query  25   MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ  204
M RW+KPEVYPL+AAM+FV+ +CVFQL+RN++ NPDVR+NK  RS  VL+N+EEGEKYA+
Sbjct  1    MGRWMKPEVYPLLAAMTFVSSMCVFQLTRNMLGNPDVRINKTRRSMPVLDNREEGEKYAE  60

Query  205  HSLRKFLRTRTPEIMPALNRFFSQN  279
H LRKFLRTR PEIMP +N FFS++
Sbjct  61   HGLRKFLRTRPPEIMPTINHFFSED  85


>ref|XP_002330916.1| predicted protein [Populus trichocarpa]
gb|EEF10241.1| predicted protein [Populus trichocarpa]
Length=86

Score =  134 bits (336),  Expect = 1e-37
Identities = 61/85 (72%), Positives = 73/85 (86%), Gaps = 0/85 (0%)
Frame = +1

Query  25   MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ  204
M RW+KPEVYPL+AAM+ VT LC+FQL+RNV  NPDVRVNKA+R   VLEN+EEGE+YA+
Sbjct  1    MGRWMKPEVYPLLAAMTCVTSLCIFQLTRNVFMNPDVRVNKANRGMGVLENKEEGERYAE  60

Query  205  HSLRKFLRTRTPEIMPALNRFFSQN  279
H LRKFLRTR PEIMP +N FFS++
Sbjct  61   HGLRKFLRTRPPEIMPTVNHFFSED  85


>gb|ACG30695.1| B12D protein [Zea mays]
Length=94

Score =  131 bits (329),  Expect = 1e-36
Identities = 59/90 (66%), Positives = 76/90 (84%), Gaps = 0/90 (0%)
Frame = +1

Query  25   MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ  204
M RW+KP+VYPLIAAMSFVTG+CVFQL+RNV+ NPDVRV+K  R +AVL+N  EG++Y+Q
Sbjct  1    MGRWLKPDVYPLIAAMSFVTGMCVFQLARNVLMNPDVRVSKTSRQSAVLDNAGEGQRYSQ  60

Query  205  HSLRKFLRTRTPEIMPALNRFFSQNNDN*T  294
H+ R+FL T+ PE+ PALN FFS +N+N T
Sbjct  61   HAFRRFLATQRPEVFPALNSFFSDSNNNNT  90


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 213584297
Number of extensions: 4126945
Number of successful extensions: 10139
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 10131
Number of HSP's successfully gapped: 0
Length of query: 459
Length of database: 6150218869
Length adjustment: 116
Effective length of query: 343
Effective length of database: 4071605125
Effective search space: 150649389625
Effective search space used: 150649389625
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEDW0HV012


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_11591

Length=452


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|ACB12048.1|  pathogenesis-related protein [Rehmannia glutin...  92.8    4e-21
gb|ABR10301.1|  pathogen-related protein STH-2 [Salvia miltior...  81.3    1e-16
ref|XP_003609710.1|  Pathogenesis-related protein [Medicago tr...  74.3    3e-14
ref|XP_003609709.1|  Pathogenesis-related protein [Medicago tr...  74.3    6e-14
ref|XP_002271428.2|  PREDICTED: pathogenesis-related protein S...  70.9    6e-13

ALIGNMENTS
>gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa]
Length=154

Score = 92.8 bits (229),  Expect = 4e-21
Identities = 50/90 (56%), Positives = 64/90 (71%), Gaps = 0/90 (0%)
Frame = -2

Query  451  RIDALDAekgttkfttteGPWLGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALK  272
RI+A+D +   +K+T  EGP LGDKIES+ ++ KFE+ S GGC  KIV EY+TKGD+ LK
Sbjct  65   RIEAVDIDNQVSKYTVIEGPMLGDKIESIHYEQKFEDSSDGGCVAKIVCEYHTKGDIQLK  124

Query  271  DEDIKEIIDGTKGFYAAAEAYLTANPTVCA  182
+E +K I D   GFY  +E YL ANP VCA
Sbjct  125  EEGVKAINDQALGFYTLSEEYLHANPNVCA  154


>gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza]
Length=160

Score = 81.3 bits (199),  Expect = 1e-16
Identities = 43/90 (48%), Positives = 60/90 (67%), Gaps = 0/90 (0%)
Frame = -2

Query  451  RIDALDAekgttkfttteGPWLGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALK  272
R+D +D EK + K+T  EG  LGDK+E + +D+KFE+   GGC +K+ +EY+TKG   L
Sbjct  71   RVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKVTSEYHTKGGYELA  130

Query  271  DEDIKEIIDGTKGFYAAAEAYLTANPTVCA  182
DED+K   + + G Y + E YL ANP VCA
Sbjct  131  DEDLKGAKEQSLGMYKSCEDYLLANPHVCA  160


>ref|XP_003609710.1| Pathogenesis-related protein [Medicago truncatula]
gb|AES91907.1| Pathogenesis-related protein [Medicago truncatula]
Length=160

Score = 74.3 bits (181),  Expect = 3e-14
Identities = 42/90 (47%), Positives = 56/90 (62%), Gaps = 0/90 (0%)
Frame = -2

Query  451  RIDALDAekgttkfttteGPWLGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALK  272
+ID LD E    K+T  EG  LGDK+ES+ ++VKFE  + GGC  K+ + Y T GD  +K
Sbjct  71   KIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVK  130

Query  271  DEDIKEIIDGTKGFYAAAEAYLTANPTVCA  182
+ED+KE  + T G Y   E+YL  NP V A
Sbjct  131  EEDVKEGRESTIGIYEVVESYLLENPQVYA  160


>ref|XP_003609709.1| Pathogenesis-related protein [Medicago truncatula]
gb|AES91906.1| Pathogenesis-related protein [Medicago truncatula]
Length=229

Score = 74.3 bits (181),  Expect = 6e-14
Identities = 42/90 (47%), Positives = 56/90 (62%), Gaps = 0/90 (0%)
Frame = -2

Query  451  RIDALDAekgttkfttteGPWLGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALK  272
+ID LD E    K+T  EG  LGDK+ES+ ++VKFE  + GGC  K+ + Y T GD  +K
Sbjct  140  KIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVK  199

Query  271  DEDIKEIIDGTKGFYAAAEAYLTANPTVCA  182
+ED+KE  + T G Y   E+YL  NP V A
Sbjct  200  EEDVKEGRESTIGIYEVVESYLLENPQVYA  229


>ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera]
emb|CBI22930.3| unnamed protein product [Vitis vinifera]
Length=160

Score = 70.9 bits (172),  Expect = 6e-13
Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 0/65 (0%)
Frame = -2

Query  388  LGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAAEAY  209
LGD++ES+V+++KFEE   GGC  K  +EY+TKG+  +K+E I+E  +   G Y   EAY
Sbjct  92   LGDQLESIVYEMKFEESGDGGCICKTRSEYHTKGEFEIKEESIREGKEKAMGVYKLVEAY  151

Query  208  LTANP  194
L ANP
Sbjct  152  LLANP  156


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 179470603
Number of extensions: 3135620
Number of successful extensions: 6465
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 6463
Number of HSP's successfully gapped: 0
Length of query: 452
Length of database: 6150218869
Length adjustment: 113
Effective length of query: 339
Effective length of database: 4125362377
Effective search space: 152638407949
Effective search space used: 152638407949
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEDYGS6012


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_12004

Length=442


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|ACB12048.1|  pathogenesis-related protein [Rehmannia glutin...  86.7    8e-19
gb|ABR10301.1|  pathogen-related protein STH-2 [Salvia miltior...  80.1    2e-16
ref|XP_003609710.1|  Pathogenesis-related protein [Medicago tr...  71.2    4e-13
ref|XP_002271428.2|  PREDICTED: pathogenesis-related protein S...  70.9    5e-13
ref|XP_003609709.1|  Pathogenesis-related protein [Medicago tr...  71.2    7e-13

ALIGNMENTS
>gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa]
Length=154

Score = 86.7 bits (213),  Expect = 8e-19
Identities = 41/72 (57%), Positives = 50/72 (69%), Gaps = 0/72 (0%)
Frame = +3

Query  3    GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA  182
G  LG+KIE + Y+ KFE+ S GGC  KIV EY+TKGD+ LK+E +K I D   GFY  +
Sbjct  83   GPMLGDKIESIHYEQKFEDSSDGGCVAKIVCEYHTKGDIQLKEEGVKAINDQALGFYTLS  142

Query  183  EAYLTANPTVCA  218
E YL ANP VCA
Sbjct  143  EEYLHANPNVCA  154


>gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza]
Length=160

Score = 80.1 bits (196),  Expect = 2e-16
Identities = 35/72 (49%), Positives = 48/72 (67%), Gaps = 0/72 (0%)
Frame = +3

Query  3    GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA  182
G  LG+K+E + YD+KFE+   GGC +K+ +EY+TKG   L DED+K   + + G Y +
Sbjct  89   GDMLGDKLEKICYDMKFEDTEDGGCVVKVTSEYHTKGGYELADEDLKGAKEQSLGMYKSC  148

Query  183  EAYLTANPTVCA  218
E YL ANP VCA
Sbjct  149  EDYLLANPHVCA  160


>ref|XP_003609710.1| Pathogenesis-related protein [Medicago truncatula]
gb|AES91907.1| Pathogenesis-related protein [Medicago truncatula]
Length=160

Score = 71.2 bits (173),  Expect = 4e-13
Identities = 33/72 (46%), Positives = 45/72 (63%), Gaps = 0/72 (0%)
Frame = +3

Query  3    GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA  182
G  LG+K+E + Y+VKFE  + GGC  K+ + Y T GD  +K+ED+KE  + T G Y
Sbjct  89   GDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVKEEDVKEGRESTIGIYEVV  148

Query  183  EAYLTANPTVCA  218
E+YL  NP V A
Sbjct  149  ESYLLENPQVYA  160


>ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera]
emb|CBI22930.3| unnamed protein product [Vitis vinifera]
Length=160

Score = 70.9 bits (172),  Expect = 5e-13
Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 0/68 (0%)
Frame = +3

Query  3    GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA  182
G  LG+++E +VY++KFEE   GGC  K  +EY+TKG+  +K+E I+E  +   G Y
Sbjct  89   GGVLGDQLESIVYEMKFEESGDGGCICKTRSEYHTKGEFEIKEESIREGKEKAMGVYKLV  148

Query  183  EAYLTANP  206
EAYL ANP
Sbjct  149  EAYLLANP  156


>ref|XP_003609709.1| Pathogenesis-related protein [Medicago truncatula]
gb|AES91906.1| Pathogenesis-related protein [Medicago truncatula]
Length=229

Score = 71.2 bits (173),  Expect = 7e-13
Identities = 33/72 (46%), Positives = 45/72 (63%), Gaps = 0/72 (0%)
Frame = +3

Query  3    GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA  182
G  LG+K+E + Y+VKFE  + GGC  K+ + Y T GD  +K+ED+KE  + T G Y
Sbjct  158  GDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVKEEDVKEGRESTIGIYEVV  217

Query  183  EAYLTANPTVCA  218
E+YL  NP V A
Sbjct  218  ESYLLENPQVYA  229


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 177980684
Number of extensions: 3148074
Number of successful extensions: 7265
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 7261
Number of HSP's successfully gapped: 0
Length of query: 442
Length of database: 6150218869
Length adjustment: 110
Effective length of query: 332
Effective length of database: 4179119629
Effective search space: 154627426273
Effective search space used: 154627426273
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEE208N016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_12099

Length=440


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003545085.1|  PREDICTED: uncharacterized protein LOC100...  78.2    4e-15
ref|XP_002300574.1|  predicted protein [Populus trichocarpa] >...  77.8    6e-15
ref|XP_003518561.1|  PREDICTED: uncharacterized protein LOC100...  75.5    4e-14
ref|XP_002863234.1|  hypothetical protein ARALYDRAFT_497148 [A...  72.8    3e-13
ref|XP_003634761.1|  PREDICTED: uncharacterized protein LOC100...  73.9    8e-13

ALIGNMENTS
>ref|XP_003545085.1| PREDICTED: uncharacterized protein LOC100812046 [Glycine max]
Length=311

Score = 78.2 bits (191),  Expect = 4e-15
Identities = 39/93 (42%), Positives = 54/93 (58%), Gaps = 3/93 (3%)
Frame = -3

Query  426  RVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVG--IFVFRIDAsvgelggsg  253
R+ NLHWRFRGNE + V+ F VQI WDVHDWL++ N  G+G   FVF+          +
Sbjct  218  RIMNLHWRFRGNEIVMVNNFPVQIFWDVHDWLFT-NDLGLGPAFFVFKPIFLETTSDFNS  276

Query  252  rveggedlGNGFWDVMTDEPCSVGFCHFLYAWK  154
G+   +++ D   + GFCH+LYAW+
Sbjct  277  IECPERGGGSSKHELLEDNSSTQGFCHYLYAWR  309


>ref|XP_002300574.1| predicted protein [Populus trichocarpa]
gb|EEE85379.1| predicted protein [Populus trichocarpa]
Length=307

Score = 77.8 bits (190),  Expect = 6e-15
Identities = 41/95 (43%), Positives = 53/95 (56%), Gaps = 1/95 (1%)
Frame = -3

Query  429  IRVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVGIFVFRIDAsvgelggsgr  250
IRV NL+WRFRGNE + VD+  VQI WDVHDWL+SG+ +  G+F+ +  A  G     G
Sbjct  213  IRVMNLNWRFRGNENMKVDDVGVQIFWDVHDWLFSGSSTSHGLFILKPAAQEGGDDKVGE  272

Query  249  veggedlGNGFWDVMTDEPCSV-GFCHFLYAWKTE  148
G +D   +   S  GF H + AWK E
Sbjct  273  GRHCRGNDGGMYDSPKERSSSTPGFFHVINAWKYE  307


>ref|XP_003518561.1| PREDICTED: uncharacterized protein LOC100776841 [Glycine max]
Length=311

Score = 75.5 bits (184),  Expect = 4e-14
Identities = 38/94 (40%), Positives = 54/94 (57%), Gaps = 3/94 (3%)
Frame = -3

Query  426  RVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVG--IFVFRIDAsvgelggsg  253
R+ NLHWRFRGNE + V+   VQI WDVHDWL++ N  G+G   FVF+          +
Sbjct  218  RIMNLHWRFRGNEILMVNNLPVQIFWDVHDWLFT-NDLGLGPAFFVFKPVFLETTSDSNS  276

Query  252  rveggedlGNGFWDVMTDEPCSVGFCHFLYAWKT  151
G+   +++ +   + GFCH+LYAW+T
Sbjct  277  IECLERSGGSNKRELLEENSSTQGFCHYLYAWRT  310


>ref|XP_002863234.1| hypothetical protein ARALYDRAFT_497148 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH39493.1| hypothetical protein ARALYDRAFT_497148 [Arabidopsis lyrata subsp.
lyrata]
Length=282

Score = 72.8 bits (177),  Expect = 3e-13
Identities = 38/94 (40%), Positives = 55/94 (59%), Gaps = 14/94 (15%)
Frame = -3

Query  429  IRVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVGIFVFRIDAsvgelggsgr  250
++V NL W+FRGN+T+ VD+  VQ+ WDV+DWL+S   +G G+F+F+ ++
Sbjct  203  VQVRNLQWKFRGNQTVLVDKEPVQVFWDVYDWLFSSPGTGHGLFIFKPES----------  252

Query  249  veggedlGNGFWDVMTDEPCSVGFCHFLYAWKTE  148
G  +  NG  D  +    S  FC FLYAWK E
Sbjct  253  --GESETSNGTKD--SSVSSSSDFCLFLYAWKLE  282


>ref|XP_003634761.1| PREDICTED: uncharacterized protein LOC100852612 [Vitis vinifera]
Length=697

Score = 73.9 bits (180),  Expect = 8e-13
Identities = 36/92 (39%), Positives = 48/92 (52%), Gaps = 2/92 (2%)
Frame = -3

Query  429  IRVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVGIFVFRIDAsvgelggsgr  250
IRV NLHWRFRGNET+ ++   +QI WDVHDWL++    G  +F+F+  A
Sbjct  213  IRVMNLHWRFRGNETVFLNNLPIQIFWDVHDWLFNSPSLGHALFIFKPGAPEYSSDSDLD  272

Query  249  veggedlGNG--FWDVMTDEPCSVGFCHFLYA  160
G   +D +     +  FCHFLYA
Sbjct  273  GTNHSGEAVGSDIYDSLWTTTSAADFCHFLYA  304


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 228184065
Number of extensions: 4528817
Number of successful extensions: 29740
Number of sequences better than 1e-10: 2
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 29737
Number of HSP's successfully gapped: 2
Length of query: 440
Length of database: 6150218869
Length adjustment: 110
Effective length of query: 330
Effective length of database: 4179119629
Effective search space: 150448306644
Effective search space used: 150448306644
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE20RD501S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_1213

Length=899


Score     E
Sequences producing significant alignments:                       (Bits)  Value

dbj|BAF47193.1|  embryonic element binding Factor 7 [Daucus ca...   176    4e-49
gb|ADZ28107.1|  ethylene response factor 4 [Malus x domestica]      173    3e-48
gb|ADE41144.1|  AP2 domain class transcription factor [Malus x...   173    3e-48
gb|ADE41127.1|  AP2 domain class transcription factor [Malus x...   166    2e-45
gb|ABB89755.1|  putative dehydration-responsive element bindin...   164    1e-44

ALIGNMENTS
>dbj|BAF47193.1| embryonic element binding Factor 7 [Daucus carota]
Length=320

Score =  176 bits (445),  Expect = 4e-49
Identities = 117/182 (64%), Positives = 133/182 (73%), Gaps = 21/182 (12%)
Frame = +3

Query  3    PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF  182
P KLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAE+AALAYD+AAYKLRGD+ARLNF
Sbjct  153  PTKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEEAALAYDRAAYKLRGDFARLNF  212

Query  183  PHLKHQLNQD-STFKPLQSSVDAKLQAICQNLAKQGTTEKKSVKPCLDSDKQSEIPKVEN  359
PHLK  L+Q+ STFKPL S+VDAKLQAICQNL K+   +K + KP      +    KVE
Sbjct  213  PHLK--LDQELSTFKPLHSTVDAKLQAICQNLNKE-PKQKSAKKP---KTSKPAFVKVEE  266

Query  360  ASSDSFNQAVKVEDDVLssskarsaspesDITFHDFSESCFDECGDFFLHKCPSVEIDWA  539
AS+ S                +  +SPES+I+F DFSE CFDE  +F L K PSVEIDW
Sbjct  267  ASNGS--------------DLSGGSSPESEISFLDFSEPCFDESENFMLQKFPSVEIDWE  312

Query  540  AL  545
AL
Sbjct  313  AL  314


>gb|ADZ28107.1| ethylene response factor 4 [Malus x domestica]
Length=323

Score =  173 bits (439),  Expect = 3e-48
Identities = 112/192 (58%), Positives = 135/192 (70%), Gaps = 11/192 (6%)
Frame = +3

Query  3    PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF  182
P KLYRGVRQRHWGKWVAEIRLP+NRTRLWLGTFDTAE+AALAYDKAA+KLRGD+ARLNF
Sbjct  132  PTKLYRGVRQRHWGKWVAEIRLPRNRTRLWLGTFDTAEEAALAYDKAAFKLRGDFARLNF  191

Query  183  PHLKHQ----LNQDSTFKPLQSSVDAKLQAICQNLAKQGTTEKKSVKPCLDSDKQSEI-P  347
PHL+HQ          +KPL SSVDAKLQAICQ+L    T +  + +PC  ++ +  +
Sbjct  192  PHLRHQGALVCGDFGHYKPLHSSVDAKLQAICQSLGANSTKQGNTGEPCPVAETKPVVSA  251

Query  348  KVENASSDSFNQAVKVEDDVL------ssskarsaspesDITFHDFSESCFDECGDFFLH  509
+E    DSF   +K E +         S    S+SPESDITF DFS+S +DE  +F L
Sbjct  252  PLEAKMDDSFKSELKSESEAFSSSSYSPSRSDESSSPESDITFLDFSDSQWDEAENFGLE  311

Query  510  KCPSVEIDWAAL  545
K PSVEIDW+A+
Sbjct  312  KYPSVEIDWSAI  323


>gb|ADE41144.1| AP2 domain class transcription factor [Malus x domestica]
Length=323

Score =  173 bits (439),  Expect = 3e-48
Identities = 111/192 (58%), Positives = 134/192 (70%), Gaps = 11/192 (6%)
Frame = +3

Query  3    PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF  182
P KLYRGVRQRHWGKWVAEIRLP+NRTRLWLGTFDTAE+AALAYDKAA+KLRGD+ARLNF
Sbjct  132  PTKLYRGVRQRHWGKWVAEIRLPRNRTRLWLGTFDTAEEAALAYDKAAFKLRGDFARLNF  191

Query  183  PHLKHQ----LNQDSTFKPLQSSVDAKLQAICQNLAKQGTTEKKSVKPCLDSDKQSEI-P  347
PHL+HQ          +KPL SSVDAKLQAICQ+L    T +  + +PC  ++ +  +
Sbjct  192  PHLRHQGALVCGDFGHYKPLHSSVDAKLQAICQSLGANSTKQGNTGEPCPVAETKPVVSA  251

Query  348  KVENASSDSFNQAVKVEDDVL------ssskarsaspesDITFHDFSESCFDECGDFFLH  509
+E    DSF   +K E +              S+SPESDITF DFS+S +DE  +F L
Sbjct  252  PLEAKMDDSFKSELKSESEAFSSSSYSPFRSDESSSPESDITFLDFSDSQWDEAENFGLE  311

Query  510  KCPSVEIDWAAL  545
K PSVEIDW+A+
Sbjct  312  KYPSVEIDWSAI  323


>gb|ADE41127.1| AP2 domain class transcription factor [Malus x domestica]
Length=338

Score =  166 bits (420),  Expect = 2e-45
Identities = 112/195 (57%), Positives = 132/195 (68%), Gaps = 14/195 (7%)
Frame = +3

Query  3    PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF  182
P KLYRGVRQRHWGKWVAEIRLP+NRTRLWLGTFDTAE+AALAYDKAA+KLRGD+A LNF
Sbjct  131  PTKLYRGVRQRHWGKWVAEIRLPRNRTRLWLGTFDTAEEAALAYDKAAFKLRGDFACLNF  190

Query  183  PHLKHQLNQDS----TFKPLQSSVDAKLQAICQNLAKQGTTEKKSVKPC-LDSDKQSEIP  347
PHLKHQ   DS     +KPL SSVDAKLQ ICQ+LA   T +  + +PC +   K
Sbjct  191  PHLKHQGAHDSGAFGHYKPLHSSVDAKLQEICQSLAANSTKQGNTGEPCSVPETKPMVSA  250

Query  348  KVENASSDSFNQAVKVEDDVL---------ssskarsaspesDITFHDFSESCFDECGDF  500
+E    DSF   +K E +           S     S++PE+DITF DFS+S +DE  +F
Sbjct  251  PLETRMDDSFKTELKSEWEAFSSLSFSPSRSDRSDESSTPETDITFLDFSDSQWDEAENF  310

Query  501  FLHKCPSVEIDWAAL  545
L K PSVEID+  L
Sbjct  311  GLEKYPSVEIDFLHL  325


>gb|ABB89755.1| putative dehydration-responsive element binding protein [Broussonetia
papyrifera]
Length=330

Score =  164 bits (415),  Expect = 1e-44
Identities = 115/200 (58%), Positives = 140/200 (70%), Gaps = 19/200 (10%)
Frame = +3

Query  3    PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF  182
P KLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAE+AALAYDKAAYKLRGD+ARLNF
Sbjct  131  PNKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEEAALAYDKAAYKLRGDFARLNF  190

Query  183  PHLKHQ----LNQDSTFKPLQSSVDAKLQAICQNLA---KQGTTE--------KKSVKPC  317
PHL+H+      +   +KPL SSVDAKLQAICQ+LA   KQG+ +        K  ++P
Sbjct  191  PHLRHEGAHVSGEFGEYKPLHSSVDAKLQAICQSLANSQKQGSAKEACSEPEVKPVIEPK  250

Query  318  LDSDK----QSEIPKVENASSDSFNQAVKVEDDVLssskarsaspesDITFHDFSESCFD  485
+ SD     + E+     +SS S + ++ +   +   S A S+SPESD+T  DFS+S +D
Sbjct  251  MASDNSPKGELEVSSSSLSSSSSLSLSLSLSSPLSDESSAGSSSPESDVTLLDFSDSHWD  310

Query  486  ECGDFFLHKCPSVEIDWAAL  545
+F L K PSVEIDW AL
Sbjct  311  GNENFGLGKYPSVEIDWDAL  330


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 431153293
Number of extensions: 8524744
Number of successful extensions: 19148
Number of sequences better than 1e-10: 6
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 19121
Number of HSP's successfully gapped: 6
Length of query: 899
Length of database: 6150218869
Length adjustment: 140
Effective length of query: 759
Effective length of database: 3641547109
Effective search space: 579005990331
Effective search space used: 579005990331
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 176 (72.4 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEES6GZ016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_12313

Length=436


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|ACB12048.1|  pathogenesis-related protein [Rehmannia glutin...  80.1    2e-16
ref|XP_003609710.1|  Pathogenesis-related protein [Medicago tr...  75.9    8e-15
gb|ABR10301.1|  pathogen-related protein STH-2 [Salvia miltior...  75.1    2e-14
ref|XP_003609709.1|  Pathogenesis-related protein [Medicago tr...  75.9    2e-14
ref|XP_002271428.2|  PREDICTED: pathogenesis-related protein S...  73.2    8e-14

ALIGNMENTS
>gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa]
Length=154

Score = 80.1 bits (196),  Expect = 2e-16
Identities = 42/87 (48%), Positives = 61/87 (70%), Gaps = 0/87 (0%)
Frame = +2

Query  2    RIDALDAekgttkftttegPWLGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALK  181
RI+A+D +   +K+T  EGP LGDKIES+ ++ KFE+ S GGC+ KI  +Y+TKGD+ LK
Sbjct  65   RIEAVDIDNQVSKYTVIEGPMLGDKIESIHYEQKFEDSSDGGCVAKIVCEYHTKGDIQLK  124

Query  182  DDDIKEFVERTKAFYDAAEAYLIANPN  262
++ +K   ++   FY  +E YL ANPN
Sbjct  125  EEGVKAINDQALGFYTLSEEYLHANPN  151


>ref|XP_003609710.1| Pathogenesis-related protein [Medicago truncatula]
gb|AES91907.1| Pathogenesis-related protein [Medicago truncatula]
Length=160

Score = 75.9 bits (185),  Expect = 8e-15
Identities = 41/86 (48%), Positives = 57/86 (66%), Gaps = 0/86 (0%)
Frame = +2

Query  2    RIDALDAekgttkftttegPWLGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALK  181
+ID LD E    K+T  EG  LGDK+ES+ ++VKFE  + GGCL K+A+ Y T GD  +K
Sbjct  71   KIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVK  130

Query  182  DDDIKEFVERTKAFYDAAEAYLIANP  259
++D+KE  E T   Y+  E+YL+ NP
Sbjct  131  EEDVKEGRESTIGIYEVVESYLLENP  156


>gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza]
Length=160

Score = 75.1 bits (183),  Expect = 2e-14
Identities = 38/87 (44%), Positives = 60/87 (69%), Gaps = 0/87 (0%)
Frame = +2

Query  2    RIDALDAekgttkftttegPWLGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALK  181
R+D +D EK + K+T  EG  LGDK+E + +D+KFE+   GGC++K+ ++Y+TKG   L
Sbjct  71   RVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKVTSEYHTKGGYELA  130

Query  182  DDDIKEFVERTKAFYDAAEAYLIANPN  262
D+D+K   E++   Y + E YL+ANP+
Sbjct  131  DEDLKGAKEQSLGMYKSCEDYLLANPH  157


>ref|XP_003609709.1| Pathogenesis-related protein [Medicago truncatula]
gb|AES91906.1| Pathogenesis-related protein [Medicago truncatula]
Length=229

Score = 75.9 bits (185),  Expect = 2e-14
Identities = 41/86 (48%), Positives = 57/86 (66%), Gaps = 0/86 (0%)
Frame = +2

Query  2    RIDALDAekgttkftttegPWLGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALK  181
+ID LD E    K+T  EG  LGDK+ES+ ++VKFE  + GGCL K+A+ Y T GD  +K
Sbjct  140  KIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVK  199

Query  182  DDDIKEFVERTKAFYDAAEAYLIANP  259
++D+KE  E T   Y+  E+YL+ NP
Sbjct  200  EEDVKEGRESTIGIYEVVESYLLENP  225


>ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera]
emb|CBI22930.3| unnamed protein product [Vitis vinifera]
Length=160

Score = 73.2 bits (178),  Expect = 8e-14
Identities = 30/66 (45%), Positives = 48/66 (73%), Gaps = 0/66 (0%)
Frame = +2

Query  65   LGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALKDDDIKEFVERTKAFYDAAEAY  244
LGD++ES+V+++KFEE   GGC+ K  ++Y+TKG+  +K++ I+E  E+    Y   EAY
Sbjct  92   LGDQLESIVYEMKFEESGDGGCICKTRSEYHTKGEFEIKEESIREGKEKAMGVYKLVEAY  151

Query  245  LIANPN  262
L+ANP+
Sbjct  152  LLANPD  157


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  May 9, 2012  2:26 PM
Number of letters in database: 6,200,364,692
Number of sequences in database:  18,076,563

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 18076563
Number of Hits to DB: 167970995
Number of extensions: 2784139
Number of successful extensions: 7347
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 7343
Number of HSP's successfully gapped: 0
Length of query: 436
Length of database: 6200364692
Length adjustment: 109
Effective length of query: 327
Effective length of database: 4230019325
Effective search space: 152280695700
Effective search space used: 152280695700
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE2YTBV01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_1307

Length=886


Score     E
Sequences producing significant alignments:                       (Bits)  Value

emb|CBI25361.3|  unnamed protein product [Vitis vinifera]           307    4e-102
ref|XP_002280352.1|  PREDICTED: macro domain-containing protei...   307    1e-101
ref|XP_002319871.1|  predicted protein [Populus trichocarpa] >...   302    2e-100
ref|XP_003588477.1|  Appr-1-p processing enzyme family protein...   295    1e-96
gb|AAB87596.2|  expressed protein [Arabidopsis thaliana]            293    2e-96

ALIGNMENTS
>emb|CBI25361.3| unnamed protein product [Vitis vinifera]
Length=190

Score =  307 bits (787),  Expect = 4e-102
Identities = 148/183 (81%), Positives = 165/183 (90%), Gaps = 0/183 (0%)
Frame = -3

Query  878  ELKLSPSSVLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAACY  699
+L LSP+S LKIQKGDIT W VDGSSDAIVNPANERMLGGGGADGAIH+AAGPEL AACY
Sbjct  7    QLALSPTSSLKIQKGDITKWFVDGSSDAIVNPANERMLGGGGADGAIHRAAGPELVAACY  66

Query  698  KVLEVKPGVRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVANE  519
KV EV+PG+RCPTGEARIT GF+LPA+HVIHTVGPIYDVD NP+ASL++AY N L +A E
Sbjct  67   KVPEVRPGIRCPTGEARITQGFKLPAAHVIHTVGPIYDVDSNPEASLKSAYANCLSLAKE  126

Query  518  NNIQYIAFTAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKARE  339
NN+QYIAF AISCGV+GYPY+EAA VAISTVKEF  ++KEVHFVLFSDDIY+VWL KA E
Sbjct  127  NNVQYIAFPAISCGVFGYPYDEAATVAISTVKEFGKDLKEVHFVLFSDDIYNVWLNKANE  186

Query  338  LLQ  330
LLQ
Sbjct  187  LLQ  189


>ref|XP_002280352.1| PREDICTED: macro domain-containing protein VPA0103 [Vitis vinifera]
emb|CAN70345.1| hypothetical protein VITISV_012577 [Vitis vinifera]
Length=231

Score =  307 bits (787),  Expect = 1e-101
Identities = 148/183 (81%), Positives = 165/183 (90%), Gaps = 0/183 (0%)
Frame = -3

Query  878  ELKLSPSSVLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAACY  699
+L LSP+S LKIQKGDIT W VDGSSDAIVNPANERMLGGGGADGAIH+AAGPEL AACY
Sbjct  48   QLALSPTSSLKIQKGDITKWFVDGSSDAIVNPANERMLGGGGADGAIHRAAGPELVAACY  107

Query  698  KVLEVKPGVRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVANE  519
KV EV+PG+RCPTGEARIT GF+LPA+HVIHTVGPIYDVD NP+ASL++AY N L +A E
Sbjct  108  KVPEVRPGIRCPTGEARITQGFKLPAAHVIHTVGPIYDVDSNPEASLKSAYANCLSLAKE  167

Query  518  NNIQYIAFTAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKARE  339
NN+QYIAF AISCGV+GYPY+EAA VAISTVKEF  ++KEVHFVLFSDDIY+VWL KA E
Sbjct  168  NNVQYIAFPAISCGVFGYPYDEAATVAISTVKEFGKDLKEVHFVLFSDDIYNVWLNKANE  227

Query  338  LLQ  330
LLQ
Sbjct  228  LLQ  230


>ref|XP_002319871.1| predicted protein [Populus trichocarpa]
gb|EEE95794.1| predicted protein [Populus trichocarpa]
Length=180

Score =  302 bits (774),  Expect = 2e-100
Identities = 147/175 (84%), Positives = 161/175 (92%), Gaps = 0/175 (0%)
Frame = -3

Query  854  VLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAACYKVLEVKPG  675
+LKI KGDIT WSVDGSSDAIVNPANERMLGGGGADGAIH+AAGP+LR ACY V EV+PG
Sbjct  5    LLKISKGDITKWSVDGSSDAIVNPANERMLGGGGADGAIHRAAGPQLRDACYTVPEVRPG  64

Query  674  VRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVANENNIQYIAF  495
VRCPTGEARITPGF LPA  VIHTVGPIYDVD NP+ASLRNAYRNSL +A +NNI+YIAF
Sbjct  65   VRCPTGEARITPGFNLPAFRVIHTVGPIYDVDGNPEASLRNAYRNSLILAKDNNIKYIAF  124

Query  494  TAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKARELLQ  330
AISCGVYGYPYEEAA VAISTVKEFA ++KEVHFVLFSD+IY+VWL+KA+ELLQ
Sbjct  125  PAISCGVYGYPYEEAAKVAISTVKEFADDLKEVHFVLFSDEIYNVWLEKAKELLQ  179


>ref|XP_003588477.1| Appr-1-p processing enzyme family protein [Medicago truncatula]
gb|AES58728.1| Appr-1-p processing enzyme family protein [Medicago truncatula]
Length=233

Score =  295 bits (755),  Expect = 1e-96
Identities = 144/184 (78%), Positives = 157/184 (85%), Gaps = 0/184 (0%)
Frame = -3

Query  881  VELKLSPSSVLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAAC  702
V   LS S+ L IQKGDIT WS+DGS+DAIVNPANERMLGGGGADGAIH+AAGP+L  AC
Sbjct  49   VRFPLSSSNALIIQKGDITKWSIDGSTDAIVNPANERMLGGGGADGAIHRAAGPDLLRAC  108

Query  701  YKVLEVKPGVRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVAN  522
V EV+PGVRCPTGEARITPGF LPASHVIHTVGPIYDVD NP ASL +AYRNSL+VA
Sbjct  109  RNVPEVRPGVRCPTGEARITPGFLLPASHVIHTVGPIYDVDSNPAASLASAYRNSLRVAK  168

Query  521  ENNIQYIAFTAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKAR  342
ENNIQYIAF AISCGVYGYPY+EAA VAIST+KEF  + KEVHFVLF  DIYD WL K+
Sbjct  169  ENNIQYIAFPAISCGVYGYPYDEAATVAISTIKEFQNDFKEVHFVLFMSDIYDTWLNKSD  228

Query  341  ELLQ  330
ELL+
Sbjct  229  ELLK  232


>gb|AAB87596.2| expressed protein [Arabidopsis thaliana]
Length=193

Score =  293 bits (749),  Expect = 2e-96
Identities = 138/180 (77%), Positives = 161/180 (89%), Gaps = 0/180 (0%)
Frame = -3

Query  869  LSPSSVLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAACYKVL  690
LS SS+LKI KGDIT WSVD SSDAIVNPANERMLGGGGADGAIH+AAGP+LRAACY+V
Sbjct  12   LSDSSLLKILKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRAAGPQLRAACYEVP  71

Query  689  EVKPGVRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVANENNI  510
EV+PGVRCPTGEARITPGF LPAS VIHTVGPIYD D NP+ SL N+Y+NSL+VA ENNI
Sbjct  72   EVRPGVRCPTGEARITPGFNLPASRVIHTVGPIYDSDVNPQESLTNSYKNSLRVAKENNI  131

Query  509  QYIAFTAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKARELLQ  330
+YIAF AISCG+YGYP++EAA + IST+K+F+ + KEVHFVLF+DDI+ VW+ KA+E+LQ
Sbjct  132  KYIAFPAISCGIYGYPFDEAAAIGISTIKQFSTDFKEVHFVLFADDIFSVWVNKAKEVLQ  191


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 443984567
Number of extensions: 9097265
Number of successful extensions: 23242
Number of sequences better than 1e-10: 94
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 23059
Number of HSP's successfully gapped: 94
Length of query: 886
Length of database: 6150218869
Length adjustment: 139
Effective length of query: 747
Effective length of database: 3659466193
Effective search space: 570876726108
Effective search space used: 570876726108
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 176 (72.4 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEF2CHP012


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_13435

Length=408


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002893532.1|  hypothetical protein ARALYDRAFT_473065 [A...   109    4e-28
ref|XP_002262673.1|  PREDICTED: glutaredoxin-C9-like [Vitis vi...   106    1e-26
emb|CAN60168.1|  hypothetical protein VITISV_003664 [Vitis vin...   104    2e-26
ref|NP_174170.1|  glutaredoxin-C9 [Arabidopsis thaliana] >sp|Q...   105    2e-26
ref|XP_002331955.1|  glutaredoxin C9 [Populus trichocarpa] >gb...   105    5e-26

ALIGNMENTS
>ref|XP_002893532.1| hypothetical protein ARALYDRAFT_473065 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH69791.1| hypothetical protein ARALYDRAFT_473065 [Arabidopsis lyrata subsp.
lyrata]
Length=121

Score =  109 bits (273),  Expect = 4e-28
Identities = 51/92 (55%), Positives = 68/92 (74%), Gaps = 0/92 (0%)
Frame = -3

Query  397  KNVKILVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNEPDVaallgiegla  218
++V+++V ENAVIV GR+GCCMCHVV+ LL G GVNP +L+++E+ E +V   L   G
Sbjct  20   ESVRMVVEENAVIVIGRRGCCMCHVVRRLLLGLGVNPAVLEIEEEREEEVLRELERIGGG  79

Query  217  aGVNFPAVFVGGELFGGIDEIMGAHITGELVP  122
V  PAV+VGG LFGG+D +M  HI+GELVP
Sbjct  80   DTVKLPAVYVGGRLFGGLDRVMATHISGELVP  111


>ref|XP_002262673.1| PREDICTED: glutaredoxin-C9-like [Vitis vinifera]
Length=141

Score =  106 bits (265),  Expect = 1e-26
Identities = 54/95 (57%), Positives = 69/95 (73%), Gaps = 4/95 (4%)
Frame = -3

Query  394  NVKILVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNE----PDVaallgie  227
N+  +V+ENAVIVFGR+GCCM HVVK LL G GVNP + +V+E++E     ++  +   E
Sbjct  37   NITNMVSENAVIVFGRRGCCMTHVVKRLLLGLGVNPAVCEVNEEDEIGVLDELGMVGAGE  96

Query  226  glaaGVNFPAVFVGGELFGGIDEIMGAHITGELVP  122
G    V FPAVF+GG LFGG+D +M AHITGELVP
Sbjct  97   GKQGAVQFPAVFIGGRLFGGLDRVMAAHITGELVP  131


>emb|CAN60168.1| hypothetical protein VITISV_003664 [Vitis vinifera]
Length=101

Score =  104 bits (260),  Expect = 2e-26
Identities = 53/91 (58%), Positives = 67/91 (74%), Gaps = 4/91 (4%)
Frame = -3

Query  382  LVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNE----PDVaallgieglaa  215
+V+ENAVIVFGR+GCCM HVVK LL G GVNP + +V+E++E     ++  +   EG
Sbjct  1    MVSENAVIVFGRRGCCMTHVVKRLLLGLGVNPAVCEVNEEDEIGVLDELGMVGAGEGKQG  60

Query  214  GVNFPAVFVGGELFGGIDEIMGAHITGELVP  122
V FPAVF+GG LFGG+D +M AHITGELVP
Sbjct  61   AVQFPAVFIGGRLFGGLDRVMAAHITGELVP  91


>ref|NP_174170.1| glutaredoxin-C9 [Arabidopsis thaliana]
sp|Q9SGP6.1|GRXC9_ARATH RecName: Full=Glutaredoxin-C9; Short=AtGrxC9; AltName: Full=Protein
ROXY 19
gb|AAF16751.1|AC010155_4 F3M18.8 [Arabidopsis thaliana]
gb|AAG40382.1|AF325030_1 At1g28480 [Arabidopsis thaliana]
dbj|BAF01232.1| hypothetical protein [Arabidopsis thaliana]
gb|ABK32150.1| At1g28480 [Arabidopsis thaliana]
gb|ACO50423.1| glutaredoxin [Arabidopsis thaliana]
gb|AEE30981.1| glutaredoxin-C9 [Arabidopsis thaliana]
Length=137

Score =  105 bits (262),  Expect = 2e-26
Identities = 52/95 (55%), Positives = 70/95 (74%), Gaps = 3/95 (3%)
Frame = -3

Query  397  KNVKILVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNEPDV---aallgie  227
+ V+++V ENAVIV GR+GCCMCHVV+ LL G GVNP +L++DE+ E +V      +G++
Sbjct  33   ERVRMVVEENAVIVIGRRGCCMCHVVRRLLLGLGVNPAVLEIDEEREDEVLSELENIGVQ  92

Query  226  glaaGVNFPAVFVGGELFGGIDEIMGAHITGELVP  122
G    V  PAV+VGG LFGG+D +M  HI+GELVP
Sbjct  93   GGGGTVKLPAVYVGGRLFGGLDRVMATHISGELVP  127


>ref|XP_002331955.1| glutaredoxin C9 [Populus trichocarpa]
gb|EEF11863.1| glutaredoxin C9 [Populus trichocarpa]
Length=152

Score =  105 bits (261),  Expect = 5e-26
Identities = 52/97 (54%), Positives = 65/97 (67%), Gaps = 6/97 (6%)
Frame = -3

Query  394  NVKILVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNEPDV------aallg  233
+V+ LV EN+VIVFG++GCCMCHVVK LL G GVNP + +V+E+ E DV
Sbjct  46   HVQKLVLENSVIVFGKRGCCMCHVVKRLLLGLGVNPPVFEVEEKEEDDVIKELSMIDSDR  105

Query  232  ieglaaGVNFPAVFVGGELFGGIDEIMGAHITGELVP  122
V FP VFVGG+LFGG++ +M  HITGELVP
Sbjct  106  GGEGVDQVQFPVVFVGGKLFGGLERVMATHITGELVP  142


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  May 9, 2012  2:26 PM
Number of letters in database: 6,200,364,692
Number of sequences in database:  18,076,563

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 18076563
Number of Hits to DB: 173621065
Number of extensions: 3259378
Number of successful extensions: 8445
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 8439
Number of HSP's successfully gapped: 0
Length of query: 408
Length of database: 6200364692
Length adjustment: 101
Effective length of query: 307
Effective length of database: 4374631829
Effective search space: 153112114015
Effective search space used: 153112114015
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE2Z4R101N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_1393

Length=876


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002518452.1|  peroxisomal membrane protein 2, pxmp2, pu...   238    4e-75
ref|XP_002316547.1|  predicted protein [Populus trichocarpa] >...   234    1e-73
ref|NP_001237804.1|  uncharacterized protein LOC100499909 [Gly...   233    3e-73
ref|XP_003534581.1|  PREDICTED: peroxisomal membrane protein P...   229    8e-72
gb|ACU19135.1|  unknown [Glycine max]                               222    6e-69

ALIGNMENTS
>ref|XP_002518452.1| peroxisomal membrane protein 2, pxmp2, putative [Ricinus communis]
gb|EEF43839.1| peroxisomal membrane protein 2, pxmp2, putative [Ricinus communis]
Length=185

Score =  238 bits (607),  Expect = 4e-75
Identities = 147/185 (79%), Positives = 166/185 (90%), Gaps = 0/185 (0%)
Frame = -3

Query  841  MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG  662
MGSVAKKGLQ YL QLQ HPLRTK +TAG LSA+SDI+AQK+SGIQKLQL+RLLLKV+FG
Sbjct  1    MGSVAKKGLQLYLLQLQHHPLRTKAITAGFLSAVSDIIAQKISGIQKLQLRRLLLKVLFG  60

Query  661  AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW  482
+AYLGPFGHF H++LDKIFKGKKDTKTVAKKVV EQLTSSP NN+LFM+YYG ++E RPW
Sbjct  61   SAYLGPFGHFLHIILDKIFKGKKDTKTVAKKVVVEQLTSSPWNNMLFMIYYGVIVERRPW  120

Query  481  IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM  302
+HVK++IKKEYP VQ T+W+FWPVVGW+NHQYVPLQ RVI   VVACFW IFLNLRARSM
Sbjct  121  MHVKARIKKEYPKVQLTSWTFWPVVGWINHQYVPLQLRVIFHMVVACFWGIFLNLRARSM  180

Query  301  TLTKG  287
L KG
Sbjct  181  ALAKG  185


>ref|XP_002316547.1| predicted protein [Populus trichocarpa]
gb|EEE97159.1| predicted protein [Populus trichocarpa]
Length=185

Score =  234 bits (597),  Expect = 1e-73
Identities = 144/185 (78%), Positives = 167/185 (90%), Gaps = 0/185 (0%)
Frame = -3

Query  841  MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG  662
MGSVAKKGLQQY+ QLQ+HPLRTK +TAGVLSA+SDIV+QKLSGIQKLQ+KR+LLKV+FG
Sbjct  1    MGSVAKKGLQQYMLQLQQHPLRTKAITAGVLSALSDIVSQKLSGIQKLQIKRILLKVLFG  60

Query  661  AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW  482
YLGPFGH+ H++LDK+FKGKKDT TVAKKV  EQLT+SP NNL+FMVYYG VI+GRPW
Sbjct  61   FGYLGPFGHYLHILLDKLFKGKKDTTTVAKKVAVEQLTASPWNNLVFMVYYGMVIDGRPW  120

Query  481  IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM  302
+ VK+K+KKEYPAVQFT+W+FWPVVGWVNHQY+P QFRVI  S++A  W IFLNLRARSM
Sbjct  121  LQVKTKLKKEYPAVQFTSWTFWPVVGWVNHQYIPQQFRVIFHSLIAVGWGIFLNLRARSM  180

Query  301  TLTKG  287
LTKG
Sbjct  181  ALTKG  185


>ref|NP_001237804.1| uncharacterized protein LOC100499909 [Glycine max]
gb|ACU14136.1| unknown [Glycine max]
Length=185

Score =  233 bits (595),  Expect = 3e-73
Identities = 137/184 (74%), Positives = 170/184 (92%), Gaps = 0/184 (0%)
Frame = -3

Query  841  MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG  662
MGS+AKKGL  Y+ QLQ+HPLRTKV+TAGVLSAISD+V+QKL+GIQKLQLKRLL KVIFG
Sbjct  1    MGSLAKKGLNNYVKQLQQHPLRTKVITAGVLSAISDVVSQKLTGIQKLQLKRLLFKVIFG  60

Query  661  AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW  482
AAYLGPFGHF+H++LDKIFKGK+D+KTVAKKV+ EQLTS+P NNLLFM+YYG V+EG+PW
Sbjct  61   AAYLGPFGHFFHLILDKIFKGKRDSKTVAKKVLIEQLTSNPWNNLLFMIYYGLVVEGQPW  120

Query  481  IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM  302
++VK+K+KK+YP+VQ+T+W+ WPVVGW+NH+++PL FRV+ QS+VA FW +FLNLRARSM
Sbjct  121  VNVKAKVKKDYPSVQYTSWTVWPVVGWINHKFMPLHFRVVFQSLVAFFWGVFLNLRARSM  180

Query  301  TLTK  290
L K
Sbjct  181  ALIK  184


>ref|XP_003534581.1| PREDICTED: peroxisomal membrane protein PMP22-like [Glycine max]
Length=185

Score =  229 bits (585),  Expect = 8e-72
Identities = 135/184 (73%), Positives = 169/184 (92%), Gaps = 0/184 (0%)
Frame = -3

Query  841  MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG  662
MGS+AKKGL  Y+ QLQ+HPLRTKV+TAGVLSAISD+V+QKL+GIQK+QLKRLL KVIFG
Sbjct  1    MGSLAKKGLNNYVKQLQQHPLRTKVITAGVLSAISDVVSQKLTGIQKIQLKRLLFKVIFG  60

Query  661  AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW  482
AAYLGPFGHF+H++LDKIFKGK+D+KTVAKKV+ EQLTS+P NNLLFM+YYG V+EG+PW
Sbjct  61   AAYLGPFGHFFHLILDKIFKGKRDSKTVAKKVLIEQLTSNPWNNLLFMIYYGLVVEGQPW  120

Query  481  IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM  302
++VK+K+KK+Y +VQ+T+W+ WPVVGW+NH+++PL FRV+ QS+VA FW +FLNLRARSM
Sbjct  121  VNVKAKVKKDYLSVQYTSWTVWPVVGWINHKFMPLHFRVVFQSLVAFFWGVFLNLRARSM  180

Query  301  TLTK  290
L K
Sbjct  181  ALIK  184


>gb|ACU19135.1| unknown [Glycine max]
Length=185

Score =  222 bits (566),  Expect = 6e-69
Identities = 132/184 (72%), Positives = 166/184 (90%), Gaps = 0/184 (0%)
Frame = -3

Query  841  MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG  662
MGS+AKKGL  Y+ QLQ+HPLRTKV+TAGVLSAISD+V+QKL+GIQK+QLKRLL KVIFG
Sbjct  1    MGSLAKKGLNNYVKQLQQHPLRTKVITAGVLSAISDVVSQKLTGIQKIQLKRLLFKVIFG  60

Query  661  AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW  482
AAY GPFGH +H++LDKIFKGK+D+KTVAKKV+ EQLTS+P NNLLFM+YYG V+EG+PW
Sbjct  61   AAYPGPFGHLFHLILDKIFKGKRDSKTVAKKVLIEQLTSNPWNNLLFMIYYGLVVEGQPW  120

Query  481  IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM  302
++VK+K+KK+Y +VQ+T+W+ WPVVGW+NH+++PL FRV+ QS+VA FW +FLNLRAR M
Sbjct  121  VNVKAKVKKDYLSVQYTSWTVWPVVGWINHKFMPLHFRVVFQSLVAFFWGVFLNLRARFM  180

Query  301  TLTK  290
L K
Sbjct  181  ALIK  184


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 1,855,251,573
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 438472506
Number of extensions: 9096450
Number of successful extensions: 17563
Number of sequences better than 1e-10: 2
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 17537
Number of HSP's successfully gapped: 2
Length of query: 876
Length of database: 6150218869
Length adjustment: 139
Effective length of query: 737
Effective length of database: 3659466193
Effective search space: 559898327529
Effective search space used: 559898327529
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 176 (72.4 bits)






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPDZ3HTU016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_147

Length=1254


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003606383.1|  Aquaporin PIP2-7 [Medicago truncatula] >g...   404    9e-137
emb|CAB61749.1|  putative water channel protein [Cicer arietinum]   393    4e-133
emb|CAB45651.1|  putative plasma membrane intrinsic protein [P...   392    6e-132
ref|NP_001105639.1|  aquaporin PIP2-7 [Zea mays] >sp|Q9ATM4.1|...   390    3e-131
ref|NP_001237286.1|  PIP2,2 [Glycine max] >gb|AAX86046.1| PIP2...   389    1e-130

ALIGNMENTS
>ref|XP_003606383.1| Aquaporin PIP2-7 [Medicago truncatula]
gb|AES88580.1| Aquaporin PIP2-7 [Medicago truncatula]
Length=285

Score =  404 bits (1038),  Expect = 9e-137
Identities = 197/203 (97%), Positives = 200/203 (99%), Gaps = 0/203 (0%)
Frame = -3

Query  1252  IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL  1073
IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL
Sbjct  83    IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL  142

Query  1072  AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  893
AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV
Sbjct  143   AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  202

Query  892   LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA  713
LAPLPIGFAVFMVHLATIPVTGTGINPARSFG AVI+N+ K WDDQWI+WVGPFIGAAVA
Sbjct  203   LAPLPIGFAVFMVHLATIPVTGTGINPARSFGSAVIYNDGKIWDDQWIFWVGPFIGAAVA  262

Query  712   AIYHQYILRGSAIKALGSFRSNA  644
AIYHQYILRGSAIKALGSFRSNA
Sbjct  263   AIYHQYILRGSAIKALGSFRSNA  285


>emb|CAB61749.1| putative water channel protein [Cicer arietinum]
Length=237

Score =  393 bits (1009),  Expect = 4e-133
Identities = 191/203 (94%), Positives = 195/203 (96%), Gaps = 0/203 (0%)
Frame = -3

Query  1252  IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL  1073
IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSL+RAVFYMAAQ AGAI GTGL
Sbjct  35    IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLLRAVFYMAAQCAGAISGTGL  94

Query  1072  AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  893
AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV
Sbjct  95    AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  154

Query  892   LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA  713
LAPLPIGFAVFMVHLATIP+TGTGINPARSFG AVI+N  K WDDQWI+WVGP IGA VA
Sbjct  155   LAPLPIGFAVFMVHLATIPITGTGINPARSFGSAVIYNEGKIWDDQWIFWVGPIIGATVA  214

Query  712   AIYHQYILRGSAIKALGSFRSNA  644
AIYHQYILRGSAIKALGSFRSNA
Sbjct  215   AIYHQYILRGSAIKALGSFRSNA  237


>emb|CAB45651.1| putative plasma membrane intrinsic protein [Pisum sativum]
Length=285

Score =  392 bits (1006),  Expect = 6e-132
Identities = 187/203 (92%), Positives = 195/203 (96%), Gaps = 0/203 (0%)
Frame = -3

Query  1252  IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL  1073
IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSL+RAVFYMAAQ AGA+CGTGL
Sbjct  83    IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLLRAVFYMAAQCAGAVCGTGL  142

Query  1072  AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  893
AKGFQK++FDRYGGGANF+HDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV
Sbjct  143   AKGFQKSFFDRYGGGANFIHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  202

Query  892   LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA  713
LAPLPIGFAVFMVHLATIP+TGTGINPARSFG AVI N  K WDDQW++WVGP IGA VA
Sbjct  203   LAPLPIGFAVFMVHLATIPITGTGINPARSFGSAVILNQGKIWDDQWVFWVGPIIGATVA  262

Query  712   AIYHQYILRGSAIKALGSFRSNA  644
AIYHQYILRGSAIKALGSFRSNA
Sbjct  263   AIYHQYILRGSAIKALGSFRSNA  285


>ref|NP_001105639.1| aquaporin PIP2-7 [Zea mays]
sp|Q9ATM4.1|PIP27_MAIZE RecName: Full=Aquaporin PIP2-7; AltName: Full=Plasma membrane
intrinsic protein 2-7; AltName: Full=ZmPIP2-7; AltName: Full=ZmPIP2;7
gb|AAK26763.1| plasma membrane integral protein ZmPIP2-7 [Zea mays]
gb|ACU23788.1| unknown [Glycine max]
Length=287

Score =  390 bits (1002),  Expect = 3e-131
Identities = 187/203 (92%), Positives = 196/203 (97%), Gaps = 0/203 (0%)
Frame = -3

Query  1252  IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL  1073
IAWAFGGMIFILVYCTAGISGGHINPAVTFGLF+GRKVSL+RA+ YM AQ AGAICG GL
Sbjct  85    IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLGRKVSLVRALLYMIAQCAGAICGAGL  144

Query  1072  AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  893
AKGFQK++++RYGGG N V DGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV
Sbjct  145   AKGFQKSFYNRYGGGVNTVSDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  204

Query  892   LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA  713
LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNN+KAWDDQWIYWVGPF+GAAVA
Sbjct  205   LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNDKAWDDQWIYWVGPFVGAAVA  264

Query  712   AIYHQYILRGSAIKALGSFRSNA  644
AIYHQYILRGSAIKALGSFRSNA
Sbjct  265   AIYHQYILRGSAIKALGSFRSNA  287


>ref|NP_001237286.1| PIP2,2 [Glycine max]
gb|AAX86046.1| PIP2,2 [Glycine max]
Length=287

Score =  389 bits (998),  Expect = 1e-130
Identities = 186/203 (92%), Positives = 195/203 (96%), Gaps = 0/203 (0%)
Frame = -3

Query  1252  IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL  1073
IAWAFGGMIFILVYCTAGISGGHINPAVTFGLF+GRKVSL+RA+ YM AQ AGAICG GL
Sbjct  85    IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLGRKVSLVRALLYMIAQCAGAICGAGL  144

Query  1072  AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  893
AKGFQK++++RYGGG N V DGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV
Sbjct  145   AKGFQKSFYNRYGGGVNTVSDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV  204

Query  892   LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA  713
LAPLPIGFAVFM HLATIPVTGTGINPARSFGPAVIFNN+KAWDDQWIYWVGPF+GAAVA
Sbjct  205   LAPLPIGFAVFMAHLATIPVTGTGINPARSFGPAVIFNNDKAWDDQWIYWVGPFVGAAVA  264

Query  712   AIYHQYILRGSAIKALGSFRSNA  644
AIYHQYILRGSAIKALGSFRSNA
Sbjct  265   AIYHQYILRGSAIKALGSFRSNA  287


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 704560993
Number of extensions: 16073247
Number of successful extensions: 44958
Number of sequences better than 1e-10: 350
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 44043
Number of HSP's successfully gapped: 352
Length of query: 1254
Length of database: 6150218869
Length adjustment: 143
Effective length of query: 1111
Effective length of database: 3587789857
Effective search space: 986642210675
Effective search space used: 986642210675
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 178 (73.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEF4CNU01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_15159

Length=369


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|NP_001051179.1|  Os03g0734300 [Oryza sativa Japonica Group...  81.3    9e-18
ref|XP_002263203.1|  PREDICTED: uncharacterized protein LOC100...  70.1    1e-13
ref|NP_001105983.1|  LOC100037813 precursor [Zea mays] >gb|ABN...  64.7    1e-11
ref|XP_003566353.1|  PREDICTED: uncharacterized protein LOC100...  63.9    3e-11
ref|XP_002521828.1|  serine-type endopeptidase inhibitor, puta...  63.5    4e-11

ALIGNMENTS
>ref|NP_001051179.1| Os03g0734300 [Oryza sativa Japonica Group]
gb|AAT78791.1| putative roteinase inhibitor [Oryza sativa Japonica Group]
gb|ABF98725.1| type II proteinase inhibitor family protein, expressed [Oryza
sativa Japonica Group]
dbj|BAF13093.1| Os03g0734300 [Oryza sativa Japonica Group]
gb|EAY91770.1| hypothetical protein OsI_13414 [Oryza sativa Indica Group]
gb|EAZ28490.1| hypothetical protein OsJ_12473 [Oryza sativa Japonica Group]
dbj|BAG97220.1| unnamed protein product [Oryza sativa Japonica Group]
Length=82

Score = 81.3 bits (199),  Expect = 9e-18
Identities = 38/81 (47%), Positives = 56/81 (69%), Gaps = 5/81 (6%)
Frame = -3

Query  367  ALKVGTILLVFVCGAILSGGNLKNVDAQ--KICPQFCYDSVAYMTCPSSGDQHLTPKCNC  194
++K+   + + +CG ++ G ++++ +AQ  K CPQFCYD + YMTCPS+G QHL P CNC
Sbjct  3    SIKLALPMALLLCGLMVIG-SIQSAEAQGGKFCPQFCYDGLEYMTCPSTGSQHLKPACNC  61

Query  193  CLA-STGCILYEADGTPI-CT  137
C+A   GC+LY  +G  I CT
Sbjct  62   CIAGEKGCVLYLNNGQVINCT  82


>ref|XP_002263203.1| PREDICTED: uncharacterized protein LOC100267991 [Vitis vinifera]
emb|CBI29703.3| unnamed protein product [Vitis vinifera]
Length=78

Score = 70.1 bits (170),  Expect = 1e-13
Identities = 34/74 (46%), Positives = 45/74 (61%), Gaps = 2/74 (3%)
Frame = -3

Query  364  LKVGTILLVFVCGAILSGGNLKNVDAQKICPQFCYDSVAYMTCPSSGDQHLTPKCNCCLA  185
+K    +L+ VCG +L G   K+  A K CP +C D V YMTC SSG++ LT  CNCCLA
Sbjct  4    MKFSVFVLLLVCGVVLLGETSKSFGA-KACPLYCLD-VDYMTCVSSGEEKLTAPCNCCLA  61

Query  184  STGCILYEADGTPI  143
C L+  DG+ +
Sbjct  62   PKQCTLHLVDGSEV  75


>ref|NP_001105983.1| LOC100037813 precursor [Zea mays]
gb|ABN54444.1| putative serine type endopeptidase inhibitor [Zea mays]
Length=78

Score = 64.7 bits (156),  Expect = 1e-11
Identities = 29/64 (45%), Positives = 40/64 (63%), Gaps = 3/64 (5%)
Frame = -3

Query  340  VFVCGAILSGGNLKNVDAQKICPQFCYDSVAYMTCPSSGDQHLTPKCNCCLASTGCILYE  161
+ + G +L G   + +D    CPQFC D V Y+TCPSSG + L  +CNCC+   GC L+
Sbjct  13   LLLIGVVLLGQ--QGIDGAVACPQFCLD-VDYVTCPSSGSEKLPARCNCCMTPKGCTLHL  69

Query  160  ADGT  149
+DGT
Sbjct  70   SDGT  73


>ref|XP_003566353.1| PREDICTED: uncharacterized protein LOC100841120 [Brachypodium
distachyon]
Length=79

Score = 63.9 bits (154),  Expect = 3e-11
Identities = 32/70 (46%), Positives = 43/70 (61%), Gaps = 3/70 (4%)
Frame = -3

Query  358  VGTILLVFVCGAILSGGNLKNVDAQKICPQFCYDSVAYMTCPSSGDQHLTPKCNCCLAST  179
+   LL+F  GA+L G + K       CPQ+C + V Y TCPSSG + L  +CNCC+A
Sbjct  8    IACALLLF--GAVLLGQDGKAGMEAVACPQYCLE-VEYTTCPSSGSEKLPARCNCCMAPK  64

Query  178  GCILYEADGT  149
GC L+ +DGT
Sbjct  65   GCTLHLSDGT  74


>ref|XP_002521828.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
gb|EEF40638.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
Length=75

Score = 63.5 bits (153),  Expect = 4e-11
Identities = 29/60 (48%), Positives = 36/60 (60%), Gaps = 1/60 (2%)
Frame = -3

Query  322  ILSGGNLKNVDAQKICPQFCYDSVAYMTCPSSGDQHLTPKCNCCLASTGCILYEADGTPI  143
+L G       + K CP +C D V YMTC SSGD+ L P CNCCLA   C L+ +DGT +
Sbjct  12   LLYGAISLQATSGKACPLYCLD-VEYMTCQSSGDEKLNPSCNCCLAPKNCTLHLSDGTSL  70


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 186386003
Number of extensions: 3776373
Number of successful extensions: 6771
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 6759
Number of HSP's successfully gapped: 0
Length of query: 369
Length of database: 6150218869
Length adjustment: 89
Effective length of query: 280
Effective length of database: 4555420393
Effective search space: 154884293362
Effective search space used: 154884293362
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE32YJ101N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_1550

Length=859


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003546558.1|  PREDICTED: CASP-like protein 9-like [Glyc...   271    6e-88
ref|XP_003535007.1|  PREDICTED: LOW QUALITY PROTEIN: CASP-like...   256    4e-82
ref|NP_001241476.1|  CASP-like protein 9 [Glycine max] >sp|C6T...   233    6e-73
ref|XP_003531310.1|  PREDICTED: CASP-like protein 9-like [Glyc...   230    6e-72
ref|XP_003630011.1|  hypothetical protein MTR_8g089300 [Medica...   229    9e-72

ALIGNMENTS
>ref|XP_003546558.1| PREDICTED: CASP-like protein 9-like [Glycine max]
Length=201

Score =  271 bits (693),  Expect = 6e-88
Identities = 131/155 (85%), Positives = 145/155 (94%), Gaps = 0/155 (0%)
Frame = +1

Query  40   LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHNLVMIALDILG  219
LVM  NKQTKSFV+ TVG+TP+TATL+AKFNQTPAFVFFVIANGNASLHNLVMI +++LG
Sbjct  45   LVMTFNKQTKSFVVATVGSTPITATLAAKFNQTPAFVFFVIANGNASLHNLVMIVMEVLG  104

Query  220  PQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCNR  399
P+YDYKGLRLALIAILDM+TMALASAGDGAATFMS LG+NGNSHARWDKICDKFE+YCNR
Sbjct  105  PRYDYKGLRLALIAILDMMTMALASAGDGAATFMSELGKNGNSHARWDKICDKFETYCNR  164

Query  400  GGGALIASFIGFILLLIITVMSISKLLKPNRINHA  504
GG ALI SF+GFILL II+VMS+ KLLK NRINHA
Sbjct  165  GGAALIVSFVGFILLFIISVMSVIKLLKSNRINHA  199


>ref|XP_003535007.1| PREDICTED: LOW QUALITY PROTEIN: CASP-like protein 9-like [Glycine
max]
Length=207

Score =  256 bits (655),  Expect = 4e-82
Identities = 129/156 (83%), Positives = 144/156 (92%), Gaps = 1/156 (1%)
Frame = +1

Query  40   LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHN-LVMIALDIL  216
LVMA NKQTKSFV+ TVG+TP+TAT +AKFNQTPAFVFFVIANGNA+LHN LVMIA++IL
Sbjct  50   LVMAFNKQTKSFVVATVGSTPITATFAAKFNQTPAFVFFVIANGNAALHNNLVMIAMEIL  109

Query  217  GPQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCN  396
G +YDYKG RLALIAILDM+TMALAS GDGAATFMS LG+NGNSHA+WDKICDKFE+YC+
Sbjct  110  GTRYDYKGPRLALIAILDMMTMALASDGDGAATFMSELGKNGNSHAKWDKICDKFETYCD  169

Query  397  RGGGALIASFIGFILLLIITVMSISKLLKPNRINHA  504
RG  ALI SF+GFILLLII+VMSI KLLKPNRINHA
Sbjct  170  RGVVALIVSFVGFILLLIISVMSIIKLLKPNRINHA  205


>ref|NP_001241476.1| CASP-like protein 9 [Glycine max]
sp|C6TG62.1|CSPL9_SOYBN RecName: Full=CASP-like protein 9
gb|ACU20814.1| unknown [Glycine max]
Length=194

Score =  233 bits (593),  Expect = 6e-73
Identities = 111/152 (73%), Positives = 131/152 (86%), Gaps = 0/152 (0%)
Frame = +1

Query  40   LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHNLVMIALDILG  219
LVMA NKQTK  V+ T+G  P+T TL+A F  TPAF+FFVI N  AS +NL++I ++ILG
Sbjct  43   LVMAFNKQTKGMVVATIGTNPVTITLTAMFQHTPAFIFFVIVNAIASFYNLLVIGVEILG  102

Query  220  PQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCNR  399
PQYDYKGLRL LIAILD++TMALA+ GDGAATFM+ LGRNGNSHARWDKICDKFE+YCNR
Sbjct  103  PQYDYKGLRLGLIAILDVMTMALAATGDGAATFMAELGRNGNSHARWDKICDKFEAYCNR  162

Query  400  GGGALIASFIGFILLLIITVMSISKLLKPNRI  495
GG AL+ASF+G ILLL++TVMSI+KLLK NRI
Sbjct  163  GGVALVASFVGLILLLVVTVMSITKLLKLNRI  194


>ref|XP_003531310.1| PREDICTED: CASP-like protein 9-like [Glycine max]
Length=194

Score =  230 bits (586),  Expect = 6e-72
Identities = 111/152 (73%), Positives = 130/152 (86%), Gaps = 0/152 (0%)
Frame = +1

Query  40   LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHNLVMIALDILG  219
LVMA NKQTKS V+ T+G  P+T TL+A F  TPAF FFVI N  AS +N+V+I ++ILG
Sbjct  43   LVMAFNKQTKSMVVATIGTNPVTITLTAMFQHTPAFTFFVIVNAIASFYNMVVIGVEILG  102

Query  220  PQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCNR  399
PQYDYK LRL LIAILD++TMALA+ GDGAATFM+ LGRNGNSHARWDKICDKFE+YCNR
Sbjct  103  PQYDYKELRLGLIAILDVMTMALAATGDGAATFMAELGRNGNSHARWDKICDKFEAYCNR  162

Query  400  GGGALIASFIGFILLLIITVMSISKLLKPNRI  495
GG ALIASF+G ILLL++TVMSI+K+LK NRI
Sbjct  163  GGVALIASFVGLILLLVVTVMSITKMLKLNRI  194


>ref|XP_003630011.1| hypothetical protein MTR_8g089300 [Medicago truncatula]
gb|AET04487.1| hypothetical protein MTR_8g089300 [Medicago truncatula]
Length=194

Score =  229 bits (585),  Expect = 9e-72
Identities = 112/151 (74%), Positives = 133/151 (88%), Gaps = 0/151 (0%)
Frame = +1

Query  40   LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHNLVMIALDILG  219
LVM+LNKQTK+FV+ T+G+TP+T  L+AKF  TPAFV+FV+ NG  SLHNLVMIA+ ILG
Sbjct  43   LVMSLNKQTKTFVVATIGSTPITVPLTAKFQHTPAFVYFVVPNGIVSLHNLVMIAMYILG  102

Query  220  PQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCNR  399
P++  KGL+LALIA+ D + +ALAS+GDGAAT MS LGRNGNSHA+W+KICDKFESYCNR
Sbjct  103  PKFHNKGLQLALIAVFDTMALALASSGDGAATAMSELGRNGNSHAKWNKICDKFESYCNR  162

Query  400  GGGALIASFIGFILLLIITVMSISKLLKPNR  492
GGG+LIASFIG ILLLIITVMSI+KLLK NR
Sbjct  163  GGGSLIASFIGLILLLIITVMSINKLLKLNR  193


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 404990501
Number of extensions: 7972247
Number of successful extensions: 19031
Number of sequences better than 1e-10: 2
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 19025
Number of HSP's successfully gapped: 2
Length of query: 859
Length of database: 6150218869
Length adjustment: 139
Effective length of query: 720
Effective length of database: 3659466193
Effective search space: 537941530371
Effective search space used: 537941530371
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 176 (72.4 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEF6YZM013


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_15676

Length=357


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002303808.1|  predicted protein [Populus trichocarpa] >...  89.7    2e-21
dbj|BAC98492.1|  AG-motif binding protein-2 [Nicotiana tabacum]    94.0    4e-21
ref|XP_002442575.1|  hypothetical protein SORBIDRAFT_08g022276...  91.3    4e-20
dbj|BAE99493.1|  GATA transcription factor 1 [Arabidopsis thal...  88.2    6e-20
ref|XP_003579113.1|  PREDICTED: GATA transcription factor 2-li...  90.9    6e-20

ALIGNMENTS
>ref|XP_002303808.1| predicted protein [Populus trichocarpa]
gb|EEE78787.1| predicted protein [Populus trichocarpa]
Length=55

Score = 89.7 bits (221),  Expect = 2e-21
Identities = 40/44 (91%), Positives = 42/44 (95%), Gaps = 0/44 (0%)
Frame = -1

Query  357  RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSH  226
RAGP GPKTLCNACGVRYKSGRL+PEYRPANSPTFSS +HSNSH
Sbjct  3    RAGPDGPKTLCNACGVRYKSGRLVPEYRPANSPTFSSKLHSNSH  46


>dbj|BAC98492.1| AG-motif binding protein-2 [Nicotiana tabacum]
Length=289

Score = 94.0 bits (232),  Expect = 4e-21
Identities = 56/77 (73%), Positives = 63/77 (82%), Gaps = 6/77 (8%)
Frame = -1

Query  357  RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSHrkvvemrrkkevVPGE  178
RAGP+GPKTLCNACGVRYKSGRLLPEYRPANSPTFS  VHSNSHRKV+EMR++K    G
Sbjct  219  RAGPLGPKTLCNACGVRYKSGRLLPEYRPANSPTFSPTVHSNSHRKVLEMRKQKI---GV  275

Query  177  EGDMVNVAVVSCGYSLG  127
G M++ A   CGY +G
Sbjct  276  GGMMIHEA---CGYRVG  289


>ref|XP_002442575.1| hypothetical protein SORBIDRAFT_08g022276 [Sorghum bicolor]
gb|EES16413.1| hypothetical protein SORBIDRAFT_08g022276 [Sorghum bicolor]
Length=306

Score = 91.3 bits (225),  Expect = 4e-20
Identities = 40/44 (91%), Positives = 42/44 (95%), Gaps = 0/44 (0%)
Frame = -1

Query  357  RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSH  226
RAGP+GPKTLCNACGVRYKSGRLLPEYRPANSPTF S +HSNSH
Sbjct  245  RAGPLGPKTLCNACGVRYKSGRLLPEYRPANSPTFMSCIHSNSH  288


>dbj|BAE99493.1| GATA transcription factor 1 [Arabidopsis thaliana]
Length=134

Score = 88.2 bits (217),  Expect = 6e-20
Identities = 38/44 (86%), Positives = 42/44 (95%), Gaps = 0/44 (0%)
Frame = -1

Query  357  RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSH  226
RAGP GPKTLCNACGVRYKSGRL+PEYRPANSPTF++ +HSNSH
Sbjct  68   RAGPAGPKTLCNACGVRYKSGRLVPEYRPANSPTFTAELHSNSH  111


>ref|XP_003579113.1| PREDICTED: GATA transcription factor 2-like [Brachypodium distachyon]
Length=321

Score = 90.9 bits (224),  Expect = 6e-20
Identities = 40/44 (91%), Positives = 42/44 (95%), Gaps = 0/44 (0%)
Frame = -1

Query  357  RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSH  226
R GP+GPKTLCNACGVRYKSGRLLPEYRPANSPTFSS +HSNSH
Sbjct  263  RTGPLGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSYMHSNSH  306


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  May 9, 2012  2:26 PM
Number of letters in database: 6,200,364,692
Number of sequences in database:  18,076,563

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 18076563
Number of Hits to DB: 151315773
Number of extensions: 2768988
Number of successful extensions: 5604
Number of sequences better than 1e-10: 6
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 5604
Number of HSP's successfully gapped: 6
Length of query: 357
Length of database: 6200364692
Length adjustment: 86
Effective length of query: 271
Effective length of database: 4645780274
Effective search space: 153310749042
Effective search space used: 153310749042
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEF903P01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_16638

Length=336


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|AAO86692.1|  small blue copper protein Bcp1 [Paraboea crass...   119    2e-31
gb|ADV57644.1|  copper binding protein 9 [Gossypium hirsutum]       107    4e-27
emb|CBI33124.3|  unnamed protein product [Vitis vinifera]           105    4e-26
ref|XP_002265268.1|  PREDICTED: uncharacterized protein LOC100...   105    8e-26
ref|XP_002271669.2|  PREDICTED: blue copper protein [Vitis vin...   103    1e-25

ALIGNMENTS
>gb|AAO86692.1| small blue copper protein Bcp1 [Paraboea crassifolia]
Length=201

Score =  119 bits (299),  Expect = 2e-31
Identities = 58/91 (64%), Positives = 69/91 (76%), Gaps = 0/91 (0%)
Frame = -1

Query  333  GSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPMA  154
G   A +I SWLSGRVFRVGDK+ F+   T D+IVEL+  +E  +CDL NPIRMY D
Sbjct  38   GWNSASNISSWLSGRVFRVGDKLWFSVPATADSIVELQSLEELATCDLRNPIRMYADGSN  97

Query  153  HVVLEKEGTRYFTSKNPESCKNGLKLPVSVQ  61
HV L+KEGTRYF+S N ESCKNG+KLPV+VQ
Sbjct  98   HVTLDKEGTRYFSSGNLESCKNGMKLPVTVQ  128


>gb|ADV57644.1| copper binding protein 9 [Gossypium hirsutum]
Length=149

Score =  107 bits (266),  Expect = 4e-27
Identities = 50/91 (55%), Positives = 64/91 (70%), Gaps = 0/91 (0%)
Frame = -1

Query  336  RGSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPM  157
RG  P+ D+ SW SGR+FRVGDKI F  S  +++IVE++  DE+ SCD+ NPIRMYT  +
Sbjct  33   RGWDPSFDVASWSSGRIFRVGDKICFPYSAAQESIVEVKSKDEYESCDVGNPIRMYTVGL  92

Query  156  AHVVLEKEGTRYFTSKNPESCKNGLKLPVSV  64
+ L+ EG RYF S  PESCK GLKL V +
Sbjct  93   DGIELDGEGIRYFMSSKPESCKKGLKLRVEL  123


>emb|CBI33124.3| unnamed protein product [Vitis vinifera]
Length=186

Score =  105 bits (262),  Expect = 4e-26
Identities = 49/92 (53%), Positives = 68/92 (74%), Gaps = 0/92 (0%)
Frame = -1

Query  336  RGSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPM  157
RG   + D+++WLS +VFRVGDKI F  SG ++ +VEL+  +EF SCD++NPIR YT+ +
Sbjct  31   RGWDTSSDVQAWLSNKVFRVGDKIWFIYSGGQEGVVELKSREEFDSCDVSNPIRTYTEGL  90

Query  156  AHVVLEKEGTRYFTSKNPESCKNGLKLPVSVQ  61
V++  EG RYFTS  P+SCK+GL+L V VQ
Sbjct  91   DAVLMGSEGIRYFTSSKPKSCKDGLRLLVEVQ  122


>ref|XP_002265268.1| PREDICTED: uncharacterized protein LOC100255445 [Vitis vinifera]
Length=224

Score =  105 bits (262),  Expect = 8e-26
Identities = 49/92 (53%), Positives = 68/92 (74%), Gaps = 0/92 (0%)
Frame = -1

Query  336  RGSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPM  157
RG   + D+++WLS +VFRVGDKI F  SG ++ +VEL+  +EF SCD++NPIR YT+ +
Sbjct  31   RGWDTSSDVQAWLSNKVFRVGDKIWFIYSGGQEGVVELKSREEFDSCDVSNPIRTYTEGL  90

Query  156  AHVVLEKEGTRYFTSKNPESCKNGLKLPVSVQ  61
V++  EG RYFTS  P+SCK+GL+L V VQ
Sbjct  91   DAVLMGSEGIRYFTSSKPKSCKDGLRLLVEVQ  122


>ref|XP_002271669.2| PREDICTED: blue copper protein [Vitis vinifera]
Length=171

Score =  103 bits (257),  Expect = 1e-25
Identities = 49/92 (53%), Positives = 63/92 (68%), Gaps = 0/92 (0%)
Frame = -1

Query  336  RGSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPM  157
RG   + ++  WLS +VFRVGDKI F  S  ++ + EL   +EF SCD++NPI+MYTD +
Sbjct  31   RGWAKSSEVRDWLSDKVFRVGDKIWFIYSAAQEGVAELRSKEEFESCDVSNPIKMYTDGL  90

Query  156  AHVVLEKEGTRYFTSKNPESCKNGLKLPVSVQ  61
V L+ EG RYFTS   ESCK+GLKL V VQ
Sbjct  91   DSVPLDGEGIRYFTSSKTESCKDGLKLHVDVQ  122


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 194106224
Number of extensions: 4248433
Number of successful extensions: 12638
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 12607
Number of HSP's successfully gapped: 0
Length of query: 336
Length of database: 6150218869
Length adjustment: 80
Effective length of query: 256
Effective length of database: 4716692149
Effective search space: 150934148768
Effective search space used: 150934148768
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE383FU01N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_2050

Length=814


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002266452.2|  PREDICTED: disease resistance response pr...   158    2e-44
ref|XP_002323339.1|  predicted protein [Populus trichocarpa] >...   152    5e-42
gb|ABK93789.1|  unknown [Populus trichocarpa]                       152    8e-42
ref|XP_002880356.1|  hypothetical protein ARALYDRAFT_480959 [A...   146    1e-39
ref|NP_850009.1|  disease resistance-responsive, dirigent doma...   145    2e-39

ALIGNMENTS
>ref|XP_002266452.2| PREDICTED: disease resistance response protein 206-like [Vitis
vinifera]
emb|CAN62170.1| hypothetical protein VITISV_027159 [Vitis vinifera]
emb|CBI17737.3| unnamed protein product [Vitis vinifera]
Length=182

Score =  158 bits (399),  Expect = 2e-44
Identities = 85/160 (53%), Positives = 111/160 (69%), Gaps = 3/160 (2%)
Frame = -2

Query  690  SQRSKNDFGFWPRPKITKLLFYFYDKPSSPDATAEVIVNGTTPFGFGTVVMIDDALLKTA  511
SQ+   +       K++ L FYF+D  S  + TA  I  G     FG  +M+DDAL +
Sbjct  26   SQQFAEEIATMRLEKVSHLHFYFHDILSGKNPTATQIA-GPKKGHFGVTMMVDDALTEGP  84

Query  510  NKKSKIIGRAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVLQTSGRELP  331
SK++GRAQG YA++ Q+  ALLMV+N+ F EG +YNGSS+SVLGRNPV+    RE+P
Sbjct  85   EPSSKLLGRAQGLYALSAQQEPALLMVMNFAFMEG-KYNGSSISVLGRNPVMHAV-REMP  142

Query  330  IVGGTGVFRFARGFALASTKWFDPRTGDAIVEYKVTVLHF  211
IVGG+G+FR+ARG+ALA T WFD +TGDAIVEY V+VLHF
Sbjct  143  IVGGSGLFRYARGYALAHTVWFDGKTGDAIVEYNVSVLHF  182


>ref|XP_002323339.1| predicted protein [Populus trichocarpa]
gb|EEF05100.1| predicted protein [Populus trichocarpa]
Length=178

Score =  152 bits (383),  Expect = 5e-42
Identities = 87/168 (52%), Positives = 109/168 (65%), Gaps = 7/168 (4%)
Frame = -2

Query  702  QQFSSQRSKNDFGFWPRPKITKLLFYFYDKPSSPDATAEVI----VNGTTPFGFGTVVMI  535
Q+F+   S    G   + K++ L FYF+D  S  + TA  I    +  T+  GFG V MI
Sbjct  14   QRFTKYLSPATLGL-KKEKLSHLHFYFHDIVSGKNPTAVRIARADMTNTSSTGFGMVAMI  72

Query  534  DDALLKTANKKSKIIGRAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVL  355
DD L  T    SK++GRAQGFYA A Q    LLM +N+VF EG ++NGS+LSVLGRN V
Sbjct  73   DDPLTMTPELSSKLVGRAQGFYASASQNDVGLLMTMNFVFMEG-KFNGSTLSVLGRNSVF  131

Query  354  QTSGRELPIVGGTGVFRFARGFALASTKWFDPRTGDAIVEYKVTVLHF  211
T  RE+PIVGG+G+FRFARG+A AST  FD  TGDA+VEY V V H+
Sbjct  132  STV-REMPIVGGSGLFRFARGYAQASTHMFDRTTGDAVVEYNVYVFHY  178


>gb|ABK93789.1| unknown [Populus trichocarpa]
Length=194

Score =  152 bits (383),  Expect = 8e-42
Identities = 87/168 (52%), Positives = 109/168 (65%), Gaps = 7/168 (4%)
Frame = -2

Query  702  QQFSSQRSKNDFGFWPRPKITKLLFYFYDKPSSPDATAEVI----VNGTTPFGFGTVVMI  535
Q+F+   S    G   + K++ L FYF+D  S  + TA  I    +  T+  GFG V MI
Sbjct  30   QRFTKYLSPATLGL-KKEKLSHLHFYFHDIVSGKNPTAVRIARADMTNTSSTGFGMVAMI  88

Query  534  DDALLKTANKKSKIIGRAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVL  355
DD L  T    SK++GRAQGFYA A Q    LLM +N+VF EG ++NGS+LSVLGRN V
Sbjct  89   DDPLTMTPELSSKLVGRAQGFYASASQNDVGLLMTMNFVFMEG-KFNGSTLSVLGRNSVF  147

Query  354  QTSGRELPIVGGTGVFRFARGFALASTKWFDPRTGDAIVEYKVTVLHF  211
T  RE+PIVGG+G+FRFARG+A AST  FD  TGDA+VEY V V H+
Sbjct  148  STV-REMPIVGGSGLFRFARGYAQASTHMFDRTTGDAVVEYNVYVFHY  194


>ref|XP_002880356.1| hypothetical protein ARALYDRAFT_480959 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH56615.1| hypothetical protein ARALYDRAFT_480959 [Arabidopsis lyrata subsp.
lyrata]
Length=187

Score =  146 bits (368),  Expect = 1e-39
Identities = 73/151 (48%), Positives = 104/151 (69%), Gaps = 6/151 (4%)
Frame = -2

Query  654  RPKITKLLFYFYDKPSSPDATAEVIVNGT----TPFGFGTVVMIDDALLKTANKKSKIIG  487
+ K+T L FYF+D  S  + TA  +  GT    +P  FG V M+DDAL +TA+ KSK++G
Sbjct  39   KEKVTNLQFYFHDTLSGKNPTAVKVAQGTDTDKSPTLFGAVFMVDDALTETADPKSKLVG  98

Query  486  RAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVLQTSGRELPIVGGTGVF  307
RAQG Y  + ++   L+M +++ F +G  Y  S++S++G+N  +    RE+PIVGGTG+F
Sbjct  99   RAQGLYGSSCKEEVGLIMAMSFCFEDGP-YKDSTISMIGKNSAMNPI-REMPIVGGTGMF  156

Query  306  RFARGFALASTKWFDPRTGDAIVEYKVTVLH  214
R ARG+A+A T WFDP+TGDAIV Y VTV+H
Sbjct  157  RMARGYAIAKTNWFDPKTGDAIVGYNVTVVH  187


>ref|NP_850009.1| disease resistance-responsive, dirigent domain-containing protein
[Arabidopsis thaliana]
gb|AAO64191.1| putative disease resistance response protein/dirigent protein
[Arabidopsis thaliana]
gb|AAT71988.1| At2g21100 [Arabidopsis thaliana]
dbj|BAF00324.1| putative disease resistance response protein [Arabidopsis thaliana]
gb|AEC07123.1| disease resistance-responsive, dirigent domain-containing protein
[Arabidopsis thaliana]
Length=187

Score =  145 bits (367),  Expect = 2e-39
Identities = 72/151 (48%), Positives = 104/151 (69%), Gaps = 6/151 (4%)
Frame = -2

Query  654  RPKITKLLFYFYDKPSSPDATAEVIVNGT----TPFGFGTVVMIDDALLKTANKKSKIIG  487
+ K+T L FYF+D  S  + TA  +  GT    +P  FG V M+DDAL +TA+ KSK++G
Sbjct  39   KDKVTNLQFYFHDTLSGKNPTAVKVAQGTDTEKSPTLFGAVFMVDDALTETADPKSKLVG  98

Query  486  RAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVLQTSGRELPIVGGTGVF  307
RAQG Y  + ++   L+M +++ F +G  Y  S++S++G+N  +    RE+PIVGGTG+F
Sbjct  99   RAQGLYGSSCKEEVGLIMAMSFCFEDGP-YKDSTISMIGKNSAMNPI-REMPIVGGTGMF  156

Query  306  RFARGFALASTKWFDPRTGDAIVEYKVTVLH  214
R ARG+A+A T WFDP+TGDAIV Y VT++H
Sbjct  157  RMARGYAIARTNWFDPKTGDAIVGYNVTIMH  187


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 423147972
Number of extensions: 9121653
Number of successful extensions: 24229
Number of sequences better than 1e-10: 23
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 24052
Number of HSP's successfully gapped: 23
Length of query: 814
Length of database: 6150218869
Length adjustment: 138
Effective length of query: 676
Effective length of database: 3677385277
Effective search space: 489092241841
Effective search space used: 489092241841
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 176 (72.4 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE3E3ET012


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_2072

Length=813


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002263257.2|  PREDICTED: BRASSINOSTEROID INSENSITIVE 1-...  97.1    8e-21
gb|ABK24079.1|  unknown [Picea sitchensis]                         96.3    9e-21
gb|AAZ91738.1|  leucine rich repeat protein 1 [Nicotiana tabacum]  95.5    2e-20
gb|ABK21187.1|  unknown [Picea sitchensis] >gb|ACN40548.1| unk...  93.6    8e-20
gb|ABK24062.1|  unknown [Picea sitchensis]                         92.4    2e-19

ALIGNMENTS
>ref|XP_002263257.2| PREDICTED: BRASSINOSTEROID INSENSITIVE 1-associated receptor
kinase 1-like [Vitis vinifera]
emb|CBI40693.3| unnamed protein product [Vitis vinifera]
Length=250

Score = 97.1 bits (240),  Expect = 8e-21
Identities = 68/144 (47%), Positives = 87/144 (60%), Gaps = 11/144 (8%)
Frame = -3

Query  643  ALNAFK---DDPTNTTSSLD-------KWFHIDY-IEGDVQRVNIMNRELSGALVKRLGE  497
ALNA K   +DP N   S +       KWFH+       V RV+++N  LSG LV +LG+
Sbjct  31   ALNALKSNLEDPNNVLQSWNATLVNPCKWFHVTRNSHNSVTRVDLVNANLSGQLVPQLGQ  90

Query  496  SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn  317
+QYL LHNN I+G IP ELGNLTNL  LDL  N+LNG IP  LG L  L +L LN+N
Sbjct  91   LTNLQYLELHNNNISGKIPKELGNLTNLVSLDLSMNNLNGTIPDTLGKLTKLRFLRLNNN  150

Query  316  nLSGRVPEVLTRLPNLRTVNLKGN  245
L+G +P  LT +  L+ ++L  N
Sbjct  151  ALTGTIPMSLTAVITLQVLDLSNN  174


>gb|ABK24079.1| unknown [Picea sitchensis]
Length=216

Score = 96.3 bits (238),  Expect = 9e-21
Identities = 71/149 (48%), Positives = 94/149 (63%), Gaps = 12/149 (8%)
Frame = -3

Query  643  ALNAFKD---DPTNTTSSLDK-------WFHIDYIEGD-VQRVNIMNRELSGALVKRLGE  497
AL+AF+    DP N   S D        WFH+   + + V RV++ N  LSG LV  LG
Sbjct  32   ALHAFRRSLLDPDNVLQSWDPTLVNPCTWFHVTCDQNNRVIRVDLGNSNLSGHLVPELGM  91

Query  496  SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn  317
+ +QYL L+ N I G+I  ELGNL NL  LDL+NN L G IP+ LGNLK+L +L +N+N
Sbjct  92   LEHLQYLELYKNNITGNILEELGNLKNLISLDLYNNKLTGEIPRSLGNLKSLVFLRINNN  151

Query  316  nLSGRVPEVLTRLPNLRTVNLKGNPNLCG  230
L+G++P  LT LPNL+ V++  N NLCG
Sbjct  152  MLTGQIPRGLTSLPNLKVVDISSN-NLCG  179


>gb|AAZ91738.1| leucine rich repeat protein 1 [Nicotiana tabacum]
Length=232

Score = 95.5 bits (236),  Expect = 2e-20
Identities = 71/149 (48%), Positives = 93/149 (62%), Gaps = 12/149 (8%)
Frame = -3

Query  643  ALNAFKD---DPTNTTSSLDK-------WFHIDY-IEGDVQRVNIMNRELSGALVKRLGE  497
ALNA K    DP N   S D        WFH+    E  V RV++ N  LSG LV +LG+
Sbjct  34   ALNALKTNLADPNNVLQSWDPTLVNPCTWFHVTCNSENSVTRVDLGNANLSGQLVPQLGQ  93

Query  496  SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn  317
+QYL L++N I+G IP ELGNLTNL  LDL+ N LNG IP  LG L+ L +L LN+N
Sbjct  94   LPNLQYLELYSNNISGRIPFELGNLTNLVSLDLYLNRLNGPIPDTLGKLQKLRFLRLNNN  153

Query  316  nLSGRVPEVLTRLPNLRTVNLKGNPNLCG  230
+L+GR+P +LT + +L+ ++L  N NL G
Sbjct  154  SLNGRIPMLLTTVISLQVLDLSNN-NLTG  181


>gb|ABK21187.1| unknown [Picea sitchensis]
gb|ACN40548.1| unknown [Picea sitchensis]
Length=216

Score = 93.6 bits (231),  Expect = 8e-20
Identities = 66/149 (44%), Positives = 97/149 (65%), Gaps = 12/149 (8%)
Frame = -3

Query  643  ALNAFK---DDPTNTTSSLDK-------WFHIDYIEGD-VQRVNIMNRELSGALVKRLGE  497
AL+AF+    DP N   S D        WFHI   + + V R+++ N  LSG+LV  LG
Sbjct  32   ALHAFRRSLSDPLNVLQSWDPTLVNPCTWFHITCNQDNRVTRIDLGNSNLSGSLVPELGR  91

Query  496  SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn  317
+ +QYL L+ N+I GSIP E GNL +L  +DL+NN++ G IP+ LGNLK+L +L LN+N
Sbjct  92   LEHLQYLELYKNRIGGSIPEEFGNLKSLISMDLYNNNITGEIPRSLGNLKSLVFLRLNNN  151

Query  316  nLSGRVPEVLTRLPNLRTVNLKGNPNLCG  230
+L+G++P  LT++ NL+  ++  N +LCG
Sbjct  152  SLTGQIPRELTKISNLKVSDVSNN-DLCG  179


>gb|ABK24062.1| unknown [Picea sitchensis]
Length=216

Score = 92.4 bits (228),  Expect = 2e-19
Identities = 65/149 (44%), Positives = 97/149 (65%), Gaps = 12/149 (8%)
Frame = -3

Query  643  ALNAFK---DDPTNTTSSLDK-------WFHIDYIEGD-VQRVNIMNRELSGALVKRLGE  497
AL+AF+    DP N   S D        WFHI   + + V R+++ N  LSG+L+  LG
Sbjct  32   ALHAFRRSLSDPLNVLQSWDPTLVNPCTWFHITCNQDNRVTRIDLGNSNLSGSLMPELGR  91

Query  496  SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn  317
+ +QYL L+ N+I GSIP E GNL +L  +DL+NN++ G IP+ LGNLK+L +L LN+N
Sbjct  92   LEHLQYLELYKNRIGGSIPEEFGNLKSLISMDLYNNNITGEIPRSLGNLKSLVFLRLNNN  151

Query  316  nLSGRVPEVLTRLPNLRTVNLKGNPNLCG  230
+L+G++P  LT++ NL+  ++  N +LCG
Sbjct  152  SLTGQIPRELTKISNLKVSDVSNN-DLCG  179


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 378401929
Number of extensions: 7587992
Number of successful extensions: 21346
Number of sequences better than 1e-10: 25
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 19932
Number of HSP's successfully gapped: 25
Length of query: 813
Length of database: 6150218869
Length adjustment: 138
Effective length of query: 675
Effective length of database: 3677385277
Effective search space: 489092241841
Effective search space used: 489092241841
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 176 (72.4 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEGC8UN01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_20761

Length=260


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|ACB12048.1|  pathogenesis-related protein [Rehmannia glutin...  76.3    1e-15
gb|ABR10301.1|  pathogen-related protein STH-2 [Salvia miltior...  71.2    8e-14
ref|XP_002271428.2|  PREDICTED: pathogenesis-related protein S...  63.9    3e-11

ALIGNMENTS
>gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa]
Length=154

Score = 76.3 bits (186),  Expect = 1e-15
Identities = 37/66 (56%), Positives = 45/66 (68%), Gaps = 0/66 (0%)
Frame = +1

Query  16   KIESFVYDVKFEEASGGGCTIKIVNEYNTKGDAALKDGDIKETVDRTKGFYVAAEAYLIA  195
KIES  Y+ KFE++S GGC  KIV EY+TKGD  LK+  +K   D+  GFY  +E YL A
Sbjct  89   KIESIHYEQKFEDSSDGGCVAKIVCEYHTKGDIQLKEEGVKAINDQALGFYTLSEEYLHA  148

Query  196  NSNVCA  213
N NVCA
Sbjct  149  NPNVCA  154


>gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza]
Length=160

Score = 71.2 bits (173),  Expect = 8e-14
Identities = 30/66 (45%), Positives = 44/66 (67%), Gaps = 0/66 (0%)
Frame = +1

Query  16   KIESFVYDVKFEEASGGGCTIKIVNEYNTKGDAALKDGDIKETVDRTKGFYVAAEAYLIA  195
K+E   YD+KFE+   GGC +K+ +EY+TKG   L D D+K   +++ G Y + E YL+A
Sbjct  95   KLEKICYDMKFEDTEDGGCVVKVTSEYHTKGGYELADEDLKGAKEQSLGMYKSCEDYLLA  154

Query  196  NSNVCA  213
N +VCA
Sbjct  155  NPHVCA  160


>ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera]
emb|CBI22930.3| unnamed protein product [Vitis vinifera]
Length=160

Score = 63.9 bits (154),  Expect = 3e-11
Identities = 28/61 (46%), Positives = 42/61 (69%), Gaps = 0/61 (0%)
Frame = +1

Query  16   KIESFVYDVKFEEASGGGCTIKIVNEYNTKGDAALKDGDIKETVDRTKGFYVAAEAYLIA  195
++ES VY++KFEE+  GGC  K  +EY+TKG+  +K+  I+E  ++  G Y   EAYL+A
Sbjct  95   QLESIVYEMKFEESGDGGCICKTRSEYHTKGEFEIKEESIREGKEKAMGVYKLVEAYLLA  154

Query  196  N  198
N
Sbjct  155  N  155


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 131337325
Number of extensions: 2704680
Number of successful extensions: 6378
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 6376
Number of HSP's successfully gapped: 0
Length of query: 260
Length of database: 6150218869
Length adjustment: 56
Effective length of query: 204
Effective length of database: 5146750165
Effective search space: 154402504950
Effective search space used: 154402504950
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEH7DTE01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_22366

Length=240


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003536253.1|  PREDICTED: uncharacterized protein At4g14...  70.1    5e-14
gb|ABK93173.1|  unknown [Populus trichocarpa]                      67.8    5e-13
ref|XP_002284531.1|  PREDICTED: uncharacterized protein At4g14...  67.4    7e-13
emb|CAN71216.1|  hypothetical protein VITISV_033484 [Vitis vin...  67.4    7e-13
emb|CBI23583.3|  unnamed protein product [Vitis vinifera]          67.4    3e-11

ALIGNMENTS
>ref|XP_003536253.1| PREDICTED: uncharacterized protein At4g14450, chloroplastic-like
[Glycine max]
Length=95

Score = 70.1 bits (170),  Expect = 5e-14
Identities = 38/74 (51%), Positives = 44/74 (59%), Gaps = 7/74 (9%)
Frame = -3

Query  238  ASLQISPVTEWNVAIPLLSPLVSPTSEFNPTVEIKTSCSGGNKEAGAAEKPVVVMKNWQH  59
+SLQI+   EWNVAIPLLSPL S      P +E+K       +EA   EK  V  K WQH
Sbjct  22   SSLQINRAVEWNVAIPLLSPLASSP----PPMELKPPQEPPQREA---EKVTVSFKKWQH  74

Query  58   PAAPFCYDATPFRP  17
PAAPFCY+  P  P
Sbjct  75   PAAPFCYEPAPMVP  88


>gb|ABK93173.1| unknown [Populus trichocarpa]
Length=105

Score = 67.8 bits (164),  Expect = 5e-13
Identities = 41/80 (51%), Positives = 51/80 (64%), Gaps = 12/80 (15%)
Frame = -3

Query  238  ASLQISPVTE--WNVAIPLLSPLV-SPTSEFNPTVEIKTSC---SGGNKEAGAAEKPVVV  77
ASLQISP +   WN AIPLLSPL+ SPT+     +++K+     S    +    EKPVV
Sbjct  26   ASLQISPASSSSWNAAIPLLSPLITSPTA-----MDMKSRDDPPSPPRIQVTEGEKPVV-  79

Query  76   MKNWQHPAAPFCYDATPFRP  17
K WQHPAAPFCY+  PF+P
Sbjct  80   FKKWQHPAAPFCYELAPFKP  99


>ref|XP_002284531.1| PREDICTED: uncharacterized protein At4g14450, chloroplastic-like
[Vitis vinifera]
Length=104

Score = 67.4 bits (163),  Expect = 7e-13
Identities = 42/80 (53%), Positives = 49/80 (61%), Gaps = 8/80 (10%)
Frame = -3

Query  238  ASLQISPVTEWNVAIPLLSPL-VSPTSE--FNPTVEIKTSCSGGNKEAGAAEKPVVVMKN  68
+SLQISP  +W VAIPLLSPL  SP+S    + T E+K        + G  EKP  V K
Sbjct  27   SSLQISPAADWKVAIPLLSPLATSPSSPKLIDRTAEVKPKEEPRQMKEG--EKP--VFKK  82

Query  67   WQHPAAPFCYDATPF-RPFV  11
WQHPAAPFCY+   F R FV
Sbjct  83   WQHPAAPFCYEPASFVRSFV  102


>emb|CAN71216.1| hypothetical protein VITISV_033484 [Vitis vinifera]
Length=104

Score = 67.4 bits (163),  Expect = 7e-13
Identities = 42/80 (53%), Positives = 49/80 (61%), Gaps = 8/80 (10%)
Frame = -3

Query  238  ASLQISPVTEWNVAIPLLSPL-VSPTSE--FNPTVEIKTSCSGGNKEAGAAEKPVVVMKN  68
+SLQISP  +W VAIPLLSPL  SP+S    + T E+K        + G  EKP  V K
Sbjct  27   SSLQISPAADWKVAIPLLSPLATSPSSPKLIDRTAEVKPKEEPRQTKEG--EKP--VFKK  82

Query  67   WQHPAAPFCYDATPF-RPFV  11
WQHPAAPFCY+   F R FV
Sbjct  83   WQHPAAPFCYEPASFVRSFV  102


>emb|CBI23583.3| unnamed protein product [Vitis vinifera]
Length=1287

Score = 67.4 bits (163),  Expect = 3e-11
Identities = 42/80 (53%), Positives = 49/80 (61%), Gaps = 8/80 (10%)
Frame = -3

Query  238   ASLQISPVTEWNVAIPLLSPL-VSPTSE--FNPTVEIKTSCSGGNKEAGAAEKPVVVMKN  68
+SLQISP  +W VAIPLLSPL  SP+S    + T E+K        + G  EKP  V K
Sbjct  1210  SSLQISPAADWKVAIPLLSPLATSPSSPKLIDRTAEVKPKEEPRQMKEG--EKP--VFKK  1265

Query  67    WQHPAAPFCYDATPF-RPFV  11
WQHPAAPFCY+   F R FV
Sbjct  1266  WQHPAAPFCYEPASFVRSFV  1285


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 93585762
Number of extensions: 1737241
Number of successful extensions: 6813
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 6801
Number of HSP's successfully gapped: 0
Length of query: 240
Length of database: 6150218869
Length adjustment: 51
Effective length of query: 189
Effective length of database: 5236345585
Effective search space: 151854021965
Effective search space used: 151854021965
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE4758N012


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_2255

Length=796


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|ACB12048.1|  pathogenesis-related protein [Rehmannia glutin...   147    9e-41
gb|ABR10301.1|  pathogen-related protein STH-2 [Salvia miltior...   145    6e-40
ref|XP_002328585.1|  predicted protein [Populus trichocarpa] >...   141    4e-38
ref|XP_002271428.2|  PREDICTED: pathogenesis-related protein S...   139    1e-37
ref|XP_002273790.2|  PREDICTED: major allergen Pru ar 1 [Vitis...   138    4e-37

ALIGNMENTS
>gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa]
Length=154

Score =  147 bits (372),  Expect = 9e-41
Identities = 80/163 (49%), Positives = 113/163 (69%), Gaps = 10/163 (6%)
Frame = +3

Query  87   MAVTKHTQEITVSISAKRMIKAMVTEASTF-LPKITSGAIQSIVVLNGNGGPGSILQTNF  263
M +TKH QE+ + +SAKRM KA+VTE+ +  LP     AI+SI +L+G+G  G+I +TN
Sbjct  1    MGITKHIQELKLRVSAKRMFKALVTESHSIPLPD----AIKSIEILHGDGSAGTIRKTNL  56

Query  264  SDVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKI  443
+D  G+ V   K RI+A+D +   +K+T++EG  LGDKIES+ ++ KFE+ S GGC  KI
Sbjct  57   AD--GSYV---KIRIEAVDIDNQVSKYTVIEGPMLGDKIESIHYEQKFEDSSDGGCVAKI  111

Query  444  LNEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPNECA  572
+ EY+TKGD+ LK+E +K   D+  GFY  +E YL ANPN CA
Sbjct  112  VCEYHTKGDIQLKEEGVKAINDQALGFYTLSEEYLHANPNVCA  154


>gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza]
Length=160

Score =  145 bits (367),  Expect = 6e-40
Identities = 74/163 (45%), Positives = 109/163 (67%), Gaps = 4/163 (2%)
Frame = +3

Query  87   MAVTKHTQEITVSISAKRMIKAMVTEASTFLPKITSGAIQSIVVLNGNG-GPGSILQTNF  263
M V    QE+   IS+ R+ KA+VTE+   +PK T+ +I+SI ++ G+G  PG+I QTNF
Sbjct  1    MGVKSFFQEMKTKISSSRLFKALVTESPEVVPKFTT-SIKSIELIQGSGYAPGAIFQTNF  59

Query  264  SDVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKI  443
+  GA     K R+D +D EK + K+T++EGD LGDK+E + +D+KFE+   GGC VK+
Sbjct  60   PE--GAHFKYMKCRVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKV  117

Query  444  LNEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPNECA  572
+EY+TKG   L DED+K   +++ G Y + E YL+ANP+ CA
Sbjct  118  TSEYHTKGGYELADEDLKGAKEQSLGMYKSCEDYLLANPHVCA  160


>ref|XP_002328585.1| predicted protein [Populus trichocarpa]
gb|EEE76932.1| predicted protein [Populus trichocarpa]
Length=160

Score =  141 bits (355),  Expect = 4e-38
Identities = 68/159 (43%), Positives = 102/159 (64%), Gaps = 2/159 (1%)
Frame = +3

Query  87   MAVTKHTQEITVSISAKRMIKAMVTEASTFLPKITSGAIQSIVVLNGNGGPGSILQTNFS  266
M V  +TQE T  IS  RM KA++ +++  +PK+    ++S+ +++G+GG GSI Q NF+
Sbjct  1    MGVASYTQEFTCPISPARMFKALILDSNNLIPKLLPQIVKSVDLIHGDGGAGSIEQVNFT  60

Query  267  DVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKIL  446
+  G  +   KHRID LD      K+TM+EGD LG+K+ES+ ++V+FE  S GGC  K+
Sbjct  61   E--GTDIKYVKHRIDELDRVNLVCKYTMIEGDSLGEKLESIAYEVRFEVGSDGGCDCKMT  118

Query  447  NEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPN  563
+ Y   GD  LK+E+IK   D+ +G Y   EAYL+ NP+
Sbjct  119  SSYLMLGDFTLKEEEIKAGQDKARGIYKVVEAYLLENPH  157


>ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera]
emb|CBI22930.3| unnamed protein product [Vitis vinifera]
Length=160

Score =  139 bits (351),  Expect = 1e-37
Identities = 68/160 (43%), Positives = 105/160 (66%), Gaps = 2/160 (1%)
Frame = +3

Query  87   MAVTKHTQEITVSISAKRMIKAMVTEASTFLPKITSGAIQSIVVLNGNGGPGSILQTNFS  266
M VT  TQE    +S  RM KA+V ++   +P++   +++SI  + G+GG GSI QTNFS
Sbjct  1    MGVTTFTQEFVTPVSPARMFKALVVDSHILVPRLVPESVKSIEFVEGDGGAGSITQTNFS  60

Query  267  DVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKIL  446
+     K++I+A+D EK   ++T++EG  LGD++ES+V+++KFEE   GGC  K
Sbjct  61   --GDSDCEYLKYKINAVDKEKLECRYTLIEGGVLGDQLESIVYEMKFEESGDGGCICKTR  118

Query  447  NEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPNE  566
+EY+TKG+  +K+E I+E  ++  G Y   EAYL+ANP+E
Sbjct  119  SEYHTKGEFEIKEESIREGKEKAMGVYKLVEAYLLANPDE  158


>ref|XP_002273790.2| PREDICTED: major allergen Pru ar 1 [Vitis vinifera]
emb|CBI22941.3| unnamed protein product [Vitis vinifera]
Length=158

Score =  138 bits (348),  Expect = 4e-37
Identities = 70/159 (44%), Positives = 103/159 (65%), Gaps = 2/159 (1%)
Frame = +3

Query  87   MAVTKHTQEITVSISAKRMIKAMVTEASTFLPKITSGAIQSIVVLNGNGGPGSILQTNFS  266
M V  +T E+T  + A R+ KA++ EA + LPKI   AI+SI  + GNGGPG+I Q NF+
Sbjct  1    MGVVTYTDELTSPVPAPRLFKALILEADSLLPKIVPQAIKSIETVEGNGGPGTIKQLNFA  60

Query  267  DVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKIL  446
+  G+     KHRID LD EK   K+T++EGD L DKIE + +++ FE    GGC  K +
Sbjct  61   E--GSQFKYVKHRIDELDKEKMIYKYTLIEGDALMDKIEYISYEISFEASPDGGCKSKNV  118

Query  447  NEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPN  563
+ Y++K  V +K+E+IK+  ++    + A EAYL+ANP+
Sbjct  119  SVYHSKPGVEIKEEEIKDGKEKAAAVFKAVEAYLLANPD  157


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 374443729
Number of extensions: 7491674
Number of successful extensions: 19377
Number of sequences better than 1e-10: 33
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 19345
Number of HSP's successfully gapped: 33
Length of query: 796
Length of database: 6150218869
Length adjustment: 138
Effective length of query: 658
Effective length of database: 3677385277
Effective search space: 467027930179
Effective search space used: 467027930179
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 175 (72.0 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE4A0UZ016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_2274

Length=795


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003529742.1|  PREDICTED: uncharacterized protein LOC100...   320    6e-105
ref|XP_003533024.1|  PREDICTED: uncharacterized protein LOC100...   318    6e-104
ref|XP_002276490.1|  PREDICTED: uncharacterized protein LOC100...   312    9e-102
ref|XP_002515894.1|  conserved hypothetical protein [Ricinus c...   311    2e-101
ref|NP_196734.1|  Core-2/I-branching beta-1,6-N-acetylglucosam...   310    6e-101

ALIGNMENTS
>ref|XP_003529742.1| PREDICTED: uncharacterized protein LOC100793448 [Glycine max]
Length=387

Score =  320 bits (821),  Expect = 6e-105
Identities = 148/215 (69%), Positives = 178/215 (83%), Gaps = 1/215 (0%)
Frame = +3

Query  3    VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS  182
VFY RQIPS+V  WG MSMCDAERRLLANA+LD+SNE+FIL+SE+CIPL NFS +Y YI+
Sbjct  159  VFYRRQIPSQVAEWGMMSMCDAERRLLANALLDISNEWFILLSESCIPLQNFSIVYLYIA  218

Query  183  GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV  362
SR SFMGAVD  GP GRGRYD  M+P +N+++WRKG QWFE+NR+LA+ IV D+ YYP
Sbjct  219  RSRYSFMGAVDEPGPYGRGRYDGNMAPEINMSDWRKGSQWFEINRELALRIVEDNTYYPK  278

Query  363  FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF  542
KE+C+PH CYVDEHYFQTMLTI +PHLLANRSLT+VDWSRGG HP  FGKD +I    F
Sbjct  279  LKEFCKPHKCYVDEHYFQTMLTINTPHLLANRSLTYVDWSRGGAHPATFGKD-DIKEEFF  337

Query  543  KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLL  647
KKI  ++ C YNN+ +++CF+FARKF+PNALG LL
Sbjct  338  KKILQDQTCLYNNQPSSLCFLFARKFAPNALGPLL  372


>ref|XP_003533024.1| PREDICTED: uncharacterized protein LOC100819579 [Glycine max]
Length=387

Score =  318 bits (814),  Expect = 6e-104
Identities = 147/215 (68%), Positives = 177/215 (82%), Gaps = 1/215 (0%)
Frame = +3

Query  3    VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS  182
VFY RQIPS+V  WG MSMCDAERRLLANA+LD+SNE+FIL+SE+CIPL NFS +Y YI+
Sbjct  159  VFYRRQIPSQVAEWGMMSMCDAERRLLANALLDISNEWFILLSESCIPLQNFSIVYRYIA  218

Query  183  GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV  362
SR SFMGAVD  GP GRGRYD  M+P +N+++WRKG QWFE+ R+LA+ IV D  YYP
Sbjct  219  HSRYSFMGAVDEPGPYGRGRYDGNMAPEINVSDWRKGSQWFEIKRELALRIVEDRTYYPK  278

Query  363  FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF  542
KE+C+PH CYVDEHYFQTMLTI +PHLLANRSLT+VDWSRGG HP  FGKD +I    F
Sbjct  279  LKEFCRPHKCYVDEHYFQTMLTINTPHLLANRSLTYVDWSRGGAHPATFGKD-DIKEEFF  337

Query  543  KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLL  647
KKI  +++C YNN+ +++CF+FARKF+PNALG LL
Sbjct  338  KKILQDQKCLYNNQPSSLCFLFARKFAPNALGPLL  372


>ref|XP_002276490.1| PREDICTED: uncharacterized protein LOC100266878 [Vitis vinifera]
Length=380

Score =  312 bits (799),  Expect = 9e-102
Identities = 148/224 (66%), Positives = 178/224 (79%), Gaps = 2/224 (1%)
Frame = +3

Query  3    VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS  182
VFY RQIPS+V  WG MSMCDAERRLLANA+LD+ NE+FIL+SE+CIPLHNFS +Y Y+S
Sbjct  159  VFYKRQIPSQVVEWGMMSMCDAERRLLANALLDIDNEWFILLSESCIPLHNFSIVYRYLS  218

Query  183  GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV  362
SR SF+GA D   P GRGRY+  ++P VNLT WRKG QWFEVNRKLAIDIV D+ +YP
Sbjct  219  RSRYSFIGAFDEDSPFGRGRYNPNLAPQVNLTEWRKGSQWFEVNRKLAIDIVGDNTFYPR  278

Query  363  FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF  542
FKE+C+P  CYVDEHYFQTMLTI +PHLLANR+ TWVDWSRGG HP  FG+  +I    F
Sbjct  279  FKEFCRPS-CYVDEHYFQTMLTILAPHLLANRTTTWVDWSRGGAHPATFGQ-ADITKEFF  336

Query  543  KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLLRRSFELFGF  674
KKI +   C YNN+ T++CF+FARKF+P+AL  LL  + E+FG+
Sbjct  337  KKIIEGGTCIYNNQPTSLCFLFARKFAPSALEPLLDLASEVFGY  380


>ref|XP_002515894.1| conserved hypothetical protein [Ricinus communis]
gb|EEF46314.1| conserved hypothetical protein [Ricinus communis]
Length=385

Score =  311 bits (797),  Expect = 2e-101
Identities = 147/224 (66%), Positives = 175/224 (78%), Gaps = 2/224 (1%)
Frame = +3

Query  3    VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS  182
VFY RQIPS++  WG MSMCD ERRLLANA+LD+SNE+FIL+SEACIPLHNFS IY YIS
Sbjct  164  VFYRRQIPSQIVEWGRMSMCDGERRLLANALLDISNEWFILLSEACIPLHNFSIIYRYIS  223

Query  183  GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV  362
SR SFMG+ D   P GRGRY+  M P V L  WRKG QWFEVNR+ A++IV D  YYP
Sbjct  224  RSRHSFMGSFDENSPYGRGRYNWNMQPEVTLEQWRKGSQWFEVNRRFAVNIVEDTTYYPK  283

Query  363  FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF  542
F+++CQP  CYVDEHYF TMLTIQ PHLLANR+LTW DWSRGG HP  FGK  +I    F
Sbjct  284  FRDFCQP-ACYVDEHYFPTMLTIQVPHLLANRTLTWTDWSRGGAHPATFGK-ADITEEFF  341

Query  543  KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLLRRSFELFGF  674
K++F+ + C+YNN+ TTVC++FARKF+P+AL  LL  S ++FGF
Sbjct  342  KRMFEGQSCTYNNQPTTVCYLFARKFAPSALEPLLGLSSKVFGF  385


>ref|NP_196734.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family
protein [Arabidopsis thaliana]
emb|CAB87691.1| putative protein [Arabidopsis thaliana]
gb|AAM13183.1| putative protein [Arabidopsis thaliana]
gb|AAP68292.1| At5g11730 [Arabidopsis thaliana]
dbj|BAE99143.1| hypothetical protein [Arabidopsis thaliana]
gb|AED91715.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family
protein [Arabidopsis thaliana]
Length=386

Score =  310 bits (794),  Expect = 6e-101
Identities = 148/224 (66%), Positives = 179/224 (80%), Gaps = 2/224 (1%)
Frame = +3

Query  3    VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS  182
VF+ RQIPS+V  WG MSMCDAE+RLLANA+LDVSNE+F+LVSE+CIPL+NF+TIYSY+S
Sbjct  165  VFHRRQIPSQVAEWGRMSMCDAEKRLLANALLDVSNEWFVLVSESCIPLYNFTTIYSYLS  224

Query  183  GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV  362
S+ SFMGA D  GP GRGRY+  M P V LT WRKG QWFEVNR LA  IV+D +YYP
Sbjct  225  RSKHSFMGAFDDPGPFGRGRYNGNMEPEVPLTKWRKGSQWFEVNRDLAATIVKDTLYYPK  284

Query  363  FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF  542
FKE+C+P  CYVDEHYF TMLTI+ P +LANRSLTWVDWSRGGPHP  FG+  +I    F
Sbjct  285  FKEFCRP-ACYVDEHYFPTMLTIEKPTVLANRSLTWVDWSRGGPHPATFGR-SDITENFF  342

Query  543  KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLLRRSFELFGF  674
KIFD + CSYN R+T++C++FARKF+P+AL  LL  + ++ GF
Sbjct  343  GKIFDGRNCSYNGRNTSMCYLFARKFAPSALEPLLHIAPKILGF  386


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 425742378
Number of extensions: 9273448
Number of successful extensions: 22362
Number of sequences better than 1e-10: 5
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 22349
Number of HSP's successfully gapped: 5
Length of query: 795
Length of database: 6150218869
Length adjustment: 138
Effective length of query: 657
Effective length of database: 3677385277
Effective search space: 467027930179
Effective search space used: 467027930179
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 175 (72.0 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEHB6WZ01N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
18,076,563 sequences; 6,200,364,692 total letters
Query= TrVeIntMedtrGB1_24566

Length=216


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003544026.1|  PREDICTED: uncharacterized protein LOC100...   106    2e-27
ref|XP_003614382.1|  hypothetical protein MTR_5g050970 [Medica...  80.5    6e-16
ref|XP_003614392.1|  hypothetical protein MTR_5g051110 [Medica...  77.8    2e-15
ref|XP_003614384.1|  hypothetical protein MTR_5g051010 [Medica...  77.8    5e-15
ref|XP_003638451.1|  hypothetical protein MTR_132s0010, partia...  77.8    5e-15

ALIGNMENTS
>ref|XP_003544026.1| PREDICTED: uncharacterized protein LOC100801029 [Glycine max]
Length=167

Score =  106 bits (265),  Expect = 2e-27
Identities = 56/71 (79%), Positives = 56/71 (79%), Gaps = 0/71 (0%)
Frame = -3

Query  214  RLQRILPAARWEL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDL  35
RLQRILPAARWEL FKA   A PP    QRHVPLG   PLL VGKRTAGA VASSPDSDL
Sbjct  75   RLQRILPAARWELRFKASRRANPPHEGHQRHVPLGGRGPLLLVGKRTAGARVASSPDSDL  134

Query  34   EVFSHNPTHGS  2
E FSHNPTHGS
Sbjct  135  EAFSHNPTHGS  145


>ref|XP_003614382.1| hypothetical protein MTR_5g050970 [Medicago truncatula]
gb|AES97340.1| hypothetical protein MTR_5g050970 [Medicago truncatula]
Length=1065

Score = 80.5 bits (197),  Expect = 6e-16
Identities = 40/60 (67%), Positives = 43/60 (72%), Gaps = 0/60 (0%)
Frame = -3

Query  181  EL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDLEVFSHNPTHGS  2
+L FK    A PP  + QRHVPLG   PLL VGKRT G  +ASSPDSDLE FSHNPTHGS
Sbjct  25   KLRFKVSRKAHPPYGVHQRHVPLGGQGPLLLVGKRTTGTRIASSPDSDLEAFSHNPTHGS  84


>ref|XP_003614392.1| hypothetical protein MTR_5g051110 [Medicago truncatula]
gb|AES97350.1| hypothetical protein MTR_5g051110 [Medicago truncatula]
Length=407

Score = 77.8 bits (190),  Expect = 2e-15
Identities = 39/60 (65%), Positives = 42/60 (70%), Gaps = 0/60 (0%)
Frame = -3

Query  181  EL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDLEVFSHNPTHGS  2
+L FK    A PP  + QRHVPLG   PLL VGKRT G  +ASSPD DLE FSHNPTHGS
Sbjct  25   KLRFKVSRKAHPPYEVHQRHVPLGGQGPLLLVGKRTTGTRIASSPDFDLEAFSHNPTHGS  84


>ref|XP_003614384.1| hypothetical protein MTR_5g051010 [Medicago truncatula]
gb|AES97342.1| hypothetical protein MTR_5g051010 [Medicago truncatula]
Length=1153

Score = 77.8 bits (190),  Expect = 5e-15
Identities = 39/60 (65%), Positives = 42/60 (70%), Gaps = 0/60 (0%)
Frame = -3

Query  181  EL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDLEVFSHNPTHGS  2
+L FK    A PP  + QRHVPLG   PLL VGKRT G  +ASSPD DLE FSHNPTHGS
Sbjct  25   KLRFKVSRKAHPPYEVHQRHVPLGGQGPLLLVGKRTTGTRIASSPDFDLEAFSHNPTHGS  84


>ref|XP_003638451.1| hypothetical protein MTR_132s0010, partial [Medicago truncatula]
gb|AES85589.1| hypothetical protein MTR_132s0010, partial [Medicago truncatula]
Length=1458

Score = 77.8 bits (190),  Expect = 5e-15
Identities = 39/60 (65%), Positives = 42/60 (70%), Gaps = 0/60 (0%)
Frame = -3

Query  181  EL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDLEVFSHNPTHGS  2
+L FK    A PP  + QRHVPLG   PLL VGKRT G  +ASSPD DLE FSHNPTHGS
Sbjct  68   KLRFKVSRKAHPPYEVHQRHVPLGGQGPLLLVGKRTTGTRIASSPDFDLEAFSHNPTHGS  127


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 229559892
Number of extensions: 4633336
Number of successful extensions: 9639
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 9636
Number of HSP's successfully gapped: 0
Length of query: 216
Length of database: 6150218869
Length adjustment: 44
Effective length of query: 172
Effective length of database: 5361779173
Effective search space: 150129816844
Effective search space used: 150129816844
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE4N93W01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_2665

Length=766


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003538935.1|  PREDICTED: cytochrome P450 81D1-like [Gly...   311    4e-99
emb|CBI33752.3|  unnamed protein product [Vitis vinifera]           251    7e-79
emb|CBI33755.3|  unnamed protein product [Vitis vinifera]           248    1e-78
emb|CBI33749.3|  unnamed protein product [Vitis vinifera]           244    1e-77
ref|XP_002283502.1|  PREDICTED: isoflavone 2'-hydroxylase-like...   253    2e-77

ALIGNMENTS
>ref|XP_003538935.1| PREDICTED: cytochrome P450 81D1-like [Glycine max]
Length=580

Score =  311 bits (797),  Expect = 4e-99
Identities = 147/197 (75%), Positives = 168/197 (85%), Gaps = 2/197 (1%)
Frame = +3

Query  3    ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA  182
ATTMEWA SLLLNHP+ MNKV+ EIDT VGQD+++N  D +KLKYLQ V+ ETLRLYP A
Sbjct  386  ATTMEWAFSLLLNHPKKMNKVKEEIDTYVGQDQMLNGLDTTKLKYLQNVITETLRLYPVA  445

Query  183  PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI  362
PL+LPHESSNDC VCGFDIP+GTMLLVNLW LHRD NLW DP  FVPERF        E+
Sbjct  446  PLLLPHESSNDCKVCGFDIPRGTMLLVNLWTLHRDANLWVDPAMFVPERF--EGEEADEV  503

Query  363  YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL  542
YNMIPFG+GRR+CPGA LAKR +GHA+G+LIQCFEWERIG +EIDM EGIGLTMPK+EPL
Sbjct  504  YNMIPFGIGRRACPGAVLAKRVMGHALGTLIQCFEWERIGHQEIDMTEGIGLTMPKLEPL  563

Query  543  VALCKPRQVMVEVISNI  593
VALC+PRQ M++V+SNI
Sbjct  564  VALCRPRQSMIKVLSNI  580


>emb|CBI33752.3| unnamed protein product [Vitis vinifera]
Length=316

Score =  251 bits (640),  Expect = 7e-79
Identities = 115/196 (59%), Positives = 156/196 (80%), Gaps = 3/196 (2%)
Frame = +3

Query  3    ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA  182
A TMEWA++LLLNHP+ + K +AE+D  VG+D+L+ ESD  KL+YL+ ++ ETLR++P A
Sbjct  124  AATMEWAMTLLLNHPDVLEKAKAELDMHVGKDRLIEESDLPKLRYLRSIISETLRVFPVA  183

Query  183  PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI  362
PL+LPH SS+DC + GFDIP+GT+LLVN+WALHRDP +W+DPT F PERFE GE
Sbjct  184  PLLLPHMSSDDCQIGGFDIPRGTLLLVNVWALHRDPQVWEDPTSFKPERFENGEREN---  240

Query  363  YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL  542
Y ++PFG+GRR+CPGA LA+R +G A+GSLIQC++W++I +  ID  EG GLTMPK++PL
Sbjct  241  YKLVPFGIGRRACPGAGLAQRVVGLALGSLIQCYDWKKISNTAIDTIEGKGLTMPKLQPL  300

Query  543  VALCKPRQVMVEVISN  590
A+CK R+++ EV  N
Sbjct  301  EAMCKAREIINEVHLN  316


>emb|CBI33755.3| unnamed protein product [Vitis vinifera]
Length=242

Score =  248 bits (632),  Expect = 1e-78
Identities = 113/196 (58%), Positives = 148/196 (76%), Gaps = 2/196 (1%)
Frame = +3

Query  3    ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA  182
A TMEWA+SLLLNHP+ + K + E+DTCVGQ++L+ E+D  KL YLQ ++ ET RL PPA
Sbjct  37   AATMEWAMSLLLNHPDVLKKAKVELDTCVGQERLLEEADLPKLHYLQNIISETFRLCPPA  96

Query  183  PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI  362
PL LPH SS +C + GFDIP+ TMLLVN W LHRDP LW DPT F PERF       GE
Sbjct  97   PLWLPHMSSENCQLGGFDIPRDTMLLVNSWTLHRDPKLWDDPTSFKPERF--EGGERGET  154

Query  363  YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL  542
Y ++PFG GRR+CPG+ LA + +G  +GSLIQC+EWERI ++++DM EG GLTMPK+EPL
Sbjct  155  YKLLPFGTGRRACPGSGLANKVVGLTLGSLIQCYEWERISEKKVDMMEGKGLTMPKMEPL  214

Query  543  VALCKPRQVMVEVISN  590
A+C+  +++ +V+ +
Sbjct  215  EAMCRAYEIVKKVLQD  230


>emb|CBI33749.3| unnamed protein product [Vitis vinifera]
Length=227

Score =  244 bits (624),  Expect = 1e-77
Identities = 115/194 (59%), Positives = 151/194 (78%), Gaps = 3/194 (2%)
Frame = +3

Query  3    ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA  182
ATT+EWA+SLLLNHP+ + K RAE+DT VG+D+L  ESD  KL+YL+ ++ ETLRL+P
Sbjct  37   ATTIEWAMSLLLNHPDVLKKARAELDTHVGKDRLTEESDFPKLQYLRSIISETLRLFPAT  96

Query  183  PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI  362
PL++PH SS++C + GFDIP+GT+LLVN WA+HRDP  WKDPT F PERF   E   GE
Sbjct  97   PLLMPHISSDNCQIGGFDIPRGTILLVNAWAIHRDPKSWKDPTSFKPERF---ENEEGEA  153

Query  363  YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL  542
Y ++PFG+GRR+CPGA LA R IG  +G LIQC+E ER  ++E+DM EG G+TMPK+EPL
Sbjct  154  YKLLPFGLGRRACPGAGLANRVIGLTLGLLIQCYELERASEKEVDMAEGKGVTMPKLEPL  213

Query  543  VALCKPRQVMVEVI  584
A+CK R ++ +V+
Sbjct  214  EAMCKARAIIRKVL  227


>ref|XP_002283502.1| PREDICTED: isoflavone 2'-hydroxylase-like [Vitis vinifera]
Length=508

Score =  253 bits (645),  Expect = 2e-77
Identities = 121/197 (61%), Positives = 150/197 (76%), Gaps = 3/197 (2%)
Frame = +3

Query  3    ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA  182
A T+EWA+SLLLNHPE + K R E+DT +G D L++E+D  KL+YLQ ++ E+LRL+P
Sbjct  311  AVTIEWAMSLLLNHPEVLKKARDELDTHIGHDCLIDETDLPKLQYLQSIISESLRLFPST  370

Query  183  PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI  362
PL++PH S+ DC + GFD+P GTMLLVN WALHRDP LW DPT F PERF   E G  E
Sbjct  371  PLLVPHFSTEDCKLRGFDVPGGTMLLVNAWALHRDPKLWNDPTSFKPERF---ETGESET  427

Query  363  YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL  542
Y ++PFGVGRR+CPG  LA R +G  +GSLIQCF+W+R+ ++EIDM EG GLTMPKVEPL
Sbjct  428  YKLLPFGVGRRACPGIGLANRVMGLTLGSLIQCFDWKRVDEKEIDMAEGQGLTMPKVEPL  487

Query  543  VALCKPRQVMVEVISNI  593
A+CK RQVM  V S I
Sbjct  488  EAMCKTRQVMNNVSSKI  504


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 412414645
Number of extensions: 8587968
Number of successful extensions: 23775
Number of sequences better than 1e-10: 973
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 21750
Number of HSP's successfully gapped: 975
Length of query: 766
Length of database: 6150218869
Length adjustment: 137
Effective length of query: 629
Effective length of database: 3695304361
Effective search space: 436045914598
Effective search space used: 436045914598
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 175 (72.0 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE57CWY01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_3020

Length=744


Score     E
Sequences producing significant alignments:                       (Bits)  Value

dbj|BAD83475.2|  ribosomal protein L16 [Nicotiana tabacum]          295    2e-98
ref|YP_005090456.1|  rpl16 gene product (mitochondrion) [Mille...   295    4e-98
ref|YP_005090475.1|  rpl16 gene product (mitochondrion) [Lotus...   292    4e-97
sp|Q95747.3|RM16_ARATH  RecName: Full=60S ribosomal protein L1...   291    1e-96
ref|YP_173411.1|  ribosomal protein L16 [Nicotiana tabacum]         288    9e-96

ALIGNMENTS
>dbj|BAD83475.2| ribosomal protein L16 [Nicotiana tabacum]
Length=158

Score =  295 bits (755),  Expect = 2e-98
Identities = 154/158 (97%), Positives = 155/158 (98%), Gaps = 0/158 (0%)
Frame = +2

Query  5    VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr  184
VSKCGF IVKK+R VLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYR
Sbjct  1    VSKCGFPIVKKKRDVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYR  60

Query  185  aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV  364
AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV
Sbjct  61   AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV  120

Query  365  STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS  478
S GQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS
Sbjct  121  SRGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS  158


>ref|YP_005090456.1| rpl16 gene product (mitochondrion) [Millettia pinnata]
gb|AET62916.1| ribosomal protein L16 (mitochondrion) [Millettia pinnata]
Length=185

Score =  295 bits (755),  Expect = 4e-98
Identities = 151/158 (96%), Positives = 157/158 (99%), Gaps = 0/158 (0%)
Frame = +2

Query  5    VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr  184
VSKCGFHIVKK+R VLYPKRTK+SKYRKGRCSRGCKPDGT+LGFGRYGI+SCRAGRLSYR
Sbjct  28   VSKCGFHIVKKKRDVLYPKRTKFSKYRKGRCSRGCKPDGTKLGFGRYGIQSCRAGRLSYR  87

Query  185  aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV  364
AIEAARRAIIGHFHRAMSGQFR+NGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV
Sbjct  88   AIEAARRAIIGHFHRAMSGQFRKNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV  147

Query  365  STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS  478
STGQ+LFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS
Sbjct  148  STGQVLFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS  185


>ref|YP_005090475.1| rpl16 gene product (mitochondrion) [Lotus japonicus]
gb|AET62935.1| ribosomal protein L16 (mitochondrion) [Lotus japonicus]
Length=185

Score =  292 bits (748),  Expect = 4e-97
Identities = 150/158 (95%), Positives = 156/158 (99%), Gaps = 0/158 (0%)
Frame = +2

Query  5    VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr  184
VSKCGFHIVKK+  VLYPKRTK+SKYRKGRCSRGCKPDGT+LGFGRYGI+SCRAGRLSYR
Sbjct  28   VSKCGFHIVKKKGDVLYPKRTKFSKYRKGRCSRGCKPDGTKLGFGRYGIQSCRAGRLSYR  87

Query  185  aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV  364
AIEAARRAIIGHFHRAMSGQFR+NGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV
Sbjct  88   AIEAARRAIIGHFHRAMSGQFRKNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV  147

Query  365  STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS  478
STGQ+LFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS
Sbjct  148  STGQVLFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS  185


>sp|Q95747.3|RM16_ARATH RecName: Full=60S ribosomal protein L16, mitochondrial
Length=179

Score =  291 bits (744),  Expect = 1e-96
Identities = 150/158 (95%), Positives = 155/158 (98%), Gaps = 0/158 (0%)
Frame = +2

Query  5    VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr  184
VSKCGFHIVKK+  VLYPKRTKYSKYRKGRCSRGCKPDGT+LGFGRYGIKSC+AG LSYR
Sbjct  22   VSKCGFHIVKKKGDVLYPKRTKYSKYRKGRCSRGCKPDGTKLGFGRYGIKSCKAGCLSYR  81

Query  185  aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV  364
AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVFAD+PITGKPTEVRMGRGKGNPTGWIARV
Sbjct  82   AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVFADLPITGKPTEVRMGRGKGNPTGWIARV  141

Query  365  STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS  478
STGQILFEMDGVSL+NARQAATLAAHKLCLSTKFVQWS
Sbjct  142  STGQILFEMDGVSLANARQAATLAAHKLCLSTKFVQWS  179


>ref|YP_173411.1| ribosomal protein L16 [Nicotiana tabacum]
Length=171

Score =  288 bits (738),  Expect = 9e-96
Identities = 151/158 (96%), Positives = 152/158 (96%), Gaps = 0/158 (0%)
Frame = +2

Query  5    VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr  184
VSKCGF IVKK+R VLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYG KSCRAGRLSYR
Sbjct  14   VSKCGFPIVKKKRDVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGTKSCRAGRLSYR  73

Query  185  aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV  364
AIEAARRAIIGHFHRAMSGQFRRNGKIWVRV ADIPITGKPTEVRMGRGKGNPTGWIARV
Sbjct  74   AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVLADIPITGKPTEVRMGRGKGNPTGWIARV  133

Query  365  STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS  478
S GQILFEMDGVSLSNARQAATLAAHKLC STKFVQWS
Sbjct  134  SRGQILFEMDGVSLSNARQAATLAAHKLCSSTKFVQWS  171


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 385964068
Number of extensions: 8213243
Number of successful extensions: 18553
Number of sequences better than 1e-10: 113
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 18400
Number of HSP's successfully gapped: 113
Length of query: 744
Length of database: 6150218869
Length adjustment: 137
Effective length of query: 607
Effective length of database: 3695304361
Effective search space: 410178784071
Effective search space used: 410178784071
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 175 (72.0 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE5E6JX01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_3129

Length=738


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002277851.2|  PREDICTED: uncharacterized protein LOC100...   139    3e-35
ref|XP_002326236.1|  predicted protein [Populus trichocarpa] >...   132    9e-33
ref|XP_002528638.1|  conserved hypothetical protein [Ricinus c...   130    6e-32
emb|CBI30220.3|  unnamed protein product [Vitis vinifera]           124    8e-30
ref|XP_003542565.1|  PREDICTED: uncharacterized protein LOC100...   121    9e-29

ALIGNMENTS
>ref|XP_002277851.2| PREDICTED: uncharacterized protein LOC100259589 [Vitis vinifera]
Length=470

Score =  139 bits (351),  Expect = 3e-35
Identities = 73/128 (57%), Positives = 94/128 (73%), Gaps = 12/128 (9%)
Frame = -1

Query  555  SRKRMKKGLSPS------IEQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQS  394
SRKR +KGL PS      + QL  E+  VLQD TKS  EK+ WMR ++++LEE+RV  Q
Sbjct  335  SRKRARKGLFPSPSPSPLMRQLSSEVMSVLQDGTKSTLEKKQWMRSRMMQLEEQRVSYQC  394

Query  393  RTLELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKELDLLD------H  232
+  ELEKQRLKW+KFS+KKEREME E+L N+R RLENERM L++RQKE++LL+
Sbjct  395  KAFELEKQRLKWVKFSSKKEREMEREKLVNQRKRLENERMALLLRQKEVELLEIHHQQQQ  454

Query  231  QQHSSNRK  208
QQHSSN++
Sbjct  455  QQHSSNKR  462


>ref|XP_002326236.1| predicted protein [Populus trichocarpa]
gb|EEE71906.1| predicted protein [Populus trichocarpa]
Length=483

Score =  132 bits (333),  Expect = 9e-33
Identities = 76/134 (57%), Positives = 97/134 (72%), Gaps = 9/134 (7%)
Frame = -1

Query  558  SSRKRMKKGL----SPSIEQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQSR  391
SSRKR +K +    S  ++QL  E+  VL+D  KS  EK  WM+ KL++LEE++V  Q +
Sbjct  351  SSRKRPRKEVFSATSSLMQQLNGEIMNVLRDGAKSSWEKNQWMKLKLMQLEEQQVNYQCQ  410

Query  390  TLELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKELDLLD--HQQHS-  220
ELEKQRLKW++FS+KKEREME  +L NER RLENERMVL VR+KEL+LLD  HQQ
Sbjct  411  AFELEKQRLKWVRFSSKKEREMERAKLENERKRLENERMVLTVRKKELELLDTTHQQQQL  470

Query  219  -SNRKSVDPSSVTG  181
SN++S DPSS+ G
Sbjct  471  PSNKRS-DPSSIAG  483


>ref|XP_002528638.1| conserved hypothetical protein [Ricinus communis]
gb|EEF33741.1| conserved hypothetical protein [Ricinus communis]
Length=499

Score =  130 bits (327),  Expect = 6e-32
Identities = 70/139 (50%), Positives = 96/139 (69%), Gaps = 14/139 (10%)
Frame = -1

Query  555  SRKRMKKGL----SPSIEQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQSRT  388
++KR + G+    S  ++QL  EL GV+QD  KS  EK+HWM+ + ++LEE+++  Q +
Sbjct  361  AQKRQRTGVHSLSSSLMQQLNSELMGVIQDGAKSPWEKKHWMKLRSMQLEEQQLSYQCQA  420

Query  387  LELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKELDLLD---------  235
ELEKQRLKW+KFS+KKEREME  +L N+R  LE+ERMVL++RQKEL+LLD
Sbjct  421  FELEKQRLKWVKFSSKKEREMEKAKLDNDRRMLESERMVLLIRQKELELLDLQQQQHHHH  480

Query  234  -HQQHSSNRKSVDPSSVTG  181
QQ  S+ K  DPSS+TG
Sbjct  481  HQQQQLSSNKRGDPSSITG  499


>emb|CBI30220.3| unnamed protein product [Vitis vinifera]
Length=410

Score =  124 bits (310),  Expect = 8e-30
Identities = 63/106 (59%), Positives = 79/106 (75%), Gaps = 6/106 (6%)
Frame = -1

Query  549  KRMKKGLSPS------IEQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQSRT  388
KR +KGL PS      + QL  E+  VLQD TKS  EK+ WMR ++++LEE+RV  Q +
Sbjct  253  KRARKGLFPSPSPSPLMRQLSSEVMSVLQDGTKSTLEKKQWMRSRMMQLEEQRVSYQCKA  312

Query  387  LELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKE  250
ELEKQRLKW+KFS+KKEREME E+L N+R RLENERM L++RQKE
Sbjct  313  FELEKQRLKWVKFSSKKEREMEREKLVNQRKRLENERMALLLRQKE  358


>ref|XP_003542565.1| PREDICTED: uncharacterized protein LOC100816661 [Glycine max]
Length=478

Score =  121 bits (303),  Expect = 9e-29
Identities = 63/124 (51%), Positives = 89/124 (72%), Gaps = 7/124 (6%)
Frame = -1

Query  558  SSRKRMKK----GLSPSI-EQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQS  394
S RKR +K     +SP + +QL  E++GV QD  KS  +K+ WMR ++++LEE+++   +
Sbjct  350  SMRKRARKVGGVSMSPQLMQQLSAEVSGVFQDVGKSAWDKKQWMRSRIMQLEEQQISYHT  409

Query  393  RTLELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKELDL--LDHQQHS  220
+  ELEKQRLKW +FS+KKEREME  +L NER RLENERMVL++RQKE +L  L HQQ
Sbjct  410  QAFELEKQRLKWARFSSKKEREMETAKLENERRRLENERMVLLIRQKEFELMSLQHQQQQ  469

Query  219  SNRK  208
+++
Sbjct  470  QHQQ  473


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 321482715
Number of extensions: 6325429
Number of successful extensions: 21451
Number of sequences better than 1e-10: 2
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 21012
Number of HSP's successfully gapped: 2
Length of query: 738
Length of database: 6150218869
Length adjustment: 137
Effective length of query: 601
Effective length of database: 3695304361
Effective search space: 402788175349
Effective search space used: 402788175349
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 175 (72.0 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPDZD6PS016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_331

Length=1109


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|ABR10301.1|  pathogen-related protein STH-2 [Salvia miltior...   213    2e-64
ref|XP_003632378.1|  PREDICTED: LOW QUALITY PROTEIN: major all...   175    2e-49
ref|XP_002271428.2|  PREDICTED: pathogenesis-related protein S...   167    3e-47
ref|XP_002273815.2|  PREDICTED: pathogenesis-related protein S...   165    3e-46
ref|NP_001237916.1|  uncharacterized protein LOC100527307 [Gly...   164    1e-45

ALIGNMENTS
>gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza]
Length=160

Score =  213 bits (541),  Expect = 2e-64
Identities = 98/160 (61%), Positives = 128/160 (80%), Gaps = 0/160 (0%)
Frame = -2

Query  721  MGVKKFFQELKTKVSPSRLFKALVTESHEVIPKVTPKVKSIETIEGDFGAVGCLRKTTFP  542
MGVK FFQE+KTK+S SRLFKALVTES EV+PK T  +KSIE I+G   A G + +T FP
Sbjct  1    MGVKSFFQEMKTKISSSRLFKALVTESPEVVPKFTTSIKSIELIQGSGYAPGAIFQTNFP  60

Query  541  ESSQIKSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLESE  362
E +  K +  RVD IDH+ ++ KYT++ GD+LGDKLEK+CY++KFE TEDGGC++K+ SE
Sbjct  61   EGAHFKYMKCRVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKVTSE  120

Query  361  YHTKGDFELKDEEVEAGKEQALWLYKACEDYLVAHPDVCA  242
YHTKG +EL DE+++  KEQ+L +YK+CEDYL+A+P VCA
Sbjct  121  YHTKGGYELADEDLKGAKEQSLGMYKSCEDYLLANPHVCA  160


>ref|XP_003632378.1| PREDICTED: LOW QUALITY PROTEIN: major allergen Pru ar 1-like
[Vitis vinifera]
Length=204

Score =  175 bits (443),  Expect = 2e-49
Identities = 78/155 (50%), Positives = 115/155 (74%), Gaps = 3/155 (2%)
Frame = -2

Query  706  FFQELKTKVSPSRLFKALVTESHEVIPKVTPKVKSIETIEGDFGAVGCLRKTTFPESSQI  527
F QE+ T ++P+ +FKAL+ +SH ++P + P +KSIE +EGD G VG +++T FPE S
Sbjct  53   FSQEITTPIAPAIMFKALIVDSHNLVPTLMPSIKSIEFVEGD-GGVGSIKQTNFPEGSHF  111

Query  526  KSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLESEYHTKG  347
K + HR+DAIDHDNY+CKYT++ G+VLGD LE + YE+KFEA+  G  + K+ S YH+K
Sbjct  112  KYLKHRIDAIDHDNYSCKYTLIEGEVLGDTLESISYEVKFEASGSGSSVCKMTSHYHSK-  170

Query  346  DFELKDEEVEAGKEQALWLYKACEDYLVAHPDVCA  242
ELKDE+++ GK++A+ +YK   +YL+A+PD  A
Sbjct  171  -IELKDEDIKTGKDKAMGMYKVVGEYLLANPDAYA  204


>ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera]
emb|CBI22930.3| unnamed protein product [Vitis vinifera]
Length=160

Score =  167 bits (424),  Expect = 3e-47
Identities = 79/158 (50%), Positives = 115/158 (73%), Gaps = 2/158 (1%)
Frame = -2

Query  721  MGVKKFFQELKTKVSPSRLFKALVTESHEVIPKVTPK-VKSIETIEGDFGAVGCLRKTTF  545
MGV  F QE  T VSP+R+FKALV +SH ++P++ P+ VKSIE +EGD GA G + +T F
Sbjct  1    MGVTTFTQEFVTPVSPARMFKALVVDSHILVPRLVPESVKSIEFVEGDGGA-GSITQTNF  59

Query  544  PESSQIKSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLES  365
S  + + ++++A+D +   C+YT++ G VLGD+LE + YE+KFE + DGGC+ K  S
Sbjct  60   SGDSDCEYLKYKINAVDKEKLECRYTLIEGGVLGDQLESIVYEMKFEESGDGGCICKTRS  119

Query  364  EYHTKGDFELKDEEVEAGKEQALWLYKACEDYLVAHPD  251
EYHTKG+FE+K+E +  GKE+A+ +YK  E YL+A+PD
Sbjct  120  EYHTKGEFEIKEESIREGKEKAMGVYKLVEAYLLANPD  157


>ref|XP_002273815.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera]
emb|CAN79554.1| hypothetical protein VITISV_025728 [Vitis vinifera]
emb|CBI22943.3| unnamed protein product [Vitis vinifera]
Length=159

Score =  165 bits (418),  Expect = 3e-46
Identities = 77/161 (48%), Positives = 118/161 (73%), Gaps = 3/161 (2%)
Frame = -2

Query  721  MGVKKFFQELKTKVSPSRLFKALVTESHEVIPKVTPK-VKSIETIEGDFGAVGCLRKTTF  545
MGV  F QE    ++PSR+FKAL+ +S+ +IPK+ P+ ++SI+ ++GD GA G + +  F
Sbjct  1    MGVTSFTQEFTCPIAPSRIFKALILDSNNLIPKLLPQTIRSIDVVQGDGGA-GTIEQVNF  59

Query  544  PESSQIKSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLES  365
E+S +K V H+++ +D +N+ CKY M+ GDVLG++LE + +E+KFEA  DGG + K+ S
Sbjct  60   TEASNLKYVKHQIEELDKENFVCKYRMIEGDVLGEELESIAHEVKFEAA-DGGSICKMAS  118

Query  364  EYHTKGDFELKDEEVEAGKEQALWLYKACEDYLVAHPDVCA  242
EYHTKG FE+K+EE++AGK +A+ +YK  E YL+ +P V A
Sbjct  119  EYHTKGKFEIKEEEIKAGKARAMGIYKVVEAYLLENPHVYA  159


>ref|NP_001237916.1| uncharacterized protein LOC100527307 [Glycine max]
gb|ACU16378.1| unknown [Glycine max]
Length=160

Score =  164 bits (414),  Expect = 1e-45
Identities = 75/161 (47%), Positives = 110/161 (68%), Gaps = 2/161 (1%)
Frame = -2

Query  721  MGVKKFFQELKTKVSPSRLFKALVTESHEVIPKVTPK-VKSIETIEGDFGAVGCLRKTTF  545
MG+  F QE  + V+PS +FKAL+ +S  ++PK+ P+ VK +  I+GD G  G + +  F
Sbjct  1    MGITTFTQEYSSSVAPSPMFKALIVDSRNLLPKLLPQFVKDVNVIQGD-GEAGSIEQVNF  59

Query  544  PESSQIKSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLES  365
E +  K + HR+D +D DN  CKYTM+ GD LGDKLE + YE+KFEAT DGGC+ K+ S
Sbjct  60   NEDNPFKYLKHRIDVLDKDNLVCKYTMIEGDPLGDKLESIGYEVKFEATSDGGCLCKMTS  119

Query  364  EYHTKGDFELKDEEVEAGKEQALWLYKACEDYLVAHPDVCA  242
Y+T G+F++K+EEV+ G+E  + +Y+  E YL+ +P V A
Sbjct  120  NYNTIGEFDVKEEEVKEGRESGIAVYRVVESYLLENPQVYA  160


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 510140310
Number of extensions: 9975518
Number of successful extensions: 26073
Number of sequences better than 1e-10: 45
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 26017
Number of HSP's successfully gapped: 45
Length of query: 1109
Length of database: 6150218869
Length adjustment: 142
Effective length of query: 967
Effective length of database: 3605708941
Effective search space: 818495929607
Effective search space used: 818495929607
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 178 (73.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPDZFM7P01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_337

Length=1105


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003552036.1|  PREDICTED: uncharacterized protein LOC100...   303    2e-95
ref|XP_003519788.1|  PREDICTED: uncharacterized protein LOC100...   300    7e-95
ref|XP_002893280.1|  hypothetical protein ARALYDRAFT_472593 [A...   299    8e-95
ref|XP_003530633.1|  PREDICTED: uncharacterized protein LOC100...   301    1e-94
ref|XP_002315678.1|  predicted protein [Populus trichocarpa] >...   300    2e-94

ALIGNMENTS
>ref|XP_003552036.1| PREDICTED: uncharacterized protein LOC100788950 [Glycine max]
Length=464

Score =  303 bits (775),  Expect = 2e-95
Identities = 156/294 (53%), Positives = 202/294 (69%), Gaps = 13/294 (4%)
Frame = -2

Query  987  ESCPKNIVPIRRITKEDVLRASSVRTFVHKPRV--IIKDSSGNTHEHAIAYVTGGSYYGA  814
ESCP+  +PIRR T+ED+LRA+SVR F  K  +  + +D+SGN HEHAI YVTG  YYGA
Sbjct  172  ESCPEGTIPIRRTTEEDMLRANSVRRFGRKKVINRVRRDTSGNGHEHAIGYVTGDQYYGA  231

Query  813  STAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIEAGWQI---WYGDKYPKLFVFWT  643
+++VWAP V NP EFSLSQ+WV++GSF  ++LN+IEAGWQ+    YGD YP+ F +WT
Sbjct  232  KASINVWAPLVENPYEFSLSQMWVISGSFG-DDLNTIEAGWQVSPELYGDSYPRFFTYWT  290

Query  642  RDAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNNANKHYDISFDIYKSSRTGNWLL  463
DAYQ TGCYNLLC GFVQ +  I  G AISP S Y+     +DIS  I+K  + GNW L
Sbjct  291  TDAYQATGCYNLLCSGFVQTNSKIAIGAAISPTSSYSGGQ--FDISLLIWKDPKHGNWWL  348

Query  462  KVNN-VQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMFGAHTKTEMGSGHFSNEGYGQAA  286
+  + + VGYWP  +FT L   A +IQFGGEIVN+G  G+HT T+MGSGHF+ EG+ +A+
Sbjct  349  EFGSGILVGYWPSFLFTHLGDHASMIQFGGEIVNSGSSGSHTSTQMGSGHFAEEGFAKAS  408

Query  285  CFKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSGRDYNILGNHFYYGGPG  124
F+ +QV    N   PL   +L+V  +   CY+  + G   N  GN+FYYGGPG
Sbjct  409  YFRNMQVVDWDNNLIPLS--NLKVLADHPNCYD--IQGGVNNAWGNYFYYGGPG  458


>ref|XP_003519788.1| PREDICTED: uncharacterized protein LOC100776135 [Glycine max]
Length=417

Score =  300 bits (767),  Expect = 7e-95
Identities = 152/293 (52%), Positives = 205/293 (70%), Gaps = 12/293 (4%)
Frame = -2

Query  987  ESCPKNIVPIRRITKEDVLRASSVRTFVHK-PRVIIKDSSGNTHEHAIAYVTGGSYYGAS  811
ESCP+  +PIRR T++D+LRASSV  F  K  R + +D++ N HEHA+ YV+G  YYGA
Sbjct  126  ESCPEGTIPIRRTTEQDMLRASSVSRFGRKIRRRVRRDTNSNGHEHAVGYVSGEQYYGAK  185

Query  810  TAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIEAGWQIW---YGDKYPKLFVFWTR  640
+++VWAP+V N +EFSLSQ+WV++GSF  ++LN+IEAGWQ+    YGD+YP+ F +WT
Sbjct  186  ASINVWAPRVENQDEFSLSQMWVISGSFG-DDLNTIEAGWQVSPEIYGDRYPRFFTYWTS  244

Query  639  DAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNNANKHYDISFDIYKSSRTGNWLLK  460
DAYQ TGCYNLLC GFVQ +  I  G AISP S Y  A   +DIS  I+K  + GNW L+
Sbjct  245  DAYQATGCYNLLCSGFVQTNNRIAIGAAISPTSSY--AGGQFDISLLIWKDPKHGNWWLE  302

Query  459  VNN-VQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMFGAHTKTEMGSGHFSNEGYGQAAC  283
+ + VGYWP  +FT LR  A ++QFGGEIVN+   G+HT T+MGSGHF++EG+G+A+
Sbjct  303  FGSGILVGYWPSFLFTHLRDHASMVQFGGEIVNSRQSGSHTSTQMGSGHFASEGFGKASY  362

Query  282  FKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSGRDYNILGNHFYYGGPG  124
F+ +QV    N   PL   +L+V  +   CY+  + G   N+ GN+FYYGGPG
Sbjct  363  FRNMQVVDWDNNLVPLS--NLRVLADHPNCYD--IQGGINNVWGNYFYYGGPG  411


>ref|XP_002893280.1| hypothetical protein ARALYDRAFT_472593 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH69539.1| hypothetical protein ARALYDRAFT_472593 [Arabidopsis lyrata subsp.
lyrata]
Length=408

Score =  299 bits (766),  Expect = 8e-95
Identities = 156/316 (49%), Positives = 207/316 (66%), Gaps = 20/316 (6%)
Frame = -2

Query  1032  PEMQKGPTSRNNTIK---------ESCPKNIVPIRRITKEDVLRASSVRTFVHKPRVIIK  880
PEM  G +  N + +         ESCP+  +PIRR T++D+LRASSVR F  K R + +
Sbjct  94    PEMPIGYSQENESYENFQLWSLSGESCPEGTIPIRRTTEQDMLRASSVRRFGRKIRRVRR  153

Query  879   DSSGNTHEHAIAYVTGGSYYGASTAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIE  700
DSS N HEHA+ YV+G  YYGA  +++VW P+V++  EFSLSQIWV+AGSF  ++LN+IE
Sbjct  154   DSSSNGHEHAVGYVSGSQYYGAKASINVWTPRVISQYEFSLSQIWVIAGSF-ADDLNTIE  212

Query  699   AGWQI---WYGDKYPKLFVFWTRDAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNN  529
AGWQI    YGD  P+ F +WT DAYQ TGCYNLLC GFVQ +  I  G AISP S Y
Sbjct  213   AGWQISPELYGDTNPRFFTYWTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYKG  272

Query  528   ANKHYDISFDIYKSSRTGNWLLKV-NNVQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMF  352
+DIS  I+K  + G+W L+  +   VGYWP  +FT LR   +++QFGGEIVNT
Sbjct  273   G--QFDISLLIWKDPKHGHWWLQFGSGTLVGYWPVSLFTHLREHGNMVQFGGEIVNTRPG  330

Query  351   GAHTKTEMGSGHFSNEGYGQAACFKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSG  172
G+HT T+MGSGHF+ EG+G+A+ F+ +Q+    N   P+   +L+V  +   CY+ R  G
Sbjct  331   GSHTSTQMGSGHFAGEGFGKASYFRNLQMVDWDNTLIPIS--NLKVLADHPNCYDIR--G  386

Query  171   RDYNILGNHFYYGGPG  124
+ GN+FYYGGPG
Sbjct  387   GVNRVWGNYFYYGGPG  402


>ref|XP_003530633.1| PREDICTED: uncharacterized protein LOC100792240 [Glycine max]
Length=463

Score =  301 bits (770),  Expect = 1e-94
Identities = 158/312 (51%), Positives = 208/312 (67%), Gaps = 17/312 (5%)
Frame = -2

Query  1029  EMQKGPTSRNNTI----KESCPKNIVPIRRITKEDVLRASSVRTFVHKPRV--IIKDSSG  868
+M  G  S N  +     ESCP+  +PIRR T++D+LRA+SVR F  K  +  + +D+SG
Sbjct  153   QMDDGDLSENFQLWSFSGESCPEGTIPIRRTTEQDMLRATSVRRFGRKKIINRVRRDTSG  212

Query  867   NTHEHAIAYVTGGSYYGASTAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIEAGWQ  688
N HEHAI YVTG  YYG+  +++VWAP V NP EFSLSQ+WV++GSF  ++LN+IEAGWQ
Sbjct  213   NGHEHAIGYVTGDQYYGSKASINVWAPLVENPYEFSLSQMWVISGSFG-DDLNTIEAGWQ  271

Query  687   I---WYGDKYPKLFVFWTRDAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNNANKH  517
+    YGD YP+ F +WT DAYQ TGCYNLLC GFVQ +  I  G AISP S Y+
Sbjct  272   VSPELYGDSYPRFFTYWTTDAYQATGCYNLLCSGFVQTNSKIAIGAAISPTSSYSGGQ--  329

Query  516   YDISFDIYKSSRTGNWLLKVNN-VQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMFGAHT  340
+DIS  I+K  + GNW L+  + + VGYWP  +FT L   A +IQFGGEIVN+G  G+HT
Sbjct  330   FDISLLIWKDPKHGNWWLEFGSGILVGYWPSFLFTHLGDHASMIQFGGEIVNSGSSGSHT  389

Query  339   KTEMGSGHFSNEGYGQAACFKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSGRDYN  160
T+MGSGHF+ EG+ +A+ F+ +QV    N   PL   +L+V  +   CY+  + G   N
Sbjct  390   STQMGSGHFAEEGFAKASYFRNMQVVDWDNNLIPLS--NLKVLADHPNCYD--IQGGVNN  445

Query  159   ILGNHFYYGGPG  124
GN+FYYGGPG
Sbjct  446   AWGNYFYYGGPG  457


>ref|XP_002315678.1| predicted protein [Populus trichocarpa]
gb|EEF01849.1| predicted protein [Populus trichocarpa]
Length=466

Score =  300 bits (768),  Expect = 2e-94
Identities = 154/293 (53%), Positives = 201/293 (69%), Gaps = 12/293 (4%)
Frame = -2

Query  987  ESCPKNIVPIRRITKEDVLRASSVRTFVHK-PRVIIKDSSGNTHEHAIAYVTGGSYYGAS  811
ESCP+  VPIRR T++D+LRASSVR F  K  R + +D++ N HEHA+ YVTG  YYGA
Sbjct  175  ESCPEGTVPIRRTTEQDMLRASSVRRFGRKLRRHVRRDTNSNGHEHAVGYVTGDQYYGAK  234

Query  810  TAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIEAGWQI---WYGDKYPKLFVFWTR  640
+++VWAP+V N  EFSLSQ+WV++GSF  ++LN+IEAGWQ+    YGD YP+ F +WT
Sbjct  235  ASINVWAPRVSNQYEFSLSQMWVISGSFG-DDLNTIEAGWQVSPELYGDNYPRFFTYWTT  293

Query  639  DAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNNANKHYDISFDIYKSSRTGNWLLK  460
DAYQ TGCYNLLC GFVQ +  I  G AISP S Y+     +DIS  ++K  + GNW L+
Sbjct  294  DAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYSGGQ--FDISLLVWKDPKHGNWWLE  351

Query  459  VNN-VQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMFGAHTKTEMGSGHFSNEGYGQAAC  283
N V VGYWP  +FT LR  A ++QFGGEIVN+   G HT T+MGSGHF+ EG+G+A+
Sbjct  352  FGNGVLVGYWPSFLFTHLRDHASMVQFGGEIVNSRPSGFHTSTQMGSGHFAGEGFGKASY  411

Query  282  FKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSGRDYNILGNHFYYGGPG  124
F+ +QV    N   PL   +L+V  +   CY+  + G    + GN+FYYGGPG
Sbjct  412  FRNLQVVDWDNNLIPLS--NLRVLADHPNCYD--IQGGINRVWGNYFYYGGPG  460


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 618672975
Number of extensions: 14276356
Number of successful extensions: 36175
Number of sequences better than 1e-10: 34
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 36001
Number of HSP's successfully gapped: 36
Length of query: 1105
Length of database: 6150218869
Length adjustment: 142
Effective length of query: 963
Effective length of database: 3605708941
Effective search space: 814890220666
Effective search space used: 814890220666
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 177 (72.8 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPDZG0NG01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_376

Length=1089


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003534672.1|  PREDICTED: isoliquiritigenin 2'-O-methylt...   414    2e-140
ref|NP_001242767.1|  uncharacterized protein LOC100793053 [Gly...   414    3e-140
ref|NP_001242792.1|  uncharacterized protein LOC100779492 [Gly...   408    4e-138
ref|XP_003556605.1|  PREDICTED: LOW QUALITY PROTEIN: isoliquir...   360    3e-119
dbj|BAA13683.1|  O-methyltransferase [Glycyrrhiza echinata]         318    6e-103

ALIGNMENTS
>ref|XP_003534672.1| PREDICTED: isoliquiritigenin 2'-O-methyltransferase-like [Glycine
max]
Length=369

Score =  414 bits (1065),  Expect = 2e-140
Identities = 195/286 (68%), Positives = 236/286 (83%), Gaps = 1/286 (0%)
Frame = +3

Query  3    LTVLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSD-GGSLAPLSTLIHRGCNSVW  179
L +LASYSLL CSIRTNEDGKRERVYALS +G YFA D D G SLAPLS+LIHRG + +W
Sbjct  84   LPLLASYSLLNCSIRTNEDGKRERVYALSPVGAYFAFDKDEGSSLAPLSSLIHRGFHDMW  143

Query  180  GDAKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTCNKAMAHSGPLEIKRILQFYKGF  359
D KDAI+DP+  N F++++G   Y Y + N ELND   KA+ H+ PLE+KR L+ YKGF
Sbjct  144  KDVKDAIVDPNNNNHFENVHGIPPYDYMEKNAELNDIFYKAVIHAAPLELKRALKLYKGF  203

Query  360  EGVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPT  539
EGVSTLVDVGGG G+TLK I+ +YPS+KGINFD+P V+Q AP HPG+E + GDMFESVPT
Sbjct  204  EGVSTLVDVGGGAGETLKQILPKYPSMKGINFDLPLVIQKAPPHPGIEQIAGDMFESVPT  263

Query  540  GDAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDN  719
GDAI++K VCHNWADE+C+KFL+N HKALP+HGKVIV +YIIPEVPNPS +SKH C +DN
Sbjct  264  GDAILVKFVCHNWADEDCIKFLRNFHKALPQHGKVIVFEYIIPEVPNPSYISKHTCTLDN  323

Query  720  LMFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK  857
+MFL   G+ERT++EFE+LCK +GFSKFHVA SD+S+  GVMEFYK
Sbjct  324  VMFLAHGGRERTQKEFENLCKSSGFSKFHVASSDISSTLGVMEFYK  369


>ref|NP_001242767.1| uncharacterized protein LOC100793053 [Glycine max]
gb|ACU22842.1| unknown [Glycine max]
Length=370

Score =  414 bits (1064),  Expect = 3e-140
Identities = 199/285 (70%), Positives = 236/285 (83%), Gaps = 0/285 (0%)
Frame = +3

Query  3    LTVLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSDGGSLAPLSTLIHRGCNSVWG  182
L VLASYSLL CSIRTNEDG RER+YALS IGQYFACD+DGGSL PLS+L HRG   V
Sbjct  86   LPVLASYSLLNCSIRTNEDGVRERLYALSPIGQYFACDNDGGSLGPLSSLFHRGYFHVLK  145

Query  183  DAKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTCNKAMAHSGPLEIKRILQFYKGFE  362
D KDAI+DP+  + FQ+++G   YQY KT++ELN   NKA+A +GP  +K +L+ YKGFE
Sbjct  146  DVKDAIMDPNNNDHFQNVHGMPPYQYMKTDEELNKLFNKALAQTGPPAMKMLLKLYKGFE  205

Query  363  GVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPTG  542
VSTLVDVGGGVG+TLK II  YPSIKGINFD+PQV+Q+AP HPG+EHV GDMFESVP G
Sbjct  206  QVSTLVDVGGGVGETLKQIIFDYPSIKGINFDLPQVIQDAPPHPGIEHVEGDMFESVPKG  265

Query  543  DAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDNL  722
DAI+LKLVCHNW DE+CVKFL+NC+KALP+HGKVIV+DYIIPEVP+ S +S   C  D+L
Sbjct  266  DAILLKLVCHNWLDEDCVKFLRNCYKALPQHGKVIVIDYIIPEVPDSSKISMQTCVADSL  325

Query  723  MFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK  857
MFLVT+GKERTE+EFESLC+ +GFS FHVAC D  ++  V+EFYK
Sbjct  326  MFLVTSGKERTEKEFESLCRNSGFSGFHVACRDSPSVLSVVEFYK  370


>ref|NP_001242792.1| uncharacterized protein LOC100779492 [Glycine max]
gb|ACU18726.1| unknown [Glycine max]
Length=357

Score =  408 bits (1048),  Expect = 4e-138
Identities = 197/285 (69%), Positives = 234/285 (82%), Gaps = 0/285 (0%)
Frame = +3

Query  3    LTVLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSDGGSLAPLSTLIHRGCNSVWG  182
L VLASYSLL C IRT EDG RER+YALS IGQYFA D DGGSL PLS+L HRG   V
Sbjct  73   LPVLASYSLLNCFIRTTEDGVRERLYALSPIGQYFASDDDGGSLGPLSSLFHRGYFHVLK  132

Query  183  DAKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTCNKAMAHSGPLEIKRILQFYKGFE  362
D KDAI+DP+  + FQ+++G   YQY KT++ELN   NKA+A +GP  +K +L+ YKGFE
Sbjct  133  DVKDAIMDPNNNDHFQNVHGMPPYQYMKTDEELNKLFNKALAQTGPPAMKMLLKLYKGFE  192

Query  363  GVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPTG  542
VSTLVDVGGGVG+TLK II +YPSIKGINFD+PQVVQ+AP +PG+EHV GDMFESVP G
Sbjct  193  QVSTLVDVGGGVGETLKQIIFEYPSIKGINFDLPQVVQDAPPYPGIEHVEGDMFESVPKG  252

Query  543  DAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDNL  722
DAI+LKLVCHNW DE+CVKFL+NCHKALP+HGKVIV+DYIIPEVP+ S +S   C  D+L
Sbjct  253  DAILLKLVCHNWLDEDCVKFLRNCHKALPQHGKVIVIDYIIPEVPDSSKISMQTCVADSL  312

Query  723  MFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK  857
MFLVT+GKERTE+EFESLC+ +GFS+FHVAC D  ++  V+EFYK
Sbjct  313  MFLVTSGKERTEKEFESLCRNSGFSRFHVACRDSPSVLSVIEFYK  357


>ref|XP_003556605.1| PREDICTED: LOW QUALITY PROTEIN: isoliquiritigenin 2'-O-methyltransferase-like
[Glycine max]
Length=352

Score =  360 bits (923),  Expect = 3e-119
Identities = 182/285 (64%), Positives = 220/285 (77%), Gaps = 6/285 (2%)
Frame = +3

Query  9    VLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSD-GGSLAPLSTLIHRGCNSVWGD  185
+LASYSLL CSIRTNEDGKRERVYALS +GQYFA D D G SLAPLSTLIHRG +
Sbjct  72   LLASYSLLNCSIRTNEDGKRERVYALSPVGQYFAFDKDEGNSLAPLSTLIHRGFHDFI--  129

Query  186  AKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTC-NKAMAHSGPLEIKRILQFYKGFE  362
KDAI+DP+  N F+ ++G   Y Y + N ELND    KA     PLE+KR L+ Y GFE
Sbjct  130  EKDAIVDPNNNNHFEYVHGIPPYDYMEKNAELNDIFFYKARILDAPLELKRALKLYIGFE  189

Query  363  GVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPTG  542
VS LVDVGGGVG+TLK ++ +YPS+KGINFD+PQV+Q AP H G+EH+ GDMFESVPTG
Sbjct  190  RVSILVDVGGGVGETLKQLLPKYPSMKGINFDLPQVIQKAPPHQGIEHIEGDMFESVPTG  249

Query  543  DAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDNL  722
D I++K VCH+WADE+ +KFL+NCHKAL +HGKV+V +YIIPEVPNP  +SKH C +DN+
Sbjct  250  DVILMKFVCHSWADEDGIKFLRNCHKALLQHGKVVVFEYIIPEVPNPRYISKHTCTLDNV  309

Query  723  MFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK  857
MFL    +ERT+ EFE+L +  GFSKF VA SD+S+  GVMEFYK
Sbjct  310  MFLAQGRRERTQGEFENLXE--GFSKFDVASSDISSTLGVMEFYK  352


>dbj|BAA13683.1| O-methyltransferase [Glycyrrhiza echinata]
Length=367

Score =  318 bits (816),  Expect = 6e-103
Identities = 156/286 (55%), Positives = 215/286 (75%), Gaps = 8/286 (3%)
Frame = +3

Query  3    LTVLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSDGGSLAPLST-LIHRGCNSVW  179
L +LASYS+LTC+ R+ E     RVY LS +G+Y   D   G LA  +T L +    +VW
Sbjct  89   LRLLASYSVLTCATRSTE-----RVYGLSQVGKYLVPDGSRGYLASFTTFLCYPALMNVW  143

Query  180  GDAKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTCNKAMAHSGPLEIKRILQFYKGF  359
+ K+A++D D+ ++F+ ++G S Y+Y +T+ ++N   NK+MA     E+KRILQ YKGF
Sbjct  144  LNFKEAVVDEDI-DLFKKLHGVSKYEYMETDPKMNHIFNKSMADVCATEMKRILQIYKGF  202

Query  360  EGVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPT  539
EG+STLVDVGGG G+ LK+IIS+YP IKGINFD+PQV++NAP  PG+E VGGDMF SVP
Sbjct  203  EGISTLVDVGGGNGQNLKMIISKYPLIKGINFDLPQVIENAPPIPGIELVGGDMFASVPQ  262

Query  540  GDAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDN  719
GDA++LK VCHNW+DE+C++FL NCHKAL  +GKVIV+++I+PE P P+  S+ A  +DN
Sbjct  263  GDAMILKAVCHNWSDEKCLEFLSNCHKALSPNGKVIVVEFILPEEPEPTEESQLASTLDN  322

Query  720  LMFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK  857
+MF+   G+ERT++++E++CK AGFSKF VAC   S++ GVMEFYK
Sbjct  323  IMFITVGGRERTQKQYENMCKLAGFSKFQVACRAFSSL-GVMEFYK  367


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 572842297
Number of extensions: 12400418
Number of successful extensions: 33187
Number of sequences better than 1e-10: 13
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 33091
Number of HSP's successfully gapped: 13
Length of query: 1089
Length of database: 6150218869
Length adjustment: 142
Effective length of query: 947
Effective length of database: 3605708941
Effective search space: 796861675961
Effective search space used: 796861675961
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 177 (72.8 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE5FBTY016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_4028

Length=689


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003555129.1|  PREDICTED: LOB domain-containing protein ...  95.5    2e-20
ref|XP_003541285.1|  PREDICTED: LOB domain-containing protein ...  92.8    2e-19
ref|XP_002276899.1|  PREDICTED: LOB domain-containing protein ...  92.4    2e-19
emb|CAN77368.1|  hypothetical protein VITISV_011159 [Vitis vin...  92.4    2e-19
ref|XP_002283795.1|  PREDICTED: LOB domain-containing protein ...  91.7    5e-19

ALIGNMENTS
>ref|XP_003555129.1| PREDICTED: LOB domain-containing protein 41-like [Glycine max]
Length=293

Score = 95.5 bits (236),  Expect = 2e-20
Identities = 80/200 (40%), Positives = 105/200 (53%), Gaps = 19/200 (10%)
Frame = -1

Query  689  QLCQDAVEAVLKGDPITQMESDVAKNG--PPLKACDIRHINKEDNLSGSNELRRVRTRCR  516
QLCQ AVEA+LKG+PIT + S+ A NG  PPLKA DIRH++K++N   +NE  + +TR R
Sbjct  94   QLCQAAVEAILKGEPITPITSEAAANGRAPPLKAYDIRHVSKDEN--SANETPKAKTRSR  151

Query  515  FKRSGA---KSKRSR-VWEGPVKESVNEKEVTRspshesslshqsEMAEPVVERASQLSE  348
F R+G+   K K S+    G V     E E     +   S       A  +VE  S+ SE
Sbjct  152  FNRTGSTLIKPKASKGTPTGFVPIEPVEPETANRTTSHESGLSHLSEAAVMVEGESKESE  211

Query  347  IVGSAETS----AEQESVG--VEAALNSDIEIELDLTLGLEPLRSC-----VKPKEVKPS  201
S ETS     E ESV    +    S  EI L+LTLG EP+        +K +++  S
Sbjct  212  SDVSVETSNLFHEEPESVAKTSDRTGESGNEIGLELTLGFEPVSRVHHVVPMKKRKIIAS  271

Query  200  HSDEPVCEDGLWKMELGLDY  141
S     E    KMELGL+Y
Sbjct  272  KSYGDSAEKDSCKMELGLEY  291


>ref|XP_003541285.1| PREDICTED: LOB domain-containing protein 41-like [Glycine max]
Length=288

Score = 92.8 bits (229),  Expect = 2e-19
Identities = 76/196 (39%), Positives = 102/196 (52%), Gaps = 16/196 (8%)
Frame = -1

Query  689  QLCQDAVEAVLKGDPITQMESDVAKNG--PPLKACDIRHINKEDNLSGSNELRRVRTRCR  516
QLCQ AVEAVLKG+PIT + S+ A NG  PPLKA DIRH++K+ N   +NE  + +TR R
Sbjct  94   QLCQAAVEAVLKGEPITPITSEAAANGRAPPLKAYDIRHVSKDQN--SANETPKTKTRSR  151

Query  515  FKRSGAKSKRSRVWEGPVKESVNEKEVTRspshesslshqsEMAEPVVERASQLSEIVGS  336
FKR+     + +  +G     V E E+    +   S       A  +VE  S+ SE V S
Sbjct  152  FKRTSGTLIKPKASKGTGFVPV-EPEMANRTASHESGLSHLSEATAMVEGESKESESVVS  210

Query  335  AETS----AEQESVG--VEAALNSDIEIELDLTLGLEPLRSC-----VKPKEVKPSHSDE  189
+TS     E E V    +    S  EI L+LTLG EP+        +K +++    S
Sbjct  211  MDTSNLIHEEPEWVAKTSDGTGESGNEIGLELTLGFEPVSRLHHVVPMKKRKIIELKSCG  270

Query  188  PVCEDGLWKMELGLDY  141
E    KMELGL+Y
Sbjct  271  DSAEKDSCKMELGLEY  286


>ref|XP_002276899.1| PREDICTED: LOB domain-containing protein 41 [Vitis vinifera]
Length=272

Score = 92.4 bits (228),  Expect = 2e-19
Identities = 76/193 (39%), Positives = 104/193 (54%), Gaps = 26/193 (13%)
Frame = -1

Query  689  QLCQDAVEAVLKGDPITQM--ESDVAKNGPPLKACDIRHINKEDNLSG-SNELRRVRTRC  519
QLCQ AVEAVL G PI Q+  ES V+   PPLKA DIRH++K++N +G S+EL +V++R
Sbjct  94   QLCQSAVEAVLSGAPIMQISAESAVSSMSPPLKAGDIRHVSKDENSAGSSHELHKVKSRG  153

Query  518  RFKRSGAKSKRSRVWEGPVKESVNEKEVTRspshesslshqsEMAEPVVERASQLSEIVG  339
RFKRS  K  R+RV      ES  E + +                +  + R S++S   G
Sbjct  154  RFKRSSGK-PRARV------ESAAEFDESAGVILSRCY-------KSELSRDSRVSH-PG  198

Query  338  SAETSAEQESVGVEAALNSDI-------EIELDLTLGLEPLRSCVKPKEVKPSHSDEPVC  180
S E S E +S+ VE    S +       ++EL+LTLGLEP+    K   V     +
Sbjct  199  SRE-SGEADSMSVETVEASLVKPTRDGSDVELELTLGLEPIPKVQKACAVVREKEEMAGS  257

Query  179  EDGLWKMELGLDY  141
ED   K+EL L+Y
Sbjct  258  EDDTCKVELALEY  270


>emb|CAN77368.1| hypothetical protein VITISV_011159 [Vitis vinifera]
Length=272

Score = 92.4 bits (228),  Expect = 2e-19
Identities = 76/193 (39%), Positives = 104/193 (54%), Gaps = 26/193 (13%)
Frame = -1

Query  689  QLCQDAVEAVLKGDPITQM--ESDVAKNGPPLKACDIRHINKEDNLSG-SNELRRVRTRC  519
QLCQ AVEAVL G PI Q+  ES V+   PPLKA DIRH++K++N +G S+EL +V++R
Sbjct  94   QLCQSAVEAVLSGAPIMQISAESAVSSMSPPLKAGDIRHVSKDENSAGSSHELHKVKSRG  153

Query  518  RFKRSGAKSKRSRVWEGPVKESVNEKEVTRspshesslshqsEMAEPVVERASQLSEIVG  339
RFKRS  K  R+RV      ES  E + +                +  + R S++S   G
Sbjct  154  RFKRSSGK-PRARV------ESAAEFDESAGVILSRCY-------KSELSRDSRVSH-PG  198

Query  338  SAETSAEQESVGVEAALNSDI-------EIELDLTLGLEPLRSCVKPKEVKPSHSDEPVC  180
S E S E +S+ VE    S +       ++EL+LTLGLEP+    K   V     +
Sbjct  199  SRE-SGEADSMSVETVEASLVKPTRDGSDVELELTLGLEPIPKVQKACAVVREKEEMAGS  257

Query  179  EDGLWKMELGLDY  141
ED   K+EL L+Y
Sbjct  258  EDDTCKVELALEY  270


>ref|XP_002283795.1| PREDICTED: LOB domain-containing protein 41 [Vitis vinifera]
emb|CAN72182.1| hypothetical protein VITISV_004355 [Vitis vinifera]
Length=283

Score = 91.7 bits (226),  Expect = 5e-19
Identities = 70/167 (42%), Positives = 97/167 (58%), Gaps = 16/167 (10%)
Frame = -1

Query  686  LCQDAVEAVLKGDPITQMESDVAK--NGPPLKACDIRHINKEDNLSGSNELRRVRTRCRF  513
LCQ AVEAVLKG+PITQ+ S+ A    GPPLKA DIRH++K++N S +N+L R + RCRF
Sbjct  95   LCQAAVEAVLKGEPITQITSEAAATGQGPPLKAYDIRHVSKDEN-SAANDLHRAKNRCRF  153

Query  512  KRSGAKSKR-SRVWEGPVKES---VNEKEVT-RspshesslshqsEMAEP-VVERASQLS  351
KRSG ++K  ++     V ES   +N   +        S  S  S  +EP  VE  S+ +
Sbjct  154  KRSGVRAKSGNKAGTNSVAESKQGMNRGSLADELNRSTSHDSTVSHPSEPHTVEGESRET  213

Query  350  EIVGSAETS-------AEQESVGVEAALNSDIEIELDLTLGLEPLRS  231
+ S ET+       AEQ+ V        +  +EL+LTLGL P++S
Sbjct  214  VSMVSVETAEPSFLFRAEQKRVQKPEKREENRGLELELTLGLGPIQS  260


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 335060043
Number of extensions: 7155018
Number of successful extensions: 22451
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 22399
Number of HSP's successfully gapped: 0
Length of query: 689
Length of database: 6150218869
Length adjustment: 136
Effective length of query: 553
Effective length of database: 3713223445
Effective search space: 345329780385
Effective search space used: 345329780385
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 174 (71.6 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE0M6BB016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_416

Length=1072


Score     E
Sequences producing significant alignments:                       (Bits)  Value

emb|CAN73038.1|  hypothetical protein VITISV_044348 [Vitis vin...   161    6e-44
ref|XP_003597035.1|  Cysteine-rich repeat secretory protein [M...   160    1e-43
dbj|BAJ34612.1|  unnamed protein product [Thellungiella haloph...   157    5e-42
ref|XP_003543377.1|  PREDICTED: cysteine-rich repeat secretory...   156    5e-42
ref|XP_002327834.1|  predicted protein [Populus trichocarpa] >...   155    2e-41

ALIGNMENTS
>emb|CAN73038.1| hypothetical protein VITISV_044348 [Vitis vinifera]
Length=243

Score =  161 bits (408),  Expect = 6e-44
Identities = 109/251 (43%), Positives = 139/251 (55%), Gaps = 15/251 (6%)
Frame = +1

Query  49   MAYSISIRAFSLIISLILIPTALSQQSDLLNKVCIATSKNYTQNSTFSNNLKNLLDDLVQ  228
M +S SI    L++SL+L     +  +D L   C    +NYT N  FS+NL  L   L
Sbjct  1    MYFSKSIPLCFLLLSLLL---QAAIGADPLFHFCFIP-ENYTANDPFSSNLNELTTLLSS  56

Query  229  KTPHQIHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-YTKSAIVM  405
K P     F    TGQG D+  GLALCRGDVS   C  C+ +A  ++  RC Y K AI+
Sbjct  57   KVPPTG--FGLASTGQGQDSVNGLALCRGDVSSRDCKTCVTEATKELHNRCPYNKGAIIW  114

Query  406  YDYCQVKYSNENFFGKIDHTNDNVLINVYNV-SANQTKQFKDASIGLLNKLSNQAAHSKS  582
YD C   YSN  FFG+ID+ N      VYNV SA+    F   +  LL+ LSN+A  S
Sbjct  115  YDNCLFNYSNVKFFGEIDNKNK---FYVYNVQSADDPTSFNVKTKELLSSLSNKAYGSPM  171

Query  583  LFAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSVGARVFGS  762
LFA G+ + +  E++ +YG+ QCT+DLS+  CKK L D V +LP Y D     G RV G
Sbjct  172  LFATGELVLE--ESMKLYGLAQCTRDLSSLDCKKSLDDAVSELPSYCDG--KRGGRVVGG  227

Query  763  SCTVRYEEYGF  795
SC VRYE Y F
Sbjct  228  SCNVRYELYPF  238


>ref|XP_003597035.1| Cysteine-rich repeat secretory protein [Medicago truncatula]
gb|AES67286.1| Cysteine-rich repeat secretory protein [Medicago truncatula]
Length=246

Score =  160 bits (406),  Expect = 1e-43
Identities = 101/252 (40%), Positives = 144/252 (57%), Gaps = 14/252 (6%)
Frame = +1

Query  46   SMAYSISIRAFSLIISLILIPTALSQQSDLLNKVCIATSKNYTQNSTFSNNLKNLLDDLV  225
S  ++I + +F+ +    LI T L   +D L  +C +TS+N+T +S + +NLK L++ L+
Sbjct  4    SKLFTIILFSFTFVF---LIQTTLG--TDPLFHIC-STSENFTAHSPYESNLKTLINSLI  57

Query  226  QKTPHQ-IHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-YTKSAI  399
KTP       S + T       YGLALCRGDVS   C  C+  A  +I   C Y K AI
Sbjct  58   YKTPSTGFGSGSIDLTQYQNQKAYGLALCRGDVSTSECKTCVSQATKEILNVCPYKKGAI  117

Query  400  VMYDYCQVKYSNENFFGKIDHTNDNVLINVYNVSANQTKQFKDASIGLLNKLSNQAAHSK  579
+ YD C  KY + +FFGKID+TN   L+NV NVS     +F + +  LL+ L+N+A+ +
Sbjct  118  IWYDNCMFKYLDNDFFGKIDNTNKFALLNVQNVS--DPIKFNNMTNDLLSFLANEASMNH  175

Query  580  SLFAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSVGARVFG  759
L+A G+    + E V  YG+ QCT+D+S+  CKKCL   + +LP   D     G RV G
Sbjct  176  KLYATGELKIGESERV--YGLTQCTRDISSVDCKKCLDGAISELPNCCDG--KKGGRVVG  231

Query  760  SSCTVRYEEYGF  795
SC +RYE Y F
Sbjct  232  GSCNIRYEIYPF  243


>dbj|BAJ34612.1| unnamed protein product [Thellungiella halophila]
Length=255

Score =  157 bits (396),  Expect = 5e-42
Identities = 98/250 (39%), Positives = 133/250 (53%), Gaps = 14/250 (6%)
Frame = +1

Query  61   ISIRAFSLIISLILIPTALSQQSD--LLNKVCIATSKNYTQNSTFSNNLKNLLDDLVQKT  234
+S+   +L I L+ I +  SQ  +   L   C     N+T  S + +NL +L   +  +
Sbjct  11   VSVSVLALAIQLLFIQSVSSQSQNNAFLYHKCSDIEGNFTSKSPYESNLDSLFRRISYRV  70

Query  235  PHQIHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-YTKSAIVMYD  411
P     F+    G  PD   GLALCRGD S   C  CL  A  ++ QRC   K+ I+ YD
Sbjct  71   PSSG--FAASSAGNSPDNVNGLALCRGDASSSDCGSCLATAIPELRQRCPNNKAGIIWYD  128

Query  412  YCQVKYSNENFFGKIDHTNDNVLINVYNVSANQTKQFKDASIGLLNKLSNQA--AHSKSL  585
C VKYS+ NFFGKID+ N   L NV NVS      F   +  LL +L+ +A    ++ L
Sbjct  129  NCLVKYSSTNFFGKIDYENRFYLYNVNNVS--DPASFNTQTKALLTELTQKATTGDNQKL  186

Query  586  FAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSVGARVFGSS  765
FA G+K  +KK+   +YG+VQCT+DL  +SCK CL  ++ +LP   D     G RV G S
Sbjct  187  FATGEKNLEKKK---LYGLVQCTRDLRRESCKACLDGIIGELPNCCDG--KEGGRVVGGS  241

Query  766  CTVRYEEYGF  795
C  RYE Y F
Sbjct  242  CNFRYEIYPF  251


>ref|XP_003543377.1| PREDICTED: cysteine-rich repeat secretory protein 38 [Glycine
max]
Length=244

Score =  156 bits (395),  Expect = 5e-42
Identities = 100/258 (39%), Positives = 142/258 (55%), Gaps = 24/258 (9%)
Frame = +1

Query  43   MSMAYSISIRAFSLIISLILIPTALSQQSDLLNKVCIATSKNYTQNSTFSNNLKNLLDDL  222
MS +   +   FSL ++L+L     S  +D L   C + S+N+T NS + +NLK L++ L
Sbjct  1    MSSSKLFTTFLFSLNLALLL---QTSFGADPLFHFC-SNSENFTANSPYESNLKTLINSL  56

Query  223  VQKTPH------QIHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-  381
+ KTP        +  + N+K        Y LALCRGDVS   C  C+ +A  +I  RC
Sbjct  57   IYKTPSTGFGVGSVGQYQNQKA-------YALALCRGDVSASECKTCVSEAPKEILSRCP  109

Query  382  YTKSAIVMYDYCQVKYSNENFFGKIDHTNDNVLINVYNVSANQTKQFKDASIGLLNKLSN  561
Y K AI+ YDYC  KY + +F GKID+TN   + N+ NVS   T  F   +  LL++L+
Sbjct  110  YNKGAIIWYDYCMFKYLDTDFLGKIDNTNKFYMWNLKNVSDPAT--FNYNTRDLLSQLAQ  167

Query  562  QAAHSKSLFAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSV  741
+A  +  L+A G+   +  E  ++YG+ QCT+DLS+  CKKCL D + +LP   D
Sbjct  168  KAYVNNKLYATGEAKLENSE--TLYGLTQCTRDLSSSDCKKCLDDAINELPNCCDG--KE  223

Query  742  GARVFGSSCTVRYEEYGF  795
G RV   SC  RYE Y F
Sbjct  224  GGRVVSGSCNFRYEIYPF  241


>ref|XP_002327834.1| predicted protein [Populus trichocarpa]
gb|EEE75622.1| predicted protein [Populus trichocarpa]
Length=242

Score =  155 bits (391),  Expect = 2e-41
Identities = 96/251 (38%), Positives = 137/251 (55%), Gaps = 17/251 (7%)
Frame = +1

Query  46   SMAYSISIRAFSLIISLILIPTALSQQSDLLNKVCIATSKNYTQNSTFSNNLKNLLDDLV  225
+ A+S+ +  FSL++  +             N    ++ +N+T N  + +NL  L   L
Sbjct  5    NFAFSLCLITFSLLLHTVFGAGP--------NFHLCSSPENFTANGPYESNLNKLTSYLY  56

Query  226  QKTPHQIHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-YTKSAIV  402
K P     F     G  PD TYGLALCRGDVS   C  C+ +A  +I +RC Y K+AI+
Sbjct  57   YKAPPTG--FGMGSRGHTPDQTYGLALCRGDVSTSDCKTCVVEASSEIRKRCPYNKAAII  114

Query  403  MYDYCQVKYSNENFFGKIDHTNDNVLINVYNVSANQTKQFKDASIGLLNKLSNQAAHSKS  582
YD C +KYSN  FFG+ID+ N   + NV+ VS  +   F + +  LL++L+N+A  +
Sbjct  115  WYDNCLLKYSNNAFFGQIDNGNKFYMWNVHVVS--EPAPFNEKTKELLSQLANEAQATPK  172

Query  583  LFAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSVGARVFGS  762
LFA G++   K  +  +YG+VQCT DLS+  CKKCL  ++ +LPI  D     G RV
Sbjct  173  LFATGERELGK--STKLYGLVQCTGDLSSAVCKKCLDGIIGELPICCDG--KQGGRVVSG  228

Query  763  SCTVRYEEYGF  795
SC   YE Y F
Sbjct  229  SCNFIYEIYPF  239


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 544842042
Number of extensions: 11412868
Number of successful extensions: 28347
Number of sequences better than 1e-10: 65
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 28143
Number of HSP's successfully gapped: 66
Length of query: 1072
Length of database: 6150218869
Length adjustment: 142
Effective length of query: 930
Effective length of database: 3605708941
Effective search space: 775227422315
Effective search space used: 775227422315
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 177 (72.8 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE5H41901N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_4555

Length=665


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|NP_001235512.1|  disease resistance protein [Glycine max] ...   237    6e-70
ref|XP_003530247.1|  PREDICTED: LOW QUALITY PROTEIN: protein B...   235    2e-68
ref|XP_003551773.1|  PREDICTED: LRR receptor-like serine/threo...   233    3e-67
ref|XP_002324588.1|  predicted protein [Populus trichocarpa] >...   192    2e-57
ref|XP_002324600.1|  predicted protein [Populus trichocarpa] >...   191    3e-54

ALIGNMENTS
>ref|NP_001235512.1| disease resistance protein [Glycine max]
gb|ACM89590.1| disease resistance protein [Glycine max]
Length=771

Score =  237 bits (605),  Expect = 6e-70
Identities = 127/173 (73%), Positives = 137/173 (79%), Gaps = 3/173 (2%)
Frame = +2

Query  2    RALNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSG  181
RALNLS+N LTGK+P TFS L Q ESLDLSFNML+ QIPPQLS L  LEVFSVAHNNLSG
Sbjct  597  RALNLSHNDLTGKIPVTFSLLAQTESLDLSFNMLNSQIPPQLSMLTSLEVFSVAHNNLSG  656

Query  182  ATPEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYV  361
TP++KGQ STFDESSYEGN FLCGPPLPKSCNP    P  +PN  N DGDND  +DMYV
Sbjct  657  PTPDFKGQFSTFDESSYEGNPFLCGPPLPKSCNP---PPTIIPNDSNTDGDNDSLLDMYV  713

Query  362  FRVSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNLYKFF  520
F VSF V+Y S +LV    LYINPYWRQAWFYY+ LV MNCYYFI DNL K F
Sbjct  714  FCVSFAVSYISTLLVTAAALYINPYWRQAWFYYMELVSMNCYYFIKDNLSKVF  766


>ref|XP_003530247.1| PREDICTED: LOW QUALITY PROTEIN: protein BRASSINOSTEROID INSENSITIVE
1-like [Glycine max]
Length=936

Score =  235 bits (599),  Expect = 2e-68
Identities = 124/173 (72%), Positives = 135/173 (78%), Gaps = 3/173 (2%)
Frame = +2

Query  2    RALNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSG  181
R LNLS+N LTG++PATFS+LVQ ESLDLSFNML+GQIPPQL+ L  LEVFSVAHNNLSG
Sbjct  762  RTLNLSHNDLTGQIPATFSHLVQTESLDLSFNMLNGQIPPQLTMLTSLEVFSVAHNNLSG  821

Query  182  ATPEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYV  361
TPE+K Q STFDESSYEGN FLCG PLPKSCNP    P  +PN  N DG  D  VDMY
Sbjct  822  PTPEFKEQFSTFDESSYEGNPFLCGLPLPKSCNP---PPTVIPNDSNTDGHYDTLVDMYF  878

Query  362  FRVSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNLYKFF  520
F VSFVV+YTS +LV    LYINPYWR AWFYY+ L  MNCYYFIVDN  K F
Sbjct  879  FCVSFVVSYTSALLVTAAALYINPYWRHAWFYYMELASMNCYYFIVDNCSKVF  931


>ref|XP_003551773.1| PREDICTED: LRR receptor-like serine/threonine-protein kinase
GSO2-like [Glycine max]
Length=1133

Score =  233 bits (593),  Expect = 3e-67
Identities = 118/171 (69%), Positives = 134/171 (78%), Gaps = 3/171 (2%)
Frame = +2

Query  2     RALNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSG  181
RALNLS+N L G++PATFSNLVQ ESLDLSFN LSGQIPPQLS L  LEVFSVAHNNLSG
Sbjct  879   RALNLSHNDLIGQIPATFSNLVQTESLDLSFNKLSGQIPPQLSKLTSLEVFSVAHNNLSG  938

Query  182   ATPEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYV  361
TPEWKGQ STF+ SSYEGN FLCGPPL KSCNP    P+ +PN  +   D+   VDMYV
Sbjct  939   TTPEWKGQFSTFENSSYEGNPFLCGPPLSKSCNP---PPSIIPNDSHTHVDDGSLVDMYV  995

Query  362   FRVSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNLYK  514
F VSF V++++ +L   I LYINPY R+AWFYY+ LVC NCYYFIVD+  K
Sbjct  996   FYVSFAVSFSAALLATAIALYINPYCRRAWFYYMELVCSNCYYFIVDSFSK  1046


>ref|XP_002324588.1| predicted protein [Populus trichocarpa]
gb|EEF03153.1| predicted protein [Populus trichocarpa]
Length=243

Score =  192 bits (488),  Expect = 2e-57
Identities = 95/169 (56%), Positives = 120/169 (71%), Gaps = 1/169 (1%)
Frame = +2

Query  2    RALNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSG  181
+ LNLS+N LTG +P TFSNL ++E+LDLS+N L+G+IPPQL  L++L  FSVAHNNLSG
Sbjct  63   KLLNLSHNSLTGPIPPTFSNLKEIETLDLSYNNLNGEIPPQLLDLNFLSAFSVAHNNLSG  122

Query  182  ATPEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYV  361
TP+   Q STF++S YEGN  LCGPPL K+C      P+ LP    +  + +  +DM
Sbjct  123  KTPKMVAQFSTFNKSCYEGNPLLCGPPLAKNCT-GAIPPSPLPRSQTHKKEENGVIDMEA  181

Query  362  FRVSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNL  508
F V+F VAY  ++L I  VLYINP WRQAWFY+IG    NCYYF+VDNL
Sbjct  182  FYVTFSVAYIMVLLAIGAVLYINPQWRQAWFYFIGESINNCYYFLVDNL  230


>ref|XP_002324600.1| predicted protein [Populus trichocarpa]
gb|EEF03165.1| predicted protein [Populus trichocarpa]
Length=534

Score =  191 bits (486),  Expect = 3e-54
Identities = 95/167 (57%), Positives = 119/167 (71%), Gaps = 1/167 (1%)
Frame = +2

Query  8    LNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSGAT  187
LNLS+N LTG +P TFSNL ++E+LDLS+N L+G+IPPQL  L++L  FSVAHNNLSG T
Sbjct  356  LNLSHNSLTGPIPPTFSNLKEIETLDLSYNNLNGEIPPQLLDLNFLSAFSVAHNNLSGKT  415

Query  188  PEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYVFR  367
PE   Q STF++S YEGN  LCGPPL K+C      P+ +P    +  + +  +DM  F
Sbjct  416  PEMVAQFSTFNKSCYEGNLLLCGPPLAKNCT-GAIPPSPVPRSQTHKKEENGVIDMEAFY  474

Query  368  VSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNL  508
V+F VAY  ++L I  VLYINP WRQAWFY+IG    NCYYF+VDNL
Sbjct  475  VTFSVAYIIVLLAIGAVLYINPQWRQAWFYFIGESINNCYYFLVDNL  521


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 361315866
Number of extensions: 7764389
Number of successful extensions: 30031
Number of sequences better than 1e-10: 119
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 28965
Number of HSP's successfully gapped: 120
Length of query: 665
Length of database: 6150218869
Length adjustment: 135
Effective length of query: 530
Effective length of database: 3731142529
Effective search space: 320878257494
Effective search space used: 320878257494
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 174 (71.6 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE5YZ9E01N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_4931

Length=650


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002298184.1|  predicted protein [Populus trichocarpa] >...  85.9    4e-18
ref|NP_001150703.1|  chemocyanin precursor [Zea mays] >gb|ACG4...  81.6    2e-16
dbj|BAJ95771.1|  predicted protein [Hordeum vulgare subsp. vul...  81.3    2e-16
gb|ACG40792.1|  chemocyanin precursor [Zea mays] >gb|ACR37830....  80.9    3e-16
gb|ACG29532.1|  chemocyanin precursor [Zea mays]                   80.5    4e-16

ALIGNMENTS
>ref|XP_002298184.1| predicted protein [Populus trichocarpa]
gb|EEE82989.1| predicted protein [Populus trichocarpa]
Length=125

Score = 85.9 bits (211),  Expect = 4e-18
Identities = 50/136 (37%), Positives = 74/136 (54%), Gaps = 13/136 (10%)
Frame = -3

Query  648  MSGKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNK  469
+ G+G+ +VA   + + +L+LHFDMA++  Y VG   G   W   +   +  P  K+
Sbjct  2    VQGRGSAMVATVAVMLCMLLLHFDMAHAATYTVGGPGG---WTFNV---SGWPKGKSFKA  55

Query  468  GDTLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESR  289
GD L F Y+   HNVV V   GY++C   +G+  Y T+G + I + KG N+FIC+
Sbjct  56   GDILVFNYSTAAHNVVAVNKAGYSSCTSPRGAKVY-TSGKDQIKLVKGQNFFICSF----  110

Query  288  GGNKCSNYGMKIAVKA  241
C + GMKIAV A
Sbjct  111  -AGHCQS-GMKIAVNA  124


>ref|NP_001150703.1| chemocyanin precursor [Zea mays]
gb|ACG40061.1| chemocyanin precursor [Zea mays]
Length=132

Score = 81.6 bits (200),  Expect = 2e-16
Identities = 52/134 (39%), Positives = 68/134 (51%), Gaps = 13/134 (10%)
Frame = -3

Query  642  GKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNKGD  463
G+G+   AA  L ++ ++LH ++A S  Y VG   G   W+         P  K    GD
Sbjct  11   GRGSGAAAALALVLLCVLLHGELAESAVYTVGDRGG---WS---FNTANWPKGKRFRAGD  64

Query  462  TLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESRGG  283
L F YN   HNVV V   GY +C   KG     TTGN+ +++K+G NYFIC+ P
Sbjct  65   VLAFRYNAKAHNVVPVSAAGYKSCSAPKGVRAL-TTGNDRVTLKRGTNYFICSFP-----  118

Query  282  NKCSNYGMKIAVKA  241
C   GMKIAV A
Sbjct  119  GHC-QAGMKIAVTA  131


>dbj|BAJ95771.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length=121

Score = 81.3 bits (199),  Expect = 2e-16
Identities = 49/134 (37%), Positives = 67/134 (50%), Gaps = 17/134 (13%)
Frame = -3

Query  642  GKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNKGD  463
G G    A   L ++ ++LH + A SK Y VG    S  W          P  K    GD
Sbjct  4    GSGGSATAVLALVLLCVLLHGEFAESKVYTVGWAVSSGGW----------PRGKRFRAGD  53

Query  462  TLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESRGG  283
L F Y +  HNVV V   GY +C  ++GS TY  +G++ +++ +G NYFIC+VP
Sbjct  54   VLLFKYGRGAHNVVAVNAAGYKSCSAARGSRTY-NSGSDRVTLSRGTNYFICSVP-----  107

Query  282  NKCSNYGMKIAVKA  241
C   GMK+AV A
Sbjct  108  GHC-QAGMKMAVTA  120


>gb|ACG40792.1| chemocyanin precursor [Zea mays]
gb|ACR37830.1| unknown [Zea mays]
Length=130

Score = 80.9 bits (198),  Expect = 3e-16
Identities = 52/135 (39%), Positives = 68/135 (50%), Gaps = 13/135 (10%)
Frame = -3

Query  645  SGKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNKG  466
S +G+   AA  L ++ ++LH ++A S  Y VG   G   W+         P  K    G
Sbjct  8    SARGSGAAAALALVLLCVLLHGELAESAVYTVGDRGG---WS---FNTANWPKGKRFRAG  61

Query  465  DTLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESRG  286
D L F YN   HNVV V   GY +C   KG     TTGN+ +++K+G NYFIC+ P
Sbjct  62   DVLAFRYNAKAHNVVPVSAAGYKSCSAPKGVRAL-TTGNDRVTLKRGANYFICSFP----  116

Query  285  GNKCSNYGMKIAVKA  241
C   GMKIAV A
Sbjct  117  -GHC-QAGMKIAVTA  129


>gb|ACG29532.1| chemocyanin precursor [Zea mays]
Length=129

Score = 80.5 bits (197),  Expect = 4e-16
Identities = 49/134 (37%), Positives = 71/134 (53%), Gaps = 13/134 (10%)
Frame = -3

Query  642  GKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNKGD  463
G GA+ +AAA   ++ ++LH  +A S  + VG   G   W+      T   + K    GD
Sbjct  8    GSGAVALAAAAAVLLCVLLHAHVAESAVFTVGDRGG---WSFSTGTWT---NGKRFKAGD  61

Query  462  TLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESRGG  283
L F Y+   HNVV V   GY  C   +G+  Y T+GN+ +++ +G NYFIC++P
Sbjct  62   VLVFKYDSTAHNVVAVNAAGYKGCSAPRGAKVY-TSGNDRVTLARGTNYFICSIP-----  115

Query  282  NKCSNYGMKIAVKA  241
C + GMKIAV A
Sbjct  116  GHCQS-GMKIAVTA  128


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 278294648
Number of extensions: 5229159
Number of successful extensions: 11404
Number of sequences better than 1e-10: 1
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 11386
Number of HSP's successfully gapped: 1
Length of query: 650
Length of database: 6150218869
Length adjustment: 135
Effective length of query: 515
Effective length of database: 3731142529
Effective search space: 302222544849
Effective search space used: 302222544849
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 174 (71.6 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE6FV6M016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters

Query= TrVeIntMedtrGB1_5067

Length=644


No significant similarity found. For reasons why, click here.

Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 294853118 Number of extensions: 5561555 Number of successful extensions: 12950 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 12938 Number of HSP's successfully gapped: 0 Length of query: 644 Length of database: 6150218869 Length adjustment: 135 Effective length of query: 509 Effective length of database: 3731142529 Effective search space: 294760259791 Effective search space used: 294760259791 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 174 (71.6 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362


A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE6P1NR013


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_5537

Length=626


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003593357.1|  Pathogenesis-related protein 1a [Medicago...   302    2e-99
ref|XP_003593358.1|  Pathogenesis-related protein 1a [Medicago...   278    2e-92
ref|XP_003593453.1|  Pathogenesis-related protein 1a [Medicago...   276    2e-91
ref|XP_003606006.1|  Maturase K [Medicago truncatula] >gb|AES8...   271    4e-82
ref|XP_003593359.1|  Pathogenesis-related protein 1a [Medicago...   240    6e-78

ALIGNMENTS
>ref|XP_003593357.1| Pathogenesis-related protein 1a [Medicago truncatula]
gb|AES63608.1| Pathogenesis-related protein 1a [Medicago truncatula]
Length=338

Score =  302 bits (774),  Expect = 2e-99
Identities = 141/148 (95%), Positives = 145/148 (98%), Gaps = 1/148 (1%)
Frame = -3

Query  624  GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH  445
GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH
Sbjct  11   GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH  70

Query  444  SGGRYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQRVG  265
SGGRYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRN++R+G
Sbjct  71   SGGRYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNTKRIG  130

Query  264  CAKVRCDNNRGTFITCNYDPPGNYVGEK  181
CAKVRC NN GTFI CNYDPPGNYVG+K
Sbjct  131  CAKVRC-NNGGTFIICNYDPPGNYVGQK  157


>ref|XP_003593358.1| Pathogenesis-related protein 1a [Medicago truncatula]
gb|AES63609.1| Pathogenesis-related protein 1a [Medicago truncatula]
Length=162

Score =  278 bits (711),  Expect = 2e-92
Identities = 126/152 (83%), Positives = 142/152 (93%), Gaps = 2/152 (1%)
Frame = -3

Query  624  GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH  445
GL +I+ HVAHAQNSQ+DYVN+HN+ARRQVGVAN+VWDN +A+ AQ+YAN R+GDC+L H
Sbjct  11   GLSLIMVHVAHAQNSQSDYVNAHNDARRQVGVANIVWDNTVASFAQDYANQRKGDCQLIH  70

Query  444  SGG--RYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQR  271
SGG  RYGENLA S+GD+SG+DAV+LWVNEK DY+YNSNTCASGKVCGHYTQVVWRNSQR
Sbjct  71   SGGGGRYGENLAWSSGDMSGSDAVKLWVNEKADYDYNSNTCASGKVCGHYTQVVWRNSQR  130

Query  270  VGCAKVRCDNNRGTFITCNYDPPGNYVGEKPY  175
VGCAKVRCDNNRGTFITCNYDPPGNYVGEKPY
Sbjct  131  VGCAKVRCDNNRGTFITCNYDPPGNYVGEKPY  162


>ref|XP_003593453.1| Pathogenesis-related protein 1a [Medicago truncatula]
gb|AES63704.1| Pathogenesis-related protein 1a [Medicago truncatula]
Length=181

Score =  276 bits (706),  Expect = 2e-91
Identities = 124/151 (82%), Positives = 141/151 (93%), Gaps = 2/151 (1%)
Frame = -3

Query  621  LFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTHS  442
L ++IGHVA+AQNS+ADYVN+HN+ARRQVGV ++VWDN +A+ AQ+YAN R+GDC+L HS
Sbjct  12   LLLVIGHVANAQNSRADYVNAHNDARRQVGVGDIVWDNTVASFAQDYANQRKGDCQLIHS  71

Query  441  GG--RYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQRV  268
GG  RYGENLA S+GD+SG+DAV+LWVNEK DYNYNSNTCASGKVCGHYTQVVWRNSQRV
Sbjct  72   GGGGRYGENLAWSSGDMSGSDAVKLWVNEKADYNYNSNTCASGKVCGHYTQVVWRNSQRV  131

Query  267  GCAKVRCDNNRGTFITCNYDPPGNYVGEKPY  175
GCAKVRCDNNRGTFITCNYDPPGNYVGEKPY
Sbjct  132  GCAKVRCDNNRGTFITCNYDPPGNYVGEKPY  162


>ref|XP_003606006.1| Maturase K [Medicago truncatula]
gb|AES88203.1| Maturase K [Medicago truncatula]
Length=855

Score =  271 bits (692),  Expect = 4e-82
Identities = 122/149 (82%), Positives = 139/149 (93%), Gaps = 2/149 (1%)
Frame = -3

Query  621  LFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTHS  442
L ++IGHVA+AQNS+ADYVN+HN+ARRQVGV ++VWDN +A+ AQ+YAN R+GDC+L HS
Sbjct  342  LLLVIGHVANAQNSRADYVNAHNDARRQVGVGDIVWDNTVASFAQDYANQRKGDCQLIHS  401

Query  441  GG--RYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQRV  268
GG  RYGENLA S+GD+SG+DAV+LWVNEK DYNYNSNTCASGKVCGHYTQVVWRNSQRV
Sbjct  402  GGGGRYGENLAWSSGDMSGSDAVKLWVNEKADYNYNSNTCASGKVCGHYTQVVWRNSQRV  461

Query  267  GCAKVRCDNNRGTFITCNYDPPGNYVGEK  181
GCAKVRCDNNRGTFITCNYDPPGNYVGEK
Sbjct  462  GCAKVRCDNNRGTFITCNYDPPGNYVGEK  490


>ref|XP_003593359.1| Pathogenesis-related protein 1a [Medicago truncatula]
gb|AES63610.1| Pathogenesis-related protein 1a [Medicago truncatula]
Length=138

Score =  240 bits (613),  Expect = 6e-78
Identities = 115/150 (77%), Positives = 124/150 (83%), Gaps = 22/150 (15%)
Frame = -3

Query  624  GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH  445
GL +IIGHVAHAQ+SQADYVN+HNEAR +VGV                     GDC+L H
Sbjct  11   GLLLIIGHVAHAQDSQADYVNAHNEARSEVGV---------------------GDCQLIH  49

Query  444  SGGRYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQRVG  265
SGGRYGENLAGSTGDLSG+DAV+LWVNEK DY+YNSNTCASGKVCGHYTQVVWRNSQRVG
Sbjct  50   SGGRYGENLAGSTGDLSGSDAVKLWVNEKADYDYNSNTCASGKVCGHYTQVVWRNSQRVG  109

Query  264  CAKVRCDNNRGTFITCNYDPPGNYVGEKPY  175
CAKVRCDNNRGTFITCNYDPPGN+ GEKPY
Sbjct  110  CAKVRCDNNRGTFITCNYDPPGNF-GEKPY  138


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 6905539149
Number of extensions: 154640758
Number of successful extensions: 389294
Number of sequences better than 1e-10: 1563
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 385289
Number of HSP's successfully gapped: 1575
Length of query: 626
Length of database: 6150218869
Length adjustment: 134
Effective length of query: 492
Effective length of database: 3749061613
Effective search space: 277430559362
Effective search space used: 277430559362
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 173 (71.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE6TT4M01N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_5544

Length=626


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002326231.1|  predicted protein [Populus trichocarpa] >...   235    5e-73
ref|XP_002528628.1|  zinc finger protein, putative [Ricinus co...   232    6e-72
ref|XP_002284087.2|  PREDICTED: GDSL esterase/lipase At5g03820...   229    1e-70
emb|CBI30208.3|  unnamed protein product [Vitis vinifera]           229    2e-70
ref|XP_002527431.1|  zinc finger protein, putative [Ricinus co...   222    8e-68

ALIGNMENTS
>ref|XP_002326231.1| predicted protein [Populus trichocarpa]
gb|EEE71901.1| predicted protein [Populus trichocarpa]
Length=351

Score =  235 bits (599),  Expect = 5e-73
Identities = 112/150 (75%), Positives = 126/150 (84%), Gaps = 0/150 (0%)
Frame = -2

Query  625  KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT  446
++L  LG  RIGVTSLPP GCLPAAITLFG GSN CV  LN DA++FN+KLN+TSQ L
Sbjct  202  QNLYGLGARRIGVTSLPPTGCLPAAITLFGAGSNQCVESLNQDAILFNDKLNSTSQGLVQ  261

Query  445  KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ  266
KL  LKLVVFDIY PLLD++ KPSD+GF ESRR CCGTGTLETS LCN RS+GTCSNAT+
Sbjct  262  KLPGLKLVVFDIYQPLLDMIRKPSDNGFFESRRACCGTGTLETSVLCNDRSVGTCSNATE  321

Query  265  YVFWDGFHPSESANEKLAQTLLEQGFDLIS  176
YVFWDGFHPSE+AN+ LA  LL+QGFDLIS
Sbjct  322  YVFWDGFHPSEAANQVLAGDLLQQGFDLIS  351


>ref|XP_002528628.1| zinc finger protein, putative [Ricinus communis]
gb|EEF33731.1| zinc finger protein, putative [Ricinus communis]
Length=352

Score =  232 bits (592),  Expect = 6e-72
Identities = 111/150 (74%), Positives = 125/150 (83%), Gaps = 0/150 (0%)
Frame = -2

Query  625  KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT  446
++L  LG  RIGVT LPP GCLPAAITLFG GSN CV RLN DA+ FNNKLN+TSQSL +
Sbjct  203  QNLYQLGARRIGVTGLPPTGCLPAAITLFGAGSNQCVERLNRDAISFNNKLNSTSQSLVS  262

Query  445  KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ  266
L  LKLVVFDIY PLLD++ KP+D+GF E+RR CCGTGTLETS LCN RS+GTCS+ATQ
Sbjct  263  NLPGLKLVVFDIYQPLLDMILKPTDNGFFEARRACCGTGTLETSVLCNARSLGTCSDATQ  322

Query  265  YVFWDGFHPSESANEKLAQTLLEQGFDLIS  176
YVFWDGFHPSE+AN+ LA  LL QGFDLIS
Sbjct  323  YVFWDGFHPSEAANKVLAGDLLAQGFDLIS  352


>ref|XP_002284087.2| PREDICTED: GDSL esterase/lipase At5g03820-like [Vitis vinifera]
Length=351

Score =  229 bits (583),  Expect = 1e-70
Identities = 108/149 (72%), Positives = 127/149 (85%), Gaps = 0/149 (0%)
Frame = -2

Query  625  KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT  446
++L  LGV +IGVT+LPP GCLPAAITLF  GSN CV+RLN DA+ FN+KLN TSQ L+
Sbjct  202  QNLYGLGVRKIGVTTLPPTGCLPAAITLFSSGSNQCVARLNQDAINFNSKLNITSQVLQN  261

Query  445  KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ  266
KL  LKLVVFDIY PLL+L+TKP+D+GF ESR+ CCGTGT+ETS LCN RS+GTCSNA+Q
Sbjct  262  KLPGLKLVVFDIYQPLLNLITKPTDNGFFESRKACCGTGTIETSLLCNARSVGTCSNASQ  321

Query  265  YVFWDGFHPSESANEKLAQTLLEQGFDLI  179
YVFWDGFHPSESAN+ LA +LLEQG +LI
Sbjct  322  YVFWDGFHPSESANQLLAGSLLEQGINLI  350


>emb|CBI30208.3| unnamed protein product [Vitis vinifera]
Length=363

Score =  229 bits (583),  Expect = 2e-70
Identities = 108/149 (72%), Positives = 127/149 (85%), Gaps = 0/149 (0%)
Frame = -2

Query  625  KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT  446
++L  LGV +IGVT+LPP GCLPAAITLF  GSN CV+RLN DA+ FN+KLN TSQ L+
Sbjct  202  QNLYGLGVRKIGVTTLPPTGCLPAAITLFSSGSNQCVARLNQDAINFNSKLNITSQVLQN  261

Query  445  KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ  266
KL  LKLVVFDIY PLL+L+TKP+D+GF ESR+ CCGTGT+ETS LCN RS+GTCSNA+Q
Sbjct  262  KLPGLKLVVFDIYQPLLNLITKPTDNGFFESRKACCGTGTIETSLLCNARSVGTCSNASQ  321

Query  265  YVFWDGFHPSESANEKLAQTLLEQGFDLI  179
YVFWDGFHPSESAN+ LA +LLEQG +LI
Sbjct  322  YVFWDGFHPSESANQLLAGSLLEQGINLI  350


>ref|XP_002527431.1| zinc finger protein, putative [Ricinus communis]
gb|EEF34923.1| zinc finger protein, putative [Ricinus communis]
Length=359

Score =  222 bits (565),  Expect = 8e-68
Identities = 105/149 (70%), Positives = 123/149 (83%), Gaps = 0/149 (0%)
Frame = -2

Query  625  KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT  446
K+L +LG  +IGVT+LPPLGCLPAAIT+FG  SN+CV+ LN D+V FNNKLNATSQSL+
Sbjct  210  KNLYNLGARKIGVTTLPPLGCLPAAITIFGSDSNDCVANLNQDSVSFNNKLNATSQSLRN  269

Query  445  KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ  266
KL  LKLVVFDIY PL D+VTKPSD+GF E+RR CCGTG LE+S LCN +SIGTC NA++
Sbjct  270  KLSGLKLVVFDIYQPLYDIVTKPSDNGFVEARRACCGTGLLESSILCNSKSIGTCKNASE  329

Query  265  YVFWDGFHPSESANEKLAQTLLEQGFDLI  179
YVFWDGFHPSE+AN+ LA  LL  G  LI
Sbjct  330  YVFWDGFHPSEAANKILADDLLTSGISLI  358


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 323187050
Number of extensions: 6813078
Number of successful extensions: 18085
Number of sequences better than 1e-10: 65
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 17933
Number of HSP's successfully gapped: 67
Length of query: 626
Length of database: 6150218869
Length adjustment: 134
Effective length of query: 492
Effective length of database: 3749061613
Effective search space: 277430559362
Effective search space used: 277430559362
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 173 (71.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE7PCVP012


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_5779

Length=617


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002520279.1|  ATP binding protein, putative [Ricinus co...  71.6    2e-11

ALIGNMENTS
>ref|XP_002520279.1| ATP binding protein, putative [Ricinus communis]
gb|EEF42065.1| ATP binding protein, putative [Ricinus communis]
Length=1433

Score = 71.6 bits (174),  Expect = 2e-11
Identities = 32/58 (55%), Positives = 41/58 (71%), Gaps = 0/58 (0%)
Frame = -1

Query  605  VLREGSLEQIKGMGELVRRCLRLQSEERPTMNEVAMELEGFRKISRHPWAVQEEVDEE  432
+L EG++EQIK +  L +RCLR++ EERPTM EVAMELEG R + +HPW   E    E
Sbjct  653  ILNEGNIEQIKEVSSLAKRCLRVKGEERPTMKEVAMELEGLRLMVKHPWVNNESNSSE  710


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 244073953
Number of extensions: 4381041
Number of successful extensions: 11480
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 11478
Number of HSP's successfully gapped: 0
Length of query: 617
Length of database: 6150218869
Length adjustment: 134
Effective length of query: 483
Effective length of database: 3749061613
Effective search space: 266183374523
Effective search space used: 266183374523
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 173 (71.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE84NHW016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_5876

Length=613


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002281329.1|  PREDICTED: uncharacterized protein LOC100...   119    2e-30
emb|CAN66257.1|  hypothetical protein VITISV_001236 [Vitis vin...   119    2e-30
ref|NP_200876.1|  uncharacterized protein [Arabidopsis thalian...   118    5e-30
ref|XP_003597242.1|  hypothetical protein MTR_2g094410 [Medica...   117    6e-30
ref|XP_003543482.1|  PREDICTED: uncharacterized protein LOC100...   114    1e-28

ALIGNMENTS
>ref|XP_002281329.1| PREDICTED: uncharacterized protein LOC100258555 isoform 1 [Vitis
vinifera]
ref|XP_003632611.1| PREDICTED: uncharacterized protein LOC100258555 isoform 2 [Vitis
vinifera]
Length=163

Score =  119 bits (298),  Expect = 2e-30
Identities = 61/85 (72%), Positives = 66/85 (78%), Gaps = 1/85 (1%)
Frame = -3

Query  611  LPVNVPDWSKILRGEYTPRACEWGDGYESDDWGDDGND-KIPPHEYLARTRVASLSVHEG  435
LPVN+PDWSKILR +Y     +  D    D   DD +D +IPPHEYLARTRVAS SVHEG
Sbjct  79   LPVNIPDWSKILRDDYRLSQRKESDEDVDDVEEDDDHDSRIPPHEYLARTRVASFSVHEG  138

Query  434  IGRTLKGRDLSRVRNAIWKKTGFED  360
IGRTLKGRDLSRVRNAIWKK GFED
Sbjct  139  IGRTLKGRDLSRVRNAIWKKVGFED  163


>emb|CAN66257.1| hypothetical protein VITISV_001236 [Vitis vinifera]
Length=163

Score =  119 bits (298),  Expect = 2e-30
Identities = 61/85 (72%), Positives = 66/85 (78%), Gaps = 1/85 (1%)
Frame = -3

Query  611  LPVNVPDWSKILRGEYTPRACEWGDGYESDDWGDDGND-KIPPHEYLARTRVASLSVHEG  435
LPVN+PDWSKILR +Y     +  D    D   DD +D +IPPHEYLARTRVAS SVHEG
Sbjct  79   LPVNIPDWSKILRDDYRLSQRKESDEDVDDVEEDDDHDSRIPPHEYLARTRVASFSVHEG  138

Query  434  IGRTLKGRDLSRVRNAIWKKTGFED  360
IGRTLKGRDLSRVRNAIWKK GFED
Sbjct  139  IGRTLKGRDLSRVRNAIWKKVGFED  163


>ref|NP_200876.1| uncharacterized protein [Arabidopsis thaliana]
dbj|BAB09841.1| unnamed protein product [Arabidopsis thaliana]
gb|AAL36227.1| unknown protein [Arabidopsis thaliana]
gb|AAM14170.1| unknown protein [Arabidopsis thaliana]
gb|AED97366.1| uncharacterized protein [Arabidopsis thaliana]
Length=163

Score =  118 bits (295),  Expect = 5e-30
Identities = 58/85 (68%), Positives = 68/85 (80%), Gaps = 1/85 (1%)
Frame = -3

Query  611  LPVNVPDWSKILRGEYTP-RACEWGDGYESDDWGDDGNDKIPPHEYLARTRVASLSVHEG  435
LPVNVPDWSKILRGEY   R     D  + DD  +DG D +PPHE+LA+TR+AS SVHEG
Sbjct  79   LPVNVPDWSKILRGEYRDNRRRSIEDNDDDDDDNEDGGDWLPPHEFLAKTRMASFSVHEG  138

Query  434  IGRTLKGRDLSRVRNAIWKKTGFED  360
+GRTLKGRDLSRVRNAI++K GF+D
Sbjct  139  VGRTLKGRDLSRVRNAIFEKFGFQD  163


>ref|XP_003597242.1| hypothetical protein MTR_2g094410 [Medicago truncatula]
gb|AES67493.1| hypothetical protein MTR_2g094410 [Medicago truncatula]
Length=156

Score =  117 bits (294),  Expect = 6e-30
Identities = 58/84 (69%), Positives = 66/84 (79%), Gaps = 2/84 (2%)
Frame = -3

Query  611  LPVNVPDWSKILRGEYTPRACEWGDGYESDDWGDDGNDKIPPHEYLARTRVASLSVHEGI  432
LPVNVPDWSKIL  +Y        D  + +D GDD  +KIPPHE+LARTR+AS SVHEG+
Sbjct  75   LPVNVPDWSKILGEDYRHNRRRNYDDVDEEDEGDD--EKIPPHEFLARTRMASFSVHEGV  132

Query  431  GRTLKGRDLSRVRNAIWKKTGFED  360
GRTLKGRDLSRVRNAIW KTGF+D
Sbjct  133  GRTLKGRDLSRVRNAIWAKTGFQD  156


>ref|XP_003543482.1| PREDICTED: uncharacterized protein LOC100814909 [Glycine max]
Length=154

Score =  114 bits (285),  Expect = 1e-28
Identities = 56/85 (66%), Positives = 66/85 (78%), Gaps = 1/85 (1%)
Frame = -3

Query  611  LPVNVPDWSKILRGEY-TPRACEWGDGYESDDWGDDGNDKIPPHEYLARTRVASLSVHEG  435
LPVNVPDWSKIL  EY   +   + D  E+    +DG  ++PPHE+LARTR+AS SVHEG
Sbjct  70   LPVNVPDWSKILGDEYGRNQRRNYDDDDEARSDEEDGVGRVPPHEFLARTRIASFSVHEG  129

Query  434  IGRTLKGRDLSRVRNAIWKKTGFED  360
+GRTLKGRDLSRVRNAIW KTGF+D
Sbjct  130  VGRTLKGRDLSRVRNAIWAKTGFQD  154


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 246135601
Number of extensions: 4540019
Number of successful extensions: 9641
Number of sequences better than 1e-10: 3
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 9632
Number of HSP's successfully gapped: 3
Length of query: 613
Length of database: 6150218869
Length adjustment: 134
Effective length of query: 479
Effective length of database: 3749061613
Effective search space: 262434312910
Effective search space used: 262434312910
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 173 (71.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE8KNMW01N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_5883

Length=613


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002304535.1|  predicted protein [Populus trichocarpa] >...   192    1e-58
ref|NP_176598.1|  Disease resistance-responsive (dirigent-like...   172    1e-50
ref|XP_002887917.1|  disease resistance-responsive family prot...   170    5e-50
ref|XP_002276477.1|  PREDICTED: disease resistance response pr...   167    6e-49
gb|ACC91257.1|  disease resistance-responsive family protein [...   167    9e-49

ALIGNMENTS
>ref|XP_002304535.1| predicted protein [Populus trichocarpa]
gb|EEE79514.1| predicted protein [Populus trichocarpa]
Length=162

Score =  192 bits (487),  Expect = 1e-58
Identities = 103/147 (70%), Positives = 119/147 (81%), Gaps = 3/147 (2%)
Frame = -3

Query  608  VLYLHDILFNGTDQanatsaattnsTRLALNPSGHFGAIVVFNDPVTKDNRLQSEPVAQA  429
VLY HD LFNGTD ANATSAA T+ T+L    +  FG +VVF+DP+TKDN L S PVA+A
Sbjct  19   VLYYHDTLFNGTDVANATSAAATDPTKLG---NFKFGMLVVFDDPMTKDNHLLSRPVARA  75

Query  428  QGFYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFFMARG  249
QGFY Y+KK    AWFAFTL+FNST++KGTLNIMGAD+M  +TRDFSVVGGTGDFFMARG
Sbjct  76   QGFYFYDKKSTYTAWFAFTLIFNSTKHKGTLNIMGADLMTEETRDFSVVGGTGDFFMARG  135

Query  248  ICTVSTDVLQGGFYFRLILDIKLYECY  168
I T+ TD  QG +YFRL +DIKLYECY
Sbjct  136  IATIHTDTFQGDYYFRLKMDIKLYECY  162


>ref|NP_176598.1| Disease resistance-responsive (dirigent-like protein) family
protein [Arabidopsis thaliana]
gb|AAF24580.1|AC007764_22 F22C12.8 [Arabidopsis thaliana]
gb|AAQ65109.1| At1g64160 [Arabidopsis thaliana]
dbj|BAD43018.1| putative dirigent protein [Arabidopsis thaliana]
dbj|BAD44501.1| putative dirigent protein [Arabidopsis thaliana]
gb|AEE34203.1| Disease resistance-responsive (dirigent-like protein) family
protein [Arabidopsis thaliana]
Length=182

Score =  172 bits (435),  Expect = 1e-50
Identities = 86/151 (57%), Positives = 106/151 (70%), Gaps = 11/151 (7%)
Frame = -3

Query  608  VLYLHDILFNGTDQanatsaattnsTRLALNPSG----HFGAIVVFNDPVTKDNRLQSEP  441
VLY HDI+F   D           ++    NP G     FG +V+F+DP+T D   QSEP
Sbjct  39   VLYYHDIMFGVDD-------VQNATSAAVTNPPGLGNFKFGKLVIFDDPMTIDKNFQSEP  91

Query  440  VAQAQGFYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFF  261
VA+AQGFY Y+ K + NAWFA+TLVFNST++KGTLNIMGAD+M  ++RD SVVGGTGDFF
Sbjct  92   VARAQGFYFYDMKNDYNAWFAYTLVFNSTQHKGTLNIMGADLMMVQSRDLSVVGGTGDFF  151

Query  260  MARGICTVSTDVLQGGFYFRLILDIKLYECY  168
M+RGI T  TD  +G  YFR+ +DIKLYECY
Sbjct  152  MSRGIVTFETDTFEGAKYFRVKMDIKLYECY  182


>ref|XP_002887917.1| disease resistance-responsive family protein [Arabidopsis lyrata
subsp. lyrata]
gb|EFH64176.1| disease resistance-responsive family protein [Arabidopsis lyrata
subsp. lyrata]
Length=182

Score =  170 bits (431),  Expect = 5e-50
Identities = 85/151 (56%), Positives = 106/151 (70%), Gaps = 11/151 (7%)
Frame = -3

Query  608  VLYLHDILFNGTDQanatsaattnsTRLALNPSG----HFGAIVVFNDPVTKDNRLQSEP  441
VLY HDI+F   D           ++    NP G     FG +V+F+DP+T D   QSEP
Sbjct  39   VLYYHDIMFGVDD-------VQNATSAAITNPPGLGNFKFGKLVIFDDPMTIDKNFQSEP  91

Query  440  VAQAQGFYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFF  261
VA+AQGFY Y+ K + NAWFA+TLVFNST++KGTLNIMGAD+M  ++RD SVVGGTGDFF
Sbjct  92   VARAQGFYFYDMKNDYNAWFAYTLVFNSTQHKGTLNIMGADLMMVQSRDLSVVGGTGDFF  151

Query  260  MARGICTVSTDVLQGGFYFRLILDIKLYECY  168
M+RGI T  TD  +G  YFR+ +DIKLY+CY
Sbjct  152  MSRGIVTFETDTFEGAKYFRVKMDIKLYDCY  182


>ref|XP_002276477.1| PREDICTED: disease resistance response protein 206 [Vitis vinifera]
Length=186

Score =  167 bits (424),  Expect = 6e-49
Identities = 85/150 (57%), Positives = 106/150 (71%), Gaps = 3/150 (2%)
Frame = -3

Query  608  VLYLHDILFNGTDQanatsaattnsT---RLALNPSGHFGAIVVFNDPVTKDNRLQSEPV  438
V Y HDI++NG +  NAT+A         +  L    HFG +VVF+DP+T DN L S PV
Sbjct  37   VFYFHDIIYNGKNSKNATAAIVGAPAWGNKTILGGKNHFGDLVVFDDPITLDNNLHSTPV  96

Query  437  AQAQGFYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFFM  258
+AQGFYIY+KK+   AW  F+ VFNSTE+KG++N  GAD +  KTRD SVVGGTGDFFM
Sbjct  97   GRAQGFYIYDKKDVFTAWLGFSFVFNSTEHKGSINFAGADPLMNKTRDISVVGGTGDFFM  156

Query  257  ARGICTVSTDVLQGGFYFRLILDIKLYECY  168
ARGI T++TD  +G  YFRL +DIKLYEC+
Sbjct  157  ARGIATLTTDAFEGEVYFRLCVDIKLYECW  186


>gb|ACC91257.1| disease resistance-responsive family protein [Capsella rubella]
Length=187

Score =  167 bits (423),  Expect = 9e-49
Identities = 90/145 (62%), Positives = 108/145 (74%), Gaps = 3/145 (2%)
Frame = -3

Query  602  YLHDILFNGTDQanatsaattnsTRLALNPSGHFGAIVVFNDPVTKDNRLQSEPVAQAQG  423
Y HDIL++G + ANATSAA  +   L    +  FG  V+F+DP+T D    SEPVA+AQG
Sbjct  46   YFHDILYDGDNVANATSAAIVSPPGLG---NFKFGKFVIFDDPITMDKNYLSEPVARAQG  102

Query  422  FYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFFMARGIC  243
FY Y+ K + NAWF +TLVFNST++KGTLNIMGAD+M   TRD SVVGGTGDFFMARGI
Sbjct  103  FYFYDMKMDFNAWFCYTLVFNSTQHKGTLNIMGADLMMEPTRDLSVVGGTGDFFMARGIA  162

Query  242  TVSTDVLQGGFYFRLILDIKLYECY  168
T  TD+ QG  YFR+ +DIKLYECY
Sbjct  163  TFVTDLFQGAKYFRVKMDIKLYECY  187


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 364805134
Number of extensions: 8583160
Number of successful extensions: 22093
Number of sequences better than 1e-10: 27
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 22019
Number of HSP's successfully gapped: 27
Length of query: 613
Length of database: 6150218869
Length adjustment: 134
Effective length of query: 479
Effective length of database: 3749061613
Effective search space: 262434312910
Effective search space used: 262434312910
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 173 (71.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPDWXTR301N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_6

Length=2060


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002522914.1|  Aspartic proteinase nepenthesin-1 precurs...   258    6e-75
emb|CBI34137.3|  unnamed protein product [Vitis vinifera]           246    3e-74
ref|XP_003523200.1|  PREDICTED: aspartic proteinase nepenthesi...   256    4e-74
ref|XP_002300215.1|  predicted protein [Populus trichocarpa] >...   254    1e-73
ref|XP_002307860.1|  ZIP transporter [Populus trichocarpa] >gb...   246    8e-72

ALIGNMENTS
>ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length=442

Score =  258 bits (659),  Expect = 6e-75
Identities = 172/413 (42%), Positives = 225/413 (54%), Gaps = 36/413 (9%)
Frame = +1

Query  100   GFMATLSHVDQGKSLTKTELLRHAIQRRTMRVEMPQAVMDQLASPLPIKAPVSGYNGEFS  279
GF  TL HVD  K+LTK + ++H I+R   R+E   A++   +S   I +PV   NGEF
Sbjct  42    GFRITLKHVDSDKNLTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLSGNGEFL  101

Query  280   MTLAIGTPPINHSLIVDTDTDLIW-----INQC-DNHSP--------SNTKLSCSSNLCK  417
M LAIGTPP  +S I+DT +DLIW       QC D  SP        S +KLSCSS LCK
Sbjct  102   MNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCK  161

Query  418   NAGTCSKKNTCQYKYQYTSNKTVQVDLATETFTFGK-NVLKLGFSCGVSLEG---VVAKG  585
S  ++C+Y Y Y    + Q  +ATETFTFGK ++  +GF CG   EG       G
Sbjct  162   ALPQSSCSDSCEYLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSG  221

Query  586   AVGLGHGPLSLVSQMNAQNFSMCLPSYDNPNNTGFVLIGSTIK---TRAKMITASLIKTN  756
VGLG GPLSLVSQ+    FS CL S D+   T  +L+GS      T A + T  LI+
Sbjct  222   LVGLGRGPLSLVSQLKEAKFSYCLTSIDD-TKTSTLLMGSLASVNGTSAAIRTTPLIQNP  280

Query  757   FFPSKYYINVTGISVGKVRLNIDSSSFSINANDGTGGMAIDTGSSVTYFEQSIFDKVIKA  936
PS YY+++ GISVG  RL I  S+F +  +DGTGG+ ID+G+++TY E+S FD V K
Sbjct  281   LQPSFYYLSLEGISVGGTRLPIKESTFQLQ-DDGTGGLIIDSGTTITYLEESAFDLVKKE  339

Query  937   FKTQTRHPTSIDPATSLPICFPTPPN-SKNAYPYITFHLGGPNGPSKLVLSIGNTYREYD  1113
F +Q   P     AT L +C+  P + S+   P +  H  G    + L L  G  Y   D
Sbjct  340   FTSQMGLPVDNSGATGLELCYNLPSDTSELEVPKLVLHFTG----ADLELP-GENYMIAD  394

Query  1114  NYVPGKNKTCLAFKAAPPSDRGMSTLGNIQLQDTLVTFDLSKNTVAFLQKKCG  1272
+        CLA      S  GMS  GN+Q Q+  V+ DL K T++FL   CG
Sbjct  395   S---SMGVICLAM----GSSGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNCG  440


>emb|CBI34137.3| unnamed protein product [Vitis vinifera]
Length=165

Score =  246 bits (629),  Expect = 3e-74
Identities = 119/158 (75%), Positives = 140/158 (89%), Gaps = 0/158 (0%)
Frame = -3

Query  1974  GETELLRHRIISQVLELGIIVHSVIIGIALGASQNPKTIRPLIAALTFHQFFEGIGLGGC  1795
G  EL+RHR+ISQVLELGI+VHSVIIGI+LGAS++PKTI+PL+AALTFHQFFEG+GLGGC
Sbjct  2     GSAELIRHRVISQVLELGIVVHSVIIGISLGASESPKTIKPLVAALTFHQFFEGMGLGGC  61

Query  1794  IAQARFNTRAVVMMAFFFSITTPSGIAIGIGISRFYSETSRNALIIEGVFNSASAGILIY  1615
I QA+F  RA  +MA FFS+TTP GIAIGIGIS  Y E S  ALI+EG+FN+ASAGIL+Y
Sbjct  62    IVQAKFKLRAAAIMALFFSLTTPVGIAIGIGISNVYDENSSTALIVEGIFNAASAGILVY  121

Query  1614  MALVDLLAADFMSPKMQNNGKLQLLANASLLIGAGCMS  1501
MALVDLLAADFM+P+MQ NG+LQ+ AN SLL+GAGCMS
Sbjct  122   MALVDLLAADFMNPRMQGNGRLQVGANISLLVGAGCMS  159


>ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length=453

Score =  256 bits (654),  Expect = 4e-74
Identities = 162/423 (38%), Positives = 228/423 (54%), Gaps = 48/423 (11%)
Frame = +1

Query  94    TTGFMATLSHVDQGKSLTKTELLRHAIQRRTMRVEMPQAVM---DQLASPLPIKAPVSGY  264
T GF   L HVD GK+LTK E ++H I+R   R++   A++     L S   ++AP+
Sbjct  45    TKGFRVMLRHVDSGKNLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAG  104

Query  265   NGEFSMTLAIGTPPINHSLIVDTDTDLIWIN-----QC---------DNHSPSNTKLSCS  402
NGE+ M LAIGTPP+++  ++DT +DLIW       QC            S S +K+SC
Sbjct  105   NGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCG  164

Query  403   SNLCKNAGTCSKKNTCQYKYQYTSNKTVQVDLATETFTFGKN-----VLKLGFSCGVSLE  567
S+LC    + +  + C+Y Y Y      Q  LATETFTFGK+     V  +GF CG   E
Sbjct  165   SSLCSAVPSSTCSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNE  224

Query  568   G---VVAKGAVGLGHGPLSLVSQMNAQNFSMCLPSYDNPNNTGFVLIGST--IKTRAKMI  732
G     A G VGLG GPLSLVSQ+    FS CL   D+   +  +L+GS   +K   +++
Sbjct  225   GDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCLTPMDDTKES-ILLLGSLGKVKDAKEVV  283

Query  733   TASLIKTNFFPSKYYINVTGISVGKVRLNIDSSSFSINANDGTGGMAIDTGSSVTYFEQS  912
T  L+K    PS YY+++ GISVG  RL+I+ S+F +  +DG GG+ ID+G+++TY EQ
Sbjct  284   TTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVG-DDGNGGVIIDSGTTITYIEQK  342

Query  913   IFDKVIKAFKTQTRHPTSIDPATSLPICFPTPPNSKNA-YPYITFHLGGPNGPSKLVLSI  1089
F+ + K F +QT+ P     +T L +CF  P  S     P I FH  G
Sbjct  343   AFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKG-----------  391

Query  1090  GNTYREYDNYVPGKNK---TCLAFKAAPPSDRGMSTLGNIQLQDTLVTFDLSKNTVAFLQ  1260
G+     +NY+ G +     CLA  A+     GMS  GN+Q Q+ LV  DL K T++F+
Sbjct  392   GDLELPAENYMIGDSNLGVACLAMGAS----SGMSIFGNVQQQNILVNHDLEKETISFVP  447

Query  1261  KKC  1269
C
Sbjct  448   TSC  450


>ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length=439

Score =  254 bits (650),  Expect = 1e-73
Identities = 166/412 (40%), Positives = 220/412 (53%), Gaps = 36/412 (9%)
Frame = +1

Query  100   GFMATLSHVDQGKSLTKTELLRHAIQRRTMRVEMPQAVMDQLASPLPIKAPVSGYNGEFS  279
GF   L HVD GK+LTK E +RH ++R   R++  QA+    +S   I+APV   NGEF
Sbjct  39    GFRVRLKHVDSGKNLTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLPGNGEFL  98

Query  280   MTLAIGTPPINHSLIVDTDTDLIW-----INQC---------DNHSPSNTKLSCSSNLCK  417
M LAIGTPP  +S I+DT +DLIW       QC            S S +KLSCSS LC+
Sbjct  99    MKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCE  158

Query  418   NAGTCSKKNTCQYKYQYTSNKTVQVDLATETFTFGK-NVLKLGFSCGVSLEG---VVAKG  585
S  N C+Y Y Y    + Q  LA+ET TFGK +V  + F CG   EG       G
Sbjct  159   ALPQSSCNNGCEYLYSYGDYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAG  218

Query  586   AVGLGHGPLSLVSQMNAQNFSMCLPSYDNPNNTGFVLIGSTIKTRAK---MITASLIKTN  756
VGLG GPLSLVSQ+    FS CL + D+   T  +L+GS     A    + T  LI +
Sbjct  219   LVGLGRGPLSLVSQLKEPKFSYCLTTVDD-TKTSTLLMGSLASVNASSSAIKTTPLIHSP  277

Query  757   FFPSKYYINVTGISVGKVRLNIDSSSFSINANDGTGGMAIDTGSSVTYFEQSIFDKVIKA  936
PS YY+++ GISVG  RL I  S+FS+  +DG+GG+ ID+G+++TY E+S F+ V K
Sbjct  278   AHPSFYYLSLEGISVGDTRLPIKKSTFSLQ-DDGSGGLIIDSGTTITYLEESAFNLVAKE  336

Query  937   FKTQTRHPTSIDPATSLPICFPTPPNSKN-AYPYITFHLGGPNGPSKLVLSIGNTYREYD  1113
F  +   P     +T L +CF  P  S N   P + FH  G            +     +
Sbjct  337   FTAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDG-----------ADLELPAE  385

Query  1114  NYVPGKNKTCLAFKAAPPSDRGMSTLGNIQLQDTLVTFDLSKNTVAFLQKKC  1269
NY+ G +   +A   A  S  GMS  GN+Q Q+ LV  DL K T++FL  +C
Sbjct  386   NYMIGDSSMGVAC-LAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC  436


>ref|XP_002307860.1| ZIP transporter [Populus trichocarpa]
gb|EEE91383.1| ZIP transporter [Populus trichocarpa]
Length=343

Score =  246 bits (629),  Expect = 8e-72
Identities = 118/160 (74%), Positives = 144/160 (90%), Gaps = 0/160 (0%)
Frame = -3

Query  1980  GSGETELLRHRIISQVLELGIIVHSVIIGIALGASQNPKTIRPLIAALTFHQFFEGIGLG  1801
GSG ++L+RHR+I+QVLELGI+VHSVIIG++LGAS +PKTIRPL+AAL+FHQFFEG+GLG
Sbjct  178   GSGPSQLIRHRVITQVLELGIVVHSVIIGVSLGASGSPKTIRPLVAALSFHQFFEGMGLG  237

Query  1800  GCIAQARFNTRAVVMMAFFFSITTPSGIAIGIGISRFYSETSRNALIIEGVFNSASAGIL  1621
GCI QA+F T+ +V+MA FFS+TTP GIAIG+GIS  Y+E+S NALI+EG+FN+ASAGIL
Sbjct  238   GCITQAKFKTKTIVIMALFFSLTTPVGIAIGLGISNVYNESSPNALIVEGIFNAASAGIL  297

Query  1620  IYMALVDLLAADFMSPKMQNNGKLQLLANASLLIGAGCMS  1501
IYMALVDLLAADFM PK+Q+NG LQ   N SLL+GAGCMS
Sbjct  298   IYMALVDLLAADFMHPKVQSNGALQFGVNVSLLLGAGCMS  337


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 1137216252
Number of extensions: 25999694
Number of successful extensions: 64215
Number of sequences better than 1e-10: 85
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 63937
Number of HSP's successfully gapped: 85
Length of query: 2060
Length of database: 6150218869
Length adjustment: 148
Effective length of query: 1912
Effective length of database: 3498194437
Effective search space: 1882028607106
Effective search space used: 1882028607106
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 181 (74.3 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE8X1XU013


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_6242

Length=601


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|ACG60665.1|  basic helix-loop-helix protein [Nicotiana taba...   166    8e-48
ref|XP_002518568.1|  DNA binding protein, putative [Ricinus co...   163    1e-46
emb|CBI28976.3|  unnamed protein product [Vitis vinifera]           162    2e-46
ref|XP_002274829.2|  PREDICTED: transcription factor ILR3-like...   162    3e-46
ref|XP_002316706.1|  predicted protein [Populus trichocarpa] >...   158    1e-44

ALIGNMENTS
>gb|ACG60665.1| basic helix-loop-helix protein [Nicotiana tabacum]
Length=233

Score =  166 bits (420),  Expect = 8e-48
Identities = 82/134 (61%), Positives = 99/134 (74%), Gaps = 0/134 (0%)
Frame = -3

Query  599  LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM  420
LLE GRP K DK+ +L D +RMV QLR EAQKLK+SN +LQ K+ ELK EKNELRDEK
Sbjct  100  LLEPGRPPKTDKSAILVDAVRMVTQLRGEAQKLKDSNLNLQEKIKELKAEKNELRDEKQK  159

Query  419  LKAKKEKLEHQVKELNARPQISPHHLPPMHSPYAHVVGSKFVPVIGFSGVPMWQMMPPTS  240
LKA+KEKLE Q+K  NA+P   P  +P   +P+  V GSK VP++ + GV MWQ MPP +
Sbjct  160  LKAEKEKLEQQLKTTNAQPGFLPPAIPAAFAPHGQVPGSKLVPIMSYPGVAMWQFMPPAA  219

Query  239  VDTSKDHTLRPPVA  198
VDTS+DH LRPPVA
Sbjct  220  VDTSQDHVLRPPVA  233


>ref|XP_002518568.1| DNA binding protein, putative [Ricinus communis]
gb|EEF43955.1| DNA binding protein, putative [Ricinus communis]
Length=229

Score =  163 bits (412),  Expect = 1e-46
Identities = 82/137 (60%), Positives = 106/137 (77%), Gaps = 4/137 (3%)
Frame = -3

Query  599  LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM  420
LL+ GRP K+DK+V+L+D ++MV QLR EAQKLKESN++LQ KVNELKVEKNELRDEK
Sbjct  94   LLDPGRPPKMDKSVILADAMKMVNQLRAEAQKLKESNENLQEKVNELKVEKNELRDEKQR  153

Query  419  LKAKKEKLEHQVKELNARPQISPHHLPPMHSPY---AHVVGSKFVPVIGFSGVPMWQMMP  249
LK +KE +E QV  L+A  +  P HLP + +P+   + V+GSK VP++G+ GVPMWQ+MP
Sbjct  154  LKTEKESIERQVNALSASARFLP-HLPAIPAPFSSPSQVIGSKLVPIVGYPGVPMWQLMP  212

Query  248  PTSVDTSKDHTLRPPVA  198
P +VDTS+D  LR P A
Sbjct  213  PATVDTSQDPVLRSPAA  229


>emb|CBI28976.3| unnamed protein product [Vitis vinifera]
Length=223

Score =  162 bits (410),  Expect = 2e-46
Identities = 82/136 (60%), Positives = 97/136 (71%), Gaps = 2/136 (1%)
Frame = -3

Query  599  LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM  420
+LE GRP K DK  +LSD +RMV QLR EAQKLKESN DLQ K+ ELK EKNELRDEK
Sbjct  88   ILEPGRPPKTDKAAILSDAVRMVTQLRSEAQKLKESNGDLQEKIKELKAEKNELRDEKQR  147

Query  419  LKAKKEKLEHQVKELNARPQISPH--HLPPMHSPYAHVVGSKFVPVIGFSGVPMWQMMPP  246
LKA+KEKLE QVK ++A+P   PH   +P   +      G+K +P IG+  V MWQ MPP
Sbjct  148  LKAEKEKLEQQVKAISAQPGFLPHPSAMPAAFAAQGRAPGNKLMPFIGYPSVAMWQFMPP  207

Query  245  TSVDTSKDHTLRPPVA  198
+VDTS+DH LRPPVA
Sbjct  208  AAVDTSQDHVLRPPVA  223


>ref|XP_002274829.2| PREDICTED: transcription factor ILR3-like [Vitis vinifera]
Length=240

Score =  162 bits (410),  Expect = 3e-46
Identities = 82/136 (60%), Positives = 97/136 (71%), Gaps = 2/136 (1%)
Frame = -3

Query  599  LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM  420
+LE GRP K DK  +LSD +RMV QLR EAQKLKESN DLQ K+ ELK EKNELRDEK
Sbjct  105  ILEPGRPPKTDKAAILSDAVRMVTQLRSEAQKLKESNGDLQEKIKELKAEKNELRDEKQR  164

Query  419  LKAKKEKLEHQVKELNARPQISPH--HLPPMHSPYAHVVGSKFVPVIGFSGVPMWQMMPP  246
LKA+KEKLE QVK ++A+P   PH   +P   +      G+K +P IG+  V MWQ MPP
Sbjct  165  LKAEKEKLEQQVKAISAQPGFLPHPSAMPAAFAAQGRAPGNKLMPFIGYPSVAMWQFMPP  224

Query  245  TSVDTSKDHTLRPPVA  198
+VDTS+DH LRPPVA
Sbjct  225  AAVDTSQDHVLRPPVA  240


>ref|XP_002316706.1| predicted protein [Populus trichocarpa]
gb|EEE97318.1| predicted protein [Populus trichocarpa]
Length=243

Score =  158 bits (400),  Expect = 1e-44
Identities = 80/136 (59%), Positives = 96/136 (71%), Gaps = 2/136 (1%)
Frame = -3

Query  599  LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM  420
LL+ GRP K+DK+ +L D  RMV QLR+E+QKLKESN  LQ K++ELK EKNELRDEK
Sbjct  108  LLDPGRPPKVDKSAILVDAARMVTQLRDESQKLKESNVSLQEKIDELKAEKNELRDEKQR  167

Query  419  LKAKKEKLEHQVKELNARPQISPH--HLPPMHSPYAHVVGSKFVPVIGFSGVPMWQMMPP  246
LK +KE LE QVK L+  P   PH   +P   S    VVGSK +P +G+ G+ MWQ MPP
Sbjct  168  LKTEKENLERQVKALSTPPNFLPHPSAIPAPFSAPGQVVGSKLMPFVGYPGISMWQFMPP  227

Query  245  TSVDTSKDHTLRPPVA  198
VDTS+DH LRPPVA
Sbjct  228  AVVDTSQDHVLRPPVA  243


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 249484877
Number of extensions: 4467113
Number of successful extensions: 21439
Number of sequences better than 1e-10: 5
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 21287
Number of HSP's successfully gapped: 5
Length of query: 601
Length of database: 6150218869
Length adjustment: 133
Effective length of query: 468
Effective length of database: 3766980697
Effective search space: 252387706699
Effective search space used: 252387706699
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 173 (71.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE8ZT8Z012


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters

Query= TrVeIntMedtrGB1_6381

Length=596


No significant similarity found. For reasons why, click here.

Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 279756301 Number of extensions: 5600591 Number of successful extensions: 12703 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 12576 Number of HSP's successfully gapped: 0 Length of query: 596 Length of database: 6150218869 Length adjustment: 133 Effective length of query: 463 Effective length of database: 3766980697 Effective search space: 244853745305 Effective search space used: 244853745305 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362


A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE9BP3001N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_6390

Length=596


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003627280.1|  Germin-like protein [Medicago truncatula]...   231    2e-73
ref|XP_003627279.1|  Germin-like protein [Medicago truncatula]...   219    8e-69
ref|XP_003627277.1|  Germin-like protein [Medicago truncatula]...   219    8e-69
emb|CAI56441.1|  germin-like protein [Cicer arietinum]              204    3e-63
ref|XP_003528531.1|  PREDICTED: auxin-binding protein ABP19a-l...   203    1e-62

ALIGNMENTS
>ref|XP_003627280.1| Germin-like protein [Medicago truncatula]
gb|AET01756.1| Germin-like protein [Medicago truncatula]
Length=209

Score =  231 bits (589),  Expect = 2e-73
Identities = 123/124 (99%), Positives = 124/124 (100%), Gaps = 0/124 (0%)
Frame = +1

Query  1    ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM  180
ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM
Sbjct  86   ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM  145

Query  181  LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF  360
LHFQVNSGKGKATAFLTFSSANPGAQLLDLLLFSNNLPSELVAQTTFLDLAQV+KLKARF
Sbjct  146  LHFQVNSGKGKATAFLTFSSANPGAQLLDLLLFSNNLPSELVAQTTFLDLAQVKKLKARF  205

Query  361  GGRG  372
GGRG
Sbjct  206  GGRG  209


>ref|XP_003627279.1| Germin-like protein [Medicago truncatula]
gb|AET01755.1| Germin-like protein [Medicago truncatula]
Length=209

Score =  219 bits (558),  Expect = 8e-69
Identities = 116/124 (94%), Positives = 121/124 (98%), Gaps = 0/124 (0%)
Frame = +1

Query  1    ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM  180
ISAARLDIA+ GSIPMHTHPGATELLI+V GEITAGFLT T+VYSKTLKPGDLMVFPQGM
Sbjct  86   ISAARLDIAENGSIPMHTHPGATELLIIVQGEITAGFLTPTSVYSKTLKPGDLMVFPQGM  145

Query  181  LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF  360
LHFQ+N+GKGKATAFLTFSSANPGAQLLDLLLFSNNLPSELVAQTTFLDLAQVQKLKARF
Sbjct  146  LHFQINTGKGKATAFLTFSSANPGAQLLDLLLFSNNLPSELVAQTTFLDLAQVQKLKARF  205

Query  361  GGRG  372
GGRG
Sbjct  206  GGRG  209


>ref|XP_003627277.1| Germin-like protein [Medicago truncatula]
gb|AET01753.1| Germin-like protein [Medicago truncatula]
Length=209

Score =  219 bits (558),  Expect = 8e-69
Identities = 116/124 (94%), Positives = 121/124 (98%), Gaps = 0/124 (0%)
Frame = +1

Query  1    ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM  180
ISAARLDIA+ GSIPMHTHPGATELLI+V GEITAGFLT TAVYSKTLKPGDLMVFPQGM
Sbjct  86   ISAARLDIAENGSIPMHTHPGATELLIIVQGEITAGFLTPTAVYSKTLKPGDLMVFPQGM  145

Query  181  LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF  360
LHFQ+N+GKGKATAFLTFSSANPGAQLLDLLLFSNNLPS+LVAQTTFLDLAQVQKLKARF
Sbjct  146  LHFQINTGKGKATAFLTFSSANPGAQLLDLLLFSNNLPSQLVAQTTFLDLAQVQKLKARF  205

Query  361  GGRG  372
GGRG
Sbjct  206  GGRG  209


>emb|CAI56441.1| germin-like protein [Cicer arietinum]
Length=185

Score =  204 bits (519),  Expect = 3e-63
Identities = 108/124 (87%), Positives = 118/124 (95%), Gaps = 0/124 (0%)
Frame = +1

Query  1    ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM  180
ISAARLDIA+GGSIPMHTHPGATELLIMV GEITAGF+TT+AVYSK LK GD+MVFPQGM
Sbjct  62   ISAARLDIAEGGSIPMHTHPGATELLIMVKGEITAGFMTTSAVYSKKLKVGDVMVFPQGM  121

Query  181  LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF  360
LHFQVNSGKG+ATAFL+FSSANPGAQLLDLLLF+NNL S+ VAQTTFLD+AQV+KLK RF
Sbjct  122  LHFQVNSGKGEATAFLSFSSANPGAQLLDLLLFANNLSSDFVAQTTFLDVAQVKKLKTRF  181

Query  361  GGRG  372
GGRG
Sbjct  182  GGRG  185


>ref|XP_003528531.1| PREDICTED: auxin-binding protein ABP19a-like [Glycine max]
Length=208

Score =  203 bits (517),  Expect = 1e-62
Identities = 105/124 (85%), Positives = 115/124 (93%), Gaps = 0/124 (0%)
Frame = +1

Query  1    ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM  180
+S ARLDIAKGGSIPMHTHP ATELLIMV G+ITAGF+T TA+Y+KTLKPGD+MVFPQG
Sbjct  85   VSVARLDIAKGGSIPMHTHPAATELLIMVEGQITAGFMTPTALYTKTLKPGDIMVFPQGQ  144

Query  181  LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF  360
LHFQVNSG GKATAFL FSSANPGAQLLDLLLF N LPS+LVAQTTFLD+AQV+K+KARF
Sbjct  145  LHFQVNSGNGKATAFLAFSSANPGAQLLDLLLFGNTLPSDLVAQTTFLDVAQVKKVKARF  204

Query  361  GGRG  372
GGRG
Sbjct  205  GGRG  208


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 314077135
Number of extensions: 6648021
Number of successful extensions: 11798
Number of sequences better than 1e-10: 77
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 11710
Number of HSP's successfully gapped: 77
Length of query: 596
Length of database: 6150218869
Length adjustment: 133
Effective length of query: 463
Effective length of database: 3766980697
Effective search space: 244853745305
Effective search space used: 244853745305
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 173 (71.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE0RJN301N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_646

Length=1000


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003519788.1|  PREDICTED: uncharacterized protein LOC100...   270    5e-84
ref|XP_003517602.1|  PREDICTED: uncharacterized protein LOC100...   270    3e-83
ref|NP_974121.1|  uncharacterized protein [Arabidopsis thalian...   268    3e-83
ref|NP_177212.2|  uncharacterized protein [Arabidopsis thalian...   268    1e-82
ref|XP_002893280.1|  hypothetical protein ARALYDRAFT_472593 [A...   264    9e-82

ALIGNMENTS
>ref|XP_003519788.1| PREDICTED: uncharacterized protein LOC100776135 [Glycine max]
Length=417

Score =  270 bits (691),  Expect = 5e-84
Identities = 143/278 (51%), Positives = 188/278 (68%), Gaps = 27/278 (10%)
Frame = -2

Query  999  QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGV-VKKESDGYTHERAIAYMEG  823
Q+W MSGESCPE T+PIRR TE+D+LRASSV  FGRK+   V+++++   HE A+ Y+ G
Sbjct  119  QLWTMSGESCPEGTIPIRRTTEQDMLRASSVSRFGRKIRRRVRRDTNSNGHEHAVGYVSG  178

Query  822  KKYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ------------  679
++YYGA A+++VW P+V N++EFSLSQ+WV +GSF   DLNTIEAGWQ
Sbjct  179  EQYYGAKASINVWAPRVENQDEFSLSQMWVISGSFGD-DLNTIEAGWQVSPEIYGDRYPR  237

Query  678  ------RDAYQKTGCYNLNCQGFVQKSTTIVLGEAIR-TTQYGGQLKELPFDIYKDSISG  520
DAYQ TGCYNL C GFVQ +  I +G AI  T+ Y G   ++   I+KD   G
Sbjct  238  FFTYWTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPTSSYAGGQFDISLLIWKDPKHG  297

Query  519  NWWLKVNN-VPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEG  343
NWWL+  + + VGYWPS LFT+L+   A  +QFGGEI+N + SG+HT TQMGSGHF +EG
Sbjct  298  NWWLEFGSGILVGYWPSFLFTHLR-DHASMVQFGGEIVNSRQSGSHTSTQMGSGHFASEG  356

Query  342  YGKAAYFKQIQVYSSTNSKNP-ANLLELATSYDNQKCY  232
+GKA+YF+ +QV    N+  P +NL  LA   D+  CY
Sbjct  357  FGKASYFRNMQVVDWDNNLVPLSNLRVLA---DHPNCY  391


>ref|XP_003517602.1| PREDICTED: uncharacterized protein LOC100776639 [Glycine max]
Length=471

Score =  270 bits (690),  Expect = 3e-83
Identities = 142/278 (51%), Positives = 188/278 (68%), Gaps = 27/278 (10%)
Frame = -2

Query  999  QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGV-VKKESDGYTHERAIAYMEG  823
Q+W MSGESCPE T+PIRR TE+D+LRASSV  FGRK+   V+++++   HE A+ Y+ G
Sbjct  173  QLWTMSGESCPEGTIPIRRTTEQDMLRASSVSRFGRKIRRRVRRDTNSNGHEHAVGYVSG  232

Query  822  KKYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ------------  679
++YYGA A+++VW P+V N++EFSLSQ+WV +GSF   DLNTIE+GWQ
Sbjct  233  EQYYGAKASINVWAPRVANQDEFSLSQMWVISGSFGD-DLNTIESGWQVSPELYGDRYPR  291

Query  678  ------RDAYQKTGCYNLNCQGFVQKSTTIVLGEAIR-TTQYGGQLKELPFDIYKDSISG  520
DAYQ TGCYNL C GFVQ +  I +G AI  T+ Y G   ++   I+KD   G
Sbjct  292  FFTYWTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPTSSYAGGQFDISLLIWKDPKHG  351

Query  519  NWWLKVNN-VPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEG  343
NWWL+  + + VGYWPS LFT+L+   A  +QFGGEI+N + SG+HT TQMGSGHF +EG
Sbjct  352  NWWLEFGSGILVGYWPSFLFTHLR-DHASMVQFGGEIVNSRQSGSHTSTQMGSGHFASEG  410

Query  342  YGKAAYFKQIQVYSSTNSKNP-ANLLELATSYDNQKCY  232
+GKA+YF+ +QV    N+  P +NL  LA   D+  CY
Sbjct  411  FGKASYFRNMQVVDWDNNLVPLSNLRVLA---DHPNCY  445


>ref|NP_974121.1| uncharacterized protein [Arabidopsis thaliana]
gb|AAG52324.1|AC011663_3 unknown protein; 106914-104701 [Arabidopsis thaliana]
gb|AAG52474.1|AC010796_13 unknown protein; 47588-49801 [Arabidopsis thaliana]
gb|AEE35077.1| uncharacterized protein [Arabidopsis thaliana]
Length=410

Score =  268 bits (685),  Expect = 3e-83
Identities = 141/285 (49%), Positives = 188/285 (66%), Gaps = 26/285 (9%)
Frame = -2

Query  999  QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGVVKKESDGYTHERAIAYMEGK  820
Q+W +SGESCPE T+PIRR TE+D+LRASSV+ FGRK+  VK++S    HE A+ Y+ G+
Sbjct  113  QLWSLSGESCPEGTIPIRRTTEQDMLRASSVQRFGRKIRRVKRDSTNNGHEHAVGYVTGR  172

Query  819  KYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ-------------  679
+YYGA A+++VW P+V ++ EFSLSQIWV AGSF   DLNTIEAGWQ
Sbjct  173  QYYGAKASINVWSPRVTSQYEFSLSQIWVIAGSFTH-DLNTIEAGWQISPELYGDTYPRF  231

Query  678  -----RDAYQKTGCYNLNCQGFVQKSTTIVLGEAI--RTTQYGGQLKELPFDIYKDSISG  520
DAY+ TGCYNL C GFVQ +  I +G AI  R++  GGQ  ++   I+KD   G
Sbjct  232  FTYWTSDAYRTTGCYNLLCSGFVQTNRRIAIGAAISPRSSYKGGQF-DISLLIWKDPKHG  290

Query  519  NWWLKV-NNVPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEG  343
+WWL+  +   VGYWP+ LFT+LK      +QFGGEI+N +  G+HT TQMGSGHF  EG
Sbjct  291  HWWLQFGSGALVGYWPAFLFTHLK-QHGSMVQFGGEIVNNRPGGSHTTTQMGSGHFAGEG  349

Query  342  YGKAAYFKQIQVYSSTNSKNPANLLELATSYDNQKCYFSGFGKDR  208
+GKA+YF+ +Q+    N+  PA+ L++   + N  CY    G +R
Sbjct  350  FGKASYFRNLQIVDWDNTLIPASNLKILADHPN--CYDIRGGTNR  392


>ref|NP_177212.2| uncharacterized protein [Arabidopsis thaliana]
gb|AEE35078.1| uncharacterized protein [Arabidopsis thaliana]
Length=465

Score =  268 bits (685),  Expect = 1e-82
Identities = 141/285 (49%), Positives = 188/285 (66%), Gaps = 26/285 (9%)
Frame = -2

Query  999  QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGVVKKESDGYTHERAIAYMEGK  820
Q+W +SGESCPE T+PIRR TE+D+LRASSV+ FGRK+  VK++S    HE A+ Y+ G+
Sbjct  168  QLWSLSGESCPEGTIPIRRTTEQDMLRASSVQRFGRKIRRVKRDSTNNGHEHAVGYVTGR  227

Query  819  KYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ-------------  679
+YYGA A+++VW P+V ++ EFSLSQIWV AGSF   DLNTIEAGWQ
Sbjct  228  QYYGAKASINVWSPRVTSQYEFSLSQIWVIAGSFTH-DLNTIEAGWQISPELYGDTYPRF  286

Query  678  -----RDAYQKTGCYNLNCQGFVQKSTTIVLGEAI--RTTQYGGQLKELPFDIYKDSISG  520
DAY+ TGCYNL C GFVQ +  I +G AI  R++  GGQ  ++   I+KD   G
Sbjct  287  FTYWTSDAYRTTGCYNLLCSGFVQTNRRIAIGAAISPRSSYKGGQF-DISLLIWKDPKHG  345

Query  519  NWWLKV-NNVPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEG  343
+WWL+  +   VGYWP+ LFT+LK      +QFGGEI+N +  G+HT TQMGSGHF  EG
Sbjct  346  HWWLQFGSGALVGYWPAFLFTHLK-QHGSMVQFGGEIVNNRPGGSHTTTQMGSGHFAGEG  404

Query  342  YGKAAYFKQIQVYSSTNSKNPANLLELATSYDNQKCYFSGFGKDR  208
+GKA+YF+ +Q+    N+  PA+ L++   + N  CY    G +R
Sbjct  405  FGKASYFRNLQIVDWDNTLIPASNLKILADHPN--CYDIRGGTNR  447


>ref|XP_002893280.1| hypothetical protein ARALYDRAFT_472593 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH69539.1| hypothetical protein ARALYDRAFT_472593 [Arabidopsis lyrata subsp.
lyrata]
Length=408

Score =  264 bits (675),  Expect = 9e-82
Identities = 136/276 (49%), Positives = 181/276 (66%), Gaps = 24/276 (9%)
Frame = -2

Query  999  QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGVVKKESDGYTHERAIAYMEGK  820
Q+W +SGESCPE T+PIRR TE+D+LRASSVR FGRK+  V+++S    HE A+ Y+ G
Sbjct  111  QLWSLSGESCPEGTIPIRRTTEQDMLRASSVRRFGRKIRRVRRDSSSNGHEHAVGYVSGS  170

Query  819  KYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ-------------  679
+YYGA A+++VW P+V+++ EFSLSQIWV AGSF   DLNTIEAGWQ
Sbjct  171  QYYGAKASINVWTPRVISQYEFSLSQIWVIAGSF-ADDLNTIEAGWQISPELYGDTNPRF  229

Query  678  -----RDAYQKTGCYNLNCQGFVQKSTTIVLGEAIR-TTQYGGQLKELPFDIYKDSISGN  517
DAYQ TGCYNL C GFVQ +  I +G AI   + Y G   ++   I+KD   G+
Sbjct  230  FTYWTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYKGGQFDISLLIWKDPKHGH  289

Query  516  WWLKV-NNVPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEGY  340
WWL+  +   VGYWP  LFT+L+    + +QFGGEI+N +  G+HT TQMGSGHF  EG+
Sbjct  290  WWLQFGSGTLVGYWPVSLFTHLR-EHGNMVQFGGEIVNTRPGGSHTSTQMGSGHFAGEGF  348

Query  339  GKAAYFKQIQVYSSTNSKNPANLLELATSYDNQKCY  232
GKA+YF+ +Q+    N+  P + L++   + N  CY
Sbjct  349  GKASYFRNLQMVDWDNTLIPISNLKVLADHPN--CY  382


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 1,855,251,573
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 535716129
Number of extensions: 12055677
Number of successful extensions: 27009
Number of sequences better than 1e-10: 37
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 26862
Number of HSP's successfully gapped: 38
Length of query: 1000
Length of database: 6150218869
Length adjustment: 141
Effective length of query: 859
Effective length of database: 3623628025
Effective search space: 695736580800
Effective search space used: 695736580800
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 177 (72.8 bits)






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE9VTXP01N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_6809

Length=581


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002271428.2|  PREDICTED: pathogenesis-related protein S...   145    1e-40
ref|XP_002328585.1|  predicted protein [Populus trichocarpa] >...   144    3e-40
gb|ACB12048.1|  pathogenesis-related protein [Rehmannia glutin...   141    4e-39
gb|ABR10301.1|  pathogen-related protein STH-2 [Salvia miltior...   135    7e-37
gb|AAS00053.1|  Mal d 1-like [Malus x domestica] >emb|CBL94148...   134    3e-36

ALIGNMENTS
>ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera]
emb|CBI22930.3| unnamed protein product [Vitis vinifera]
Length=160

Score =  145 bits (365),  Expect = 1e-40
Identities = 71/160 (44%), Positives = 110/160 (69%), Gaps = 2/160 (1%)
Frame = -2

Query  502  MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNGGPGTIIQTNFS  323
M VT  +QE    VSP RM KALV D+   +P+++  +++SI  + G+GG G+I QTNFS
Sbjct  1    MGVTTFTQEFVTPVSPARMFKALVVDSHILVPRLVPESVKSIEFVEGDGGAGSITQTNFS  60

Query  322  DVAGAPVPTAKHRIDALDAEKGTTKFTMIEGAYLGEKIESVVYDVKFEEAGNGGCTIKVT  143
+     K++I+A+D EK   ++T+IEG  LG+++ES+VY++KFEE+G+GGC  K
Sbjct  61   --GDSDCEYLKYKINAVDKEKLECRYTLIEGGVLGDQLESIVYEMKFEESGDGGCICKTR  118

Query  142  NEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIANLNE  23
+EY++KG+  +K+E I+E  ++A G Y   EAYL+AN +E
Sbjct  119  SEYHTKGEFEIKEESIREGKEKAMGVYKLVEAYLLANPDE  158


>ref|XP_002328585.1| predicted protein [Populus trichocarpa]
gb|EEE76932.1| predicted protein [Populus trichocarpa]
Length=160

Score =  144 bits (363),  Expect = 3e-40
Identities = 70/157 (45%), Positives = 102/157 (65%), Gaps = 2/157 (1%)
Frame = -2

Query  502  MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNGGPGTIIQTNFS  323
M V  ++QE T  +SP RM KAL+ D++  +PK++   ++S+ ++ G+GG G+I Q NF+
Sbjct  1    MGVASYTQEFTCPISPARMFKALILDSNNLIPKLLPQIVKSVDLIHGDGGAGSIEQVNFT  60

Query  322  DVAGAPVPTAKHRIDALDAEKGTTKFTMIEGAYLGEKIESVVYDVKFEEAGNGGCTIKVT  143
+  G  +   KHRID LD      K+TMIEG  LGEK+ES+ Y+V+FE   +GGC  K+T
Sbjct  61   E--GTDIKYVKHRIDELDRVNLVCKYTMIEGDSLGEKLESIAYEVRFEVGSDGGCDCKMT  118

Query  142  NEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIAN  32
+ Y   GD  LK+E+IK   D+A+G Y   EAYL+ N
Sbjct  119  SSYLMLGDFTLKEEEIKAGQDKARGIYKVVEAYLLEN  155


>gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa]
Length=154

Score =  141 bits (355),  Expect = 4e-39
Identities = 78/162 (48%), Positives = 111/162 (69%), Gaps = 8/162 (5%)
Frame = -2

Query  502  MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNGGPGTIIQTNFS  323
M +TKH QE+ + VS KRM KALVT++ + +P  +   I+SI +L G+G  GTI +TN +
Sbjct  1    MGITKHIQELKLRVSAKRMFKALVTESHS-IP--LPDAIKSIEILHGDGSAGTIRKTNLA  57

Query  322  DVAGAPVPTAKHRIDALDAEKGTTKFTMIEGAYLGEKIESVVYDVKFEEAGNGGCTIKVT  143
D  G+ V   K RI+A+D +   +K+T+IEG  LG+KIES+ Y+ KFE++ +GGC  K+
Sbjct  58   D--GSYV---KIRIEAVDIDNQVSKYTVIEGPMLGDKIESIHYEQKFEDSSDGGCVAKIV  112

Query  142  NEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIANLNECA  17
EY++KGD  LK+E +K I D+A GFY  +E YL AN N CA
Sbjct  113  CEYHTKGDIQLKEEGVKAINDQALGFYTLSEEYLHANPNVCA  154


>gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza]
Length=160

Score =  135 bits (340),  Expect = 7e-37
Identities = 71/163 (44%), Positives = 106/163 (65%), Gaps = 4/163 (2%)
Frame = -2

Query  502  MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNG-GPGTIIQTNF  326
M V    QE+   +S  R+ KALVT++   +PK  + +I+SI ++ G+G  PG I QTNF
Sbjct  1    MGVKSFFQEMKTKISSSRLFKALVTESPEVVPKFTT-SIKSIELIQGSGYAPGAIFQTNF  59

Query  325  SDVAGAPVPTAKHRIDALDAEKGTTKFTMIEGAYLGEKIESVVYDVKFEEAGNGGCTIKV  146
+  GA     K R+D +D EK + K+T+IEG  LG+K+E + YD+KFE+  +GGC +KV
Sbjct  60   PE--GAHFKYMKCRVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKV  117

Query  145  TNEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIANLNECA  17
T+EY++KG   L DED+K   +++ G Y + E YL+AN + CA
Sbjct  118  TSEYHTKGGYELADEDLKGAKEQSLGMYKSCEDYLLANPHVCA  160


>gb|AAS00053.1| Mal d 1-like [Malus x domestica]
emb|CBL94148.1| putative Mal d 1.11 isoallergen [Malus x domestica]
Length=163

Score =  134 bits (336),  Expect = 3e-36
Identities = 75/160 (47%), Positives = 101/160 (63%), Gaps = 5/160 (3%)
Frame = -2

Query  502  MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNGGPGTIIQTNFS  323
M VTK SQ+    V+P+RM  AL+ DA    PK+M  +I+SI  LSG+G  GTI Q NF+
Sbjct  1    MGVTKISQKFVTQVTPQRMFNALILDAHNICPKLMFSSIKSIEFLSGSGEVGTIKQINFT  60

Query  322  DVAGAPVPTAKHRIDALDAEKGTTKFTMIEGA---YLGEKIESVVYDVKFEEAGNGGCTI  152
+ +  P+  AKHRIDALD E  +  +T IE     +L +K+E + YDVKFE  G GGC
Sbjct  61   EAS--PMKYAKHRIDALDKEALSCTYTFIESDATDHLLDKLEYITYDVKFEGYGRGGCIC  118

Query  151  KVTNEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIAN  32
+T+ Y +K D  +K+EDI+   DRA G Y   EAYL+A+
Sbjct  119  HLTSTYKAKDDIQIKEEDIELGKDRAIGMYEVLEAYLMAH  158


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 327883953
Number of extensions: 7719891
Number of successful extensions: 21928
Number of sequences better than 1e-10: 27
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 21866
Number of HSP's successfully gapped: 28
Length of query: 581
Length of database: 6150218869
Length adjustment: 133
Effective length of query: 448
Effective length of database: 3766980697
Effective search space: 226018841820
Effective search space used: 226018841820
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 173 (71.2 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEA40KK016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_7456

Length=562


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002313837.1|  predicted protein [Populus trichocarpa] >...   122    1e-31
ref|XP_002885981.1|  hypothetical protein ARALYDRAFT_480428 [A...   111    1e-26
ref|XP_002521848.1|  hypothetical protein RCOM_0774340 [Ricinu...   108    1e-26
ref|XP_002281892.2|  PREDICTED: uncharacterized protein LOC100...   107    9e-26
ref|XP_002281852.1|  PREDICTED: uncharacterized protein LOC100...   107    9e-26

ALIGNMENTS
>ref|XP_002313837.1| predicted protein [Populus trichocarpa]
gb|EEE87792.1| predicted protein [Populus trichocarpa]
Length=170

Score =  122 bits (305),  Expect = 1e-31
Identities = 61/123 (50%), Positives = 77/123 (63%), Gaps = 10/123 (8%)
Frame = -1

Query  529  PGRRNVTQKPKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKY-------TKAHSV  371
P R+N T  P  + VG +  W FG++YA W  K  PF+  DTLVFKY       T  HSV
Sbjct  41   PCRQNSTAAPNKIVVGGSQNWTFGINYADWALKNGPFYFNDTLVFKYDPPSDTNTHPHSV  100

Query  370  YLLPDLDSYTKCDIAKGKLIGA-TNAGSKGFKYTLKA-QKSYFACGEGGNGLHCNQGNMK  197
YLLP+L S+ KCD+++ KL+ + T  G  GF++ LK+ Q  YFACG GG G HCN G MK
Sbjct  101  YLLPNLWSFLKCDLSRAKLVASETQGGGDGFEFVLKSWQPHYFACG-GGAGFHCNNGTMK  159

Query  196  FIV  188
F V
Sbjct  160  FFV  162


>ref|XP_002885981.1| hypothetical protein ARALYDRAFT_480428 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH62240.1| hypothetical protein ARALYDRAFT_480428 [Arabidopsis lyrata subsp.
lyrata]
Length=261

Score =  111 bits (277),  Expect = 1e-26
Identities = 55/111 (50%), Positives = 72/111 (65%), Gaps = 7/111 (6%)
Frame = -1

Query  502  PKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKYTK----AHSVYLLPDLDSYTKC  335
P+ + VG    WN+GV+YA W +K +PF L D LVFKY       HSVYLLP+  SY KC
Sbjct  145  PRKIIVGGDKEWNYGVNYAEWASKTAPFFLNDILVFKYNPPAPFTHSVYLLPNPSSYEKC  204

Query  334  DIAKGKLIGATNAGS-KGFKYTLKAQKSYF-ACGEGGNGLHCNQGNMKFIV  188
D+ KGK+I +   G+ KGF++ LK  + Y+ +CGE  +G HCN G MKF V
Sbjct  205  DVKKGKMIASPKQGAGKGFEFVLKQMRPYYISCGE-HDGAHCNNGTMKFTV  254


>ref|XP_002521848.1| hypothetical protein RCOM_0774340 [Ricinus communis]
gb|EEF40484.1| hypothetical protein RCOM_0774340 [Ricinus communis]
Length=138

Score =  108 bits (269),  Expect = 1e-26
Identities = 55/117 (47%), Positives = 70/117 (60%), Gaps = 10/117 (9%)
Frame = -1

Query  511  TQKPKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKY-------TKAHSVYLLPDL  353
T+ PK + VG +  W FG DY  W  + SPF++ DTLVFKY       T  HSVYLLP+L
Sbjct  17   TRTPKKIVVGGSAKWTFGFDYTDWAFRNSPFYVNDTLVFKYKLPKDNSTHPHSVYLLPNL  76

Query  352  DSYTKCDIAKG-KLIGATNAGSKGFKYTLKAQKS-YFACGEGGNGLHCNQGNMKFIV  188
S+  C++ K  K+      G KGF++ L   K  YFACG GG+G+HC  G MKF V
Sbjct  77   SSFVTCNLTKAVKVADGKQGGGKGFRFVLNKWKPYYFACG-GGDGIHCGLGQMKFYV  132


>ref|XP_002281892.2| PREDICTED: uncharacterized protein LOC100255503 [Vitis vinifera]
emb|CBI32457.3| unnamed protein product [Vitis vinifera]
Length=181

Score =  107 bits (266),  Expect = 9e-26
Identities = 57/126 (45%), Positives = 72/126 (57%), Gaps = 9/126 (7%)
Frame = -1

Query  541  PPAAPGRRNVTQKPKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKY------TKA  380
P   P   N T+ PK   VG +  W +G +Y  W  K  PF++ DTLVFKY      T
Sbjct  48   PRNNPFHPNNTRPPKKFIVGGSERWRYGFNYTDWALKNGPFYINDTLVFKYDPPNSTTFP  107

Query  379  HSVYLLPDLDSYTKCDIAKGKLIG-ATNAGSKGFKYTLK-AQKSYFACGEGGNGLHCNQG  206
HSVYLLP+  S+  CD+++ K +      GSKGF++ LK     YFACGE  NGLHC +G
Sbjct  108  HSVYLLPNFGSFLTCDLSRAKQVATVAQGGSKGFEFVLKNLWPHYFACGE-HNGLHCKEG  166

Query  205  NMKFIV  188
MKF V
Sbjct  167  MMKFSV  172


>ref|XP_002281852.1| PREDICTED: uncharacterized protein LOC100248603 [Vitis vinifera]
emb|CBI32459.3| unnamed protein product [Vitis vinifera]
Length=181

Score =  107 bits (266),  Expect = 9e-26
Identities = 57/126 (45%), Positives = 72/126 (57%), Gaps = 9/126 (7%)
Frame = -1

Query  541  PPAAPGRRNVTQKPKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKY------TKA  380
P   P   N T+ PK   VG +  W +G +Y  W  K  PF++ DTLVFKY      T
Sbjct  48   PRNNPFHPNNTRPPKKFIVGGSERWRYGFNYTDWALKNGPFYINDTLVFKYDPPNSTTFP  107

Query  379  HSVYLLPDLDSYTKCDIAKGKLIG-ATNAGSKGFKYTLK-AQKSYFACGEGGNGLHCNQG  206
HSVYLLP+  S+  CD+++ K +      GSKGF++ LK     YFACGE  NGLHC +G
Sbjct  108  HSVYLLPNFGSFLTCDLSRAKQVATVAQGGSKGFEFVLKNLWPHYFACGE-HNGLHCKEG  166

Query  205  NMKFIV  188
MKF V
Sbjct  167  MMKFSV  172


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 263332965
Number of extensions: 5551168
Number of successful extensions: 11337
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 11323
Number of HSP's successfully gapped: 0
Length of query: 562
Length of database: 6150218869
Length adjustment: 132
Effective length of query: 430
Effective length of database: 3784899781
Effective search space: 208169487955
Effective search space used: 208169487955
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 172 (70.9 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEA6VHH01N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_7457

Length=562


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003634199.1|  PREDICTED: uncharacterized GPI-anchored p...   141    2e-37
emb|CBI15549.3|  unnamed protein product [Vitis vinifera]           141    5e-37
emb|CAN82280.1|  hypothetical protein VITISV_044064 [Vitis vin...   139    1e-36
ref|XP_003525199.1|  PREDICTED: uncharacterized GPI-anchored p...   125    4e-31
gb|ACU18611.1|  unknown [Glycine max]                               123    1e-30

ALIGNMENTS
>ref|XP_003634199.1| PREDICTED: uncharacterized GPI-anchored protein At4g28100-like
[Vitis vinifera]
Length=327

Score =  141 bits (355),  Expect = 2e-37
Identities = 82/139 (59%), Positives = 102/139 (73%), Gaps = 7/139 (5%)
Frame = +2

Query  2    EAVKRLEKDCFNhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKMHNQD  181
E+VK+LE+DC +     GLGGCSKCLN+LY L  DK+  S+  E  +       KMHN+D
Sbjct  181  ESVKKLERDCLSTNGFPGLGGCSKCLNTLYLLGKDKTGNSSKLEARSR------KMHNRD  234

Query  182  CEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSEINSQ  361
CE+MGLTWLL KNR+AYIHTVSAVLR++M++ D  SDP  C+LNSDGMPLAVDS+EI+S
Sbjct  235  CELMGLTWLLAKNRTAYIHTVSAVLRAIMMSTD-GSDPLSCTLNSDGMPLAVDSAEISSH  293

Query  362  SLSTVLRFYNHMLPISLLF  418
S ST L F  H+  +SL F
Sbjct  294  SSSTSLHFSIHLCILSLSF  312


>emb|CBI15549.3| unnamed protein product [Vitis vinifera]
Length=374

Score =  141 bits (355),  Expect = 5e-37
Identities = 82/139 (59%), Positives = 102/139 (73%), Gaps = 7/139 (5%)
Frame = +2

Query  2    EAVKRLEKDCFNhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKMHNQD  181
E+VK+LE+DC +     GLGGCSKCLN+LY L  DK+  S+  E  +       KMHN+D
Sbjct  181  ESVKKLERDCLSTNGFPGLGGCSKCLNTLYLLGKDKTGNSSKLEARSR------KMHNRD  234

Query  182  CEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSEINSQ  361
CE+MGLTWLL KNR+AYIHTVSAVLR++M++ D  SDP  C+LNSDGMPLAVDS+EI+S
Sbjct  235  CELMGLTWLLAKNRTAYIHTVSAVLRAIMMSTD-GSDPLSCTLNSDGMPLAVDSAEISSH  293

Query  362  SLSTVLRFYNHMLPISLLF  418
S ST L F  H+  +SL F
Sbjct  294  SSSTSLHFSIHLCILSLSF  312


>emb|CAN82280.1| hypothetical protein VITISV_044064 [Vitis vinifera]
Length=328

Score =  139 bits (350),  Expect = 1e-36
Identities = 81/139 (58%), Positives = 101/139 (73%), Gaps = 7/139 (5%)
Frame = +2

Query  2    EAVKRLEKDCFNhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKMHNQD  181
E+VK+LE+DC +     GLGGCSKCLN+LY L  DK+  S+  E  +       KMHN+D
Sbjct  182  ESVKKLERDCLSTNGFPGLGGCSKCLNTLYLLGKDKTGNSSKLEARSR------KMHNRD  235

Query  182  CEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSEINSQ  361
CE+MGLTWLL KNR+ YIHTVSAVLR++M++ D  SDP  C+LNSDGMPLAVDS+EI+S
Sbjct  236  CELMGLTWLLAKNRTXYIHTVSAVLRAIMMSTD-GSDPLSCTLNSDGMPLAVDSAEISSH  294

Query  362  SLSTVLRFYNHMLPISLLF  418
S ST L F  H+  +SL F
Sbjct  295  SSSTSLHFSIHLCILSLSF  313


>ref|XP_003525199.1| PREDICTED: uncharacterized GPI-anchored protein At4g28100-like
[Glycine max]
Length=360

Score =  125 bits (313),  Expect = 4e-31
Identities = 76/131 (58%), Positives = 97/131 (74%), Gaps = 12/131 (9%)
Frame = +2

Query  2    EAVKRLEKDCF----NhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKM  169
++VKRLE+DCF    N     GLGGCSKCL+SLYSL  + S  S + +R+        K+
Sbjct  201  QSVKRLERDCFSSSTNVNKFPGLGGCSKCLHSLYSLRKNSSNSSKSEDRTT-------KI  253

Query  170  HNQDCEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSE  349
HN+DCE+MGLTWLL KNR+AYIHTVS VLR+LM++  + SDP  C+LNSDGMPLAVDSSE
Sbjct  254  HNKDCELMGLTWLLAKNRTAYIHTVSGVLRALMLS-TEGSDPQSCTLNSDGMPLAVDSSE  312

Query  350  INSQSLSTVLR  382
++ +S ST L+
Sbjct  313  MSDESSSTNLQ  323


>gb|ACU18611.1| unknown [Glycine max]
Length=360

Score =  123 bits (309),  Expect = 1e-30
Identities = 76/131 (58%), Positives = 96/131 (73%), Gaps = 12/131 (9%)
Frame = +2

Query  2    EAVKRLEKDCF----NhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKM  169
++VKRLE DCF    N     GLGGCSKCL+SLYSL  + S  S + +R+        K+
Sbjct  201  QSVKRLEGDCFSSGTNVNKFPGLGGCSKCLHSLYSLRKNSSNSSKSEDRTT-------KI  253

Query  170  HNQDCEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSE  349
HN+DCE+MGLTWLL KNR+AYIHTVS VLR+LM++  + SDP  C+LNSDGMPLAVDSSE
Sbjct  254  HNKDCELMGLTWLLAKNRTAYIHTVSGVLRALMLS-TEGSDPQSCTLNSDGMPLAVDSSE  312

Query  350  INSQSLSTVLR  382
++ +S ST L+
Sbjct  313  MSDESSSTNLQ  323


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 244898668
Number of extensions: 4517584
Number of successful extensions: 10596
Number of sequences better than 1e-10: 6
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 10583
Number of HSP's successfully gapped: 6
Length of query: 562
Length of database: 6150218869
Length adjustment: 132
Effective length of query: 430
Effective length of database: 3784899781
Effective search space: 208169487955
Effective search space used: 208169487955
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 172 (70.9 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEAKT7901N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_7825

Length=552


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002298184.1|  predicted protein [Populus trichocarpa] >...  66.2    3e-11
ref|XP_003560817.1|  PREDICTED: chemocyanin-like [Brachypodium...  66.2    3e-11
gb|ACG29532.1|  chemocyanin precursor [Zea mays]                   65.5    5e-11
ref|NP_001149596.1|  chemocyanin precursor [Zea mays] >gb|ACG3...  65.1    7e-11
gb|AEC10985.1|  basic blue protein [Camellia sinensis]             64.7    9e-11

ALIGNMENTS
>ref|XP_002298184.1| predicted protein [Populus trichocarpa]
gb|EEE82989.1| predicted protein [Populus trichocarpa]
Length=125

Score = 66.2 bits (160),  Expect = 3e-11
Identities = 34/60 (57%), Positives = 39/60 (65%), Gaps = 1/60 (2%)
Frame = -3

Query  334  KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT  155
K   AGD L F Y+ AA +NVV VNKAGY +C   +  K YTSG D I L KG NFFIC+
Sbjct  51   KSFKAGDILVFNYSTAA-HNVVAVNKAGYSSCTSPRGAKVYTSGKDQIKLVKGQNFFICS  109


>ref|XP_003560817.1| PREDICTED: chemocyanin-like [Brachypodium distachyon]
Length=130

Score = 66.2 bits (160),  Expect = 3e-11
Identities = 30/60 (50%), Positives = 42/60 (70%), Gaps = 1/60 (2%)
Frame = -3

Query  334  KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT  155
K   AGD LQF Y + A +NVV VN AGY +C+  +  K Y+SGND++ L++G N+FIC+
Sbjct  56   KRFRAGDVLQFKYGRGA-HNVVAVNAAGYKSCSAPRGAKVYSSGNDSVKLSRGTNYFICS  114


>gb|ACG29532.1| chemocyanin precursor [Zea mays]
Length=129

Score = 65.5 bits (158),  Expect = 5e-11
Identities = 31/60 (52%), Positives = 39/60 (65%), Gaps = 1/60 (2%)
Frame = -3

Query  334  KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT  155
K   AGD L F Y+  A +NVV VN AGY  C+  +  K YTSGND +TL +G N+FIC+
Sbjct  55   KRFKAGDVLVFKYDSTA-HNVVAVNAAGYKGCSAPRGAKVYTSGNDRVTLARGTNYFICS  113


>ref|NP_001149596.1| chemocyanin precursor [Zea mays]
gb|ACG35999.1| chemocyanin precursor [Zea mays]
Length=129

Score = 65.1 bits (157),  Expect = 7e-11
Identities = 31/60 (52%), Positives = 39/60 (65%), Gaps = 1/60 (2%)
Frame = -3

Query  334  KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT  155
K   AGD L F Y+  A +NVV VN AGY  C+  +  K YTSGND +TL +G N+FIC+
Sbjct  55   KRFKAGDVLVFKYDSTA-HNVVVVNAAGYKGCSAPRGAKVYTSGNDRVTLARGTNYFICS  113


>gb|AEC10985.1| basic blue protein [Camellia sinensis]
Length=122

Score = 64.7 bits (156),  Expect = 9e-11
Identities = 32/60 (53%), Positives = 38/60 (63%), Gaps = 1/60 (2%)
Frame = -3

Query  334  KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT  155
K   AGD L F YN A  +NVV VNKAGYD+C      + ++SG D I L KG NFFIC+
Sbjct  48   KRFRAGDILAFNYN-AQAHNVVSVNKAGYDSCKAPAGARVFSSGKDQIKLVKGQNFFICS  106


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 255915547
Number of extensions: 5362215
Number of successful extensions: 12058
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 12044
Number of HSP's successfully gapped: 0
Length of query: 552
Length of database: 6150218869
Length adjustment: 132
Effective length of query: 420
Effective length of database: 3784899781
Effective search space: 196814788612
Effective search space used: 196814788612
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 172 (70.9 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEB3BST013


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_7997

Length=547


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003523208.1|  PREDICTED: small nuclear ribonucleoprotei...   165    1e-49
ref|XP_003542280.1|  PREDICTED: small nuclear ribonucleoprotei...   165    2e-49
ref|XP_003597961.1|  Small nuclear ribonucleoprotein E [Medica...   164    4e-49
ref|XP_002522808.1|  Small nuclear ribonucleoprotein E, putati...   164    4e-49
ref|XP_002331664.1|  predicted protein [Populus trichocarpa] >...   164    4e-49

ALIGNMENTS
>ref|XP_003523208.1| PREDICTED: small nuclear ribonucleoprotein E-like [Glycine max]
ref|XP_003526883.1| PREDICTED: small nuclear ribonucleoprotein E [Glycine max]
Length=88

Score =  165 bits (418),  Expect = 1e-49
Identities = 83/86 (97%), Positives = 85/86 (99%), Gaps = 0/86 (0%)
Frame = +1

Query  1    ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN  180
ASTKVQR+MTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLD+AEEVN
Sbjct  2    ASTKVQRVMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDDAEEVN  61

Query  181  VKKKSRKQLGRILLKGDNITLMMNTG  258
VKKKSRK LGRILLKGDNITLMMNTG
Sbjct  62   VKKKSRKTLGRILLKGDNITLMMNTG  87


>ref|XP_003542280.1| PREDICTED: small nuclear ribonucleoprotein E-like [Glycine max]
gb|ACU14300.1| unknown [Glycine max]
Length=88

Score =  165 bits (417),  Expect = 2e-49
Identities = 82/86 (95%), Positives = 85/86 (99%), Gaps = 0/86 (0%)
Frame = +1

Query  1    ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN  180
ASTKVQR+MTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLD+AEEVN
Sbjct  2    ASTKVQRVMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDDAEEVN  61

Query  181  VKKKSRKQLGRILLKGDNITLMMNTG  258
+KKKSRK LGRILLKGDNITLMMNTG
Sbjct  62   IKKKSRKTLGRILLKGDNITLMMNTG  87


>ref|XP_003597961.1| Small nuclear ribonucleoprotein E [Medicago truncatula]
ref|XP_003615807.1| Small nuclear ribonucleoprotein E [Medicago truncatula]
gb|AES68212.1| Small nuclear ribonucleoprotein E [Medicago truncatula]
gb|AES98765.1| Small nuclear ribonucleoprotein E [Medicago truncatula]
Length=88

Score =  164 bits (415),  Expect = 4e-49
Identities = 82/86 (95%), Positives = 85/86 (99%), Gaps = 0/86 (0%)
Frame = +1

Query  1    ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN  180
ASTKVQR+MTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLD+AEEVN
Sbjct  2    ASTKVQRVMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDDAEEVN  61

Query  181  VKKKSRKQLGRILLKGDNITLMMNTG  258
VKKKS+K LGRILLKGDNITLMMNTG
Sbjct  62   VKKKSKKTLGRILLKGDNITLMMNTG  87


>ref|XP_002522808.1| Small nuclear ribonucleoprotein E, putative [Ricinus communis]
gb|EEF39659.1| Small nuclear ribonucleoprotein E, putative [Ricinus communis]
Length=88

Score =  164 bits (415),  Expect = 4e-49
Identities = 83/86 (97%), Positives = 84/86 (98%), Gaps = 0/86 (0%)
Frame = +1

Query  1    ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN  180
ASTKVQRIMTQPINLIFRFLQSKARIQ WLFEQKDLRIEGRIIGFDEYMNLVLD+AEEVN
Sbjct  2    ASTKVQRIMTQPINLIFRFLQSKARIQFWLFEQKDLRIEGRIIGFDEYMNLVLDDAEEVN  61

Query  181  VKKKSRKQLGRILLKGDNITLMMNTG  258
VKKKSRK LGRILLKGDNITLMMNTG
Sbjct  62   VKKKSRKTLGRILLKGDNITLMMNTG  87


>ref|XP_002331664.1| predicted protein [Populus trichocarpa]
ref|XP_002325079.1| predicted protein [Populus trichocarpa]
gb|ABK93602.1| unknown [Populus trichocarpa]
gb|EEF03644.1| predicted protein [Populus trichocarpa]
gb|EEF11214.1| predicted protein [Populus trichocarpa]
Length=88

Score =  164 bits (415),  Expect = 4e-49
Identities = 82/86 (95%), Positives = 85/86 (99%), Gaps = 0/86 (0%)
Frame = +1

Query  1    ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN  180
ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVL++AEEVN
Sbjct  2    ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLEDAEEVN  61

Query  181  VKKKSRKQLGRILLKGDNITLMMNTG  258
+KKKSRK LGRILLKGDNITLMMNTG
Sbjct  62   IKKKSRKSLGRILLKGDNITLMMNTG  87


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 231300735
Number of extensions: 4031322
Number of successful extensions: 9114
Number of sequences better than 1e-10: 11
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 9099
Number of HSP's successfully gapped: 11
Length of query: 547
Length of database: 6150218869
Length adjustment: 132
Effective length of query: 415
Effective length of database: 3784899781
Effective search space: 189244989050
Effective search space used: 189244989050
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 172 (70.9 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPE1825Y01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_837

Length=960


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|NP_566697.1|  cysteine-rich repeat secretory protein 38 [A...   122    2e-29
dbj|BAJ34612.1|  unnamed protein product [Thellungiella haloph...   119    2e-28
ref|XP_002883339.1|  hypothetical protein ARALYDRAFT_479720 [A...   118    4e-28
ref|XP_002274668.1|  PREDICTED: cysteine-rich repeat secretory...   117    8e-28
ref|XP_002327925.1|  predicted protein [Populus trichocarpa] >...   115    4e-27

ALIGNMENTS
>ref|NP_566697.1| cysteine-rich repeat secretory protein 38 [Arabidopsis thaliana]
sp|Q9LRJ9.1|CRR38_ARATH RecName: Full=Cysteine-rich repeat secretory protein 38; Flags:
Precursor
dbj|BAB01391.1| unnamed protein product [Arabidopsis thaliana]
gb|AAK59407.1| unknown protein [Arabidopsis thaliana]
gb|ABD91491.1| At3g22060 [Arabidopsis thaliana]
gb|AEE76585.1| cysteine-rich repeat secretory protein 38 [Arabidopsis thaliana]
Length=252

Score =  122 bits (305),  Expect = 2e-29
Identities = 80/222 (36%), Positives = 114/222 (51%), Gaps = 13/222 (6%)
Frame = +1

Query  109  YNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVTFSNTSYGKSPDQVYGLALCHADT  288
+ CS   G +  +S Y+SNL  L + L+  +      F+ +S G +P+ V GLALC  D
Sbjct  37   HKCSDIEGSFTSKSLYESNLNNLFSQLSYKVPSTG--FAASSTGNTPNNVNGLALCRGDA  94

Query  289  YSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTL--VMDV  462
S +C  C+      +   C N KA I W D C VKYS+ NFFG++D +N+  L  V +V
Sbjct  95   SSSDCRSCLETAIPELRQRCPNNKAGIVWYDNCLVKYSSTNFFGKIDFENRFYLYNVKNV  154

Query  463  HDISYsltkslttttfkllsdlkNKATANNNHMIMFARGTAVGRQDRFVIFGLVQCSGDL  642
D S     +  + T  LL++L  KAT  +N  +       +G+     ++GLVQC+ DL
Sbjct  155  SDPS-----TFNSQTKALLTELTKKATTRDNQKLFATGEKNIGKNK---LYGLVQCTRDL  206

Query  643  SKYNCGQCLNYTIGQLPKNLTSL-GMKAVTGSCVVRYDSYHF  765
C  CLN  IG+LP       G + V GSC  RY+ Y F
Sbjct  207  KSITCKACLNGIIGELPNCCDGKEGGRVVGGSCNFRYEIYPF  248


>dbj|BAJ34612.1| unnamed protein product [Thellungiella halophila]
Length=255

Score =  119 bits (298),  Expect = 2e-28
Identities = 82/220 (37%), Positives = 119/220 (54%), Gaps = 9/220 (4%)
Frame = +1

Query  109  YNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVTFSNTSYGKSPDQVYGLALCHADT  288
+ CS   G +  +S Y+SNL  L   ++  +  +   F+ +S G SPD V GLALC  D
Sbjct  40   HKCSDIEGNFTSKSPYESNLDSLFRRISYRVPSSG--FAASSAGNSPDNVNGLALCRGDA  97

Query  289  YSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTLVMDVHD  468
S +C  C+      +   C N KA I W D C VKYS+ NFFG++D +N+  L  +V++
Sbjct  98   SSSDCGSCLATAIPELRQRCPNNKAGIIWYDNCLVKYSSTNFFGKIDYENRFYL-YNVNN  156

Query  469  ISYsltkslttttfkllsdlkNKATANNNHMIMFARGTAVGRQDRFVIFGLVQCSGDLSK  648
+S     S  T T  LL++L  KAT  +N   +FA G      ++  ++GLVQC+ DL +
Sbjct  157  VS--DPASFNTQTKALLTELTQKATTGDNQK-LFATGEK--NLEKKKLYGLVQCTRDLRR  211

Query  649  YNCGQCLNYTIGQLPKNLTSL-GMKAVTGSCVVRYDSYHF  765
+C  CL+  IG+LP       G + V GSC  RY+ Y F
Sbjct  212  ESCKACLDGIIGELPNCCDGKEGGRVVGGSCNFRYEIYPF  251


>ref|XP_002883339.1| hypothetical protein ARALYDRAFT_479720 [Arabidopsis lyrata subsp.
lyrata]
gb|EFH59598.1| hypothetical protein ARALYDRAFT_479720 [Arabidopsis lyrata subsp.
lyrata]
Length=252

Score =  118 bits (296),  Expect = 4e-28
Identities = 80/220 (36%), Positives = 113/220 (51%), Gaps = 9/220 (4%)
Frame = +1

Query  109  YNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVTFSNTSYGKSPDQVYGLALCHADT  288
+ CS   G +  +S Y+SNL  L   L+  +      F+ +S G +PD V GLALC  D
Sbjct  37   HKCSDIEGSFTSKSPYESNLNNLFPQLSYKVPSTG--FATSSAGITPDNVNGLALCRGDA  94

Query  289  YSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTLVMDVHD  468
S +C  C+      I   C + KA I W D C VKYS+ NFFG++D +N+  L  +V++
Sbjct  95   SSSDCSSCLATAIPEIRQRCPSNKAGIIWYDNCLVKYSSTNFFGKIDFENRFYL-YNVNN  153

Query  469  ISYsltkslttttfkllsdlkNKATANNNHMIMFARGTAVGRQDRFVIFGLVQCSGDLSK  648
+S     +  T T  LL+ L  KAT  +N  +       +G +    ++GLVQC+ DL
Sbjct  154  VS--DPSTFNTQTKALLTKLTKKATTGDNQKLFATGEKNIGMKK---LYGLVQCTRDLKS  208

Query  649  YNCGQCLNYTIGQLPKNLTSL-GMKAVTGSCVVRYDSYHF  765
C  CLN  IG+LP       G + V GSC  RY+ Y F
Sbjct  209  EACKACLNGIIGELPNCCDGKEGGRVVGGSCNFRYEIYPF  248


>ref|XP_002274668.1| PREDICTED: cysteine-rich repeat secretory protein 38-like [Vitis
vinifera]
Length=242

Score =  117 bits (293),  Expect = 8e-28
Identities = 79/224 (35%), Positives = 116/224 (52%), Gaps = 10/224 (4%)
Frame = +1

Query  97   TKPQYNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVTFSNTSYGKSPDQVYGLALC  276
T P Y+  S+S  +    TY++NL KL+  L   L      F   S G++ DQV GLALC
Sbjct  25   TDPLYHFCSSSQKFIDNGTYETNLNKLMGYLY--LAAPPTGFRKGSVGENNDQVNGLALC  82

Query  277  HADTYSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTLVM  456
D  + +C  C+   +  I   C + KA I W DYC +KYSN NFFG+VD  N +  +
Sbjct  83   RGDVSNTDCKACITESSSEIRKRCPDNKAAIIWYDYCLLKYSNVNFFGQVDHQN-MFYMW  141

Query  457  DVHDISYsltkslttttfkllsdlkNKATANNNHMIMFARGTAVGRQDRFVIFGLVQCSG  636
+++++S        +   K    L N A    N M M+A G  +  ++   ++GL QC+
Sbjct  142  NLNNVS-----DPDSFNQKTKELLSNLAQQAFNAMKMYATG-ELELEESEKLYGLTQCTR  195

Query  637  DLSKYNCGQCLNYTIGQLPKNLTSL-GMKAVTGSCVVRYDSYHF  765
DLS  +C +CL+  I +LP       G + V GSC +RY+ Y F
Sbjct  196  DLSSSDCKKCLDDAISKLPNCCDGKEGGRVVGGSCNIRYEIYPF  239


>ref|XP_002327925.1| predicted protein [Populus trichocarpa]
gb|EEE75713.1| predicted protein [Populus trichocarpa]
Length=242

Score =  115 bits (288),  Expect = 4e-27
Identities = 81/224 (36%), Positives = 112/224 (50%), Gaps = 14/224 (6%)
Frame = +1

Query  103  PQYNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVT-FSNTSYGKSPDQVYGLALCH  279
P ++  S+   +     Y+SNL KL + L    Y    T F   S G +PDQ YGLALC
Sbjct  27   PNFHLCSSPENFTANGPYESNLKKLTSYL---YYKAPPTGFGMGSRGHTPDQTYGLALCR  83

Query  280  ADTYSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTLVMD  459
D  + +C  CV   +  I   C   KA I W D C +KYSN  FFG+VDT NK   + +
Sbjct  84   GDVSTSDCKTCVFEASSEIRKRCPYNKAAIIWYDNCLLKYSNTGFFGQVDTGNKF-YMWN  142

Query  460  VHDISYsltkslttttfkllsdlkNKATANNNHMIMFARG-TAVGRQDRFVIFGLVQCSG  636
VH +S     +  T         + +AT       +FA G   +G+  +  ++GLVQC+G
Sbjct  143  VHVVSKPAPFNKKTKELLSQLANEAQATPK-----LFATGERELGKSTK--LYGLVQCTG  195

Query  637  DLSKYNCGQCLNYTIGQLPKNLT-SLGMKAVTGSCVVRYDSYHF  765
DLS   C +CL+  IG+LP       G + V+GSC   Y+ Y F
Sbjct  196  DLSSAVCKKCLDGIIGELPSCCDGKQGGRVVSGSCNFIYELYPF  239


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 510557841
Number of extensions: 10815912
Number of successful extensions: 24913
Number of sequences better than 1e-10: 14
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 24856
Number of HSP's successfully gapped: 18
Length of query: 960
Length of database: 6150218869
Length adjustment: 140
Effective length of query: 820
Effective length of database: 3641547109
Effective search space: 655478479620
Effective search space used: 655478479620
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 177 (72.8 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPDY3X4S01N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_84

Length=1352


Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|AAQ09999.1|  putative fructokinase 2 [Petunia integrifolia ...   407    1e-136
gb|AAQ10000.1|  putative fructokinase 2 [Petunia integrifolia ...   407    1e-136
ref|XP_002533363.1|  fructokinase, putative [Ricinus communis]...   405    6e-136
gb|AAA80675.1|  fructokinase [Beta vulgaris]                        398    3e-133
emb|CAD31714.1|  fructokinase-like protein [Cicer arietinum]        393    9e-133

ALIGNMENTS
>gb|AAQ09999.1| putative fructokinase 2 [Petunia integrifolia subsp. inflata]
Length=328

Score =  407 bits (1045),  Expect = 1e-136
Identities = 198/234 (85%), Positives = 216/234 (92%), Gaps = 0/234 (0%)
Frame = -2

Query  1351  DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP  1172
D GARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNL +IRSAK+FHYGSISLIVEP
Sbjct  91    DTGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLDVIRSAKIFHYGSISLIVEP  150

Query  1171  CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL  992
CRSAHLKAM+ AK AGALLSYDPNLRLPLWPSA+ A++QI SIW+KAD+IKVSDNELEFL
Sbjct  151   CRSAHLKAMEVAKEAGALLSYDPNLRLPLWPSAEEARKQIKSIWDKADVIKVSDNELEFL  210

Query  991   TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV  812
TGS KIDDE+AMSLWHP L+LLLVTLGEKGCRYY KNF G ++G+HVKTVDTTGAGDSFV
Sbjct  211   TGSDKIDDESAMSLWHPNLKLLLVTLGEKGCRYYTKNFHGGVEGFHVKTVDTTGAGDSFV  270

Query  811   GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSIL  650
GALL KIVDD SI+ DEARLKEVL +ACACGAITTTKKGAIPALP  S+ L++L
Sbjct  271   GALLTKIVDDQSILEDEARLKEVLTFACACGAITTTKKGAIPALPTESEALTLL  324


>gb|AAQ10000.1| putative fructokinase 2 [Petunia integrifolia subsp. inflata]
Length=328

Score =  407 bits (1045),  Expect = 1e-136
Identities = 198/234 (85%), Positives = 216/234 (92%), Gaps = 0/234 (0%)
Frame = -2

Query  1351  DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP  1172
D GARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNL +IRSAK+FHYGSISLIVEP
Sbjct  91    DTGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLDVIRSAKIFHYGSISLIVEP  150

Query  1171  CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL  992
CRSAHLKAM+ AK AGALLSYDPNLRLPLWPSA+ A++QI SIW+KAD+IKVSDNELEFL
Sbjct  151   CRSAHLKAMEVAKEAGALLSYDPNLRLPLWPSAEEARKQIKSIWDKADVIKVSDNELEFL  210

Query  991   TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV  812
TGS KIDDE+AMSLWHP L+LLLVTLGEKGCRYY KNF G ++G+HVKTVDTTGAGDSFV
Sbjct  211   TGSDKIDDESAMSLWHPNLKLLLVTLGEKGCRYYTKNFHGGVEGFHVKTVDTTGAGDSFV  270

Query  811   GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSIL  650
GALL KIVDD SI+ DEARLKEVL +ACACGAITTTKKGAIPALP  S+ L++L
Sbjct  271   GALLTKIVDDQSILEDEARLKEVLTFACACGAITTTKKGAIPALPTESEALTLL  324


>ref|XP_002533363.1| fructokinase, putative [Ricinus communis]
gb|EEF29025.1| fructokinase, putative [Ricinus communis]
Length=330

Score =  405 bits (1040),  Expect = 6e-136
Identities = 197/237 (83%), Positives = 220/237 (93%), Gaps = 0/237 (0%)
Frame = -2

Query  1351  DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP  1172
DKGARTALAFVTLRADGEREFMFYRNPSADMLLTP+ELNL +IRSAK+FHYGSISLIVEP
Sbjct  93    DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPEELNLEVIRSAKIFHYGSISLIVEP  152

Query  1171  CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL  992
CRSAHLKAM+EAK+AGALLSYDPNLRLPLWPSA+YA+EQIMSIW+KADIIKVSD ELEFL
Sbjct  153   CRSAHLKAMEEAKNAGALLSYDPNLRLPLWPSAEYAREQIMSIWDKADIIKVSDVELEFL  212

Query  991   TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV  812
TGS KIDDE+A+SLWHP L+LLLVTLGE GCRYY KNF GS+D +HVKTVDTTGAGDSFV
Sbjct  213   TGSDKIDDESALSLWHPNLKLLLVTLGENGCRYYTKNFHGSVDAFHVKTVDTTGAGDSFV  272

Query  811   GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSILNGA  641
GALL KIVDD S++ +E RL+EVL++A ACGAITTTKKGAIPALP  +DVLS++  +
Sbjct  273   GALLCKIVDDLSVLEEEPRLREVLRFANACGAITTTKKGAIPALPTEADVLSLMKAS  329


>gb|AAA80675.1| fructokinase [Beta vulgaris]
Length=331

Score =  398 bits (1022),  Expect = 3e-133
Identities = 194/234 (83%), Positives = 215/234 (92%), Gaps = 0/234 (0%)
Frame = -2

Query  1351  DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP  1172
DKGARTALAFVTL++DGEREFMFYRNPSADMLLTPDELNL +IRSAKVFHYGSI LIVEP
Sbjct  93    DKGARTALAFVTLKSDGEREFMFYRNPSADMLLTPDELNLDLIRSAKVFHYGSIRLIVEP  152

Query  1171  CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL  992
CRSAHLKAM+EAK AGALLSYDPNLRLPLWPSA+ A+EQIMSIW+KA++IKVSDNELEFL
Sbjct  153   CRSAHLKAMEEAKKAGALLSYDPNLRLPLWPSAEEAREQIMSIWDKAEVIKVSDNELEFL  212

Query  991   TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV  812
TG+S IDD  AMSLWHP L+LLLVTLG++GCRYY KNF+GS+DG+ V  VDTTGAGDSFV
Sbjct  213   TGNSTIDDATAMSLWHPNLKLLLVTLGDQGCRYYTKNFKGSLDGFKVNAVDTTGAGDSFV  272

Query  811   GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSIL  650
GALL KIVDDHSII DE+RLKEVLK+A ACGAITTTKKGAIPALP  +D L ++
Sbjct  273   GALLNKIVDDHSIIEDESRLKEVLKFANACGAITTTKKGAIPALPTVADALELI  326


>emb|CAD31714.1| fructokinase-like protein [Cicer arietinum]
Length=238

Score =  393 bits (1010),  Expect = 9e-133
Identities = 193/236 (82%), Positives = 213/236 (90%), Gaps = 0/236 (0%)
Frame = -2

Query  1351  DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP  1172
DKGARTALAFVTLRADGEREFMFYRNPSADMLLTP++LNL +IRSAKVFHYGSISLIVEP
Sbjct  2     DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPEDLNLELIRSAKVFHYGSISLIVEP  61

Query  1171  CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL  992
CRSAHLKAM+ AK AG LLSYDPNLRLPLWPS + A+ QI+SIW+KAD+IKVSD ELEFL
Sbjct  62    CRSAHLKAMEVAKDAGCLLSYDPNLRLPLWPSPEEARNQILSIWDKADLIKVSDVELEFL  121

Query  991   TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV  812
TGS KIDD +A+SLWHP L+LLLVTLGE G RYY KNF GS+D +HV TVDTTGAGDSFV
Sbjct  122   TGSDKIDDASALSLWHPNLKLLLVTLGENGSRYYTKNFHGSVDAFHVNTVDTTGAGDSFV  181

Query  811   GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSILNG  644
GALLGKIVDD SI+ DEARL+EVLK+A ACGAITTTKKGAIPALP  +DVLS++ G
Sbjct  182   GALLGKIVDDQSILEDEARLREVLKFANACGAITTTKKGAIPALPTEADVLSLIKG  237


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 1,855,251,573
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 766044481
Number of extensions: 17406778
Number of successful extensions: 41200
Number of sequences better than 1e-10: 135
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 40779
Number of HSP's successfully gapped: 144
Length of query: 1352
Length of database: 6150218869
Length adjustment: 144
Effective length of query: 1208
Effective length of database: 3569870773
Effective search space: 1092380456538
Effective search space used: 1092380456538
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 179 (73.6 bits)






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEBB8R7016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_8780

Length=525


Score     E
Sequences producing significant alignments:                       (Bits)  Value

emb|CAJ34815.1|  amino acid permease [Plantago major]               183    4e-56
ref|XP_002327101.1|  lysine/histidine transporter [Populus tri...   181    3e-51
ref|XP_002510286.1|  amino acid transporter, putative [Ricinus...   181    4e-51
ref|XP_002284114.1|  PREDICTED: lysine histidine transporter-l...   180    8e-51
ref|XP_003545851.1|  PREDICTED: lysine histidine transporter-l...   178    4e-50

ALIGNMENTS
>emb|CAJ34815.1| amino acid permease [Plantago major]
Length=136

Score =  183 bits (465),  Expect = 4e-56
Identities = 100/110 (91%), Positives = 106/110 (96%), Gaps = 0/110 (0%)
Frame = +1

Query  1    ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW  180
A YTSRTNRPCSIWVRSGFRVFYGF+SLLIGVA PFLSSLAGLLGGLTLPVTFAYPCFMW
Sbjct  26   AGYTSRTNRPCSIWVRSGFRVFYGFISLLIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW  85

Query  181  VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKP  330
VLIK+P KY+FNWYFNW+LGWLG+AFS AFS+GGIWSMVNSGLKLKFFKP
Sbjct  86   VLIKKPTKYTFNWYFNWILGWLGIAFSLAFSIGGIWSMVNSGLKLKFFKP  135


>ref|XP_002327101.1| lysine/histidine transporter [Populus trichocarpa]
gb|EEE73851.1| lysine/histidine transporter [Populus trichocarpa]
Length=521

Score =  181 bits (459),  Expect = 3e-51
Identities = 98/110 (89%), Positives = 105/110 (95%), Gaps = 0/110 (0%)
Frame = +1

Query  1    ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW  180
ASYT+RTNRPCSIWVRSGFRVFYGF+S  IGVA PFLSSLAGLLGGLTLPVTFAYPCFMW
Sbjct  410  ASYTTRTNRPCSIWVRSGFRVFYGFISFFIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW  469

Query  181  VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKP  330
VLIK+P KYSFNWYFNW+LGWLG+AFS AFS+GG+WSMVNSGLKLKFFKP
Sbjct  470  VLIKKPSKYSFNWYFNWILGWLGIAFSLAFSIGGVWSMVNSGLKLKFFKP  519


>ref|XP_002510286.1| amino acid transporter, putative [Ricinus communis]
gb|EEF52473.1| amino acid transporter, putative [Ricinus communis]
Length=521

Score =  181 bits (458),  Expect = 4e-51
Identities = 98/110 (89%), Positives = 104/110 (95%), Gaps = 0/110 (0%)
Frame = +1

Query  1    ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW  180
A YTSRTNRPCSIWVRSGFRVFYGF+S  IGVA PFLSSLAGLLGGLTLPVTFAYPCFMW
Sbjct  410  AGYTSRTNRPCSIWVRSGFRVFYGFISFFIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW  469

Query  181  VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKP  330
VLIKRP KYSFNWYFNW+LGWLG+AFS AFS+GG+WSMVNSGL+LKFFKP
Sbjct  470  VLIKRPSKYSFNWYFNWILGWLGIAFSLAFSIGGVWSMVNSGLRLKFFKP  519


>ref|XP_002284114.1| PREDICTED: lysine histidine transporter-like 8 [Vitis vinifera]
emb|CBI19587.3| unnamed protein product [Vitis vinifera]
Length=514

Score =  180 bits (456),  Expect = 8e-51
Identities = 97/111 (87%), Positives = 105/111 (95%), Gaps = 0/111 (0%)
Frame = +1

Query  1    ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW  180
A YTSRTNRPCSIWVRSGFRVFYGF+S  IGVA PFLSSLAGLLGGLTLPVTFAYPCFMW
Sbjct  404  AGYTSRTNRPCSIWVRSGFRVFYGFISFFIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW  463

Query  181  VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKPA  333
VLIK+P K+SFNWYFNW+LGWLG+AFS AFS+GG+WSMVNSGLKLKFFKP+
Sbjct  464  VLIKKPTKFSFNWYFNWILGWLGIAFSLAFSIGGVWSMVNSGLKLKFFKPS  514


>ref|XP_003545851.1| PREDICTED: lysine histidine transporter-like 8-like [Glycine
max]
Length=516

Score =  178 bits (451),  Expect = 4e-50
Identities = 99/110 (90%), Positives = 103/110 (94%), Gaps = 0/110 (0%)
Frame = +1

Query  1    ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW  180
A YTSRTNRPCSIWVRSGFRVFYGFVS  IGVA PFLSSLAGLLGGLTLPVTFAYPCFMW
Sbjct  406  AGYTSRTNRPCSIWVRSGFRVFYGFVSFFIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW  465

Query  181  VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKP  330
VLIK+P KYSFNWYFNW+LGWLGVAFS AFS+GGIWS+VN GLKLKFFKP
Sbjct  466  VLIKQPPKYSFNWYFNWILGWLGVAFSLAFSIGGIWSIVNDGLKLKFFKP  515


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 262786732
Number of extensions: 5527508
Number of successful extensions: 15061
Number of sequences better than 1e-10: 1
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 15038
Number of HSP's successfully gapped: 1
Length of query: 525
Length of database: 6150218869
Length adjustment: 131
Effective length of query: 394
Effective length of database: 3802818865
Effective search space: 167324030060
Effective search space used: 167324030060
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 172 (70.9 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPEBE7W201N


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_8801

Length=524


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003634857.1|  PREDICTED: uncharacterized protein LOC100...   103    5e-25
ref|XP_002265064.1|  PREDICTED: uncharacterized protein LOC100...   102    2e-24
ref|XP_002316639.1|  predicted protein [Populus trichocarpa] >...  95.5    8e-22
ref|NP_001237779.1|  uncharacterized protein LOC100527043 [Gly...  92.0    2e-20
ref|XP_003542773.1|  PREDICTED: uncharacterized protein LOC100...  90.5    6e-20

ALIGNMENTS
>ref|XP_003634857.1| PREDICTED: uncharacterized protein LOC100854976 [Vitis vinifera]
Length=155

Score =  103 bits (258),  Expect = 5e-25
Identities = 53/98 (54%), Positives = 71/98 (72%), Gaps = 6/98 (6%)
Frame = -1

Query  524  AIFEKLLRTTSTDISPGGGRRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVMPG  345
AIFE+ LR T   +SP  GR  R D+E+Q    + F+ KLG+PSPK++V AR VSV+MPG
Sbjct  52   AIFERFLRPTPPSLSPATGR-DRGDVESQ----MGFNGKLGHPSPKMTVYARGVSVLMPG  106

Query  344  NKLPTFIANPAPVPCPPDRMPWPSHQQQTNLPDPHSSS  231
+PTFIA+PAPVPCPPDR+PWP  ++  +  +P S+S
Sbjct  107  EDIPTFIAHPAPVPCPPDRIPWPL-KEHNSTANPSSNS  143


>ref|XP_002265064.1| PREDICTED: uncharacterized protein LOC100241762 [Vitis vinifera]
Length=155

Score =  102 bits (254),  Expect = 2e-24
Identities = 50/83 (60%), Positives = 63/83 (76%), Gaps = 5/83 (6%)
Frame = -1

Query  524  AIFEKLLRTTSTDISPGGGRRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVMPG  345
AIFE+ LR T   +SP  GR  R D+E+Q    + F+ KLG+PSPK++V AR VSV+MPG
Sbjct  52   AIFERFLRPTPPSLSPATGR-DRGDVESQ----MGFNGKLGHPSPKMTVYARGVSVLMPG  106

Query  344  NKLPTFIANPAPVPCPPDRMPWP  276
+PTFIA+PAPVPCPPDR+PWP
Sbjct  107  EDIPTFIAHPAPVPCPPDRIPWP  129


>ref|XP_002316639.1| predicted protein [Populus trichocarpa]
gb|EEE97251.1| predicted protein [Populus trichocarpa]
Length=157

Score = 95.5 bits (236),  Expect = 8e-22
Identities = 52/95 (55%), Positives = 66/95 (69%), Gaps = 12/95 (13%)
Frame = -1

Query  524  AIFEKLLRTTSTDISPGGGRRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVMPG  345
AIFE+ L  TS       GR G  D+E+Q     RF+SKLG+PSPK++V A  VSV+MPG
Sbjct  56   AIFERFLGPTS-------GRGGHGDLESQ----TRFNSKLGHPSPKMTVYANGVSVLMPG  104

Query  344  NKLPTFIANPAPVPCPPDRMPWPSHQQQTN-LPDP  243
+ +PTFIA PAPVPCPP+R  +P +QQ  N LP+P
Sbjct  105  DNIPTFIALPAPVPCPPERPSYPHNQQHINQLPNP  139


>ref|NP_001237779.1| uncharacterized protein LOC100527043 [Glycine max]
gb|ACU16082.1| unknown [Glycine max]
Length=155

Score = 92.0 bits (227),  Expect = 2e-20
Identities = 45/89 (51%), Positives = 58/89 (65%), Gaps = 6/89 (7%)
Frame = -1

Query  524  AIFEKLLRTTSTDISPGGGRRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVMPG  345
AIFE+ L+ TS  I P GGR  R   +        F+ KLG+PSPK+S+ A  VSV+MPG
Sbjct  58   AIFERFLKPTSPPILPSGGRNRRRSSQMD------FNGKLGHPSPKMSLYASWVSVLMPG  111

Query  344  NKLPTFIANPAPVPCPPDRMPWPSHQQQT  258
+  P+FIA+P P PC P+R+ WPSHQ  T
Sbjct  112  DATPSFIAHPVPAPCCPERISWPSHQHST  140


>ref|XP_003542773.1| PREDICTED: uncharacterized protein LOC100794476 [Glycine max]
Length=155

Score = 90.5 bits (223),  Expect = 6e-20
Identities = 50/100 (50%), Positives = 65/100 (65%), Gaps = 8/100 (8%)
Frame = -1

Query  524  AIFEKLLRTTSTDISPGGG--RRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVM  351
AIFE+ LR TS  +SP     RR   D+EAQ    + F  KL + SPK+SV A  VSV+M
Sbjct  53   AIFERYLRPTSPPLSPSATTRRRSPSDVEAQ----IGFSGKLAHASPKMSVYASGVSVLM  108

Query  350  PGNKLPTFIANPAPVPCPPDRMPWPSHQQQTNLPDPHSSS  231
PG+++PTFIA+PA  PC P+R+ WPSHQ    LP   S++
Sbjct  109  PGDEIPTFIAHPA--PCYPERISWPSHQHNNTLPCSSSNT  146


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 303724249
Number of extensions: 7275648
Number of successful extensions: 22166
Number of sequences better than 1e-10: 0
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 21999
Number of HSP's successfully gapped: 0
Length of query: 524
Length of database: 6150218869
Length adjustment: 131
Effective length of query: 393
Effective length of database: 3802818865
Effective search space: 163521211195
Effective search space used: 163521211195
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPECAW31016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_9545

Length=504


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003632587.1|  PREDICTED: regulatory-associated protein ...   243    3e-71
ref|XP_003632588.1|  PREDICTED: regulatory-associated protein ...   243    3e-71
ref|XP_003533671.1|  PREDICTED: regulatory-associated protein ...   240    3e-70
ref|XP_003623550.1|  Regulatory-associated protein of mTOR [Me...   240    3e-70
ref|XP_003551595.1|  PREDICTED: regulatory-associated protein ...   238    1e-69

ALIGNMENTS
>ref|XP_003632587.1| PREDICTED: regulatory-associated protein of TOR 1-like isoform
1 [Vitis vinifera]
emb|CBI18073.3| unnamed protein product [Vitis vinifera]
Length=1363

Score =  243 bits (619),  Expect = 3e-71
Identities = 118/150 (79%), Positives = 133/150 (89%), Gaps = 0/150 (0%)
Frame = +2

Query  2     FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR  181
F DG V+L+D+RTPEMLV   +PH Q VERVVGIGFQPGL+PAK+VSASQAG+IQFLD+R
Sbjct  1208  FVDGSVKLFDVRTPEMLVCAARPHTQRVERVVGIGFQPGLDPAKIVSASQAGDIQFLDVR  1267

Query  182   FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK  361
YLTI AHRGSLTALA+HRHAP+IASGSAKQ+IK+FNLEG  LGTIR+YPTFMAQK
Sbjct  1268  NGNCAYLTIDAHRGSLTALAIHRHAPLIASGSAKQIIKVFNLEGSQLGTIRFYPTFMAQK  1327

Query  362   IGSVSCLTFHPYQVMLGAGAADACVSIYAD  451
IGSV+CLTFHPYQV+L AGAADA VSIYAD
Sbjct  1328  IGSVNCLTFHPYQVLLAAGAADALVSIYAD  1357


>ref|XP_003632588.1| PREDICTED: regulatory-associated protein of TOR 1-like isoform
2 [Vitis vinifera]
Length=1370

Score =  243 bits (619),  Expect = 3e-71
Identities = 118/150 (79%), Positives = 133/150 (89%), Gaps = 0/150 (0%)
Frame = +2

Query  2     FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR  181
F DG V+L+D+RTPEMLV   +PH Q VERVVGIGFQPGL+PAK+VSASQAG+IQFLD+R
Sbjct  1215  FVDGSVKLFDVRTPEMLVCAARPHTQRVERVVGIGFQPGLDPAKIVSASQAGDIQFLDVR  1274

Query  182   FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK  361
YLTI AHRGSLTALA+HRHAP+IASGSAKQ+IK+FNLEG  LGTIR+YPTFMAQK
Sbjct  1275  NGNCAYLTIDAHRGSLTALAIHRHAPLIASGSAKQIIKVFNLEGSQLGTIRFYPTFMAQK  1334

Query  362   IGSVSCLTFHPYQVMLGAGAADACVSIYAD  451
IGSV+CLTFHPYQV+L AGAADA VSIYAD
Sbjct  1335  IGSVNCLTFHPYQVLLAAGAADALVSIYAD  1364


>ref|XP_003533671.1| PREDICTED: regulatory-associated protein of TOR 1-like [Glycine
max]
Length=1373

Score =  240 bits (612),  Expect = 3e-70
Identities = 118/150 (79%), Positives = 130/150 (87%), Gaps = 0/150 (0%)
Frame = +2

Query  2     FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR  181
F DG VRLYD+RTP+MLV   +PH Q VE+VVGIGFQPGL+  K+VSASQAG+IQFLD+R
Sbjct  1218  FIDGSVRLYDVRTPDMLVCGLRPHTQRVEKVVGIGFQPGLDQGKIVSASQAGDIQFLDIR  1277

Query  182   FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK  361
YLTI AHRGSLTALAVHRHAPIIASGSAKQ+IK+F+LEGD LGTIRYYPT MAQK
Sbjct  1278  NHSSAYLTIEAHRGSLTALAVHRHAPIIASGSAKQLIKVFSLEGDQLGTIRYYPTLMAQK  1337

Query  362   IGSVSCLTFHPYQVMLGAGAADACVSIYAD  451
IGSVSCL FHPYQV+L AGAADACV IYAD
Sbjct  1338  IGSVSCLNFHPYQVLLAAGAADACVCIYAD  1367


>ref|XP_003623550.1| Regulatory-associated protein of mTOR [Medicago truncatula]
gb|AES79768.1| Regulatory-associated protein of mTOR [Medicago truncatula]
Length=1430

Score =  240 bits (612),  Expect = 3e-70
Identities = 118/150 (79%), Positives = 131/150 (87%), Gaps = 0/150 (0%)
Frame = +2

Query  2     FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR  181
F DG VRLYD RTPEMLV   +PH Q VE+V+GIGFQPGL+P K+VSASQAG+IQFLD+R
Sbjct  1275  FVDGSVRLYDARTPEMLVCGLRPHTQRVEKVMGIGFQPGLDPGKLVSASQAGDIQFLDIR  1334

Query  182   FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK  361
YLTI AHRGSLTALAVHRHAPIIASGSAKQ+IK+F+LEGD LGTIRYYPT MAQK
Sbjct  1335  NHSSAYLTIEAHRGSLTALAVHRHAPIIASGSAKQLIKVFSLEGDQLGTIRYYPTLMAQK  1394

Query  362   IGSVSCLTFHPYQVMLGAGAADACVSIYAD  451
IGSVSCL+FHPYQ++L AGAADACV IYAD
Sbjct  1395  IGSVSCLSFHPYQLLLAAGAADACVCIYAD  1424


>ref|XP_003551595.1| PREDICTED: regulatory-associated protein of TOR 1-like [Glycine
max]
Length=1365

Score =  238 bits (607),  Expect = 1e-69
Identities = 117/150 (78%), Positives = 129/150 (86%), Gaps = 0/150 (0%)
Frame = +2

Query  2     FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR  181
F DG VRLYD+RTP+MLV   +PH Q VE+VVGIGFQPGL+  K+VSASQAG+IQFLD+R
Sbjct  1210  FVDGSVRLYDVRTPDMLVCGLRPHTQRVEKVVGIGFQPGLDQGKIVSASQAGDIQFLDIR  1269

Query  182   FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK  361
YLTI AHRGSLTALAVHRHAPIIASGSAKQ IK+F+LEGD LGTI+YYPT MAQK
Sbjct  1270  NHSSAYLTIEAHRGSLTALAVHRHAPIIASGSAKQFIKVFSLEGDQLGTIKYYPTLMAQK  1329

Query  362   IGSVSCLTFHPYQVMLGAGAADACVSIYAD  451
IGSVSCL FHPYQV+L AGAADACV IYAD
Sbjct  1330  IGSVSCLNFHPYQVLLAAGAADACVCIYAD  1359


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 274557214
Number of extensions: 5904993
Number of successful extensions: 14931
Number of sequences better than 1e-10: 8
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 14866
Number of HSP's successfully gapped: 8
Length of query: 504
Length of database: 6150218869
Length adjustment: 128
Effective length of query: 376
Effective length of database: 3856576117
Effective search space: 154263044680
Effective search space used: 154263044680
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPECJ6WG01S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_9620

Length=502


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_003544765.1|  PREDICTED: ubiquitin-fold modifier 1-like...   170    1e-51
ref|XP_003523461.1|  PREDICTED: ubiquitin-fold modifier 1-like...   168    5e-51
ref|XP_002285835.1|  PREDICTED: ubiquitin-fold modifier 1 [Vit...   168    6e-51
ref|XP_002519920.1|  Ubiquitin-fold modifier 1 precursor, puta...   167    2e-50
ref|XP_003614786.1|  Ubiquitin-fold modifier [Medicago truncat...   167    2e-50

ALIGNMENTS
>ref|XP_003544765.1| PREDICTED: ubiquitin-fold modifier 1-like [Glycine max]
Length=94

Score =  170 bits (430),  Expect = 1e-51
Identities = 84/85 (99%), Positives = 85/85 (100%), Gaps = 0/85 (0%)
Frame = +2

Query  2    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  181
GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ
Sbjct  4    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  63

Query  182  QSAGNVFLKHGSELRLIPRDRVGSY  256
QSAGNVFLKHGSELRLIPRDRVG+Y
Sbjct  64   QSAGNVFLKHGSELRLIPRDRVGAY  88


>ref|XP_003523461.1| PREDICTED: ubiquitin-fold modifier 1-like [Glycine max]
ref|XP_003526612.1| PREDICTED: ubiquitin-fold modifier 1 [Glycine max]
gb|ACU14017.1| unknown [Glycine max]
Length=88

Score =  168 bits (426),  Expect = 5e-51
Identities = 83/85 (98%), Positives = 85/85 (100%), Gaps = 0/85 (0%)
Frame = +2

Query  2    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  181
GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ
Sbjct  4    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  63

Query  182  QSAGNVFLKHGSELRLIPRDRVGSY  256
QSAGNVFLKHGSELRLIPRDRVG++
Sbjct  64   QSAGNVFLKHGSELRLIPRDRVGAF  88


>ref|XP_002285835.1| PREDICTED: ubiquitin-fold modifier 1 [Vitis vinifera]
emb|CBI19064.3| unnamed protein product [Vitis vinifera]
Length=97

Score =  168 bits (426),  Expect = 6e-51
Identities = 83/85 (98%), Positives = 85/85 (100%), Gaps = 0/85 (0%)
Frame = +2

Query  2    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  181
GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ
Sbjct  3    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  62

Query  182  QSAGNVFLKHGSELRLIPRDRVGSY  256
QSAGNVFLKHGSELRLIPRDRVG++
Sbjct  63   QSAGNVFLKHGSELRLIPRDRVGAF  87


>ref|XP_002519920.1| Ubiquitin-fold modifier 1 precursor, putative [Ricinus communis]
gb|EEF42524.1| Ubiquitin-fold modifier 1 precursor, putative [Ricinus communis]
Length=92

Score =  167 bits (423),  Expect = 2e-50
Identities = 83/84 (99%), Positives = 84/84 (100%), Gaps = 0/84 (0%)
Frame = +2

Query  2    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  181
GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ
Sbjct  7    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  66

Query  182  QSAGNVFLKHGSELRLIPRDRVGS  253
QSAGNVFLKHGSELRLIPRDRVG+
Sbjct  67   QSAGNVFLKHGSELRLIPRDRVGA  90


>ref|XP_003614786.1| Ubiquitin-fold modifier [Medicago truncatula]
gb|AES97744.1| Ubiquitin-fold modifier [Medicago truncatula]
Length=95

Score =  167 bits (423),  Expect = 2e-50
Identities = 83/84 (99%), Positives = 84/84 (100%), Gaps = 0/84 (0%)
Frame = +2

Query  2    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  181
GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ
Sbjct  5    GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ  64

Query  182  QSAGNVFLKHGSELRLIPRDRVGS  253
QSAGNVFLKHGSELRLIPRDRVG+
Sbjct  65   QSAGNVFLKHGSELRLIPRDRVGA  88


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 237055771
Number of extensions: 4771997
Number of successful extensions: 10348
Number of sequences better than 1e-10: 10
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 10345
Number of HSP's successfully gapped: 10
Length of query: 502
Length of database: 6150218869
Length adjustment: 128
Effective length of query: 374
Effective length of database: 3856576117
Effective search space: 150406468563
Effective search space used: 150406468563
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362






A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


RID: UPECMGTD016


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
17,919,084 sequences; 6,150,218,869 total letters
Query= TrVeIntMedtrGB1_9746

Length=499


Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|XP_002327206.1|  predicted protein [Populus trichocarpa] >...   125    1e-33
ref|XP_002326029.1|  predicted protein [Populus trichocarpa] >...   125    3e-33
ref|XP_002279275.1|  PREDICTED: glucan endo-1,3-beta-glucosida...   124    7e-33
ref|XP_002525141.1|  hydrolase, hydrolyzing O-glycosyl compoun...   121    1e-31
ref|XP_002878188.1|  predicted protein [Arabidopsis lyrata sub...   115    3e-29

ALIGNMENTS
>ref|XP_002327206.1| predicted protein [Populus trichocarpa]
gb|EEE74011.1| predicted protein [Populus trichocarpa]
Length=143

Score =  125 bits (315),  Expect = 1e-33
Identities = 57/81 (70%), Positives = 66/81 (81%), Gaps = 0/81 (0%)
Frame = +1

Query  1    KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN  180
KNNA D  LQ AI+WACG G  +CGPIQ+ G CYD  D+QRTAS+AFNDYYL+NG +DD
Sbjct  7    KNNAADQALQEAINWACGQGGANCGPIQQGGACYDSNDIQRTASWAFNDYYLKNGLTDDA  66

Query  181  CNFSNTAALTSIDPSHDKCKF  243
C FSNTAALTS++PS DKCKF
Sbjct  67   CYFSNTAALTSLNPSFDKCKF  87


>ref|XP_002326029.1| predicted protein [Populus trichocarpa]
gb|ABK94511.1| unknown [Populus trichocarpa]
gb|EEF00411.1| predicted protein [Populus trichocarpa]
Length=175

Score =  125 bits (314),  Expect = 3e-33
Identities = 57/81 (70%), Positives = 66/81 (81%), Gaps = 0/81 (0%)
Frame = +1

Query  1    KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN  180
KNNA D  LQ +IDWACG G  +CGPIQ+ G CYD  D+QRTAS+AFNDYYL+NG +DD
Sbjct  40   KNNAADQALQESIDWACGPGGANCGPIQQGGPCYDSSDVQRTASWAFNDYYLKNGLTDDA  99

Query  181  CNFSNTAALTSIDPSHDKCKF  243
C FSNTAALTS++PS DKCKF
Sbjct  100  CYFSNTAALTSLNPSFDKCKF  120


>ref|XP_002279275.1| PREDICTED: glucan endo-1,3-beta-glucosidase-like protein 1 [Vitis
vinifera]
emb|CBI21947.3| unnamed protein product [Vitis vinifera]
Length=174

Score =  124 bits (312),  Expect = 7e-33
Identities = 55/81 (68%), Positives = 65/81 (80%), Gaps = 0/81 (0%)
Frame = +1

Query  1    KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN  180
KNNAD+  LQTA+DWACG G  DC PIQ+ G CYD  DLQ+TAS+AFNDYYL++G SDD+
Sbjct  40   KNNADNPALQTALDWACGPGGADCSPIQQGGPCYDSQDLQKTASFAFNDYYLKHGLSDDS  99

Query  181  CNFSNTAALTSIDPSHDKCKF  243
C F NTAALTS++PS   CKF
Sbjct  100  CGFDNTAALTSLNPSFGNCKF  120


>ref|XP_002525141.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
gb|EEF37268.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
Length=175

Score =  121 bits (303),  Expect = 1e-31
Identities = 55/81 (68%), Positives = 64/81 (79%), Gaps = 0/81 (0%)
Frame = +1

Query  1    KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN  180
KNNADD  LQ+AIDWACG G  +C PIQ+ G CYD  D+Q TAS+AFNDYYL+NG +DD
Sbjct  37   KNNADDQSLQSAIDWACGPGGANCSPIQQGGPCYDPNDIQTTASWAFNDYYLKNGLTDDA  96

Query  181  CNFSNTAALTSIDPSHDKCKF  243
C FSNTAA TS++PSH  CKF
Sbjct  97   CFFSNTAAPTSLNPSHGNCKF  117


>ref|XP_002878188.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gb|EFH54447.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length=173

Score =  115 bits (287),  Expect = 3e-29
Identities = 49/81 (60%), Positives = 63/81 (78%), Gaps = 0/81 (0%)
Frame = +1

Query  1    KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN  180
KNNA+D+ LQTAI+WACG G  DCGPIQ+ G C D  D+Q+ AS+ FN+YYL+NG  D+
Sbjct  38   KNNAEDSSLQTAIEWACGQGGADCGPIQQGGPCNDPTDVQKMASFVFNNYYLKNGEEDEA  97

Query  181  CNFSNTAALTSIDPSHDKCKF  243
CNF+N AALTS++PS   CK+
Sbjct  98   CNFNNNAALTSLNPSQGTCKY  118


Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date:  Apr 23, 2012  4:44 PM
Number of letters in database: 6,150,218,869
Number of sequences in database:  17,919,084

Lambda      K        H
0.318    0.134    0.401
Gapped
Lambda      K        H
0.267   0.0410    0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 17919084
Number of Hits to DB: 240022639
Number of extensions: 4743736
Number of successful extensions: 10768
Number of sequences better than 1e-10: 12
Number of HSP's better than 1e-10 without gapping: 0
Number of HSP's gapped: 10745
Number of HSP's successfully gapped: 12
Length of query: 499
Length of database: 6150218869
Length adjustment: 127
Effective length of query: 372
Effective length of database: 3874495201
Effective search space: 151105312839
Effective search space used: 151105312839
T: 12
A: 40
X1: 16 (7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 171 (70.5 bits)
ka-blk-alpha gapped: 1.9
ka-blk-alpha ungapped: 0.7916
ka-blk-alpha_v gapped: 42.6028
ka-blk-alpha_v ungapped: 4.96466
ka-blk-sigma gapped: 43.6362