Table 1.
Motif No. | Consensus sequences |
1 | KCGDQAQLSCCNKATYAQDVTDIDEFILAGTLKNLIGGGSG{T, S}EGLGLF{D, N}Q |
2 | {D, G}L{V, G}{G, N}Q{K, S}C{K, S}{Q, A}{Q, N}{I, T}{V, A}CCQN{S, N}{P{F, S}{D, N}{G, A} |
3 | {S, Q}{Q, C}{C, S}{N, Q}{T, G}{G, Q}{T, S}{L, V, A}{Q, K}CCNS |
4 | VQS{A, S}S{S, D, N}PX{V, A}{A, G}{G, L}LLGLLG{I, V}V{L, V}G |
5 | L{V, I}{G, N}LTC{S, T}PI{S, T}V |
6 | SX{T, V}A{A, L}VLALAA{A, L}{A, L}{A, V}{A, V}AXPXPX |
6 motifs are selected to search the NCBI nr database. All of these 6 motifs exist in at least 1 of the returned sequences. This raised the possibility of retrieving the positive signal as well as the noise. Pattern and domain analysis were conducted to compensate this side effect, which will be described later.