Figure 2. Comparison of preference profile for Aac-Sse-Acc joint features between T3S and non-T3S sequences.
(A) and (C): Total number of non-zero distributed joint features at each position for T3S or non-T3S sequences. Full set of joint features include 120 different elements. The ratio of data size between T3S and non-T3S proteins is ∼1∶2 in (A) and 1∶1 in (C). (B) and (D): Cumulative frequency of the most enriched 10 (T3S-10 or non-T3S-10) or 20 (T3S-20 or non-T3S-20) joint features in T3S or non-T3S sequences. The ratio of data size between T3S and non-T3S proteins was about 1∶2 in (B) and 1∶1 in (D). Only the first 50 positions at the N-terminal end of T3S and non-T3S sequences were included for analysis.
