Table 2. Original features and results of selected features.
Abbreviations | Description | Selection Results | Tool | |||
---|---|---|---|---|---|---|
GN | GP | Full | ||||
Intrinsic feature | Gene size | Length of genes | * | * | ||
strand | Negative or positive strand | * | ||||
protein size | Length of amino acids | * | ||||
Codon bias | T3s | Silent base compositions about T | * | * | * | CodonW [20] |
C3s | Silent base compositions about C | * | * | |||
A3s | Silent base compositions about A | * | * | |||
G3s | Silent base compositions about G | * | * | * | ||
CAI | Codon Adaptation Index | * | * | * | ||
CBI | Codon Bias Index | * | ||||
Fop | Frequency of Optimal codons | |||||
Nc | The effective number of codons | * | * | |||
GC3s | G+C content 3rd position of synonymous codons | |||||
GC | G+C content of the gene | * | * | |||
L_sym | Length of system amino acids | * | ||||
Gravy | Hydropathicity of protein | * | * | |||
Aromo | The frequency of aromatic amino acids | |||||
Amino acid usage | Amino acid | A, R, D, C, Q, H, I, N, L, K, M, F, P, S, T, W, Y, V | * | |||
Amino acid | R, D, C, E, H, L, G, N, K, F, P, S, T, M, V | * | ||||
Amino acid | A, R, C, Q, D, H, I, G, N, L, K, M, F, P, S, T, W, V, Y | * | ||||
Rare_aa_ratio | The frequencies of rare amino acids | * | ||||
Close_aa_ratio | The number of codons that one third-base mutationis removed from a stop codon | |||||
Physio- chemical Properties | M_weight | Molecular weight | Pepstats [21] | |||
I_Point | Isoelectric Point | * | * | |||
Tiny | Number of mole of the amino acids (A+C+G+S+T) | * | * | * | ||
Small | Number of mole of the amino acids (A+B+C+D+G+N+P+S+T+V) | |||||
Aliphatic | Number of mole of the amino acids (A+I+L+V) | * | * | |||
Aromatic | Number of mole of the amino acids (F+H+W+Y) | * | * | * | ||
Non-polar | Number of mole of the amino acids (A+C+F+G+I+L+M+P+V+W+Y) | * | * | |||
Polar | Number of mole of the amino acids (D+E+H+K+N+Q+R+S+T+Z) | * | * | |||
Charged | Number of mole of the amino acids (B+D+E+H+K+R+Z) | * | * | |||
Basic | Number of mole of the amino acids (H+K+R) | * | * | |||
Acidic | Number of mole of the amino acids (B+D+E+Z) | * | * | |||
Transmembrane helices | ExpAA | The number of transmembrane amino acids | * | TMHMM3 [22] |
||
First60 | The number of transmembrane amino acids in first 60 | * | * | * | ||
PredHel | The final prediction of transmembrane helices | |||||
Subcellular localization | Cytom | Cytoplasmic Membrane Score | PSORTb v3.0 [23] | |||
Extra | Extracellular Score | * | * | * | ||
OuterM | Outer Membrane Score | |||||
Peri | Periplasmic Score | * | ||||
Cyto | Cytoplasmic Score | * | * | * | ||
Cellw | Cell wall Score | * | ||||
Loc_s | Final Score | * | * | * | ||
Hurst exponent | Hurst | The Hurst exponent | * | * | R package [24] | |
Total features (dimension) | 37 | 38 | 40 |
* indicates a selected feature. If a feature was selected from two or three of the sets (GN, GP, Full), then it should be considered significantly associated with essentiality.