Table 2. Average Shannon and Approximate entropy (ApEn) of real and random coding sequences (CDS), and percentage of CDS where the entropy of the real sequence is less than expected by chance across the six species.
Yeast | C. elegans | A. thaliana | Chicken | Chimp | Human | |
Number of CDS | 6,413 | 27,974 | 32,936 | 18,536 | 30,973 | 56,323 |
Entropy of real CDS | ||||||
Shannon | 1.954 | 1.970 | 1.976 | 1.971 | 1.968 | 1.965 |
ApEn | 1.300 | 1.294 | 1.303 | 1.290 | 1.287 | 1.276 |
Entropy of random CDSA | ||||||
Shannon | 1.979 | 1.983 | 1.986 | 1.986 | 1.986 | 1.984 |
ApEn | 1.331 | 1.336 | 1.337 | 1.339 | 1.340 | 1.330 |
% Observed<Random | ||||||
Shannon | 93.44 | 78.22 | 84.64 | 79.97 | 82.10 | 82.28 |
ApEn | 97.47 | 99.12 | 98.61 | 99.28 | 99.37 | 98.75 |
% with Significant (P<0.01) Differential Entropy | ||||||
Shannon | 82.55 | 65.28 | 62.01 | 60.86 | 65.18 | 63.43 |
ApEn | 61.02 | 77.66 | 68.54 | 83.67 | 85.03 | 79.82 |
For every CDS, we generated 20 random sequences that encode the identical amino acid sequence to compute average entropies.