Table 1. Lengths of homo-repeats whose frequencies in real proteomes have a 10-fold difference from theoretical estimates.
C | W | M | H | Y | N | K | F | D | P | Q | I | T | E | S | R | V | G | A | L | N* | N** | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Metazoa | 4 | 5 | 5 | 4 | 5 | 5 | 5 | 5 | 5 | 5 | 5 | 6 | 5 | 5 | 5 | 5 | 6 | 5 | 5 | 6 | 424 | 17 |
Viridiplantae | 4 | 4 | 5 | 4 | 5 | 5 | 5 | 5 | 5 | 4 | 4 | 6 | 5 | 5 | 5 | 5 | 6 | 5 | 5 | 6 | 108 | 5 |
Stramenopiles | 4 | 4 | 5 | 4 | – | 4 | 4 | 5 | 5 | 4 | 4 | – | 5 | 5 | 5 | 5 | 7 | 5 | 6 | 7 | 12 | 1 |
Choanoflagellida | 4 | 4 | 4 | 5 | 5 | 5 | 4 | 5 | 5 | 5 | 5 | 6 | 5 | 5 | 6 | 6 | 7 | 6 | 6 | 7 | 9 | 1 |
Euglenozoa | 4 | 4 | 5 | 4 | 5 | 4 | 4 | 4 | 5 | 5 | 4 | 5 | 5 | 5 | 5 | 6 | 7 | 5 | 5 | 6 | 44 | 4 |
Alveolata | 5 | 4 | 5 | 5 | 6 | 6 | 6 | 5 | 5 | 4 | 5 | 8 | 5 | 5 | 5 | 4 | – | 4 | 5 | 7 | 50 | 6 |
Amoebozoa | 4 | 4 | 4 | 4 | 5 | 5 | 5 | 5 | 5 | 4 | 4 | 8 | 5 | 5 | 5 | 5 | 5 | 4 | 4 | 7 | 25 | 2 |
Diplomonadida | – | – | – | – | – | 6 | – | – | 6 | 5 | 5 | – | 6 | 6 | 7 | 6 | 5 | 6 | 7 | – | 17 | 3 |
Fungi | 4 | 5 | 5 | 4 | 5 | 5 | 5 | 5 | 5 | 5 | 4 | 7 | 5 | 5 | 6 | 5 | 6 | 5 | 6 | 8 | 551 | 58 |
Bacteria | 5 | – | – | 5 | – | 6 | 5 | 6 | 6 | 5 | 5 | – | 6 | 7 | 6 | 6 | 8 | 6 | 7 | 9 | 210 | 25 |
N* is the number of proteins (×104), N** is the number of proteomes.