Skip to main content
. 2014 May 14;9(5):e96910. doi: 10.1371/journal.pone.0096910

Figure 1. E. coli O157 amino acid dictionaries.

Figure 1

Over- and underrepresentation of repetitive amino acid words is plotted for E. coli O157 as the residual difference between Observed and Expected counts of each word (from 2 to 12 mers). (a) Word counts of the non-redundant (cdhit 95%), protein-coding genes of the native E. coli O157 genome (n = 555753 repeated amino acid words); (b) Word counts after randomizing the amino acid sequence of the non-redundant, protein-coding genes of E. coli O157 (n = 433566 repeated amino acid words).