Figure 4.
Spacer Duplication Correlates with Start Codon Enrichment. Spacers were grouped by duplication frequency, and the proportion in each bin containing an in-frame ATG was plotted. For example, the asterisk marks the 19-duplication bin, where 60% of spacers contained in-frame ATGs. Only 30 bp of each spacer could contribute an in-frame ATG due to overlap with the repeat. Assuming random composition, the probability of a 30 bp sequence containing an in-frame ATG lacking a downstream in-frame stop codon is 12% (dashed line and Supplementary Table S3).
