Figure 2.

Pictographic representation of a hypothetical de novo ORF in Saccharomyces cerevisiae. (a) A combination of conserved synteny and phylostratigraphy is used to identify the homologous region of interest (highlighted in yellow) in the closely related species. This region of interest can be used to identify enabling mutations across the lineage that led to the de novo ORF in the focal species (S. cerevisiae in this case). The enabling mutations can include but are not limited to a gain of the start codon (green star), loss of premature stop codon (gray star), insertion–deletion and/or a frameshift (pink star) and a gain of stop codon (red star). Figure inspired by Vakirlis and McLysaght (2019). (b) A hypothetical example of enabling mutations that occurred along the lineage to result in a de novo ORF in the focal genome. Changes highlighted within boxes are possible enablers. Identification of one or more of such mutations (example gain of the start codon) are needed to provide convincing evidence of de novo ORF emergence. ORF, open reading frame; S. cer, Saccharomyces cerevisiae; S. mik, Saccharomyces mikatae; S. par, Saccharomyces paradoxus.