Fig. 1.
Gene content feature performance. The performance of the 4 gene content features in the 10 KB training dataset. a Gene density represented by number of genes per 1 KB. b Median operon length is a representative measure of strand switching frequency. An operon is defined as as a set of closely linked genes on the same strand. c Percentage of overlapping peptides measured as a percentage of all predicted genes. Viruses that have a lysogeny phase are known overlap genes for different life cycles. d Median amino acid length as viral peptides are commonly shorter than bacterial peptides
