The log-log plot of outer capsid volume as a function of genome length for viruses infecting different hosts
Size of the data points indicate the number of genes while shapes are in accordance with the genetic material (filled circle-dsDNA viruses, empty circle-RNA viruses, filled triangle-ssDNA viruses). A power law is used as a fitting expression for the entire data and appears as a straight line on the log-log plot. A linear regression fit of the form to the data, where , , and , gives and (p value < 2.2 × 10−16 and R2 = 0.67). All logs are to the base 10. Formulas to calculate capsid volume are described in STAR methods and data are available in the Table S1. See also Figure S1 and S2 and Table S2.