Skip to main content
. 2019 Jun 1;30(12):1359–1368. doi: 10.1091/mbc.E15-02-0076

TABLE 1:

Matching types of data appropriately with commonly used statistical tests (Crawley, 2013; Whitlock and Schluter, 2014).

Response type Treatment type Statistical tests Typical null ­hypothesis Assumptions Functions in Excel and R Examples
Continuous numerical (e.g., reaction rate) Binary (e.g., wild type vs. mutant) t test Means equal for two treatments Randomized treatment; normally distributed response; equal variance of response for two treatment groups Excel: TTEST(controlrate, ­genotype, tails = 2, type = 2) R: t.test(rate∼genotype, var.equal = TRUE) Zhuravlev et al., 2017; Figure 1C
Continuous numerical (e.g., reaction rate) Categorial (e.g. wild type vs mutant 1 vs mutant 2) ANOVA followed by Tukey-Kramer post-hoc test Means equal across treatments Randomized treatment; normally distributed response; equal variance of response between treatment groups Excel: Analysis ToolPak add-in for single factor ANOVA required if treatment has more than two levels R: aov(rate∼genotype) TukeyHSD() Plooster et al., 2017; Figure 1D
Continuous numerical (e.g., reaction rate) Continuous numerical (e.g., drug concentrations) Linear regression followed by t test on regression coefficients or ­correlation test Coefficients equal to zero; correlation ­coefficient equal to zero Randomized treatment; linear relationship between treatment and response; treatment and response bivariate normally distributed; normally distributed residuals Excel: LINEST(rate, drug, const = TRUE, stats = TRUE) R: lm(rate∼drug) summary() Spencer et al., 2017; Figure 4
Continuous numerical (e.g., reaction rate) More than one treatment: Continuous numerical (e.g., drug concentrations) plus categorical (e.g., wild type vs. mutant 1 vs. mutant 2) Analysis of ­covariance (­ANCOVA) Coefficients equal to zero Randomized treatment; numerical treatment and response bivariate normally distributed; equal variance of response between treatment groups; linear relationship between numerical treatment and response; normally distributed residuals Excel: LINEST(rate, treatments, const = TRUE, stats = TRUE) R: lm(rate∼drug + genotype) summary() Nowotarski et al., 2014; Figures 3 and 4
Categorical (e.g., cell cycle stage) Categorical (e.g., wild type vs. mutant 1 genotype vs. mutant 2 genotype) Chi-square or G contingency test; binomial test; Fisher’s exact test Proportions ­between ­response ­categories are equal between treatments Randomized treatment; all expected counts are one or more and no more than 20% of expected counts less than five; binomial: response or treatment sample sizes fixed; Fisher’s: response and treatment sample sizes fixed Excel: CHISQ.TEST(actual_range,expected_range) R: chisq.test(stage, genotype) Bartolini et al., 2016; Figures 2F and 5B
Binary categorical (e.g., alive or dead) Continuous numerical (e.g., drug concentrations) Logistic regression; generalized linear model Slope or ­intercept equal to zero Randomized treatment; linear relationship between treatment and log odds of one response Excel: Not available R: glm(alive∼drug, family = binomial) summary() Atay and Skotheim, 2014; Figure 2
Binary categorical (e.g., alive or dead) More than one treatment: Categorical (e.g., wild type vs. mutant 1 vs. mutant 2) plus categorical (e.g., day 1 vs. day 2 vs. day 3) Logistic regression; generalized linear model Slope or ­intercept equal to zero Randomized treatment Excel: Not available R: glm(alive∼genotype + day, family = binomial) summary() Kumfer et al., 2010; Table 1