Skip to main content
. 2021 Feb 17;7:539573. doi: 10.3389/fvets.2020.539573

Table 4.

Formulae to estimate a sample size for each of the common observational epidemiological study designs.

Study design Formula Arguments
Cross-sectional nr+1r(λ-1)2π2 [z1-(α/2) (r+1)pc(1- pc)+ z1-β λπ(1- λπ)+rπ(1- π)]2 n = the number of subjects in the sample.
r = the anticipated number of subjects in the exposed group divided by the anticipated number of subjects in the unexposed group.
λ = the expected prevalence ratio.
π = the expected prevalence of the outcome among the non-exposed.
z1−(α/2) = value from the standard normal curve corresponding to the desired level of confidence. Use
z1−(α/2) = 1.96 for 95% (two-sided) confidence.
pc = the common prevalence over exposed and unexposed groups.z1−β = value from the standard normal curve corresponding to the desired study power.
Use z1−β = −0.84 for 80% power.
Case-control pc*= p0r+1 (rλ1+(λ-1)p0+1) n (r+1)(1+(λ-1)p0)2rp02(p0-1)2(λ-1)2 [z1-(α/2) (r+1)pc*(1- pc*)+ z1-β λp0(1-p0)[1+(λ-1)p0]2+rp0(1-p0)]2 n = the number of subjects in the sample.p0 = the expected prevalence of exposure among the controls.
r = anticipated number of subjects in the control group divided by the anticipated number of subjects in the case group.
λ = the expected odds ratio.
z1−(α/2) = value from the standard normal curve corresponding to the desired level of confidence. Use
z1−(α/2) = 1.96 for 95% (two-sided) confidence. z1−β = value from the standard normal curve corresponding to the desired study power.
Use z1−β = 0.84 for 80% power.
Cohort, count data nr+1r(λ-1)2π2 [z1-(α/2) (r+1)pc(1- pc)+ z1-β λπ(1- λπ)+rπ(1- π)]2 n = the number of subjects in the sample.
r = the anticipated number of subjects in the exposed group divided by the anticipated number of subjects in the unexposed group.
λ = the expected incidence risk ratio.
π = the expected prevalence of the outcome among the non-exposed.
z1−(α/2) = value from the standard normal curve corresponding to the desired level of confidence. Use
z1−(α/2) = 1.96 for 95% (two-sided) confidence.
pc = the common prevalence over exposed and unexposed groups.z1−β = value from the standard normal curve corresponding to the desired study power.
Use z1−β = 0.84 for 80% power.
Cohort, time at risk λ0= λ03FTλ0 FT-1+ exp(-λ0FT) λ1= λ103FTλ1 FT-1+ exp(-λ1FT) λ¯= λ¯3 FTλ¯ FT-1+ exp(-λ¯FT) nA = rnB nA(z1-(α/2) (1+r)λ¯+ z1- β (r × λ1+ λ0)) 2r(λ1- λ0)2 nA = the number of subjects in the sample.
λ0 = the expected incidence rate among the unexposed.λ1 = the expected incidence rate among the exposed.λ¯=(λ0+ λ1) / 2
FT = the expected follow-up period for the study.
r = anticipated number of subjects in the exposed group divided by the anticipated number of subjects in the unexposed group.
z1−(α/2) = value from the standard normal curve corresponding to the desired level of confidence. Use
z1−(α/2) = 1.96 for 95% (two-sided) confidence.
z1−β = value from the standard normal curve corresponding to the desired study power.
Use z1−β = 0.84 for 80% power.