Error in sample size formula

Desmond Dedalus Campbell

doi:10.4103/0974-1208.138878

letter

. 2014 Apr-Jun;7(2):155–156. doi: 10.4103/0974-1208.138878

Error in sample size formula

Desmond Dedalus Campbell ^1,^2,^✉

PMCID: PMC4150146 PMID: 25191033

Sir,

Re: Suresh K, Chandrashekara S. Sample size estimation and power analysis for clinical research studies. J Hum Reprod Sci 2012;5:7-13.

Although informative and useful Suresh and Chandrashekara's article on sample size estimation and power analysis contains a serious error (Suresh and Chandrashekara, 2012). In the section titled “sample size estimation with two means” they state the minimum required sample size for detecting a mean difference between two groups is:

graphic file with name JHRS-7-155-g001.jpg

Where

α is the false positive rate

β is the false negative rate

N is the sample size required to detect an inter-group mean difference of d with specified α and power of 1−β

σ² is the variance in each group (both groups having the same variance)

r is the ratio of size (n₁ and n₂) of the two groups, that is, r = n₁/n₂

Z is the standard normal distribution deviate, note this is the absolute of the z-score, as in (Suresh and Chandrashekara, 2012) Tables 2 and 3.

The formula as stated cannot be correct as relabeling of the two groups results in different values of N.

Example:

If n₁ = 100 and n₂ = 200 then r = 1/2 and Inline graphic

Swapping the two groups around:

Then n₁ = 200 and n₂ = 100 then r = 2 and Inline graphic

The N calculated for the first case is twice that of the second; they should be identical.

In fact, the formula given is the formula for n₂, which I prove thus.

In the ‘Sample Size estimation with two means’ case, the z-score of the test statistic d is related to the required false positive rate and power by[1]

graphic file with name JHRS-7-155-g004.jpg

Where the standard error of d is

graphic file with name JHRS-7-155-g005.jpg

Substituting the expression for stdErr (d) into the first equation and rearranging gives:

graphic file with name JHRS-7-155-g006.jpg

The formula for N is then

N=n₁ + n₂ = rn₂ + n₂ =(r+1)n₂

graphic file with name JHRS-7-155-g007.jpg

With the new formula group labels can be swapped without changing the value calculate for N.

The original erroneous formula could result in studies seriously underestimating their required sample size. For instance, the required sample size (as calculated by the current formula) is half that truly required, given equal numbers in the two groups. I therefore draw this error to your attention. The illustrative examples that follow the formula presentation are also in error.

REFERENCE

1.Wittes J. Sample size calculations for randomized controlled trials. Epidemiol Rev. 2002;24:39–53. doi: 10.1093/epirev/24.1.39. [DOI] [PubMed] [Google Scholar]

[ref1] 1.Wittes J. Sample size calculations for randomized controlled trials. Epidemiol Rev. 2002;24:39–53. doi: 10.1093/epirev/24.1.39. [DOI] [PubMed] [Google Scholar]

PERMALINK

Error in sample size formula

Desmond Dedalus Campbell

REFERENCE

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Error in sample size formula

Desmond Dedalus Campbell

REFERENCE

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases