Skip to main content
. 2023 Oct 16;20:28. doi: 10.3352/jeehp.2023.20.28

Table 1.

Comparative analysis between GPT-3.5 and GPT-4 for certain domains in biostatistics

Domain Version of ChatGPT Correct in the first attempt Correct in the second attempt Correct in the third attempt Incorrect after more than 3 attempts
95% Confidence interval V 3.5 Y
V 4 Y
Binomial distribution V 3.5 Y
V 4 Y
Categorical data V 3.5 Y
V 4 Y
Cross-sectional study V 3.5 Y
V 4 Y
The chi-square test V 3.5 Y
V 4 Y
Measuring reliability V 3.5 Y
V 4 Y
One-way analysis of variance V 3.5 Y
V 4 Y
Probability: properties V 3.5 Y
V 4 Y
Sample size calculation V 3.5 Y
V 4 Y
T-test for paired data V 3.5 Y
V 4 Y

Y, yes.