. 2023 Oct 16;20:28. doi: 10.3352/jeehp.2023.20.28

Table 1.

Comparative analysis between GPT-3.5 and GPT-4 for certain domains in biostatistics

Domain	Version of ChatGPT	Correct in the first attempt	Correct in the second attempt	Correct in the third attempt	Incorrect after more than 3 attempts
95% Confidence interval	V 3.5			Y
	V 4	Y
Binomial distribution	V 3.5			Y
	V 4		Y
Categorical data	V 3.5	Y
	V 4	Y
Cross-sectional study	V 3.5	Y
	V 4	Y
The chi-square test	V 3.5				Y
	V 4		Y
Measuring reliability	V 3.5	Y
	V 4	Y
One-way analysis of variance	V 3.5				Y
	V 4			Y
Probability: properties	V 3.5	Y
	V 4	Y
Sample size calculation	V 3.5				Y
	V 4			Y
T-test for paired data	V 3.5	Y
	V 4	Y

Y, yes.