Skip to main content
. 2024 Nov 26;121(49):e2414955121. doi: 10.1073/pnas.2414955121

Fig. 4.

Fig. 4.

Course Performance by Course Size. Average course performance of GPT-4 with the majority vote strategy stratified by the course size, measured by the number of enrolled students. GPT-4 successfully answers questions for assessments in some of the largest courses by enrollment, amplifying the potential impact of assessment vulnerability. Error bars represent 95% CIs using the nonparametric bootstrap with 1,000 resamples.