Skip to main content
. 2024 Apr 16;19(4):e0301854. doi: 10.1371/journal.pone.0301854

Fig 5. Individual model scores compared to average scores for the history and physical-only dataset.

Fig 5

There was a poor correlation between the individual model scores and the average ChatGPT-4 score, consistent with wide variation between the models.