Skip to main content
[Preprint]. 2023 Jul 24:2023.07.16.23292743. [Version 2] doi: 10.1101/2023.07.16.23292743

Figure 2: Internal Concordance by Accuracy Subgroup among SCORE Questions.

Figure 2:

Legend: SCORE questions were presented to ChatGPT via two formats: open-ended and multiple-choice. ChatGPT’s outputs to open-ended SCORE questions were assessed for internal concordance by accuracy subgroup. A total of 167 SCORE questions were presented to the ChatGPT interface. Concordance was nearly 100% (79/80) for accurate responses. Internally discordant responses were more frequently observed for inaccurate responses (33%, 31/75).