Skip to main content
. 2023 Oct 10;26(11):108163. doi: 10.1016/j.isci.2023.108163

Table 3.

Comprehensiveness assessment for all LLM-Chatbot responses that received a 'good' accuracy rating

LLMa Response comprehensiveness
n Mean (SD) Median
ChatGPT-3.5 22 4.6 (0.3) 4.5
ChatGPT-4.0 33 4.6 (0.4) 4.7
Google Bard 15 4.7 (0.2) 4.7
a

Based on majority consensus across the three graders.