. 2023 Oct 10;26(11):108163. doi: 10.1016/j.isci.2023.108163

Table 3.

Comprehensiveness assessment for all LLM-Chatbot responses that received a 'good' accuracy rating

LLM^a	Response comprehensiveness
LLM^a	n	Mean (SD)	Median
ChatGPT-3.5	22	4.6 (0.3)	4.5
ChatGPT-4.0	33	4.6 (0.4)	4.7
Google Bard	15	4.7 (0.2)	4.7

Based on majority consensus across the three graders.