Skip to main content
. 2023 Sep 30;13(10):1457. doi: 10.3390/jpm13101457

Table 1.

Validity of provided references from ChatGPT, Bing Chat and Bard AI.

ChatGPT-3.5
(n = 199)
Bing Chat
(n = 158)
Bard
(n = 112)
p-Value
Accurate 76 (38.2%) * 47 (29.8%) ** 3 (2.7%) *,** <0.001
Inaccurate 82 (41.2%) * 77 (48.7%) ** 26 (23.2%) *,** <0.001
Fabricated 32 (16.1%) * 21 (13.3%) ** 71 (63.4%) *,** <0.001
Incomplete 9 (4.5%) 13 (8.2%) 12 (10.7%) 0.11

* Significant difference between ChatGPT-3.5 and Bard p < 0.05. ** Significant difference between Bing Chat and Bard p < 0.05.