Skip to main content
. 2024 Mar 12;38(5):2522–2532. doi: 10.1007/s00464-024-10720-2

Table 2.

Comparison of reading grade levels between institutional, initial LLM and simplified LLM responses to bariatric surgery frequently asked questions

Grade range Gunning Fog Score Flesch–Kincaid Grade Level Coleman–Liau Index SMOG Index Automated Readability Index Linsear–Write Formula
Institutions 10th grade to college sophomore 14.2 (4.4) 11.0 (3.8) 11.1 (3.3) 10.4 (3.0) 10.6 (4.7) 12.0 (5.5)
GPT-3.5 Initial College freshman to college graduate 18.1 (2.7)* 13.6 (2.3)* 14.2 (1.8)* 13.0 (1.8)* 13.8 (2.7)* 14.7 (3.4)*
GPT-3.5 Simplified 10th grade to college freshman 13.4 (2.6) 9.6 (2.0)* 11.6 (1.6) 9.9 (1.7) 9.7 (2.3) 10.1 (2.8)*
GPT-4 Initial 12th grade to college senior 15.6 (2.6)* 11.8 (2.0) 12.4 (1.6)* 11.5 (1.7)* 11.7 (2.4) 12.8 (3.3)
GPT-4 Simplified 6th grade to 9th grade 9.4 (1.9)* 6.2 (1.5)* 8.0 (1.4)* 7.0 (1.2)* 5.8 (1.9)* 7.1 (2.1)*
Bard Initial 9th grade to college freshman 13.3 (2.7) 9.8 (2.6)* 9.5 (1.6)* 9.9 (2.0) 9.2 (2.9)* 11.4 (4.1)
Bard Simplified 8th grade to 12th grade 12.1 (2.6)* 8.5 (2.4)* 8.8 (1.4)* 9.0 (2.0)* 7.8 (2.6)* 9.9 (3.5)*

All values are presented as mean (standard deviation)

LLM large language model

*p < 0.05 when compared to institutional scores