Skip to main content
. 2024 May 30;15:8. doi: 10.1186/s13326-024-00306-1

Table 4.

Comparison of version 3.5 and 4 of ChatGPT language models with varying runs and threshold settings, illustrating the impact on low and high accuracy metrics

LM Run Threshold Low acc. High acc.
v-3.5 3 20% 0.64 0.73
3 50% 0.79 0.82
5 20% 0.64 0.79
5 50% 0.78 0.84
v-4 3 20% 0.63 0.72
3 50% 0.8 0.82
5 20% 0.63 0.73
5 50% 0.8 0.81