Table 5.
Demonstration of ChatGPT-3.5’s ability to self-correct when prompted
Topic | Question | Summed score |
Consensus-based rating |
||
---|---|---|---|---|---|
Initial | Self-corrected | Initial | Self-corrected | ||
Metamorphopsia |
|
4 | 6 | Poor | Borderline |
Red Eye |
|
6 | 8 | Poora | Good |
Where consensus on final accuracy rating was not reached (i.e., each grader provided a different rating), the lowest score (‘poor’) was assigned.