Skip to main content
. 2024 May 1;7:106. doi: 10.1038/s41746-024-01079-8

Fig. 3. The performance of ChatGPT is measured by the confusion matrixes for the key attributes of interest on Test Data.

Fig. 3

For meaningful evaluation, the cases with uncertain values, such as “Not Available”, “Not Specified”, “Cannot be determined”, “Unknown”, et al. in reference and prediction have been removed. a Primary tumor features (pT), b regional lymph node involvement (pN), c overall tumor stage, and d histological diagnosis.