. 2024 Mar 18;16(3):e56402. doi: 10.7759/cureus.56402

Table 2. The accuracy of GPT-3.5, GPT-4, and GPT-4V for the 33rd-35th Japanese Board of Orthopaedic Surgery Examination (JBOSE).

The accuracy was defined as the percentage of correct answers. All questions include both text-based and image-based questions.

	JBOSE	33rd	34th	35th
	JBOSE	Number of correct answers/total questions, accuracy (%)
GPT-4	All questions	59/98, (60)	54/99, (55)	59/97, (61)
	Text-based questions	49/77, (64)	46/73, (63)	47/64, (73)
	Image-based questions without image	10/21, (48)	8/26, (31)	12/33, (36)
GPT-4V	Image-based questions with image	8/21, (38)	9/26, (35)	13/33, (39)
GPT-3.5	All questions	27/98, (28)	32/99, (32)	29/97, (30)
	Text-based questions	20/77, (26)	25/73, (34)	23/64, (36)
	Image-based questions without image	7/21, (33)	7/26, (27)	6/33, (18)