. 2024 Apr 29;3:e46875. doi: 10.2196/46875

Table 5.

Ordering of symptom checkers and physicians (denoted as MD₁, MD₂, and MD₃) from best-performing to worst-performing symptom checkers and physicians.

Metrics	Descending order (best to worst)	Symptom checkers			Doctors
		Values, range (%)	Values, SD (%)	Values, range (%)		Values, SD (%)
M1%	MD₃, Avey, MD₂, Ada, MD₁, K Health, Buoy, WebMD, and Babylon	65.3	21	22.8		9
M3%	MD₃, Avey, Ada, MD₂, MD₁, WebMD, Buoy, K Health, and Babylon	84.8	27	26.2		11
M5%	Avey, MD₃, Ada, MD₂, MD₁, WebMD, K Health, Buoy, and Babylon	87.2	27	25.8		11
Average recall	Avey, Ada, MD₃, WebMD, MD₁ and MD₂ (a tie), K Health, Buoy, and Babylon	70.9	22	16.1		8
Average precision	MD₃, MD₂, MD₁, Ada, Avey, K Health, Buoy, WebMD, and Babylon	40.6	13	19.5		8
Average F₁-measure	MD₃, Avey, MD₂, Ada, MD₁, K Health, Buoy and WebMD (a tie), and Babylon	32.9	16	15.3		6
Average NDCG^a	Avey, MD₃, Ada, MD₂, MD₁, WebMD, K Health, Buoy, and Babylon	74.2	23	21.3		9

^aNDCG: Normalized Discounted Cumulative Gain.