. 2026 Feb 11;5:e84322. doi: 10.2196/84322

Table 3.

Adjusted Pearson residuals of large language model decisions.

Models and affiliation level			Q1^a (adjusted Pearson residuals)		Q2^b (adjusted Pearson residuals)		Q3^c (adjusted Pearson residuals)		Q4^d (adjusted Pearson residuals)		Q1 and Q2 (adjusted Pearson residuals)		Q3 and Q4 (adjusted Pearson residuals)
Llama 3.3-70B
	None	−1.063		0.8343		0.4576		−1.0017		−0.2288		−0.5440
	High tier	11.2757		−0.8985		−0.5720		0.5008		0.3773		−0.0712
	Low tier	−0.2126		0.0642		0.1144		0.5008		−0.1484		0.6152
Mistral-7B
	None	−0.0649		−1.0999		1.4668		0		−1.1648		1.4668
	High tier	0.1298		0.8105		−1.1734		0		0.9403		−1.1734
	Low tier	−0.0649		0.2895		−0.2934		0		0.2246		−0.2934
Gemma 2-9B
	None	0		−0.2619		−1.4974		2.8717 ^e		−0.2619		1.3743
	High tier	0		0.5237		0.5240		−1.6231		0.5237		−1.0991
	Low tier	0		−0.2619		0.9732		−1.2486		−0.2619		−0.2753
DeepSeek r1-distill Qwen-14B
	None	0.5377		−0.9462		0.9841		−0.3647		−0.4086		0.6195
	High tier	−1.0753		0.8280		−0.7526		0.3647		−0.2474		−0.3879
	Low tier	0.5377		0.1183		−0.2316		0		0.6559		−0.2316
Qwen 2.5-7B
	None	2.1374		−1.7008		1.2394		−1.0017		0.4366		0.2377
	High tier	−0.7125		1.7008		−1.7352		2.0033		0.9884		0.2682
	Low tier	−1.4249		0		0.4958		−1.0017		−1.4249		−0.5059

^aQ1: quartile 1.

^bQ2: quartile 2.

^cQ3: quartile 3.

^dQ4: quartile 4.

^eItalicization indicates statistically significant residuals.