Table 2.
Author (year) | Clinical objective | How was fairness evaluated? | Was racial bias identified? | How was the AIa model biased? | Was racial bias mitigated? | Protected class |
Abubakar et al (2020) [29] | Identification of images of burns vs healthy skin | Accuracy | Yes | Poor accuracy of models trained on a Caucasian data set and validated on an African data set and vice versa | Yes | Dark-skinned patients, light-skinned patients |
Allen et al (2020) [30] | Intensive care unit (ICU) mortality prediction | Equal opportunity difference (FNRb disparity) | N/Ac | N/A | Yes | Non-White patients |
Briggs and Hollmén (2020) [31] | Prediction of future health care expenditures of individual patients | Balanced accuracy, statistical parity, disparate impact, average odds, equal opportunity | N/A | N/A | Yes | Black patients |
Burlina et al (2021) [32] | Diagnosis of diabetic retinopathy from fundus photography | Accuracy | Yes | Lower diagnostic accuracy in darker-skinned individuals compared to lighter-skinned individuals | Yes | Dark-skinned patients |
Chen et al (2019) [33] | ICU mortality prediction, psychiatric readmission prediction | Error rate (0-1 loss) | Yes | Differences in error rates in ICU mortality between racial groups | No | Non-White patients |
Gianattasio et al (2020) [34] | Dementia status classification | Sensitivity, specificity, accuracy | Yes | Existing algorithms varying in sensitivity and specificity between race/ethnicity groups | Yes | Hispanic, non-Hispanic Black patients |
Noseworthy et al (2020) [35] | Prediction of left ventricular ejection fraction ≤35% from the electrocardiogram (ECG) | AUROCd | No | N/A | No | Non-White patients |
Obermeyer et al (2019) [36] | Prediction of future health care expenditures of individual patients | Calibration | Yes | Black patients with a higher burden than White patients at the same algorithmic risk score | Yes | Black patients |
Park et al (2021) [37] | Prediction of postpartum depression and postpartum mental health service utilization | Disparate impact, equal opportunity difference (TPRe disparity) | Yes | Black women with a worse health status than White women at the same predicted risk level | Yes | Black patients |
Seyyed-Kalantari et al (2021) [38] | Diagnostic label prediction from chest X-rays | Equal opportunity difference (TPR disparity) | Yes | Greater TPR disparity in Hispanic patients | No | Non-White patients |
Thompson et al (2021) [39] | Identification of opioid misuse from clinical notes | Equal opportunity difference (FNR disparity) | Yes | Greater FNR in the Black subgroup than in the White subgroup | Yes | Black patients |
Wissel et al (2019) [40] | Assignment of surgical candidacy score for patients with epilepsy using clinical notes | Regression analysis of the impact of the race variable on the candidacy score | No | N/A | No | Non-White patients |
aAI: artificial intelligence.
bFNR: false-negative rate.
cN/A: not applicable.
dAUROC: area under the receiver operating characteristic curve.
eTPR: true-positive rate.