Skip to main content
Alzheimer's & Dementia logoLink to Alzheimer's & Dementia
. 2026 Jan 8;21(Suppl 2):e105856. doi: 10.1002/alz70856_105856

Addressing AI Bias in MRI‐Based Dementia Classification Among Hispanic Individuals

Ngoc‐Huynh Ho 1,, Sokratis Charisis 1, Sachintha Ransara Brandigampala 2, Di Wang 3, Susan R Heckbert 4, David M Martinez 1, Timothy M Hughes 5, Nicolas Honnorat 1, Derek B Archer 6,7,8, Timothy J Hohman 6,9,10,11, Sudha Seshadri 12, Christos Davatzikos 13, Mohamad Habes 14
PMCID: PMC12782832

Abstract

Background

MRI based dementia classification using diagnostic tools trained predominantly on non‐Hispanic White (NHW) participants often fail to generalize to other groups such as Hispanic, leading to biased outcomes. This study investigates disparities in dementia classification between NHW and Hispanic populations and evaluates bias mitigation techniques to improve diagnostic fairness.

Method

MRI‐based features without harmonization were extracted using MUSE for 2,541 NHW and 249 Hispanic participants from the National Alzheimer's Coordinating Center for classifying normally cognitive and demented patients, with dementia prevalences of 35.97% and 45.78%, respectively. XGBoost classifiers were trained and assessed using balanced accuracy (BA) and fairness metrics, including false positive rate (FPR) and false negative rate (FNR) disparities. Bias mitigation techniques such as correlation removal (CR) and Kernel Mean Matching (KMM) were implemented. SHAP maps illustrated population‐specific brain regions contributing to dementia predictions.

Result

KMM demonstrated the lowest disparity in BA between NHW and Hispanic populations (Figure 1), with differences of 1.5% and 6% when trained on Hispanic and NHW datasets, respectively. CR slightly enhanced overall model accuracy when trained on the combined dataset, achieving a 0.8% improvement over the baseline, and reduced disparities to 4%. KMM minimized FPR differences, achieving 0.8% when trained on NHW and 8.3% when trained on Hispanic data, and recorded the lowest FNR disparity of 11.2% for NHW training (Figure 2). CR achieved the smallest FPR differences when trained on all groups and the lowest FNR disparity when trained on Hispanic participants. SHAP visualizations (Figure 3) revealed that the model identified population‐specific structural brain changes related to dementia, such as hippocampal and temporal lobe importance in NHW and occipital lobe contributions in Hispanic participants. These findings emphasize the risk of biased predictions when models are trained on a single population and tested on another.

Conclusion

This study highlights the need for inclusive training datasets and robust bias mitigation strategies to address disparities in MRI based dementia classification across NHW and Hispanic populations. Techniques such as KMM and CR effectively reduced performance gaps, though balancing fairness with accuracy remains a challenge. The findings encourage robust strategies to enhance fairness in AI‐based healthcare diagnostics.


graphic file with name ALZ-21-e105856-g002.jpg

graphic file with name ALZ-21-e105856-g003.jpg

graphic file with name ALZ-21-e105856-g001.jpg


Articles from Alzheimer's & Dementia are provided here courtesy of Wiley

RESOURCES