Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

medRxiv logoLink to medRxiv
[Preprint]. 2025 Sep 5:2025.09.03.25334761. [Version 1] doi: 10.1101/2025.09.03.25334761

ci-fGBD: Cluster-Integrated Fast Generalized Bruhat Decomposition for Multimodal Data Clustering in Alzheimer's Disease.

Lokendra S Thakur, Gurpreet Bharj, Lokesh Sangabattula, Bushra Malik
PMCID: PMC12424880  PMID: 40950451

Abstract

Multimodal biomedical datasets, such as those from neurodegenerative disease cohorts, present significant challenges in stratifying heterogeneous patient populations due to missing values, high dimensionality, and modality-specific biases. Traditional clustering methods often require extensive preprocessing and fail to integrate heterogeneous data types effectively. We introduce ci-fGBD(Cluster-Integrated Fast Generalized Bruhat Decomposition), a novel matrix factorization and clustering framework that natively operates on block-structured, multimodal datasets. ci-fGBD extends the classical Bruhat decomposition by jointly learning latent representations and patient clusters while automatically harmonizing contributions across diverse modalities, including neuroimaging, cognitive assessments, genomics, wearable sensors, and environmental exposures. Benchmarking against standard methods on real datasets demonstrates that ci-fGBD consistently identifies clinically meaningful subgroups, capturing subtle biological, cognitive, and demographic heterogeneity in Alzheimer disease cohorts with superior interpretability and robustness.

Full Text Availability

The license terms selected by the author(s) for this preprint version do not permit archiving in PMC. The full text is available from the preprint server.


Articles from medRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES