Abstract
Purpose of Review.
Risk stratification for chronic kidney is becoming increasingly important as a clinical tool for both treatment and prevention measures. The goal of this review is to identify how machine learning tools contribute and facilitate risk stratification in the clinical setting.
Recent Findings.
The two key machine learning paradigms to predictively stratify kidney disease risk are genomics-based and electronic health record-based approaches. These methods can provide both quantitative information such as relative risk and qualitative information like characterizing risk by subphenotype.
Summary.
The four key methods to stratify chronic kidney disease risk are genomics, multi-omics, supervised, and unsupervised machine learning methods. Polygenic risk scores utilize whole genome sequencing data to generate an individual’s relative risk compared to the population. Multi-omic methods integrate information from multiple biomarkers to generate trajectories and prognostic different outcomes. Supervised machine learning methods can directly utilize the growing compendia of electronic health records such as lab results and notes to generate direct risk predictions, while unsupervised machine learning methods can cluster individuals with chronic kidney disease into subphenotypes with differing approaches to care.
Keywords: Machine Learning, Chronic Kidney Disease, Genomics, Multi-omics, Subphenotype
Introduction
Risk stratification for kidney disease has become globally imperative to improve health equity (1). Early identification of kidney disease can prove a useful tool for the treatment of perioperative and postoperative acute kidney injury (2,3), as well as chronic kidney disease (4).However, current measures of kidney function and structure are inadequate especially in the early stages of the disease. Fortunately, increasing multi-omics and health record data has permitted clinicians to stratify individuals prior to the deterioration of kidney function. However, modern machine learning methods are needed to appropriately leverage this wealth of multi-modal data.
There are two classes of machine learning methods that are currently being used to a priori stratify risk of kidney disease. The first involves genomics and multi-omics - utilizing genome-wide association studies (GWAS’s) as well as transcriptome and proteomic studies to generate an individual’s relative risk for developing a kidney disease. The second involves training machine learning methods on electronic health record data to both directly calculate risk, and as importantly, identify sub-phenotypes of chronic kidney disease. This review will cover the basics of generating a polygenic risk score, as well as the machine learning methods for direct stratification and sub-phenotyping (Figure 1). Finally, we will discuss the limitations of current studies as well as the future directions for applying machine learning into risk stratification for kidney disease.
Figure 1.

Four distinct approaches to machine learning for chronic kidney disease.
Biological Approaches for Risk Stratification
The creation of the large biobanks of genomic, transcriptomic, and multi-omic data has prompted a new wave of innovative approaches towards interpretable risk stratification. Here, we show two approaches using genomic data - data generated from whole genome sequencing of large populations (500,000+), and multi-omic data - data generated via whole genome sequencing in conjunction with RNA sequencing and protein analysis from smaller populations (< 100,000).
Polygenic Risk Scores for Risk Stratification
As whole genome sequencing (WGS) data becomes increasingly available for individuals with kidney disease, the ability to generate accurate and precise prognosticators of kidney disease has also improved. Some rare kidney diseases, like polycystic kidney disease, exhibit mendelian inheritance patterns, and therefore can be diagnosed via the identification of disease-causing variants (5). However, the genetic architecture of common phenotypes like chronic kidney disease is attributed to a non-linear mixture of rare variants such as the APOL1 G1 and G2 variants and common variants identified in genome-wide association studies (6).
The estimated contribution of common genetic variants to the expression of chronic kidney disease or the heritability is 39% (7). Genome-wide association studies work by running a linear regression on a given phenotype, with the independent variable representing each sequenced single-nucleotide polymorphism. These studies calculate a coefficient for each sequenced single-nucleotide polymorphism, and these coefficients can be aggregated for a single individual to generate a relative risk for an individual of a given phenotype (8).
This methodology, however, faces some issues that machine learning is helping resolve. Because genome-wide association studies are primarily conducted in European populations, modeling risk in individuals with European ancestry tend to be more accurate because of better accounting for linkage-disequilibrium patterns (9). Novel methods in multi-ethnic polygenic risk scoring has significantly improved polygenic risk scoring in individuals of admixed ancestry by linearly combining coefficients obtained from large samples of European populations and from smaller samples of individuals of diverse ancestry (10). Multi-ethnic polygenic risk for chronic kidney disease is powerful enough to stratify individuals by incident chronic kidney disease, end stage kidney disease, kidney failure and acute kidney injury (11). We anticipate that computational improvements in multi-ethnic polygenic risk as well as the increasing sample size of individuals with admixed ancestry through improving diversity in research will only further improve risk stratification via polygenic risk scoring (12).
Multi-omic Approaches to Risk Stratification
The heterogeneity of kidney disease presentation, treatment and etiology has strongly demonstrated the need for multi-omic approaches to the stratification of kidney disease risk (13). Trends towards growing data quantity and availability has lent itself to improvements in understanding the biological underpinnings of kidney disease. For example, the multi-omic database - a repository consisting of genes, proteins, microRNA, and metabolites - in the context of chronic kidney disease has been expanding since 2017 (14). Preliminary qualitative analysis of the tissue proteome have revealed insights into the biological underpinnings of kidney function during diabetic kidney disease (15).
Multi-omic machine learning approaches have shown the potential to prognosticate outcomes in chronic kidney disease (16). For example, methods like multi-PLIER utilizes differential network enrichment to generate a set of ‘latent’ variables used in downstream prognostication tasks. Integrative analysis of prognostic biomarkers from multi-omics panels help discriminate different trajectories of kidney function (17). Nevertheless, we anticipate that with growing databases, more state-of-the-art methods can be utilized to maximize risk prognostication. For example, graph convolutional neural networks have been used to create predictive models from protein-protein interaction graphs in the field of oncology (18). Future studies may hope to translate this into the complex protein interactions of kidney disease.
Machine Learning
The promise of machine learning in health care is to develop rich, hierarchical models that can accurately and interpretably provide novel clinical insights into the diagnosis and prognosis of disease pathology. The breath of readily available electronic health record data, especially for chronic kidney disease, has yielded important results for patients and clinicians alike.
Direct Risk Prediction
Direct risk prediction involves a class of algorithms called supervised machine learning algorithms, which utilize training data to predict an outcome. In the case for most of the following algorithms, biomarkers, electronic health records, and metabolites are all used as input, and trained to generate a direct prediction of chronic kidney disease or acute kidney failure.
Convolutional Neural Networks, a type of network that accounts for spatial relations within data, were used to develop a model to predict chronic kidney disease with a cohort of 2 million prospective individuals (19). Feedforward neural networks predicted the risk of end stage renal disease in patients with diabetic kidney disease from clinical trial data (20). Publicly available data from the UCI repository permits an empirical approach to model comparison for chronic kidney disease (21).
More nuanced approaches to machine learning for risk stratification - targeting subclinical markers of kidney disease have improved models and yielded insights into the etiology of kidney disease, as well as the associated comorbidities. Using structural data, text data and longitudinal data from electronic medical records from 64,059 individuals, natural language processing models (NLP) could detect the progression of diabetic kidney disease (22). The inclusion of metabolite biomarkers in a machine learning algorithm revealed that 13 metabolites out of 120 were identified to be significantly associated with the development of chronic kidney disease over the span of five years. (23). Machine learning also shows promise in kidney transplantation, with a systematic analysis of currently deployed models demonstrating high sensitivity and specificity for predicting graft failure. (24). An ensemble method applied to 671 cardiac surgery patients predicted the risk of acute kidney injury via preoperative biochemical data, preoperative medical and intraoperative time-series like hemodynamic changes. A study with 1,146 patients demonstrated AI-enabled prognosis can provide insight into the rapid decline of kidney function in diabetic patients (25). Computer assisted diagnosis of chronic kidney disease demonstrates potential for utilization in developing countries to improve screening in low-income and hard-to-reach settings (26).
However, most risk prediction algorithms are trained on patients from a single institution, which limits their generalizability. (27). For example, an external validation for 11 distinct machine learning algorithms to predict kidney failure demonstrated that most of the algorithms perform well in the short term, but perform poorly with horizons longer than 5 years on external populations (28). Institutions are primarily constrained by data-sharing issues, and therefore train algorithms within their own patient database, which limits external validity as well as generalizability. Federated learning presents a potential solution to this problem by allowing institutions to share the weights of a model rather than the patient data to train the model (29). Application of federated learning to acute kidney failure is relatively novel, but has demonstrated improved outcomes in the context of COVID-19 patients (30).
Risk Stratification by Clustering
One challenge that remains in diagnosing and prognosing kidney diseases is that they exhibit a heterogeneity in both expression and etiology (31). This makes risk stratification especially difficult, as individual risk is contingent on the subset of etiologies contributing to an individual’s phenotype. Unsupervised machine learning methods, machine learning methods that find similarity within the data - clustering, or outlier like anomaly detection, can and are be used to sub-phenotype kidney disease. A deep autoencoder is one method that can compress high-dimensional spaces into low-dimensional spaces, and these low-dimensional spaces are very tractable with methods like K-means clustering. One study demonstrated that this method can be used to compress electronic health record data from 4001 sepsis-associated kidney injury patients to generate four distinct sub-phenotypes with varying prognosis (32). Hierarchical clustering, a method that generates a set of hierarchical groups using similarity metrics, demonstrated four distinct subphenotypes of acute kidney injury with varying ages, ICU admission rates and comorbidities amongst clusters when trained upon distinct molecular and clinical patient signatures (33). Similarly, memory networks, a variant of the long-short term memory network, were trained on longitudinal and unstructured electronic health record data to produce patient representations that clustered into three distinct sub-phenotypes (34). This work is beginning to be applied translationally. For example, two different subphenotypes with different biomarker signatures in an AKI patient subset with 794 discovery and 425 validation patients were identified via latent factor analysis. Moreover, these patients respond differently to vasopressin therapy, which demonstrates the translational potential for sub-phenotyping (35).
Nevertheless, external validation remains a significant challenge for kidney machine learning models. First, as noted above, different datasets demonstrated different numbers of clusters in acute kidney failure as well as different characteristics in each cluster. External validation in unsupervised learning depends on metrics that are not currently tested in machine learning models. Several metrics exist for validating unsupervised machine learning models like cohesion, separation and mutual information (36). More rigorous validation of sub-phenotyping algorithms using these metrics on external datasets are necessary before integration of unsupervised cluster-based machine learning methods into the standard of care.
Conclusions
Machine learning for risk stratification in the context of kidney disease has shown great promise to translate complex presentations into direct risk predictions, as well as to disentangle the heterogeneity of etiologies of kidney disease into distinct sub-phenotypes or clusters. With growing amounts of data, the field of nephrology is positioned to optimally utilize machine learning algorithms to provide novel insights into diagnosis, prognosis and treatment of kidney disease. Funding for cross-site collaborations and federated learning approaches should be increased to further improve the generalizability and interoperability of machine learning models. To be successful, machine learning interventions in kidney disease require a team of engaged stakeholders and a systematic approach from ideation to implementation (37). While the ideation stage shows significant promise, we hope more nephrologists will take charge in implementing these ideas into clinical practice.
Key Points.
Risk stratification for kidney disease is becoming an increasingly important part of prevention and treatment.
Common genetic variants contribute significantly to the risk of kidney disease, leading to an interest in polygenic risk scores.
There is an under-representation of individuals of non-European ancestry in published GWAS studies; more studies in diverse populations are indicated.
EHR data has been demonstrated to be useful in predicting the risk of progression in diabetic kidney disease, graft failure in renal transplantation and identification of subphenotypes of acute kidney injury that differ in response to treatments.
Rigorous external validation of discoveries based on EHR data has been lacking; thus increased funding for inter-institutional collaborations is necessary to help bridge this gap.
Acknowledgements
We acknowledge Ms. Jill Gregory for assistance with the figures
Funding sources.
G.N.N is supported by R01DK127139 and R01HL155915
Footnotes
Conflicts of Interest.
G.N.N. has received consulting fees from AstraZeneca, Reata, BioVie, Qiming Capital, Daiichi Sankyo, Variant Bio and GLG Consulting; has received financial compensation as a scientific board member and advisor to Renalytix; and owns equity in Renalytix, Verici Dx and Pensieve Health as a cofounder
References
- 1.Shlipak MG, Tummalapalli SL, Boulware LE, et al. The case for early identification and intervention of chronic kidney disease: conclusions from a Kidney Disease: Improving Global Outcomes (KDIGO) Controversies Conference. Kidney Int. 2021. Jan;99(1):34–47. [DOI] [PubMed] [Google Scholar]
- 2.Lei VJ, Luong T, Shan E, et al. Risk Stratification for Postoperative Acute Kidney Injury in Major Noncardiac Surgery Using Preoperative and Intraoperative Data . JAMA Netw Open. 2019. Dec 2;2(12):e1916921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Romagnoli S, Ricci Z, Ronco C. Perioperative Acute Kidney Injury: Prevention, Early Recognition, and Supportive Measures. Nephron. 2018. Jun 26;140(2):105–10. [DOI] [PubMed] [Google Scholar]
- 4.Vassalotti JA, Boucree SC. Integrating CKD Into US Primary Care: Bridging the Knowledge and Implementation Gaps. Kidney International Reports [Internet]. 2022. Feb 4; Available from: https://www.sciencedirect.com/science/article/pii/S2468024922010920 [DOI] [PMC free article] [PubMed]
- 5.Mallawaarachchi AC, Lundie B, Hort Y, et al. Genomic diagnostics in polycystic kidney disease: an assessment of real-world use of whole-genome sequencing. Eur J Hum Genet. 2021. May;29(5):760–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Pollak MR, Friedman DJ. The Genetic Architecture of Kidney Disease. Clin J Am Soc Nephrol. 2020. Feb 7;15(2):268–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Wuttke M, Li Y, Li M, et al. A catalog of genetic loci associated with kidney function from analyses of a million individuals. Nat Genet. 2019. Jun;51(6):957–72. This paper describes the most recent use of genomic data to generate a genome-wide association study for chronic kidney disease.
- 8.Choi SW, Mak TSH, O’Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc. 2020. Sep;15(9):2759–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Liu L, Kiryluk K. Genome-wide polygenic risk predictors for kidney disease. Nat Rev Nephrol. 2018. Dec;14(12):723–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Márquez-Luna C, Loh PR, South Asian Type 2 Diabetes (SAT2D) Consortium, et al. Multiethnic polygenic risk scores improve risk prediction in diverse populations. Genet Epidemiol. 2017. Dec;41(8):811–23. Polygenic risk scores generally do not work very well in diverse populations that are affected by chronic kidney disease, but this paper generates a method to improve risk prediction across different ancestries.
- 11.Yu Z, Jin J, Tin A, et al. Polygenic Risk Scores for Kidney Function and Their Associations with Circulating Proteome, and Incident Kidney Diseases. J Am Soc Nephrol [Internet]. 2021. Sep 21; Available from: 10.1681/ASN.2020111599 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lin BM, Nadkarni GN, Tao R, et al. Genetics of Chronic Kidney Disease Stages Across Ancestries: The PAGE Study. Front Genet. 2019. May 24;10:494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Eddy S, Mariani LH, Kretzler M. Integrated multi-omics approaches to improve classification of chronic kidney disease. Nat Rev Nephrol. 2020. Nov;16(11):657–68. [DOI] [PubMed] [Google Scholar]
- 14.Fernandes M, Husi H. Establishment of a integrative multi-omics expression database CKDdb in the context of chronic kidney disease (CKD). Sci Rep. 2017. Jan 12;7:40367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rinschen MM, Saez-Rodriguez J. The tissue proteome in the multi-omic landscape of kidney disease. Nat Rev Nephrol. 2021. Mar;17(3):205–19. [DOI] [PubMed] [Google Scholar]
- 16.Ma J, Karnovsky A, Afshinnia F, et al. Differential network enrichment analysis reveals novel lipid pathways in chronic kidney disease. Bioinformatics. 2019. Sep 15;35(18):3441–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Kammer M, Heinzel A, Willency JA, et al. Integrative analysis of prognostic biomarkers derived from multiomics panels helps discrimination of chronic kidney disease trajectories in people with type 2 diabetes. Kidney Int. 2019. Dec;96(6):1381–8. This paper utilizes multiple modes of biological data to understand underlying etiology of chronic kidney disease and generate patient-specific prognostications.
- 18.Schulte-Sasse R, Budach S, Hnisz D, et al. Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms. Nature Machine Intelligence. 2021. Apr 12;3(6):513–26. [Google Scholar]
- 19.Krishnamurthy S, Ks K, Dovgan E, et al. Machine Learning Prediction Models for Chronic Kidney Disease Using National Health Insurance Claim Data in Taiwan. Healthcare (Basel) [Internet]. 2021. May 7;9(5). Available from: 10.3390/healthcare9050546 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Belur Nagaraj S, Pena MJ, Ju W, et al. Machine-learning-based early prediction of end-stage renal disease in patients with diabetic kidney disease using clinical trials data. Diabetes Obes Metab. 2020. Dec;22(12):2479–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Khan B, Naseem R, Muhammad F, et al. An Empirical Evaluation of Machine Learning Techniques for Chronic Kidney Disease Prophecy. IEEE Access. undefined 2020;8:55012–22. [Google Scholar]
- 22.Makino M, Yoshimoto R, Ono M, et al. Artificial intelligence predicts the progression of diabetic kidney disease using big data machine learning. Sci Rep. 2019. Aug 14;9(1):11862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Huang J, Huth C, Covic M, et al. Machine Learning Approaches Reveal Metabolic Signatures of Incident Chronic Kidney Disease in Individuals With Prediabetes and Type 2 Diabetes . Diabetes. 2020. Dec;69(12):2756–65. *This paper utilized deep learning on metabolic data to reveal that sphingomyelin C18:1 and phosphatidylcholine diacyl C38:0 are candidate biomarkers of incident CKD specifically in hyperglycemic individuals. This paper is important because these biomarkers can be subsequently used to stratify individuals with T2D.
- 24.Senanayake S, White N, Graves N, et al. Machine learning in predicting graft failure following kidney transplantation: A systematic review of published predictive models. Int J Med Inform. 2019. Oct;130:103957. [DOI] [PubMed] [Google Scholar]
- 25.Chan L, Nadkarni GN, Fleming F, et al. 499-P: Clinical Validation of an AI-Enabled Prognostic Test (KidneyintelX) to Accurately Predict Rapid Kidney Function Decline in Patients with Type 2 Diabetic Kidney Disease. Diabetes [Internet]. 2020. Jun 1 [cited 2022 Feb 17];69(Supplement_1). Available from: https://diabetesjournals.org/diabetes/article/69/Supplement_1/499-P/57937 [Google Scholar]
- 26.Sobrinho A, Queiroz ACMDS, Dias Da Silva L, et al. Computer-Aided Diagnosis of Chronic Kidney Disease in Developing Countries: A Comparative Analysis of Machine Learning Techniques. IEEE Access. undefined 2020;8:25407–19. [Google Scholar]
- 27.Song X, Yu ASL, Kellum JA, et al. Cross-site transportability of an explainable artificial intelligence model for acute kidney injury prediction. Nat Commun. 2020. Nov 9;11(1):5668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ramspek CL, Evans M, Wanner C, et al. Kidney Failure Prediction Models: A Comprehensive External Validation Study in Patients with Advanced CKD. J Am Soc Nephrol. 2021. May 3;32(5):1174–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bonawitz K, Eichner H, Grieskamp W, et al. Towards federated learning at scale: System design. Proceedings of Machine Learning and Systems. 2019. Apr 15;1:374–88. [Google Scholar]
- 30.Jaladanki SK, Vaid A, Sawant AS, et al. Development of a federated learning approach to predict acute kidney injury in adult hospitalized patients with COVID-19 in New York City. medRxiv [Internet]. 2021. Jul 28; Available from: 10.1101/2021.07.25.21261105 [DOI] [Google Scholar]
- 31.Caplin B, Yang CW, Anand S, et al. The International Society of Nephrology’s International Consortium of Collaborators on Chronic Kidney Disease of Unknown Etiology: report of the working group on approaches to population-level detection strategies and recommendations for a minimum dataset. Kidney Int. 2019. Jan;95(1):4–10. [DOI] [PubMed] [Google Scholar]
- 32.Chaudhary K, Vaid A, Duffy Á, et al. Utilization of Deep Learning for Subphenotype Identification in Sepsis-Associated Acute Kidney Injury. Clin J Am Soc Nephrol. 2020. Nov 6;15(11):1557–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Vasquez-Rios G, Oh W, Lee S, et al. Molecular and clinical signatures in Acute Kidney Injury define distinct subphenotypes that associate with death, kidney, and cardiovascular events [Internet]. bioRxiv. 2021. Available from: 10.1101/2021.12.14.21267738 [DOI] [Google Scholar]
- 34. Xu Z, Chou J, Zhang XS, et al. Identifying sub-phenotypes of acute kidney injury using structured and unstructured electronic health record data with memory networks. J Biomed Inform. 2020. Feb;102:103361. This paper utilizes electronic health record data to generate sub-phenotypes for individuals and cluster acute kidney injury by underlying etiology.
- 35.Bhatraju PK, Zelnick LR, Herting J, et al. Identification of Acute Kidney Injury Subphenotypes with Differing Molecular Signatures and Responses to Vasopressin Therapy. Am J Respir Crit Care Med. 2019. Apr 1;199(7):863–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Palacio-Niño JO, Berzal F. Evaluation Metrics for Unsupervised Learning Algorithms [Internet]. arXiv [cs.LG]. 2019. Available from: http://arxiv.org/abs/1905.05667 [Google Scholar]
- 37.Wiens J, Saria S, Sendak M, et al. Do no harm: a roadmap for responsible machine learning for health care. Nat Med. 2019. Sep;25(9):1337–40. [DOI] [PubMed] [Google Scholar]
