Abstract
The emergence of genome-wide association studies (GWAS) has identified genetic traits and polymorphisms that are associated with the progression of nonalcoholic fatty liver disease (NAFLD). Phospholipase domain-containing 3 and transmembrane 6 superfamily member 2 are genes commonly associated with NAFLD phenotypes. However, there are fewer studies and replicability in lesser-known genes such as LYPLAL1 and glucokinase regulator (GCKR). With the advent of artificial intelligence (AI) in clinical genetics, studies have utilized AI algorithms to identify phenotypes through electronic health records and utilize convolution neural networks to improve the accuracy of variant identification, predict the deleterious effects of variants, and conduct phenotype-to-genotype mapping. Natural language processing (NLP) and machine-learning (ML) algorithms are popular tools in GWAS studies and connect electronic health record phenotypes to genetic diagnoses using a combination of international classification disease (ICD)-based approaches. However, there are still limitations to machine-learning - and NLP-based models, such as the lack of replicability in larger cohorts and underpowered sample sizes, which prevent the accurate prediction of genetic variants that may increase the risk of NAFLD and its progression to advanced-stage liver fibrosis. This may be largely due to the lack of understanding of the clinical consequence in the majority of pathogenic variants. Though the concept of evolution-based AI models and evolutionary algorithms is relatively new, combining current international classification disease -based NLP models with phylogenetic and evolutionary data can improve prediction accuracy and create valuable connections between variants and their pathogenicity in NAFLD. Such developments can improve risk stratification within clinical genetics and research while overcoming limitations in GWAS studies that prevent community-wide interpretations.
Keywords: artificial intelligence, genetics, machine-learning, natural language processing, nonalcoholic fatty liver disease
Introduction
Novel genome-wide association studies have extensively studied the genetic risk variants in nonalcoholic fattyl disease (NAFLD).
Applications of artificial intelligence can improve NAFLD phenotyping from electronic health record data.
The clinical consequences of most pathogenic genetic variants are still undetermined.
Evolution-based artificial intelligence algorithms can improve genetic risk stratifications in NAFLD.
Nonalcoholic fatty liver disease (NAFLD), one of the most common diseases in the world, is characterized by adipose tissue deposition in the liver1. NAFLD is often associated with a sedentary lifestyle, a high-fat diet, and obesity; as a result, the most common ‘cure’ for NAFLD is weight loss. Scientific evidence suggests that NAFLD can contribute to insulin resistance and dyslipidemia and has been correlated with the development of diabetes and the metabolic syndrome1. Most people with NAFLD suffer from the less complicated form, steatosis, which is simply defined by the accumulation of fat in the liver. A certain percentage of patients may suffer from Nonalcoholic steatohepatitis (NASH), the more progressive form of NAFLD, which can result in liver cirrhosis, and the accumulation of scar tissue can eventually render the liver damaged beyond repair. If NAFLD and NASH were left untreated, extensive cirrhotic and fibrotic changes can eventually progress to hepatocellular carcinoma (HCC), treatment for which remains difficult. Amongst patients with both NASH and cirrhosis, the risk of developing HCC is high (12.8%) over a time period of 3 years2.
While the development of HCC is multifactorial, increased iron absorption in NASH, insulin resistance, and certain genetic polymorphisms are observed to have an important role in its pathophysiology3,4. Improvements in the prognosis of end-stage liver cancer requires robust identification strategies across NAFLD and NASH patients earlier in their care. As a result, the heritability of NAFLD has emerged to become a popular area of clinical research and has been extensively studied in genome-wide association studies (GWAS). Genome studies have linked various Single-Nucleotide Polymorphisms on genes such as Phospholipase domain-containing 3 (PNPLA3) and MBOAT7 that are associated with hepatic steatosis, fibrotic changes of the liver, and the acceleration of HCC development in patients with metabolic syndrome (i.e. insulin resistance, hypertension, obesity, etc.)5. Evolution-based artificial intelligence (AI) has emerged as a new technological model, which aims to improve variant identification of NAFLD-associated genes. However, the lack of risk stratification algorithms that could apply to international communities makes it challenging to accurately predict NAFLD-associated variants across ancestries. This review highlights the promising role of evolution-based AI in genome-wide studies to stratify genetic risk variants in NAFLD progression, improve risk stratification, and influence therapeutic implications.
The emergence of GWAS in NAFLD
Several environmental and genetic factors play a significant role in the heritability of NAFLD, ranging from 22 to 50%6.
GWAS is a research strategy to discover genetic variations statistically related to disease risk. In this approach, the genomes of numerous individuals are scanned to identify genetic variants that are more prevalent in people with the disease or traits than those without the disease. These genomic variants are often used to locate neighboring variants that are directly responsible for the disease or trait once they have been found. Across GWAS studies investigating NAFLD severity, the condition is often identified based on visceral fat content and histological fibrosis severity to accurately represent steatosis in populations across clinical databases7,8. With consistent improvements in the definition of NAFLD and NASH, GWASs have greatly improved our understanding on its etiologies and pathophysiology in at risk patients with metabolic phenotypes9. There have been several studies linking nonsynonymous single-nucleotide polymorphisms in PNPLA3 and, more recently, Transmembrane 6 Superfamily member 2 (rs58542926), which was originally associated with the nearby neurocan (NCAN) gene7,8,10. Following subsequent investigations, it has been determined that both genetic relationships are associated with clinically significant outcomes, including steatohepatitis grade, cirrhosis or hepatic fibrosis stage, and, in the case of PNPLA3, hepatocellular carcinoma in NAFLD patients11–13.
While variations in or near LYPLAL1 and PNPLA3 do not affect serum lipid levels, those in or near neurocan (NCAN), glucokinase regulator (GCKR), and PPP1R3B do. Glycemic characteristics are also impacted by variations close to GCKR and PPP1R3B. Thus, it demonstrates the genetic basis of NAFLD and increases the variety of common genetic variations associated with this characteristic7–10. Recent studies using exome sequencing have correlated elevated alanine amino transferase (ALT) with an HSD17B13 SNP (rs6834314) in a general patient population and further demonstrated that this polymorphism is linked to NAFLD. This is widely supported by two more investigations14,15. In addition to the PNPLA3 effect, the GWAS suggests that IL17RA and other biologically significant genes play a significant role in the severity of NAFLD16.
Commonly reported gene variants that shape liver phenotypes in NAFLD
A case-only GWAS using standardized liver enzymes as a quantitative phenotype found a strong association between the PNPLA3 risk allele and higher ALT levels. The highest effect was found for ALT at rs738409, with a P-value of 4.68 10 717. Both AST and ALT were found to share a new impact at 2p22 close to the Xanthine Dehydrogenase (XDH) gene, which are expressed in the liver and involved in the oxidative metabolism of purines. This enzyme catalyzes the conversion of xanthine to uric acid and hypoxanthine to xanthine. Therefore, XDH-produced uric acid and reactive oxygen species may result in inflammation and oxidative stress. Recent research has demonstrated a correlation between the serum level of XDH and blood metabolic markers associated with obesity, including triglycerides, cholesterol, and glucose18. The transmembrane heparan sulphate proteoglycan Syndecan-1 (CD138, SDC1), which is abundantly expressed in the liver, also exerts metabolic effects and is suggested to cause a strong impact on liver enzyme levels; previous studies have corroborated with this hypothesis given that NAFLD patients exhibit higher serum levels of syndecan-119. In addition to these genes, recent reports have also suggested that HSD17B13 is relevant to NAFLD through several genetic polymorphisms linked to lower risks of steatosis15,20.
Contrary to PNPLA3 and LYPLAL1, variants near NCAN (which encodes for an adhesion molecule), PPP1R3B (which encodes for a protein that regulates glycogen breakdown), and GCKR (which, by inhibiting glucokinase, regulates glucose storage/disposal and provides substrates for de novo lipogenesis) are linked to distinct alterations in serum and liver lipids as glycemic traits21,22. Certain polymorphisms in PNPLA3 and LYPLAL1 are known to induce hepatic steatosis without the presence of common metabolic characteristics such as insulin resistance and obesity in patient groups. These distinct correlation patterns suggest that hepatic steatosis is potentially regulated by various metabolic pathways, and is not just limited to lipid dysregulation or oxidative stress to the liver. Addressing genetic variation in the etiology of hepatic steatosis may create new avenues for individualized treatments, but there is a growing need to improve the accuracy and efficiency of these identification studies with modern technology.
The influence of AI in clinical genetics research
AI has wide-ranging applications in clinical practice and stimulates human intelligence in computer-based networks. In the broad scheme of medicine, AI has been commonly known to maintain electronic health records (EHR) and assist patient consults, while machine-learning has been implemented to facilitate surgical procedures and assist in radiographic diagnostics23. In clinical genomics, AI computer systems can replicate applications requiring human intelligence through deep learning algorithms, thus improving image-based diagnostics, EHR phenotyping, and genetic variant classifications. Examples of these algorithms include computer vision (image acquisition), time-series analyses, automatic speech recognition, and Natural Language Processing (NLP)24.
Convolution neural networks (CNN) drive the use of these AI algorithms, which are computational systems that consist of artificial neurons that take input data based on their trained interpretations. CNN allows for deep learning by interpreting features (i.e. diagnostics, imaging, procedure codes, and sequencing data) from complex datasets to drive genomic research. Though reports have found certain AI algorithms are limited and impractical compared with human intelligence, there have been major developments and improvements in CNN-based algorithms for genomics analyses. A review led by Dias in 2019 best summarizes adaptations that address the limitations of current AI-algorithm including deep variant caller algorithms to improve the accuracy of variant identification in a genome, combined annotation dependent depletion approach algorithms to predict deletions of variants. The use of primate AI to identify known pathogenic variants using cross-species analyses, and other CNN-based applications that would improve phenotype-to-genotype mapping and vice-versa 24. A summary of AI applications to genetics research is shown in Figure 1.
Strengths and Limitations to NLP and ML Algorithms in NAFLD variant identification
NLP is one of the most commonly utilized AI-based algorithms that can extract important clinical information from EHRs, such as biopsies or images, and provide a molecular diagnosis with a combination of genotypic and phenotypic information. In particular, AI-based NLP approaches are often used to connect EHR phenotypes to genetic diagnoses through hierarchical statistical models and are often used for phenotype-to-genotype diagnostics. By detecting certain terms from the natural human language, NLP-based algorithms can computationally extract documents from phenotypic inputs and derive genetic variants of conditions. In NAFLD, the use of NLP in GWAS studies has risen in popularity and use, as NLP has shown an accuracy in detecting image-proven and biopsy-proven steatosis. A recent study led by Van Vleck supported these findings. NLP performed better in identifying NAFLD based on radiology notes and fibrosis scores from biopsies compared to traditional approaches using ICD-based approaches25. Given the propensity of NLP use in studies, multiple ML algorithms have been employed to identify the severity of NAFLD and its risk of progression to advanced disease such as cirrhosis. The Path AI research platform established in Boston trained a deep neural network to interpret over 600 liver biopsies and showed high concordance with liver histopathological interpretations made by trained pathologists. Such algorithms have also shown wide applications in identifying steatohepatitis from imaging, analyzing liver biomarkers such as ALT or gamma glutamyl transferase, and fibrosis staging26,27. As a result, AI has bridged several gaps in the knowledge of NAFLD progression, especially in risk stratification.
Despite major steps in identifying NAFLD variants underlying disease pathogenicity and progression, limitations of ML and NLP-based algorithms have been overlooked and may limit the strength of studies and there is still more room for growth in efficient algorithms that would stratify the risk of developing advanced liver disease based on genetic variants in studies. Current developments in AI models may be efficient, but the clinical consequences of over 98% of pathogenic variants in disease-associated genes have not yet been defined28–30. This may be the root cause of common limitations in NAFLD GWAS studies, such as underpowered sample sizes and the lack of replication due to the unknown association of certain variants31. As a result, most GWAS studies are limited to small cohorts that may not accurately represent the risks detected in populations. Data from EHRs have the potential for errors in lab measures and clinical diagnoses despite studies adjusting for confounding factors. This may also affect the clinical accuracy of quantitative liver enzyme tests that are not entirely specific to NAFLD, thus increasing the need for larger sample sizes and replication31.
Given that the NAFLD population is at a higher risk of developing HCC, it is crucial to identify variants that may increase the risk of irreversible advanced-stage liver cirrhosis32. Certain proposed models include genotype-to-phenotype predictions, which can improve risk stratification of NAFLD progression to HCC. However, current models are not equipped to predict the risk of developing HCC based on genotypes associated with liver phenotypes and do not represent causal relationships between risk factors. The accurate prediction of phenotypes from genotypes poses a huge problem, as the effects of genetic mutations are highly variable and can pose a barrier in personalized risk stratification for NAFLD.
Evolution-based AI models in detecting genetic variants and applicability to NAFLD GWAS
Evolution-based AI models are a topic of interest, as traditional state-of-the-art models rely on known disease labels such as ICD codes, which are subject to inaccuracies. By combining genomic information with evolutionary and phylogenetic data (seen in Fig. 2), trained algorithms can determine the propensity of disease-causing genes in their progression to disease-associated carcinomas and improve risk stratification within clinical genetics. One example of an evolution-based AI model is the evolutionary model of variant effect (EVE), a project developed in collaboration with the University of Oxford, Harvard, and the Broad Institute to determine genetic variants of significance related to a genetic condition33. By predicting the pathogenicity of over 36 million variants in over 3000 disease-associated genes, it was suggested that evolutionary information across species and ancestries provide better evidence on single variants tied with clinical phenotypes compared to traditional models. Experimental results of EVE disseminated the function of variants in BRCA1, TP53, MSH2, and PTEN with known disease links, accurately predicting the function of variants34.
Though EVE is currently the only evolution-based AI model created, evolutionary algorithms can supplement the function of pre-existing ML models and optimize its function in interpreting the risk of disease. In a study conducted by Ordikani, an evolutionary ML algorithm utilizes a ‘genetic algorithm’, an optimizer that is trained to evolve each proposed solution to reach the optimal outcome that can determine the risk of cardiovascular disease amongst patients in the Isfahan Cohort35. The effort in estimating the occurrence of cardiovascular disease used a risk assessment model called the eXplainable Persian Atherosclerotic cardiovascular disease Risk Stratification, and the algorithm was proven to have higher prediction accuracy than traditional chart models, even in the absence of quantitative tests. The study opened up the possibility of utilizing Darwin’s theory of evolution to conduct risk prediction tests that may overpower the functionalities of statistical models currently being used in genetic algorithms. Multilevel selection genetic algorithms is also an example of using evolutionary theory to use different reproduction models that compare the behavior of multiple variants through fitness functions. A comparison of variants can adjust to the diversity within a population between generations and can be utilized for different conditions36.
We propose that evolution-based AI models can improve the current limitations in NAFLD GWAS studies. Given the variability in hepatic phenotypes within communities and risk factors that increase the risk of progression of NAFLD, AI models, and evolutionary algorithms can provide better associations of risk factors in genotype-to-phenotype studies and improve risk-stratification models in liver diseases. These models can shed insight on whether certain genetic variants in identified NAFLD patients increase the risk of advanced-stage liver disease and subsequent cancer, given that there is a lack of literature surrounding the genetic risk of HCC development. In future studies, newer models can provide therapeutic avenues in personalized treatment within communities and better insight into rare variants with unknown pathogenicity that traditional methods do not address.
Conclusion
As NAFLD patients remain at higher risk of developing HCC, it is imperative to uncover the pathogenicity of rare variants and improve early disease management. However, the current AI models are not capable of predicting the risk of developing HCC based on genotypes associated with liver phenotypes, hindering personalized risk stratification for NAFLD. AI and ML, with the combination of evolutionary data, hold great promise in optimizing GWAS studies, improving current limitations, and augmenting subsequent identification of genetic variants correlated with an increased risk of developing the NAFLD phenotype. Therefore, we propose that future studies consider evolution-based AI models, which can create valuable connections between genetic variants and the risk of diseases progressing to advanced-stage conditions. Further translational research exploring the utility of evolution-based AI models in GWAS studies can provide valuable evidence about its applications to NAFLD and its influence on patient outcomes.
Ethical approval
NA.
Consent
NA.
Sources of funding
Authors have no funding to declare.
Author contribution statement
All authors under the guidance of H.H.: substantial contribution to the conception and design of the work; all authors under the guidance of H.H. and W.A.A.: drafting the work and critical revision. All the authors read and approved the final version of the manuscript.
Conflicts of interest disclosure
Authors have no conflicts of interest to declare.
Research registration unique identification number (UIN)
Name of the registry: NA.
Unique Identifying number or registration ID: NA.
Hyperlink to your specific registration (must be publicly accessible and will be checked): NA.
Guarantor
Mohammad Mehedi Hasan, Department of Biochemistry and Molecular Biology, Faculty of Life Science, Mawlana Bhashani Science and Technology University, Tangail, Bangladesh. e-mail: mehedi.bmb.mbstu@gmail.com (MMH)
Data availability statement
No data available.
Provence and peer review
Not commissioned, externally peer-reviewed.
Acknowledgments
The authors would like to thank contributing authors from the Intercontinental Omni-Research in Medicine Collaborative (ICORMed) Research Group.
Footnotes
Sponsorships or competing interests that may be relevant to content are disclosed at the end of this article.
Published online 8 May 2023
Contributor Information
Helen Huang, Email: helenh2001@gmail.com.
Wireko A. Awuah, Email: andyvans36@yahoo.com.
Tulika Garg, Email: tulika0611@gmail.com.
Jyi Cheng Ng, Email: jyicheng_97@hotmail.com.
Aashna Mehta, Email: aashna.m19@gmail.com.
Krishna Ramamoorthy, Email: krishnaramamoorthy2003@gmail.com.
Jacob Kalmanovich, Email: Kalmanovich.jacob1@gmail.com.
Mohammad M. Hasan, Email: mehedi.bmb.mbstu@gmail.com.
References
- 1. Benedict M, Zhang X. Non-alcoholic fatty liver disease: an expanded review. World J Hepatol 2017;9:715–732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. White DL, Kanwal F, El-Serag HB. Association between nonalcoholic fatty liver disease and risk for hepatocellular cancer, based on systematic review. Clin Gastroenterol Hepatol 2012;10:1342–1359.e2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Margini C, Dufour JF. The story of HCC in NAFLD: from epidemiology, across pathogenesis, to prevention and treatment. Liver Int 2016;36:317–324. [DOI] [PubMed] [Google Scholar]
- 4. Farazi PA, DePinho RA. Hepatocellular carcinoma pathogenesis: from genes to environment. Nat Rev Cancer 2006;6:674–687. [DOI] [PubMed] [Google Scholar]
- 5. Donati B, Dongiovanni P, Romeo S, et al. MBOAT7 rs641738 variant and hepatocellular carcinoma in non-cirrhotic individuals. Sci Rep 2017;7:4492. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Sookoian S, Pirola CJ. Genetic predisposition in nonalcoholic fatty liver disease. Clin Mol Hepatol 2017;23:1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Romeo S, Kozlitina J, Xing C, et al. Genetic variation in PNPLA3 confers susceptibility to nonalcoholic fatty liver disease. Nat Genet 2008;40:1461–1465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Speliotes EK, Yerges-Armstrong LM, Wu J, et al. GOLD Consortium. Genome-wide association analysis identifies variants associated with nonalcoholic fatty liver disease that have distinct effects on metabolic traits. PLoS Genet 2011;7:e1001324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Anstee QM, Day CP. The genetics of NAFLD. Nat Rev Gastroenterol Hepatol 2013;10:645–655. [DOI] [PubMed] [Google Scholar]
- 10. Kozlitina J, Smagris E, Stender S, et al. Exome-wide association study identifies a TM6SF2 variant that confers susceptibility to nonalcoholic fatty liver disease. Nat Genet 2014;46:352–356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Valenti L, Al-Serri A, Daly AK, et al. Homozygosity for the patatin-like phospholipase-3/adiponutrin I148M polymorphism influences liver fibrosis in patients with nonalcoholic fatty liver disease. Hepatology 2010;51:1209–1217. [DOI] [PubMed] [Google Scholar]
- 12. Liu YL, Reeves HL, Burt AD, et al. TM6SF2 rs58542926 influences hepatic fibrosis progression in patients with non-alcoholic fatty liver disease. Nat Commun 2014;5:4309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Liu YL, Patman GL, Leathart JB, et al. Carriage of the PNPLA3 rs738409 C >G polymorphism confers an increased risk of non-alcoholic fatty liver disease associated hepatocellular carcinoma. J Hepatol 2014;61:75–81. [DOI] [PubMed] [Google Scholar]
- 14. Ma Y, Belyaeva OV, Brown PM, et al. (for the Nonalcoholic Steatohepatitis Clinical Research Network). , 17-Beta Hydroxysteroid Dehydrogenase 13 is a hepatic retinol dehydrogenase associated with histological features of nonalcoholic fatty liver disease. Hepatology 2019;69:1504–1519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Pirola CJ, Garaycoechea M, Flichman D, et al. Splice variant rs72613567 prevents worst histologic outcomes in patients with nonalcoholic fatty liver disease. J Lipid Res 2019;60:176–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Namjou B, Lingren T, Huang Y, et al. GWAS and enrichment analyses of non-alcoholic fatty liver disease identify new trait-associated genes and pathways across eMERGE Network. BMC Med 2019;17:135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Li Q, Qu HQ, Rentfro AR, et al. PNPLA3 polymorphisms and liver aminotransferase levels in a Mexican American population. Clin Invest Med 2012;35:E237–E245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Battelli MG, Bortolotti M, Polito L, et al. The role of xanthine oxidoreductase and uric acid in metabolic syndrome. Biochim Biophys Acta Mol Basis Dis 2018;1864:2557–2565. [DOI] [PubMed] [Google Scholar]
- 19. Yilmaz Y, Eren F, Colak Y, et al. Hepatic expression and serum levels of syndecan 1 (CD138) in patients with nonalcoholic fatty liver disease. Scand J Gastroenterol 2012;47:1488–1493. [DOI] [PubMed] [Google Scholar]
- 20. Abul-Husn NS, Cheng X, Li AH, et al. A Protein-Truncating HSD17B13. Variant and protection from chronic liver disease. N Engl J Med 2018;378:1096–1106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Rauch U, Feng K, Zhou XH. Neurocan: a brain chondroitin sulfate proteoglycan. Cell Mol Life Sci 2001;58. 1842–1856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Agius L. Glucokinase and molecular aspects of liver glycogen metabolism. Biochem J 2018;414:1–18. [DOI] [PubMed] [Google Scholar]
- 23. Amisha F, Malik P, Pathania M, et al. Overview of artificial intelligence in medicine. J Family Med Prim Care 2019;8:2328–2331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Dias R, Torkamani A. Artificial intelligence in clinical and genomic diagnostics. Genome Med 2019;11:70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Van Vleck TT, Chan L, Coca SG, et al. Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression,. Int J Med Inform 2019;129:334–341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Aggarwal P, Alkhouri N. Artificial intelligence in nonalcoholic fatty liver disease: a new frontier in diagnosis and treatment. Clin Liver Dis (Hoboken) 2021;17:392–397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Decharatanachart P, Chaiteerakij R, Tiyarattanachai T, et al. Application of artificial intelligence in non-alcoholic fatty liver disease and liver fibrosis: a systematic review and meta-analysis. Therap Adv Gastroenterol 2021;14:17562848211062807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Van Hout CV, Tachmazidou I, Backman JD, et al. Exome sequencing and characterization of 49,960 individuals in the UK Biobank. Nature 2020;586:749–756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Karczewski KJ, Francioli LC, Tiao G, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature . 2020;581:434–443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Landrum MJ, Kattman BL. ClinVar at five years: delivering on the promise. Hum Mutat 2018;39:1623–1630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Fairfield CJ, Drake TM, Pius R, et al. Genome-WIDE ASSOCIATION STUDY of NAFLD USING ELECTRONIC HEALTH RECORDS. Hepatol Commun 2022;6:297–308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Kanwal F, Kramer JR, Mapakshi S, et al. Risk of hepatocellular cancer in patients with non-alcoholic fatty liver disease. Gastroenterology 2018;155:1828–1837.e2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Frazer J, Notin P, Dias M, et al. Disease variant prediction with deep generative models of evolutionary data. Nature . 2021;599:91–95. [DOI] [PubMed] [Google Scholar]
- 34. Iftikhar S, Fatima K, Rehman A, et al. An evolution based hybrid approach for heart diseases classification and associated risk factors identification. Biomed Res 2017;28:3451–3455. [Google Scholar]
- 35. Ordikhani M, Saniee Abadeh M, Prugger C, et al. An evolutionary machine learning algorithm for cardiovascular disease risk prediction. PLoS One 2022;17:e0271723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Sobey AJ, Grudniewski PA. Re-inspiring the genetic algorithm with multi-level selection theory: multi-level selection genetic algorithm. Bioinspir Biomim 2018;13:056007. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
No data available.