ABSTRACT
The prevalence of metabolic dysfunction‐associated steatotic liver disease (MASLD) has increased exponentially over the past three decades, in parallel with the global rise in obesity and type 2 diabetes. It is currently the most common cause of liver‐related morbidity and mortality. Although obesity has been identified as a key factor in the increased prevalence of MASLD, individual differences in susceptibility are significantly influenced by genetic factors. PNPLA3 I148M (rs738409 C>G) is the variant with the greatest impact on the risk of developing progressive MASLD and likely other forms of steatotic liver disease. This variant is prevalent across the globe, with the risk allele (G) frequency exhibiting considerable variation. Here, we review the contribution of PNPLA3 I148M to global burden and regional differences in MASLD prevalence, focusing on recent evidence emerging from population‐based sequencing studies and prevalence assessments. We calculated the population attributable fraction (PAF) as a means of quantifying the impact of the variant on MASLD. Furthermore, we employ quantitative trait locus (QTL) analysis to ascertain the associations between rs738409 and a range of phenotypic traits. This analysis suggests that these QTLs may underpin pleiotropic effects on extrahepatic traits. Finally, we outline potential avenues for further research and identify key areas for investigation in future studies.
Keywords: genetics, HSD17B13, MASH, MASLD, PNPLA3, rs6006460, rs738409
Abbreviations
- ALD
alcohol‐related liver disease
- BMI
body mass index
- CLD
chronic liver disease
- edQTL
RNA editing quantitative trait locus
- eQTL
expression quantitative trait locus
- GWAS
genome‐wide association study
- HCC
hepatocellular carcinoma
- HCV
hepatitis C virus
- lncQTL
lncRNA expression quantitative trait locus
- MASH
metabolic dysfunction‐associated steatohepatitis
- MASLD
metabolic dysfunction‐associated steatotic liver disease
- Mb
megabase
- MENA
Middle East and North Africa
- MetALD
metabolic and alcohol‐related/associated liver disease
- mQTL
methylation quantitative trait locus
- PAF
population attributable fraction
- PNPLA3
patatin‐like phospholipase domain‐containing 3
- QTL
quantitative trait locus
- sICAM‐1
soluble intercellular adhesion molecule‐1
- SLD
steatotic liver disease
- SNP
single nucleotide polymorphism
- sQTL
splicing quantitative trait locus
- T2D
type 2 diabetes
- tuQTL
transcript usage quantitative trait locus
Summary.
PNPLA3 I148M is the most common and impactful genetic risk factor for progressive MASLD (including MASH, cirrhosis and HCC).
PNPLA3 I148M risk allele frequency varies across the globe and is lowest in Africa and highest in Latin America.
PNPLA3 I148M accounts for a substantial proportion of chronic liver disease and cirrhosis cases, and the population attributable fraction varies with allele frequency.
Obesity, diet and other environmental factors amplify the effects of PNPLA3 I148M on MASLD and may further increase the global impact of PNPLA3 I148M.
The presence of QTLs associated with rs738409 may explain pleiotropic effects on extrahepatic traits.
1. Introduction
Metabolic dysfunction‐associated steatotic liver disease (MASLD) has reached epidemic proportions over the past three decades, following the global rise in obesity and type 2 diabetes (T2D), and is rapidly becoming the most common cause of liver‐related morbidity and mortality [1]. Recent estimates indicate that MASLD affects over 30% of adults worldwide [1] and over 55% of individuals with obesity and T2D [2, 3]. If the current trends in obesity and metabolic syndrome continue, the burden of liver disease is projected to increase even further [4]. Although obesity has played a cardinal role in the epidemic rise of MASLD, individual differences in susceptibility are strongly influenced by genetic factors, with heritability estimates ranging between 20% and 70% [5]. In addition, MASLD prevalence varies dramatically between geographical regions and racial and ethnic groups [6]. A 2004 investigation in a multiethnic population‐based cohort from Texas, USA, found that Hispanics had nearly twofold higher prevalence of hepatic steatosis (45%) compared to Blacks (24%) and roughly 1.4‐fold higher prevalence compared to Whites (33%), despite the fact that Blacks and Hispanics had an equally elevated burden of obesity and T2D compared to Whites [7]. This observation was consistent with earlier findings from hospital‐based cohorts that showed an underrepresentation of Blacks among patients with cryptogenic cirrhosis (attributed to MASLD) [8, 9] and prompted the search for genetic underpinnings of steatotic liver disease (SLD).
In 2008, the first genome‐wide association study (GWAS) of hepatic steatosis performed in a multiethnic cohort from Dallas, TX, found a common nonsynonymous variant, rs738409 C>G, encoding p.I148M, in a previously unknown gene, patatin‐like phospholipase domain‐containing 3 (PNPLA3), that was strongly associated with hepatic fat content, assessed quantitatively by 1H‐MR spectroscopy [10]. The variant was common in all ancestry groups represented in the study (overall G allele frequency 23%), but had the highest frequency in Hispanics (49%), followed by Whites (23%) and Blacks (14%), mirroring the prevalence of hepatic steatosis in the three groups. Together with another missense variant identified in the same gene (rs6006460 G>T, p.S453I), which is specific to Blacks, PNPLA3 I148M was shown to explain up to 70% of interethnic differences in hepatic fat content in the study cohort [10].
Numerous studies performed since showed that PNPLA3 I148M is associated with a full spectrum of SLD, including hepatic steatosis, steatohepatitis or MASH (metabolic dysfunction‐associated steatohepatitis), fibrosis, cirrhosis, hepatocellular carcinoma (HCC), and liver‐related mortality, in patients with MASLD, alcohol‐related SLD (MetALD and ALD depending on the severity of alcohol consumption), hepatitis C and mixed disease etiologies [11, 12, 13, 14, 15, 16, 17, 18]. The association between PNPLA3 I148M and SLD has been replicated in various geographical regions and in paediatric populations as well as in adults (reviewed in [19]).
Unlike many other variants discovered in GWAS, that had modest disease odds ratios in the range of 1.1–1.2 per allele, PNPLA3 I148M has had an unusually large effect size, with odds ratios of 1.5–3 per allele for progressive forms of MASLD (including MASH, cirrhosis and HCC), and proportionally higher odds ratios among GG homozygotes, consistent with an additive genetic model for quantitative traits and multiplicative model for dichotomous outcomes [10, 11, 19]. This large effect size has led some authors to compare the impact of PNPLA3 I148M to alleles causing monogenic (Mendelian) disorders [20]. After 15 years of GWAS, which included increasingly large sample sizes, PNPLA3 I148M remains the most common and impactful genetic risk factor for SLD. Given both its high frequency and large effect size, I148M was estimated to have the highest population attributable fraction (PAF) for chronic liver disease (CLD) and cirrhosis in European populations, compared to other GWAS‐identified variants [21]. Here we review the contribution of PNPLA3 I148M to global burden and regional differences in MASLD, focusing on recent evidence emerging from population‐based sequencing studies and prevalence assessments.
2. Regional Variation in Prevalence of MASLD and Frequency of PNPLA3 I148M (rs738409)
PNPLA3 I148M is common in all parts of the world, but the risk (G) allele frequency varies widely across the globe (Figure 1). Data from large‐scale genome sequencing projects (including the 1000 Genomes Project [29] and gnomAD [30]) indicate that the allele frequency is lowest in Sub‐Saharan Africa (12%), followed by Europe (23%), South Asia (24%–30%), East Asia (35%–45%), and is highest in Central and South America (~50%). Substantial variation has also been observed across the Americas, a region with a complex genetic structure shaped by historical admixture events among indigenous Native American, European, and African populations as a result of European colonisation, migration and transatlantic slave trade [31]. Accordingly, the lowest frequency of PNPLA3 I148M has been reported in populations with a higher proportion of African ancestry (e.g., Dominican Republic, 25%) and highest in populations with a greater proportion of indigenous Native American ancestry (e.g., Mexico, 52%). Notably, some studies report that G allele is the major (more common) allele in Peru (72%) [29] and Guatemala (69%) [32].
FIGURE 1.

Worldwide allele frequency distribution of PNPLA3 rs738409 I148M. Allele frequencies are based on data from the 1000 Genomes Project and gnomAD database, downloaded from NCBI dbSNP: https://www.ncbi.nlm.nih.gov/snp/rs738409/. Estimates of SLD prevalence derived from recent meta‐analyses and population‐based studies [1, 22, 23, 24, 25, 26, 27, 28].
These differences broadly parallel differences in worldwide prevalence of MASLD (Figure 1) [1, 22, 23, 24]. According to the most recent meta‐analyses, the highest prevalence of MASLD was estimated to be in Latin America (44.4%), followed by Middle East and North Africa (MENA) (39.4%) [23], South Asia (33.8%), South‐East Asia (33.1%), North America (31.2%), East Asia (29.7%), Asia Pacific 28.0% and Western Europe 25.1% [1]. The data for Sub‐Saharan Africa are more sparse, but previous meta‐analyses estimated MASLD prevalence to be around 13.5% [25]. Several studies have also reported that the prevalence of MASLD varies among Hispanic/Latino populations of different origin. For example, in the US, the prevalence of MASLD was reported to be highest among Hispanics of Mexican origin and lowest among those of Cuban, Dominican and Puerto Rican origin [26, 27, 33]. Substantial variation in prevalence of MASLD has also been reported among countries in Central and South America [28]. Differences in the frequency of the PNPLA3 148 M allele likely contribute at least partially to this variation in prevalence, although one study revealed that the proportion of Native American ancestry was an independent risk factor for MASLD in Hispanic/Latino adults, even after accounting for PNPLA3 genotype [34]. Furthermore, variation in the prevalence of obesity, extent of urbanisation, dietary composition and lifestyle habits likely also contribute to regional differences in MASLD prevalence.
One commonly used measure to quantify the impact of genetic variants on disease is the PAF or the fraction of disease cases that would not have occurred if the risk allele had been removed from the population. The PAF is a function of the effect size (relative risk of disease conferred by the variant) and the allele frequency (i.e., the proportion of individuals with the risk genotype(s)) [35]. We estimated the impact of PNPLA3 I148M on the global and regional prevalence of MASLD, using previously reported odds ratios [19] and allele frequencies. Data on the effect size of PNPLA3 I148M in different regions are sparse. However, most studies which performed ancestry‐stratified analyses found concordant effects of PNPLA3 I148M on MASLD across ancestries [36, 37]. While some studies reported significant heterogeneity in the effects of I148M on MASLD and cirrhosis across ancestries [37, 38], this may have been due to the small sample size for some ancestry groups [37] or heterogeneous characteristics of study cohorts, from which subjects of European and non‐European ancestries were derived [38]. Therefore, we used a range of odds ratios, similar to those reported in previous studies of European subjects [11, 19, 39].
Assuming a similar effect size of I148M on MASLD across different ancestries and regions, the variant would be expected to account for a much larger fraction of cases in Latin America vs. Africa due to regional variation in allele frequency (Table S1). For example, assuming an allelic odds ratio of 1.9 [19], PNPLA3 I148M accounted for 18.7% of MASLD cases in Africa, whereas it accounted for > 50% MASLD cases in Latin America. Assuming a larger odds ratio of 2.5 (as reported in some previous studies), the corresponding PAF increased to 28% in Africa and 67% in Latin America. These differences are a consequence of a higher total number of individuals with at‐risk genotypes among Latin American populations, but also a higher fraction of risk‐allele homozygotes, who have higher relative odds of MASLD than heterozygotes. Although high PAF estimates should be interpreted with caution [35], they illustrate the contribution of the PNPLA3 I148M allele to regional differences in the MASLD burden.
Data on the prevalence of MASH are less abundant because accurate diagnosis usually requires liver biopsy or accurate estimates of liver inflammation and fibrosis. Nevertheless, estimates from meta‐analyses suggest that similar regional differences exist in the prevalence of MASH, with the highest prevalence reported in Latin America (7.1%), followed by MENA (5.85%), South Asia (5.4%), South‐East Asia (5.3%), North America (5.0%), East Asia (4.8%), Asia Pacific (4.5%) and Western Europe (4.0%). These estimates were based on the average progression rates and MASH rates among patients with steatosis [4]. However, given the fact that PNPLA3 I148M is associated with MASH and fibrosis, one might expect to see an even greater burden of MASH and cirrhosis in countries with a higher prevalence of I148M.
It should be noted that in addition to I148M, several other coding PNPLA3 variants have been identified that may contribute to global MASLD prevalence. One of these is the missense variant rs6006460 G>T, encoding S453I, which was shown to be associated with a lower prevalence of hepatic steatosis, independent of I148M [10, 37, 40]. In contrast to the I148M variant, S453I is common in African‐ancestry populations (allele frequency 10%–11%) but virtually absent in other ancestry groups (Figure S1). The two variants appear to act in an additive fashion, with no documented interaction between rs738409 and rs6006460 [10]. Nonetheless, given a similar frequency of S453I and I148M among African populations, it is possible that the protective S453I variant may contribute to the lower overall prevalence of MASLD in African‐ancestry individuals. Another variant is PNPLA3 rs2294918 G>A (encoding E434K), which was shown to influence the risk of MASLD by altering the expression of PNPLA3 and interacting with the I148M variant in one study [41]. In particular, the risk G (434E) allele was shown to increase the expression of both 148I and 148M proteins (without changing protein activity), while the 434K allele reduced the expression of 148M and 148I, thus dampening the deleterious effect of 148M and the protective effect of 148I [41]. Unlike I148M and S453I variants, the risk rs2294918‐G allele frequency is highest in Africa (86%) and Asia (85%) and lowest in Europe (59%) [30], which may also contribute to the impact of I148M on MASLD burden in these regions, although more data are needed to evaluate the magnitude of its effect.
3. What Factors Affect the Impact of I148M?
While available data indicate that the effect sizes of PNPLA3 I148M on MASLD are similar across different geographical regions, the risks have been shown to be amplified by obesity and other metabolic factors (as discussed elsewhere in this issue). For example, one study found that the odds ratio of cirrhosis for GG homozygotes versus CC homozygotes was increased from 2.4 among lean individuals (those with BMI < 25 kg/m2) to 5.8 in the obese (those with BMI > 35 kg/m2) [42]. Thus, the contribution of PNPLA3 I148M to the global burden of SLD (including MASLD and MetALD) may also depend on the prevalence of obesity and other metabolic risk factors in different world regions. For example, this may account for the exceptionally high prevalence of MASLD in the MENA region, which has reported some of the highest prevalence rates for obesity and T2D [23]. This may also account for the lowest reported prevalence of MASLD in Peru (12.5%), which has the highest known frequency of PNPLA3 I148M, but where only 15% of the population is obese [28]. Given the projected trends in obesity rates, especially in the developing world, one may therefore expect that the impact of PNPLA3 I148M will be even greater in populations with a higher burden of obesity [4].
It is noteworthy that the majority of genetic studies, including GWAS, employed an additive model of inheritance. While this is a reasonable assumption for complex traits, this approach neglects possible synergistic effects, such as the genotype‐by‐environment interactions (G × E) described above, and genotype‐by‐genotype interactions (epistasis), which can sometimes manifest as non‐linear (or non‐additive) effects. A number of intriguing examples of these effects have been documented. For example, Lazo et al. identified evidence of an interaction between the presence of the PNPLA3 G allele and the association between moderate alcohol consumption and hepatic steatosis, demonstrating that the effect of genotype depended on level of alcohol consumption [43]. Furthermore, Cherubini et al. identified a robust interaction between the PNPLA3 p.I148M variant and female sex, providing evidence that epistasis contributes to the genetic architecture of common traits such as MASLD [44].
In addition, there have been interesting studies focusing on gene‐diet interaction effect(s), for example, a recent study assessing a gene‐diet interaction between rs738409, nutrient intake and severity of liver histology [45]. Vilar‐Gomez et al. [45] showed that the PNPLA3 rs738409 G‐allele may modulate the effect of specific dietary nutrients on the risk of fibrosis in patients with NAFLD. Other human studies have investigated the direct effect of PNPLA3 rs738409 on the development of liver fibrosis in relation to liver histological features. In particular, a large proportion of the indirect effect of rs738409 on fibrosis severity was reported to be mediated by portal inflammation [46].
Recent studies have highlighted the influence of genetic variants, including variants influencing risk and protection against MASLD histological severity, including rs738409, on liver microbial DNA composition [47]. For example, Pirola et al. [47] found that members of the Gammaproteobacteria class were significantly enriched in carriers of the rs738409, including the genera Enterobacter.
Additionally, intriguing interactions have been observed between PNPLA3 rs738409 and a loss‐of‐function variant in HSD17B13 (rs72613567), which is linked to a decreased susceptibility to chronic liver disease and the progression from steatosis to steatohepatitis. For example, the rs72613567:TA variant mitigated liver injury associated with the risk‐increasing PNPLA3 I148M allele and resulted in an unstable and truncated protein with reduced enzymatic activity in a sample of 46 544 participants in the DiscovEHR study [48]. Furthermore, the analysis of a potential PNPLA3‐HSD17B13 interaction assessed in 1153 non‐Hispanic whites with biopsy‐proven nonalcoholic fatty liver disease enrolled in the nonalcoholic steatohepatitis. Clinical Research Network studies indicated that the protective impact of the HSD17B13 rs72613567 TA‐allele on the likelihood of developing steatohepatitis and fibrosis may be constrained to specific demographic subgroups, including individuals aged 45 years and above, females, those with class 2 obesity or diabetes, and those with the PNPLA3 rs738409 CC genotype [49]. In Japanese patients with MASLD, carriage of the HSD17B13 rs6834314‐G allele (a variant in linkage disequilibrium with rs72613567) had the effect of moderating the impact of the PNPLA3 rs738409 GG genotype on the development of advanced hepatic fibrosis [50]. Finally, a study focused on chronic hepatitis C showed that carriage of the combination PNPLA3 minor allele and HSD17B13 major allele may represent a risk factor for HCC among HCV‐infected patients [51].
4. Molecular Impact and Variant Functionality
The elucidation of the mechanism linking genetic variants to diseases has been facilitated by the widespread application of expression quantitative trait loci (eQTLs) analysis. This analysis has enabled the identification of causative variants that colocalise with GWAS variants, thus providing explanations for some of the observed relationships.
The analysis of gene expression regulation using eQTLs can be addressed by exploring cis‐eQTLs or trans‐eQTLs, which are classified depending on their genomic location [52]. Cis‐eQTLs affect local genes, whereas trans‐eQTLs affect distant genes, even on different chromosomes.
Although eQTLs have been extensively employed for the identification of disease‐associated genetic variants, they only account for 20%–50% of disease association variants. Cis‐eQTLs, where gene expression levels are influenced by a gene‐proximal single nucleotide polymorphism (SNP) located within 1 megabase (Mb), have been widely employed for this purpose. Nevertheless, cis‐eQTLs typically account for a relatively small proportion of disease heritability, which suggests the existence of additional pathways to disease. Trans‐eQTLs typically exhibit reduced effect sizes in comparison to cis‐eQTLs, necessitating greater statistical power to detect them effectively. This is because the SNP in question is typically situated at a greater genetic distance (exceeding 5 megabases) from the gene in question or on a different chromosome [53].
Furthermore, quantitative trait loci (QTLs) can influence not only gene expression but also gene splicing. Splicing quantitative trait loci (sQTLs) are significantly enriched in functional elements of the genome [52]. Collectively, genetic variants that influence gene expression offer a potential molecular rationale for their observed effects.
We conducted a quantitative trait locus (QTL) analysis to identify associations between rs738409 and various phenotypic traits. This analysis employed a range of resources, including FIVEx [54], an interactive eQTL/sQTL browser accessible at https://fivex.sph.umich.edu, and the QTLbase, which compiles and curates genome‐wide summary statistics for human molecular traits across > 70 tissue/cell types. Figure 2 shows significant cis‐eQTLs associated with the rs738409 (22:43928847_C/G), including tissue, sample size, effect, and p‐value. A comprehensive list of QTLs is shown in Table S2. Of note, there are two significantly associated blood mQTLs (methylation quantitative trait locus), five sQTL (splicing quantitative trait locus) one located in PNPLA3 and four in SAMM50, six blood pQTLS (protein quantitative trait locus) in nearby loci (P17405 [SMPD1]; Q02083 [NAAA]; P53365 [ARFIP2]; P07686 [HEXB]; P00326 [ADH1C]; and Q9NZK5 [ADA2]), two edQTL (RNA editing quantitative trait locus) in the liver, one lncQTL (lncRNA expression quantitative trait locus) in thyroid gland and nine stem cell‐iPSC tuQTL (transcript usage quantitative trait locus). The rs6006460, on the other hand, only has two blood‐monocyte eQTLs in SAMM50 and one mQTL in brain‐prefrontal cortex (Table S2).
FIGURE 2.

cis‐eQTLs associated with variant: rs738409 (22:43928847_C/G) and various phenotypic traits. This analysis employed a range of resources, including FIVEx [54], and the QTLbase.
In addition to the effect of the variant on liver phenotypes, the presence of these QTLs may explain pleiotropic effects on extrahepatic traits [55, 56, 57]. For instance, blood eQTLS and mQTLs might explain the potential involvement of rs738409 in COVID‐19 outcomes [58, 59, 60, 61]. There are many other examples of diseases associated with rs738409 or other variants in PNPLA3, for example, kidney diseases [62] or even rare conditions such as cystathioninuria and microcephaly‐congenital cataract‐psoriasiform dermatitis syndrome among many other diseases [56].
The rs738409 has been shown to be associated with immune‐related traits [56], including plasma soluble intercellular adhesion molecule‐1 (sICAM‐1) levels in a large GWAS of 22 435 healthy women from the Women's Genome Health Study [63].
Finally, a recent report identified a colocalisation between a GWAS signal for type 2 diabetes and the missense lead variant rs738409, as well as a trans‐pQTL for ADP ribosylation factor interacting protein 2 (ARFIP2) [64]. This latter protein is strongly downregulated in individuals with type 2 diabetes, as reported by Gudjonsson et al. [64] in the AGES study. This observation prompts a number of inquiries, including the manner by which a missense variant in PNPLA3 affects the circulatory levels of ARFIP2, the role of ARFIP2 in the interpretation of PNPLA3 function and the relationship of ARFIP2 to type 2 diabetes.
Therefore, not only a complete understanding of the liver‐related traits associated with rs738409 is required but also a thorough understanding of the gene and protein interactions, the active protein ligands and, most importantly, an accurate and comprehensive assessment of the impact of the pleiotropic effects of the variant.
5. Conclusion, Unanswered Questions and Future Research Opportunities
In conclusion, PNPLA3 I148M is the most common and impactful risk variant for progressive MASLD and likely other forms of steatotic liver disease. It affects all regions and contributes significantly to global variation in MASLD. The effect of PNPLA3 I148M is magnified by obesity; thus, its impact on burden of MASLD may be even greater if the obesity rates continue to rise. Despite the well‐established role of PNPLA3 I148M in chronic liver disease, several unanswered questions remain.
One puzzling question is how a variant of such high impact on disease risk became so common.
The effect size of PNPLA3 rs738409 is unusually large given its frequency, compared to most other common variants identified in GWAS, and contradicts population genetic theories that posit that variants of large effects are kept at a low frequency as a result of negative selection [65]. This begs a question of whether the risk allele conferred a selective advantage in cold climates faced by early humans. One recent study found no evidence of selection in the last 10 000 years of human history [66], but this conclusion remains to be confirmed.
Another area requiring further study is whether rs738409 can be an explanation for extrahepatic manifestations of MASLD and MASH. In fact, this variant yields disparate outcomes contingent on the specific tissue under examination. The rs738409 cis‐eQTLs in the aortic and coronary arteries have been associated with positive effects, whereas other cis‐eQTLs in the skin and brain have been linked to negative effects (Figure 2). Furthermore, it would be of interest to ascertain whether the QTLs of rs738409 exert an influence on the global prevalence and incidence of MASLD.
Finally, given the high prevalence and effect size of PNPLA3 I148M, one might ask whether genetic testing should be considered as part of patient management, as the field moves towards precision medicine [67].
Conflicts of Interest
The authors declare no conflicts of interest.
Supporting information
TABLE S1. Attributable fraction of PNPLA3 rs738409 I148M on liver disease.
TABLE S2. QTLs associated with rs738409 and rs6006460.
FIGURE S1. Worldwide allele frequency distribution of PNPLA3 rs6006460 S453I. Allele frequencies are based on data from the 1000 Genomes Project and gnomAD database, downloaded from NCBI dbSNP: https://www.ncbi.nlm.nih.gov/snp/rs6006460. Estimates of SLD prevalence derived from recent meta‐analyses and population‐based studies [1, 26–31, 33].
Handling Editor: Luca Valenti
Funding: This work was supported by Agencia Nacional de Promoción Científica y Tecnológica Argentina, FONCyT (Grants PICT 2018‐889 and PICT 2019‐0528).
Contributor Information
Julia Kozlitina, Email: julia.kozlitina@utsouthwestern.edu.
Silvia Sookoian, Email: ssookoian@intramed.net.
Data Availability Statement
All data used in the manuscript are publicly available from cited online databases or included in Supporting Information.
References
- 1. Younossi Z. M., Golabi P., Paik J. M., Henry A., Van Dongen C., and Henry L., “The Global Epidemiology of Nonalcoholic Fatty Liver Disease (NAFLD) and Nonalcoholic Steatohepatitis (NASH): A Systematic Review,” Hepatology 77 (2023): 1335–1347, 10.1097/HEP.0000000000000004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Quek J., Chan K. E., Wong Z. Y., et al., “Global Prevalence of Non‐Alcoholic Fatty Liver Disease and Non‐Alcoholic Steatohepatitis in the Overweight and Obese Population: A Systematic Review and Meta‐Analysis,” Lancet Gastroenterology & Hepatology 8 (2023): 20–30, 10.1016/S2468-1253(22)00317-X. [DOI] [PubMed] [Google Scholar]
- 3. Younossi Z. M., Golabi P., de Avila L., et al., “The Global Epidemiology of NAFLD and NASH in Patients With Type 2 Diabetes: A Systematic Review and Meta‐Analysis,” Journal of Hepatology 71 (2019): 793–801, 10.1016/j.jhep.2019.06.021. [DOI] [PubMed] [Google Scholar]
- 4. Estes C., Anstee Q. M., Arias‐Loste M. T., et al., “Modeling NAFLD Disease Burden in China, France, Germany, Italy, Japan, Spain, United Kingdom, and United States for the Period 2016–2030,” Journal of Hepatology 69 (2018): 896–904, 10.1016/j.jhep.2018.05.036. [DOI] [PubMed] [Google Scholar]
- 5. Sookoian S. and Pirola C. J., “Genetic Predisposition in Nonalcoholic Fatty Liver Disease,” Clinical and Molecular Hepatology 23 (2017): 1–12, 10.3350/cmh.2016.0109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Yip T. C., Vilar‐Gomez E., Petta S., et al., “Geographical Similarity and Differences in the Burden and Genetic Predisposition of NAFLD,” Hepatology 77 (2023): 1404–1427, 10.1002/hep.32774. [DOI] [PubMed] [Google Scholar]
- 7. Browning J. D., Szczepaniak L. S., Dobbins R., et al., “Prevalence of Hepatic Steatosis in an Urban Population in the United States: Impact of Ethnicity,” Hepatology 40 (2004): 1387–1395, 10.1002/hep.20466. [DOI] [PubMed] [Google Scholar]
- 8. Caldwell S. H., Harris D. M., Patrie J. T., and Hespenheide E. E., “Is NASH Underdiagnosed Among African Americans?,” American Journal of Gastroenterology 97 (2002): 1496–1500, 10.1111/j.1572-0241.2002.05795.x. [DOI] [PubMed] [Google Scholar]
- 9. Browning J. D., Kumar K. S., Saboorian M. H., and Thiele D. L., “Ethnic Differences in the Prevalence of Cryptogenic Cirrhosis,” American Journal of Gastroenterology 99 (2004): 292–298. [DOI] [PubMed] [Google Scholar]
- 10. Romeo S., Kozlitina J., Xing C., et al., “Genetic Variation in PNPLA3 Confers Susceptibility to Nonalcoholic Fatty Liver Disease,” Nature Genetics 40 (2008): 1461–1465, 10.1038/ng.257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Sookoian S. and Pirola C. J., “Meta‐Analysis of the Influence of I148M Variant of Patatin‐Like Phospholipase Domain Containing 3 Gene (PNPLA3) on the Susceptibility and Histological Severity of Nonalcoholic Fatty Liver Disease,” Hepatology 53 (2011): 1883–1894, 10.1002/hep.24283. [DOI] [PubMed] [Google Scholar]
- 12. Trepo E., Nahon P., Bontempi G., et al., “Association Between the PNPLA3 (rs738409 C>G) Variant and Hepatocellular Carcinoma: Evidence from a Meta‐Analysis of Individual Participant Data,” Hepatology 59 (2014): 2170–2177, 10.1002/hep.26767. [DOI] [PubMed] [Google Scholar]
- 13. Emdin C. A., Haas M., Ajmera V., et al., “Association of Genetic Variation With Cirrhosis: A Multi‐Trait Genome‐Wide Association and Gene‐Environment Interaction Study,” Gastroenterology 160 (2021): 1620–1633.e13, 10.1053/j.gastro.2020.12.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Tian C., Stokowski R. P., Kershenobich D., Ballinger D. G., and Hinds D. A., “Variant in PNPLA3 is Associated With Alcoholic Liver Disease,” Nature Genetics 42 (2010): 21–23, 10.1038/ng.488. [DOI] [PubMed] [Google Scholar]
- 15. Stickel F., Buch S., Lau K., et al., “Genetic Variation in the PNPLA3 Gene is Associated With Alcoholic Liver Injury in Caucasians,” Hepatology 53 (2011): 86–95, 10.1002/hep.24017. [DOI] [PubMed] [Google Scholar]
- 16. Valenti L., Rumi M., Galmozzi E., et al., “Patatin‐Like Phospholipase Domain‐Containing 3 I148M Polymorphism, Steatosis, and Liver Damage in Chronic Hepatitis C,” Hepatology 53 (2011): 791–799, 10.1002/hep.24123. [DOI] [PubMed] [Google Scholar]
- 17. Vigano M., Valenti L., Lampertico P., et al., “Patatin‐Like Phospholipase Domain‐Containing 3 I148M Affects Liver Steatosis in Patients With Chronic Hepatitis B,” Hepatology 58 (2013): 1245–1252, 10.1002/hep.26445. [DOI] [PubMed] [Google Scholar]
- 18. Meffert P. J., Repp K. D., Volzke H., et al., “The PNPLA3 SNP rs738409:G Allele is Associated With Increased Liver Disease‐Associated Mortality but Reduced Overall Mortality In a Population‐Based Cohort,” Journal of Hepatology 68 (2018): 858–860, 10.1016/j.jhep.2017.11.038. [DOI] [PubMed] [Google Scholar]
- 19. Carlsson B., Linden D., Brolen G., et al., “Review Article: The Emerging Role of Genetics in Precision Medicine for Patients With Non‐Alcoholic Steatohepatitis,” Alimentary Pharmacology & Therapeutics 51 (2020): 1305–1320, 10.1111/apt.15738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Krawczyk M., Stokes C. S., Romeo S., and Lammert F., “HCC and Liver Disease Risks in Homozygous PNPLA3 p.I148M Carriers Approach Monogenic Inheritance,” Journal of Hepatology 62 (2015): 980–981, 10.1016/j.jhep.2014.10.048. [DOI] [PubMed] [Google Scholar]
- 21. Jamialahmadi O., Mancina R. M., Ciociola E., et al., “Exome‐Wide Association Study on Alanine Aminotransferase Identifies Sequence Variants in the GPAM and APOE Associated With Fatty Liver Disease,” Gastroenterology 160 (2021): 1634–1646.e7, 10.1053/j.gastro.2020.12.023. [DOI] [PubMed] [Google Scholar]
- 22. Younossi Z. M., Wong G., Anstee Q. M., and Henry L., “The Global Burden of Liver Disease,” Clinical Gastroenterology and Hepatology 21 (2023): 1978–1991, 10.1016/j.cgh.2023.04.015. [DOI] [PubMed] [Google Scholar]
- 23. Younossi Z. M., Golabi P., Paik J., et al., “Prevalence of Metabolic Dysfunction‐Associated Steatotic Liver Disease in the Middle East and North Africa,” Liver International 44 (2024): 1061–1070, 10.1111/liv.15852. [DOI] [PubMed] [Google Scholar]
- 24. Perazzo H., Pacheco A. G., Griep R. H., Gracindo R., Goulart A. C., and da Fonseca M. D., “Changing from NAFLD Through MAFLD to MASLD: Similar Prevalence and Risk Factors in a Large Brazilian Cohort,” Journal of Hepatology 80 (2024): e72–e74, 10.1016/j.jhep.2023.08.025. [DOI] [PubMed] [Google Scholar]
- 25. Younossi Z., Anstee Q. M., Marietti M., et al., “Global Burden of NAFLD and NASH: Trends, Predictions, Risk Factors and Prevention,” Nature Reviews Gastroenterology & Hepatology 15 (2018): 11–20, 10.1038/nrgastro.2017.109. [DOI] [PubMed] [Google Scholar]
- 26. Fleischman M. W., Budoff M., Zeb I., Li D., and Foster T., “NAFLD Prevalence Differs Among Hispanic Subgroups: The Multi‐Ethnic Study of Atherosclerosis,” World Journal of Gastroenterology 20 (2014): 4987–4993, 10.3748/wjg.v20.i17.4987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Kallwitz E. R., Daviglus M. L., Allison M. A., et al., “Prevalence of Suspected Nonalcoholic Fatty Liver Disease in Hispanic/Latino Individuals Differs by Heritage,” Clinical Gastroenterology and Hepatology 13 (2015): 569–576, 10.1016/j.cgh.2014.08.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Younossi Z., Tacke F., Arrese M., et al., “Global Perspectives on Nonalcoholic Fatty Liver Disease and Nonalcoholic Steatohepatitis,” Hepatology 69 (2019): 2672–2682, 10.1002/hep.30251. [DOI] [PubMed] [Google Scholar]
- 29. Sudmant P. H., Rausch T., Gardner E. J., et al., “An Integrated Map of Structural Variation in 2504 Human Genomes,” Nature 526 (2015): 75–81, 10.1038/nature15394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Chen S., Francioli L. C., Goodrich J. K., et al., “A Genomic Mutational Constraint Map Using Variation in 76,156 Human Genomes,” Nature 625 (2024): 92–100, 10.1038/s41586-023-06045-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Bryc K., Durand E. Y., Macpherson J. M., Reich D., and Mountain J. L., “The Genetic Ancestry of African Americans, Latinos, and European Americans across the United States,” American Journal of Human Genetics 96 (2015): 37–53, 10.1016/j.ajhg.2014.11.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Lazo M., Xie J., Alvarez C. S., et al., “Frequency of the PNPLA3 rs738409 Polymorphism and Other Genetic Loci for Liver Disease in a Guatemalan Adult Population,” Liver International 42 (2022): 1470–1474, 10.1111/liv.15268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Fernandes D. M., Pantangi V., Azam M., et al., “Pediatric Nonalcoholic Fatty Liver Disease in New York City: An Autopsy Study,” Journal of Pediatrics 200 (2018): 174–180, 10.1016/j.jpeds.2018.04.047. [DOI] [PubMed] [Google Scholar]
- 34. Kallwitz E. R., Tayo B. O., Kuniholm M. H., et al., “American Ancestry is a Risk Factor for Suspected Nonalcoholic Fatty Liver Disease in Hispanic/Latino Adults,” Clinical Gastroenterology and Hepatology 17 (2019): 2301–2309, 10.1016/j.cgh.2019.02.007. [DOI] [PubMed] [Google Scholar]
- 35. Witte J. S., Visscher P. M., and Wray N. R., “The Contribution of Genetic Variants to Disease Depends on the Ruler,” Nature Reviews Genetics 15 (2014): 765–776, 10.1038/nrg3786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Palmer N. D., Musani S. K., Yerges‐Armstrong L. M., et al., “Characterization of European Ancestry Nonalcoholic Fatty Liver Disease‐Associated Variants in Individuals of African and Hispanic Descent,” Hepatology 58 (2013): 966–975, 10.1002/hep.26440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Palmer N. D., Kahali B., Kuppa A., et al., “Allele‐Specific Variation at APOE Increases Nonalcoholic Fatty Liver Disease and Obesity but Decreases Risk of Alzheimer's Disease and Myocardial Infarction,” Human Molecular Genetics 30 (2021): 1443–1456, 10.1093/hmg/ddab096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Ghouse J., Sveinbjornsson G., Vujkovic M., et al., “Integrative Common and Rare Variant Analyses Provide Insights Into the Genetic Architecture of Liver Cirrhosis,” Nature Genetics 56 (2024): 827–837, 10.1038/s41588-024-01720-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Gellert‐Kristensen H., Richardson T. G., Davey Smith G., Nordestgaard B. G., Tybjaerg‐Hansen A., and Stender S., “Combined Effect of PNPLA3, TM6SF2, and HSD17B13 Variants on Risk of Cirrhosis and Hepatocellular Carcinoma in the General Population,” Hepatology 72 (2020): 845–856, 10.1002/hep.31238. [DOI] [PubMed] [Google Scholar]
- 40. Kubiliun M. J., Cohen J. C., Hobbs H. H., and Kozlitina J., “Contribution of a Genetic Risk Score to Ethnic Differences in Fatty Liver Disease,” Liver International 42 (2022): 2227–2236, 10.1111/liv.15322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Donati B., Motta B. M., Pingitore P., et al., “The rs2294918 E434K Variant Modulates Patatin‐Like Phospholipase Domain‐Containing 3 Expression and Liver Damage,” Hepatology 63 (2016): 787–798, 10.1002/hep.28370. [DOI] [PubMed] [Google Scholar]
- 42. Stender S., Kozlitina J., Nordestgaard B. G., Tybjaerg‐Hansen A., Hobbs H. H., and Cohen J. C., “Adiposity Amplifies the Genetic Risk of Fatty Liver Disease Conferred by Multiple Loci,” Nature Genetics 49 (2017): 842–847, 10.1038/ng.3855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Lazo M., Bilal U., Mitchell M. C., Potter J., Hernaez R., and Clark J. M., “Interaction Between Alcohol Consumption and PNPLA3 Variant in the Prevalence of Hepatic Steatosis in the US Population,” Clinical Gastroenterology and Hepatology 19 (2021): 2606–2614.e4, 10.1016/j.cgh.2020.08.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Cherubini A., Ostadreza M., Jamialahmadi O., et al., “Interaction Between Estrogen Receptor‐Alpha and PNPLA3 p.I148M Variant Drives Fatty Liver Disease Susceptibility in Women,” Nature Medicine 29 (2023): 2643–2655, 10.1038/s41591-023-02553-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Vilar‐Gomez E., Pirola C. J., Sookoian S., et al., “Impact of the Association Between PNPLA3 Genetic Variation and Dietary Intake on the Risk of Significant Fibrosis in Patients With NAFLD,” American Journal of Gastroenterology 116 (2021): 994–1006, 10.14309/ajg.0000000000001072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Vilar‐Gomez E., Pirola C. J., Sookoian S., Wilson L. A., Liang T., and Chalasani N., “PNPLA3 rs738409 and Risk of Fibrosis in NAFLD: Exploring Mediation Pathways Through Intermediate Histological Features,” Hepatology 76 (2022): 1482–1494, 10.1002/hep.32491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Pirola C. J., Salatino A., Quintanilla M. F., Castano G. O., Garaycoechea M., and Sookoian S., “The Influence of Host Genetics on Liver Microbiome Composition in Patients with NAFLD,” eBioMedicine 76 (2022): 103858, 10.1016/j.ebiom.2022.103858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Abul‐Husn N. S., Cheng X., Li A. H., et al., “A Protein‐Truncating HSD17B13 Variant and Protection from Chronic Liver Disease,” New England Journal of Medicine 378 (2018): 1096–1106, 10.1056/NEJMoa1712191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Vilar‐Gomez E., Pirola C. J., Sookoian S., Wilson L. A., Liang T., and Chalasani N., “The Protection Conferred by HSD17B13 rs72613567 Polymorphism on Risk of Steatohepatitis and Fibrosis May Be Limited to Selected Subgroups of Patients With NAFLD,” Clinical and Translational Gastroenterology 12 (2021): e00400, 10.14309/ctg.0000000000000400. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Seko Y., Yamaguchi K., Tochiki N., et al., “Attenuated Effect of PNPLA3 on Hepatic Fibrosis by HSD17B13 in Japanese Patients With Non‐Alcoholic Fatty Liver Disease,” Liver International 40 (2020): 1686–1692, 10.1111/liv.14495. [DOI] [PubMed] [Google Scholar]
- 51. De Benedittis C., Bellan M., Crevola M., et al., “Interplay of PNPLA3 and HSD17B13 Variants in Modulating the Risk of Hepatocellular Carcinoma among Hepatitis C Patients,” Gastroenterology Research and Practice 2020 (2020): 4216451, 10.1155/2020/4216451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Consortium GT , “The GTEx Consortium atlas of Genetic Regulatory Effects Across Human Tissues,” Science 369 (2020): 1318–1330, 10.1126/science.aaz1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Vosa U., Claringbould A., Westra H. J., et al., “Large‐Scale Cis‐ and Trans‐eQTL Analyses Identify Thousands of Genetic Loci and Polygenic Scores That Regulate Blood Gene Expression,” Nature Genetics 53 (2021): 1300–1310, 10.1038/s41588-021-00913-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Kwong A., Boughton A. P., Wang M., et al., “FIVEx: An Interactive eQTL Browser Across Public Datasets,” Bioinformatics 38 (2022): 559–561, 10.1093/bioinformatics/btab614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Pirola C. J., Salatino A., and Sookoian S., “Pleiotropy Within Gene Variants Associated With Nonalcoholic Fatty Liver Disease and Traits of the Hematopoietic System,” World Journal of Gastroenterology 27 (2021): 305–320, 10.3748/wjg.v27.i4.305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Pirola C. J. and Sookoian S., “COVID‐19 and Non‐Alcoholic Fatty Liver Disease: Biological Insights from Multi‐Omics Data,” Liver International 43 (2023): 580–587, 10.1111/liv.15509. [DOI] [PubMed] [Google Scholar]
- 57. Chen H., Wang T., Yang J., Huang S., and Zeng P., “Improved Detection of Potentially Pleiotropic Genes in Coronary Artery Disease and Chronic Kidney Disease Using GWAS Summary Statistics,” Frontiers in Genetics 11 (2020): 592461, 10.3389/fgene.2020.592461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Grimaudo S., Amodio E., Pipitone R. M., et al., “PNPLA3 and TLL‐1 Polymorphisms as Potential Predictors of Disease Severity in Patients With COVID‐19,” Frontiers in Cell and Development Biology 9 (2021): 627914, 10.3389/fcell.2021.627914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Innes H., Buch S., Barnes E., Hampe J., Marjot T., and Stickel F., “The rs738409 G Allele in PNPLA3 is Associated With a Reduced Risk of COVID‐19 Mortality and Hospitalization,” Gastroenterology 160 (2021): 2599–2601.e2, 10.1053/j.gastro.2021.02.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Pirola C. J. and Sookoian S., “PNPLA3 and COVID‐19 Outcomes: Thinking Outside the Box Might Explain the Biology Behind Pleiotropic Effects of rs738409 on the Immune System,” Liver International 41 (2021): 2801–2804, 10.1111/liv.15043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Valenti L., Jamialahmadi O., and Romeo S., “Lack of Genetic Evidence That Fatty Liver Disease Predisposes to COVID‐19,” Journal of Hepatology 73 (2020): 709–711, 10.1016/j.jhep.2020.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Mantovani A., Pelusi S., Margarita S., et al., “Adverse Effect of PNPLA3 p.I148M Genetic Variant on Kidney Function in Middle‐Aged Individuals With Metabolic Dysfunction,” Alimentary Pharmacology & Therapeutics 57 (2023): 1093–1102, 10.1111/apt.17477. [DOI] [PubMed] [Google Scholar]
- 63. Pare G., Ridker P. M., Rose L., et al., “Genome‐Wide Association Analysis of Soluble ICAM‐1 Concentration Reveals Novel Associations at the NFKBIK, PNPLA3, RELA, and SH2B3 loci,” PLoS Genetics 7 (2011): e1001374, 10.1371/journal.pgen.1001374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Gudjonsson A., Gudmundsdottir V., Axelsson G. T., et al., “A Genome‐Wide Association Study of Serum Proteins Reveals Shared Loci With Common Diseases,” Nature Communications 13 (2022): 480, 10.1038/s41467-021-27850-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Manolio T. A., Collins F. S., Cox N. J., et al., “Finding the missing heritability of complex diseases,” Nature 461 (2009): 747–753, 10.1038/nature08494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Geier A., Trost J., Wang K., Schmid C., Krawczyk M., and Schiffels S., “PNPLA3 Fatty Liver Allele was Fixed in Neanderthals and Segregates Neutrally in Humans,” Gut 73 (2024): 1008–1014, 10.1136/gutjnl-2023-331594. [DOI] [PubMed] [Google Scholar]
- 67. Sookoian S., Rotman Y., and Valenti L., “Genetics of MASLD: The State of Art Update,” Clinical Gastroenterology and Hepatology 24 (2024): 1542–3565, 10.1016/j.cgh.2024.05.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
TABLE S1. Attributable fraction of PNPLA3 rs738409 I148M on liver disease.
TABLE S2. QTLs associated with rs738409 and rs6006460.
FIGURE S1. Worldwide allele frequency distribution of PNPLA3 rs6006460 S453I. Allele frequencies are based on data from the 1000 Genomes Project and gnomAD database, downloaded from NCBI dbSNP: https://www.ncbi.nlm.nih.gov/snp/rs6006460. Estimates of SLD prevalence derived from recent meta‐analyses and population‐based studies [1, 26–31, 33].
Data Availability Statement
All data used in the manuscript are publicly available from cited online databases or included in Supporting Information.
