Partitioned polygenic risk scores identify distinct types of metabolic dysfunction-associated steatotic liver disease

Oveis Jamialahmadi; Antonio De Vincentis; Federica Tavaglione; Francesco Malvestiti; Ruifang Li-Gao; Rosellina M Mancina; Marcus Alvarez; Kyla Gelev; Samantha Maurotti; Umberto Vespasiani-Gentilucci; Frits Richard Rosendaal; Julia Kozlitina; Päivi Pajukanta; François Pattou; Luca Valenti; Stefano Romeo

doi:10.1038/s41591-024-03284-0

. 2024 Dec 9;30(12):3614–3623. doi: 10.1038/s41591-024-03284-0

Partitioned polygenic risk scores identify distinct types of metabolic dysfunction-associated steatotic liver disease

Oveis Jamialahmadi ^1,^✉, Antonio De Vincentis ^2,³, Federica Tavaglione ^4,⁵, Francesco Malvestiti ⁶, Ruifang Li-Gao ⁷, Rosellina M Mancina ^1,^5,⁸, Marcus Alvarez ⁹, Kyla Gelev ⁹, Samantha Maurotti ¹⁰, Umberto Vespasiani-Gentilucci ^4,⁵, Frits Richard Rosendaal ⁷, Julia Kozlitina ¹¹, Päivi Pajukanta ^9,^12,¹³, François Pattou ^14,¹⁵, Luca Valenti ^6,¹⁶, Stefano Romeo ^1,^17,^18,^19,^20,^✉

¹Department of Molecular and Clinical Medicine, Institute of Medicine, Sahlgrenska Academy, Wallenberg Laboratory, University of Gothenburg, Gothenburg, Sweden

²Operative Unit of Internal Medicine, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, Italy

³Research Unit of Internal Medicine, Department of Medicine and Surgery, Università Campus Bio-Medico di Roma, Rome, Italy

⁴Operative Unit of Clinical Medicine and Hepatology, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, Italy

⁵Research Unit of Clinical Medicine and Hepatology, Department of Medicine and Surgery, Università Campus Bio-Medico di Roma, Rome, Italy

⁶Department of Pathophysiology and Transplantation, Università degli Studi di Milano, Milan, Italy

⁷Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, the Netherlands

⁸Department of Life Science, Health, and Health Professions, Link Campus University, Rome, Italy

⁹Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA USA

¹⁰Department of Experimental and Clinical Medicine, Magna Graecia University, Catanzaro, Italy

¹¹The Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX USA

¹²Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA USA

¹³Institute for Precision Health, David Geffen School of Medicine at UCLA, Los Angeles, CA USA

¹⁴Service de chirurgie générale et endocrinienne, Centre Hospitalier Universitaire de Lille, Lille, France

¹⁵European Genomic Institute for Diabetes, UMR 1190 Translational Research for Diabetes, Inserm, CHU Lille, University of Lille, Lille, France

¹⁶Precision Medicine - Biological Resource Center, Department of Transfusion Medicine, Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Milan, Italy

¹⁷Department of Cardiology, Sahlgrenska University Hospital, Gothenburg, Sweden

¹⁸Clinical Nutrition Unit, Department of Medical and Surgical Sciences, University Magna Graecia, Catanzaro, Italy

¹⁹Department of Medicine (H7), Karolinska Institute, Huddinge, Stockholm, Sweden

²⁰Department of Endocrinology, Karolinska University Hospital, Huddinge, Stockholm, Sweden

^✉

Corresponding author.

PMCID: PMC11645285 PMID: 39653778

Abstract

Metabolic dysfunction-associated steatotic liver disease (MASLD) is characterized by an excess of lipids, mainly triglycerides, in the liver and components of the metabolic syndrome, which can lead to cirrhosis and liver cancer. While there is solid epidemiological evidence that MASLD clusters with cardiometabolic disease, several leading genetic risk factors for MASLD do not increase the risk of cardiovascular disease, suggesting no causal relationship between MASLD and cardiometabolic derangement. In this work, we leveraged measurements of visceral adiposity identifying 27 previously unknown genetic loci associated with MASLD (n = 36,394), six replicated in four independent cohorts (n = 3,903). Next, we generated two partitioned polygenic risk scores based on the presence of lipoprotein retention in the liver. The two polygenic risk scores suggest the presence of at least two distinct types of MASLD, one confined to the liver resulting in a more aggressive liver disease and one that is systemic and results in a higher risk of cardiometabolic disease. These findings shed light on the heterogeneity of MASLD and have the potential to improve the prediction of clinical trajectories and inform precision medicine approaches.

Subject terms: Risk factors, Genome-wide association studies

The application of partitioned polygenic scores to data from cohorts of patients identifies two distinct subtypes of MASLD with different cardiometabolic risk profiles.

Main

Paralleling the obesity epidemic, steatotic liver disease (SLD) is a growing burden worldwide. SLD includes a spectrum of conditions characterized by an excess of lipids, mainly triglycerides, stored in intracellular lipid droplets in the liver, potentially progressing to inflammation, fibrosis and ultimately to cirrhosis and liver cancer¹. SLD is a heterogenous disease coexisting with a metabolic derangement, including visceral adiposity, insulin resistance and hypertension, namely, metabolic dysfunction-associated SLD (or MASLD). This metabolic derangement ultimately increases the risk of cardiovascular events, including heart failure, and also increases kidney disease^2–4. Indeed, cardiovascular disease is the most frequent cause of death in individuals with MASLD, whereas liver-related death is less frequent; however, it is a common clinical observation that some individuals develop a rapidly progressing liver disease despite similar or even less-marked metabolic derangement.

MASLD has a strong inherited component; several variants that increase primarily liver lipids by impairing hepatocyte lipid droplet remodeling and lipoprotein secretion also cause the progression of MASLD⁵; however, contrarily to the epidemiological evidence, these variants result in a protection against cardiovascular disease and no association with hypertension^5–7 or heart failure, suggesting no causal relationship between MASLD and cardiometabolic derangement⁵.

Over the last 15 years, genome-wide association studies (GWAS) identified several genetic loci associated with chronic liver disease or proxies for increased liver triglyceride content^8–13. Excess in adiposity amplifies the effect size of a handful of variants¹⁴ likely by increasing ectopic visceral fat. To improve the precision of genetic studies and to identify genetic variants with primary effects on the liver, independent of adiposity, GWAS analyses are typically adjusted for body mass index (BMI); however, anthropometric measures of adiposity (BMI) and body fat distribution (waist circumference) fail to provide an accurate quantification of visceral adiposity, which is most closely related to insulin resistance and metabolic alterations. Therefore, standard adjustments for BMI may fail to capture and remove the total effect of adiposity on liver fat, limiting the precision of GWAS. In contrast, imaging (for example, visceral adipose volume) and bioelectrical impedance analysis (for example, whole-body fat mass) are more accurate measurements of body composition and are better predictors of MASLD¹⁵. We thus reasoned that adjusting for these traits could better capture the effect of adiposity on liver fat, thereby improving the power to detect previously unknown loci contributing to SLD.

Here, we show that indices of adiposity differentially contribute to the association between genetic variants and liver triglyceride content/inflammation, and we leverage these indices to identify previously unknown genetic loci associated with SLD. We identified and replicated six previously unknown loci and generated two partitioned polygenic risk scores (pPRSs) that suggest the presence of at least two distinct types of MASLD, one confined to the liver and one entwined in the systemic cardiometabolic syndrome.

Results

Visceral adipose tissue, whole-body fat mass and BMI are independent predictors of liver triglyceride content and inflammation/fibrosis

To identify the independent predictors of liver triglyceride content and inflammation/fibrosis among the indices of adiposity, we examined the pairwise correlations among different measures of adiposity and (1) liver triglyceride content measured by magnetic resonance imaging (MRI)-derived proton density fat fraction (PDFF); and (2) liver inflammation/fibrosis measured by liver iron corrected T1 (cT1) in European participants from the UK Biobank (Extended Data Fig. 1a). The strongest correlation with liver traits was observed for visceral adipose tissue (VAT) volume followed by BMI, waist-to-hip ratio (WHR) and whole-body fat mass (WFM). As expected, there was a high correlation between PDFF and cT1. Due to high multicollinearity among the adiposity indices, we used three penalized regression models (Methods) to assess their predictive contribution to PDFF and cT1. The standardized coefficients from the best performing algorithm (Ridge regression) showed that VAT was the strongest independent predictor of PDFF and cT1, followed by WFM and BMI for PDFF (Extended Data Fig. 1b). In the penalized regression analysis, WHR and impedance of whole body had almost no independent predictive power and therefore, we used WFM, BMI and VAT as covariates in the genetic association studies.

Extended Data Fig. 1 — In (a), the phenotypic correlation between different measures of adiposity, liver triglyceride content measured by proton density fat fraction (PDFF), and inflammation/fibrosis measured by liver iron corrected T1 (cT1); pairwise Spearman’s correlation coefficients have been shown on the heatmap. All correlations had a Benjamini–Hochberg False Discovery Rate (FDR) <0.05. (b) penalized Ridge regression analysis of different adiposity indices in predicting PDFF and liver iron corrected T1. Each dot represents standardized coefficients, and dashed line represents the lack of contribution of each trait to the liver outcomes. Both target variables were rank-based inverse normal transformed before the regression analysis. VAT: visceral adipose tissue, WFM: whole-body fat mass, BMI: body mass index, IWB: impedance of whole body, WHR: waist-to-hip ratio.

Identification of 17 previously unknown loci for liver triglyceride content and 9 for liver inflammation/fibrosis by the multi-adiposity-adjusted GWAS

To capitalize on the independent contribution of indices of adiposity to proxies of liver triglyceride content (PDFF) and inflammation/fibrosis (cT1), we conducted eight GWAS (four GWAS per each trait), together referred to as the multi-adiposity-adjusted GWAS. Each GWAS was adjusted for a specific index of adiposity (VAT, BMI and WFM), and one unadjusted (Supplementary Table 1)¹⁶. Genetic heritability estimates for PDFF showed that adjustment for adiposity index explains up to 6% more heritability compared to the unadjusted model (Supplementary Table 2). For cT1, all adiposity measurements yielded similar results to the unadjusted model. These data suggest that liver triglyceride content is dependent on adiposity, whereas inflammation is less correlated. Furthermore, genetic correlations among different adiposity adjustments showed that BMI and WFM adjustments shared the largest overlap for both PDFF and cT1 (Fig. 1a and Supplementary Table 3), consistent with the epidemiological correlation.

Fig. 1 — a, Genetic correlation among different multi-adiposity-adjusted PDFF and liver iron corrected T1 was estimated using LD score regression analysis. The asterisks denote Benjamini–Hochberg false discovery rate (FDR) <0.05. The color bar represents the genetic correlation values. Detailed summary statistics for genetic correlations have been reported in Supplementary Table 3. b, Circular Manhattan plot of PDFF and liver iron corrected T1 for different adiposity adjustments. The association analyses were performed using REGENIE adjusting for adiposity index, age, sex, age × sex, age² and age²× sex, first ten genomic principal components and array batch. Each dot represents an independent genetic locus. Yellow represents loci associated with liver PDFF and purple represents those associated with liver cT1. Large dots represent pleiotropic loci (where the association with either PDFF or liver cT1 was shared among two or more adiposity adjustments). Small dots show adiposity-trait specific associations. Loci in bold are shared among both traits irrespective of the adiposity adjustment. Only loci with a genome-wide significant P <5 × 10⁻⁸ calculated by a whole-genome regression model (Methods) are shown. P values were two-sided and not corrected for multiple testing among four different models (unadjusted, adjusted for BMI, WFM and VAT).

Statistically independent genetic loci for each adiposity-adjusted GWAS were identified by linkage disequilibrium (LD) clumping¹⁷ and conditional analysis (Methods)¹⁸. Next, we performed pleiotropic analysis¹⁹ to identify independent genetic loci from the four adiposity adjustments. In this context, pleiotropic analysis refers to genetic loci that are shared among more than two adiposity adjustments for each liver trait (Supplementary Table 4). Specifically, we assigned the same locus number to lead variants from adiposity-adjusted GWAS (four for PDFF and four for cT1) located within 1 Mb of each other, provided that more than two GWAS lead variants were in LD (r²> 0.2). Finally, the strongest (lowest GWAS P value) association at each locus was selected as the independent lead variant for that trait (PDFF or cT1).

A total of 37 and 18 independent genetic loci for PDFF and cT1 reached the genome-wide significance level (P < 5 × 10⁻⁸ not adjusted for the number of GWAS carried out), respectively (Fig. 1b and Table 1). Multiple loci showed the strongest associations when adjusted for specific adiposity indices (Supplementary Table 5).

Table 1.

Genome-wide significant loci for multi-adiposity-adjusted PDFF and liver iron corrected T1 in the UK Biobank

Trait	CHR	POS	Variant ID	n	Consequence	A1	A2	A1Freq	β	s.e.m.	P	locus	MAGMA
PDFF (WFM)	1	110232983	rs140584594	34,367	missense_variant	A	G	0.267	−0.049	0.007	1.42 × 10⁻¹³	*GSTM1*
PDFF (BMI)	1	172323134	rs17277932	35,146	intron_variant	A	G	0.145	0.049	0.008	5.91 × 10⁻⁹	*DNM3*
PDFF (VAT)	1	220970028	rs2642438	33,540	missense_variant	A	G	0.296	−0.061	0.006	7.89 × 10⁻²⁴	MARC1	5.91 × 10⁻¹⁴
PDFF (BMI)	2	27730940	rs1260326	35,146	missense_variant	T	C	0.392	0.063	0.006	4.23 × 10⁻²⁵	GCKR	4.51 × 10⁻¹⁴
PDFF (WFM)	2	165501927	rs5835988	34,367	intergenic_variant	TG	T	0.408	−0.046	0.006	3.25 × 10⁻¹⁴	COBLL1	3.58 × 10⁻¹¹
PDFF (WFM)	2	227100579	2:227100579_TC_T	34,367	intergenic_variant	TC	T	0.348	−0.042	0.006	1.63 × 10⁻¹¹	IRS1
PDFF (WFM)	3	12393125	rs1801282	34,367	missense_variant	G	C	0.122	−0.059	0.009	6.61 × 10⁻¹¹	PPARG
PDFF (WFM)	3	25484121	rs79905393	34,367	intron_variant	A	G	0.036	−0.094	0.016	2.86 × 10⁻⁹	*RARB*
PDFF (WFM)	3	150066540	rs62271373	34,367	regulatory_region_variant	A	T	0.060	0.082	0.013	1.02 × 10⁻¹⁰	*TSC22D2*
PDFF (NA)	4	100239319	rs1229984	36,394	missense_variant	T	C	0.024	−0.165	0.023	1.26 × 10⁻¹²	ADH1B
PDFF (VAT)	4	100503761	rs11937107	33,540	intron_variant	T	C	0.252	−0.044	0.006	4.87 × 10⁻¹²	MTTP	8.20 × 10⁻¹²
PDFF (BMI)	4	100562374	rs36029295	35,146	intron_variant	GA	G	0.039	0.085	0.015	2.68 × 10⁻⁸	*RP11-766F14.2*
PDFF (VAT)	5	55808342	rs392794	33,540	intron_variant	C	T	0.254	−0.063	0.006	5.09 × 10⁻²³	AC022431.2	6.83 × 10⁻¹³
PDFF (BMI)	5	72951153	5:72951153_GT_G	35,146	intron_variant	G	GT	0.462	0.033	0.006	3.45 × 10⁻⁸	*ARHGEF28*
PDFF (BMI)	6	26093141	rs1800562	35,146	missense_variant	A	G	0.077	0.062	0.011	3.27 × 10⁻⁸	HFE
PDFF (BMI)	7	28172732	rs702814	35,146	intron_variant	C	T	0.492	0.035	0.006	3.73 × 10⁻⁹	JAZF1
PDFF (VAT)	7	65431686	rs6955582	33,540	intron_variant	A	G	0.451	−0.034	0.006	1.72 × 10⁻⁹	*GUSB*	1.03 × 10⁻⁸
PDFF (VAT)	8	25464670	rs73221948	33,540	intergenic_variant	T	G	0.294	0.046	0.006	3.05 × 10⁻¹³	*CDCA2*
PDFF (BMI)	8	126500031	rs28601761	35,146	intron_variant	G	C	0.420	−0.072	0.006	1.63 × 10⁻³²	TRIB1	4.72 × 10⁻⁷
PDFF (VAT)	9	132566666	rs7029757	33,540	non_coding_transcript_exon_variant	A	G	0.096	−0.063	0.009	3.15 × 10⁻¹¹	TOR1B
PDFF (VAT)	10	113910721	rs1129555	33,540	3_prime_UTR_variant	A	G	0.273	0.062	0.006	2.52 × 10⁻²³	GPAM	4.04 × 10⁻¹⁴
PDFF (VAT)	10	114024573	10:114024573_GA_G	33,540	intergenic_variant	G	GA	0.286	−0.039	0.006	2.24 × 10⁻¹⁰	*TECTB*	4.90 × 10⁻⁹
PDFF (WFM)	11	823586	rs140201358	34,367	missense_variant	G	C	0.015	0.147	0.025	2.35 × 10⁻⁹	PNPLA2
PDFF (VAT)	11	75456134	rs531117	33,540	downstream_gene_variant	T	C	0.157	−0.042	0.008	3.91 × 10⁻⁸	*MOGAT2*
PDFF (NA)	12	3685100	rs1985912	36,394	intron_variant	G	A	0.326	−0.043	0.008	2.67 × 10⁻⁸	*PRMT8*
PDFF (WFM)	12	124476873	rs12833624	34,367	intron_variant	T	C	0.337	−0.039	0.006	3.37 × 10⁻¹⁰	*FAM101A*	1.01 × 10⁻⁶
PDFF (VAT)	17	7116853	rs446994	33,540	intron_variant	C	A	0.422	0.033	0.006	3.43 × 10⁻⁹	*DLG4*	1.04 × 10⁻⁸
PDFF (NA)	17	17974014	rs11439486	36,394	downstream_gene_variant	T	TA	0.330	0.046	0.008	7.63 × 10⁻⁹	GID4	2.19 × 10⁻⁷
PDFF (VAT)	17	64210580	rs1801689	33,540	missense_variant	C	A	0.029	0.099	0.016	1.63 × 10⁻⁹	*APOH*
PDFF (NA)	19	18229208	rs56252442	36,394	intron_variant	T	G	0.253	0.048	0.008	5.11 × 10⁻⁹	MAST3	5.13 × 10⁻⁷
PDFF (VAT)	19	19379549	rs58542926	33,540	missense_variant	T	C	0.075	0.292	0.011	4.7 × 10⁻¹⁶⁸	TM6SF2	5.00 × 10⁻¹⁰
PDFF (VAT)	19	33834096	rs73026242	33,540	downstream_gene_variant	G	A	0.072	0.060	0.011	3.59 × 10⁻⁸	*CEBPG*
PDFF (BMI)	19	45411941	rs429358	35,146	missense_variant	C	T	0.151	−0.102	0.008	2.92 × 10⁻³⁵	APOE	1.67 × 10⁻¹⁵
PDFF (NA)	19	45830763	rs344790	36,394	upstream_gene_variant	A	C	0.421	0.043	0.007	2.68 × 10⁻⁹	*CKM*
PDFF (VAT)	19	54671421	rs60204587	33,540	intron_variant	A	G	0.426	0.045	0.006	1.42 × 10⁻¹⁵	MBOAT7	5.27 × 10⁻¹⁵
PDFF (WFM)	20	39142516	rs2207132	34,367	intergenic_variant	A	G	0.034	0.092	0.016	1.73 × 10⁻⁸	*MAFB*
PDFF (VAT)	22	44324730	rs738408	33,540	synonymous_variant	T	C	0.214	0.237	0.007	1.7 × 10⁻²⁶⁹	PNPLA3	1.38 × 10⁻¹⁴
cT1 (BMI)	1	15810346	rs12130283	29,312	intron_variant	C	T	0.484	0.045	0.007	5.33 × 10⁻¹⁰	*CELA2B*	6.91 × 10⁻⁸
cT1 (BMI)	1	220100497	rs759359281	29,312	splice_polypyrimidine_tract_variant	C	CA	0.055	0.129	0.016	4.74 × 10⁻¹⁵	SLC30A10
cT1 (VAT)	4	103188709	rs13107325	27,782	missense_variant	T	C	0.070	0.556	0.015	<4.94 × 10⁻³²⁴	SLC39A8	2.01 × 10⁻¹⁸
cT1 (VAT)	6	25878848	rs55925606	27,782	intron_variant	G	A	0.073	−0.149	0.015	1.07 × 10⁻²⁴	SLC17A3	3.32 × 10⁻⁹
cT1 (VAT)	8	9173209	rs7012637	27,782	intron_variant	A	G	0.472	0.043	0.007	9.39 × 10⁻⁹	*RP11-10A14.4*
cT1 (BMI)	8	18272881	rs1495741	29,312	regulatory_region_variant	G	A	0.219	−0.064	0.009	4.5 × 10⁻¹³	*NAT2*
cT1 (NA)	9	136149830	rs532436	30,481	intron_variant	A	G	0.183	0.079	0.010	7.47 × 10⁻¹⁵	*ABO*
cT1 (WFM)	10	113921825	rs2792735	28,638	intron_variant	G	A	0.277	0.046	0.008	4.05 × 10⁻⁸	GPAM	4.81 × 10⁻⁷
cT1 VAT)	11	61549025	rs174533	27,782	intron_variant	A	G	0.351	0.052	0.008	2.49 × 10⁻¹¹	*MYRF*	3.51 × 10⁻⁷
cT1 (NA)	12	51511269	rs59372312	30,481	intron_variant	G	A	0.067	−0.088	0.016	2.33 × 10⁻⁸	*TFCP2*
cT1 (BMI)	14	24572932	rs111723834	29,312	missense_variant	A	G	0.016	0.383	0.029	1.7 × 10⁻³⁹	PCK2	9.99 × 10⁻⁸
cT1 (VAT)	16	77423427	rs2454933	27,782	intron_variant	T	C	0.082	0.075	0.014	3.69 × 10⁻⁸	*ADAMTS18*
cT1 (NA)	18	59323966	18:59323966_CTT_C	30,481	intron_variant	C	CTT	0.287	−0.050	0.009	1.28 × 10⁻⁸	*CDH20*
cT1 (WFM)	19	19379549	rs58542926	28,638	missense_variant	T	C	0.074	0.141	0.014	4.22 × 10⁻²³	TM6SF2	4.46 × 10⁻⁷
cT1 (VAT)	19	33890838	rs10406327	27,782	intron_variant	G	C	0.477	−0.042	0.007	1.24 × 10⁻⁸	*PEPD*	1.84 × 10⁻⁰⁶
cT1 (BMI)	19	49206172	rs516246	29,312	intron_variant	C	T	0.489	−0.053	0.007	3.67 × 10⁻¹³	FUT2	1.00 × 10⁻¹³
cT1 (VAT)	22	37469192	rs6000553	27,782	intron_variant	A	G	0.466	0.070	0.007	7.53 × 10⁻²¹	TMPRSS6	4.81 × 10⁻¹²
cT1 (BMI)	22	44324727	rs738409	29,312	missense_variant	G	C	0.214	0.120	0.009	3.2 × 10⁻⁴¹	PNPLA3	2.85 × 10⁻¹³

Open in a new tab

The association between common genetic variants and PDFF under different adiposity adjustments was performed using REGENIE adjusting for adiposity index, age, sex, age × sex, age² and age²× sex, first ten genomic principal components and array batch. Each adiposity adjustment is shown in parentheses. Genomic loci in bold represent the previously unknown loci identified in the present work. The locus column shows the nearest gene to the lead variant (from COJO analysis) at each locus. MAGMA column shows significant gene-based associations at each locus exceeding Bonferroni threshold (P < 2.65 × 10⁻⁶). Sample size used in each GWAS has been shown in column N. P values were not corrected for multiple testing among four different models (unadjusted, adjusted for BMI, WFM and VAT). NA, unadjusted; MAGMA, multi-marker analysis of genomic annotation.

We found 17 and 9 previously unknown genetic loci associating with PDFF and cT1, respectively (Methods, Table 1 and Supplementary Table 6). Four loci (PNPLA3, TM6SF2, GPAM and HFE/SLC17A3) were associated with both traits with at least one adiposity adjustment; however, only PNPLA3 and TM6SF2 loci were associated with both traits at a genome-wide level irrespective of the adjustment (Fig. 1b).

Identification of the putative causal loci associated with liver traits

To identify the putative causal loci, we fine-mapped the independent genome-wide significant loci associated with adiposity-adjusted PDFF and cT1. Independent lead variants at multiple loci had a posterior inclusion probability (PIP) > 0.95, suggesting that these GWAS lead variants are causal variants (Supplementary Table 7). Notably, a missense variant on ADH1B (rs1229984) had a PIP of 1 at ADH1B, MTTP and RP11-766F14.2 loci, suggesting that the observed effect from all three loci may derive from the same putative causal variant. In fact, ADH1B rs1229984 and MTTP rs11937107 have a D′ of 1 in Europeans²⁰ (Supplementary Table 7).

To examine whether the set of independent variants could potentially perturb the gene expression patterns of nearby genes, we performed a Bayesian colocalization (Methods). We were able to colocalize 13 and 7 GWAS signals with at least one eQTL evidence for PDFF and cT1, respectively (Supplementary Table 8).

Functional analyses of independent loci associated with liver traits

Independent genetic loci for adiposity-adjusted PDFF and cT1 were mapped to genes and ranked using multiple approaches (Methods). Out of 37 and 18 independent loci for PDFF and cT1, respectively, the majority (31 and 12) loci had the highest rank for the nearest genes (Supplementary Table 9). For the remaining loci, multiple candidate genes were found. To gain a deeper understanding of the biological implications of genome-wide significant loci, we conducted a functional gene-set enrichment analysis using mapped genes with the highest evidence (Supplementary Table 10). Mapped genes for PDFF were enriched in genes mostly expressed in liver and they were involved in lipid metabolism (Supplementary Table 10a and Supplementary Fig. 1a). Conversely, mapped genes for liver iron corrected T1 were enriched in metal ion metabolism (Supplementary Table 10b and Supplementary Fig. 1b).

Previously unknown genetic loci, liver and metabolic traits

Given the causal relationship between liver triglyceride content and inflammation/fibrosis, we examined the association of the previously unknown variants identified by PDFF with cT1 and vice versa. Notably, most of the variants were associated with both traits and directionally concordant (Extended Data Fig. 2). This is consistent with the notion that liver triglyceride content causes inflammation²¹. A total of 5 (29 %) and 4 (44 %) loci were associated with either PDFF or cT1, suggesting a specificity of the effect on lipid or inflammation pathways. Furthermore, we examined the association between these previously unknown variants and indices of liver damage, fibrosis and liver disease (Extended Data Fig. 2 and Supplementary Table 11). More than 80% of variants associated with PDFF also associated with alanine aminotransferase (ALT) (one-sided Fisher’s exact test P = 0.028); however, there was no significant correlation between PDFF and cT1 loci associations with aspartate aminotransferase (one-sided Fisher’s exact test P = 0.613). Most variants were associated with plasma lipoproteins and glucose metabolism traits, including diabetes (Extended Data Fig. 2 and Supplementary Table 11).

Extended Data Fig. 2 — Heatmap of the Z-score of associations for the effect (risk) allele between previously unknown genetic *loci* and liver or metabolic-related traits (columns) in n=397,780 UKBB participants after excluding individuals with available PDFF or liver iron corrected T1 (n=36,748). The association analyses were performed by linear or logistic regression analysis using REGENIE and adjusted for adiposity index, age, sex, age×sex, age² and age²×sex, first 10 genomic principal components and array batch. Upper and lower boxes correspond to liver iron corrected T1 and PDFF genetic *loci*, respectively. Full summary statistics have been reported in Supplementary Table 11. P values were two-sided and not corrected for multiple hypothesis testing. VAT: Visceral adipose tissue; WFM: Whole-body fat mass (kg/m2); cT1: liver iron corrected T1; PDFF: proton density fat fraction; CLD: chronic liver disease.

Indices of adiposity contribute differentially to the association between genetic variants and liver triglycerides

Adiposity is a well-known risk factor for MASLD and there is no evidence on causal impact of SLD on adiposity²². Hence, it is safe to assume that adjusting for adiposity does not suffer from collider bias. Therefore, we hypothesized that the association between PDFF or cT1 and genetic loci depends on the measures of adiposity. To explore this, we performed different statistical analyses (Methods). As reported in Supplementary Tables 12 and 13, while the overall associations are consistent across multi-adiposity-adjusted GWAS, some loci display different associations depending on the adiposity adjustment.

For instance, we found no association between PDFF and rs73026242CEBPG with BMI or WFM adjustments, but a strong genome-wide association with VAT adjustment. This locus has a strong association with VAT but in the opposite direction to that of PDFF, and mediation analysis suggests an inconsistent mediation, namely, a partial mediation in the opposite direction. This locus has been recently linked with visceral to abdominal subcutaneous fat ratio¹⁵. Our gene mapping suggests that CEBPA, known for its role in adipogenesis through PPARγ, is the potential causal gene (Supplementary Table 9a)²³.

Conversely, while the PPARG locus shows evidence of interaction with all three adiposity measures, a putative inconsistent effect was only observed for WFM adjustment. This variant decreases PPARγ activity²⁴ and confers a protection against diabetes^25,26. While the contribution of this variant to SLD is a matter of debate²⁷, we observed a modest positive association with WFM. Another interesting finding is FAM101A locus, where there is a nominally significant association with BMI/WFM and VAT but in opposite directions. Hence, adjusting for VAT mitigated the association (P^VAT = 0.01). The top-ranked gene at this locus, CCDC92, has been shown to play a role in insulin resistance and subcutaneous adipose and peripheral fat²⁸. On the other hand, we also encounter the opposite scenario, where adiposity may act as a positive partial mediator, as for the PRMT8, MAST3 and CKM loci.

For cT1, while most loci have consistent associations over different adiposity adjustments, the mediation analysis suggests a putative inconsistent effect for VAT-adjusted model at PEPD locus. The top-ranked gene at this locus is CEBPA; however, whether the putative mechanism is similar to the one described above for liver triglyceride content is unclear because PEPD has a similar rank and is associated with diabetes and adiposity²⁹.

Finally, we performed sensitivity analysis to examine the robustness of mediation analysis to unmeasured confounders, and observed a strong (for PDFF, ρ = 0.6) and moderate (for cT1, ρ = 0.4) robustness of mediation estimates to the sequential ignorability assumption³⁰. Given the causality assumptions in mediation analysis, these potential mechanisms should be considered with caution. Furthermore, due to the statistical equivalence of mediation and confounding effects³¹, the observed inconsistent mediations may be interpreted as negative confounding effects that may enable the discovery of unknown genetic loci.

Considering this extensive body of evidence, measures of adiposity contribute to the association between genetic loci and proxies of liver triglyceride content (PDFF) and inflammation/fibrosis (cT1), thus supporting our multi-adiposity-adjusted GWAS approach.

The association between six previously unknown loci and liver triglyceride content was replicated in independent cohorts

Based on the strong genetic correlation between PDFF and cT1, to validate the previously unknown SNPs, we meta-analyzed the association between all the previously unknown 26 variants and liver triglyceride content in 3,903 individuals of European ancestry from four independent cohorts (Fig. 2 and Supplementary Table 14). We were able to replicate the association between six of the previously unknown loci (CEBPG, TSC22D2, ABO, GUSB, TECTB and TFCP2) and liver triglyceride content. The direction of the association in the replication cohort was consistent with the discovery cohort.

Fig. 2 — The association between each genetic variant and rank-based inverse normal transformed hepatic triglyceride content was performed using a linear regression analysis adjusted for age, sex, age², age × sex, age² × sex (shown as circles). Proxy variants were used for variants not available in the replication cohorts (r² > 0.4 within a window of 1.5 Mb around each lead variant in the UK Biobank) as marked with an asterisk. Pooled effect estimates were calculated using inverse-variance-weighted fixed-effect meta-analysis (shown as diamonds). Genomic loci in bold are those with a P value <0.05 in the fixed-effects model. Error bars represent the 95% confidence intervals from the regression models or meta-analysis. Full summary statistics have been reported in Supplementary Table 14. P values are two-sided and not adjusted for multiple testing. NEO, Netherlands Epidemiology of Obesity study; DHS, Dallas Heart Study.

Partitioned polygenic risk scores identify a steatotic liver-specific disease and a systemic MASLD

Triglyceride secretion is a key mechanism regulating hepatocyte triglyceride homeostasis. Triglyceride secretion is mediated by very-low-density lipoprotein (VLDL) secretion that in fasting conditions are proxied by circulating triglyceride levels. Variants in genes hampering VLDL secretion, including APOB, MTTP, TM6SF2 and PNPLA3, cause retention of liver triglycerides mirrored by lower circulating lipoproteins. Carriers of these variants have an increased risk of MASLD and lower risk for cardiovascular disease due to lower lipoproteins^32–34.

Based on this mechanism, we allocated the previously unidentified replicated and the previously known variants into two pPRSs: (1) a group showing discordant association between PDFF and circulating triglycerides (n = 10), suggesting that liver triglyceride content is primarily influenced by liver retention, and (2) a group with concordant associations (n = 13), indicating that liver triglyceride content may result from an increase in uptake, synthesis of energy substrates or a reduction in β-oxidation (Methods, Extended Data Fig. 3 and Supplementary Table 15). The variance explained by the discordant pPRS was higher than the concordant pPRS mirroring the fact that discordant pPRS is composed of PNPLA3 and TM6SF2 variants (Supplementary Table 16).

Extended Data Fig. 3 — The heatmap shows the Z-score of associations for the effect (risk) allele in Europeans (n=397,780) after excluding individuals with available PDFF or liver iron corrected T1 (n=36,748). The association was performed by linear regression analysis using REGENIE and adjusted for adiposity index, age, sex, age×sex, age² and age²×sex, first 10 genomic principal components and array batch. Upper and lower boxes correspond to liver iron corrected T1 and PDFF genetic *loci*, respectively. Previously unknown replicated genetic *loci* have been marked in blue. Full summary statistics have been reported in Supplementary Table 15. P values were two-sided and not corrected for multiple hypothesis testing.

Both pPRSs were associated with an increased risk of MASLD with the largest association being with hepatocellular carcinoma (HCC); however, the association was stronger for the discordant pPRS (Fig. 3a, Supplementary Fig. 2 and Supplementary Tables 17 and 18). Of note, only the discordant pPRS was associated with autoimmune liver diseases.

Fig. 3 — a,b, The case–control (a) and prospective (b) association between two PDFF-circulating TGs pPRS and liver-related, cardiometabolic and chronic kidney failure traits in the UK Biobank. Effect plot of the association between concordant and discordant PDFF-circulating TGs PRS with each disease was tested using either logistic (a) or Cox proportional hazard (b) regression analysis adjusted for BMI, age, sex, age × sex, age² and age² × sex, first ten genomic principal components and array batch. The x axis shows either the odds ratio (OR) or hazard ratio. All association analyses have been performed after excluding individuals with available PDFF (n = 36,394). Error bars represent the 95% confidence intervals from the regression models. Full summary statistics have been reported in Supplementary Table 18. P values were two-sided and not corrected for multiple hypothesis testing. TG, triglyceride.

Discordant pPRS was associated with a decreased risk of cardiovascular, whereas concordant PRS was associated with a substantial increased risk of cardiovascular disease and heart failure. Both pPRSs conferred an increased predisposition to diabetes, suggesting that hepatic triglyceride accumulation predisposes to diabetes irrespective of the underlying cause. Conversely, the larger effect size of the concordant pPRS for diabetes despite the lower effect on liver triglyceride content would suggest that the association of diabetes in the concordant pPRS is not mediated by liver damage. In the case of hypertension and chronic kidney failure, discordant pPRS showed no association, whereas the concordant pPRS increased the risk of both diseases; however, when we adjusted for hypertension, the association with chronic kidney failure was no longer significant, whereas the other associations remained (Supplementary Table 18). Further adjustment for diabetes, total cholesterol and alcohol intake did not change the results (Supplementary Table 18). The prospective risk conferred by the pPRS to develop liver and cardiometabolic disease in the UK Biobank was virtually identical (Fig. 3b and Supplementary Table 18).

Functional gene-set enrichment analysis for both pPRSs also revealed a distinct metabolic pattern. While gene sets of discordant pPRS were mostly enriched in lipid and triglyceride homeostasis (Supplementary Table 10c and Supplementary Fig. 3), concordant pPRS gene sets were enriched in insulin receptor signaling and glucose homeostasis pathways, overall consistent with an impact on stimulation of de novo lipogenesis (Supplementary Table 10d and Supplementary Fig. 3).

In addition to our hypothesis-driven approach, we performed an unsupervised soft clustering approach³⁵ (Methods). Of 1,000 iterations, 90% converged to two clusters and 10% to one cluster. One genetic locus, rs738408PNPLA3, appeared in both clusters³⁶ (Supplementary Fig. 4). We used two top-weighted traits to name the clusters: (1) low-density lipoprotein (negative)/triglycerides (negative); and (2) triglycerides (positive)/ALT (positive). When examining the association of pPRS clusters with the same outcomes, we observed a similar dissociation to the PDFF-circulating TGs pPRS; however, differences in the risk of diseases defining the two types of MASLD were larger in our hypothesis-driven approach (Supplementary Table 19). This may be attributed to the soft clustering feature of the Bayesian non-negative matrix factorization (bNMF) algorithm, where rs738408PNPLA3 was included in both clusters (Supplementary Fig. 5).

When comparing plasma biomarkers between individuals in the upper and lower quartiles of pPRS, those in the upper quartile of discordant pPRS had the largest differences in lipoprotein levels (Supplementary Table 20), consistent with the protective effect of the liver-specific subtype compared to the systemic subtype. In addition, individuals in the top quartile of concordant pPRS had higher lipoproteins and blood pressure and lower creatinine levels, consistent with the increased risk of cardiovascular disease, heart failure and kidney failure.

Sex-specific effect of the association between pPRS and the feature of cardiometabolic syndrome

Liver diseases have a different prevalence between males and females. For instance, HCC is more frequent in men while liver autoimmune disease is more prevalent in females. Moreover, there are sex-specific differences in carriers of the PNPLA3 rs738409 between males and females³⁷. Therefore, we examined the association between the two pPRS and cardiometabolic syndrome stratified by sex. Results of the stratified analyses are consistent with the pooled analyses, with the following exceptions: (1) HCC is associated with the concordant pPRS only in males; (2) heart failure is associated with protection in the discordant pPRS specifically in females; and (3) chronic kidney failure is increased in the concordant pPRS only in males (Supplementary Fig. 6 and Supplementary Table 21).

mRNA expression of loci from the liver-specific PRS is more abundant in the liver

We further examined the messenger RNA expression of mapped genes within concordant and discordant pPRS using paired bulk RNA-seq of liver (n = 244) and VAT (n = 261) from participants with obesity from the MAFALDA cohort (Molecular Architecture of Fatty Liver Disease in Patients with Obesity Undergoing Bariatric Surgery study). Notably, only the mapped genes of discordant pPRS showed a significant overlap with upregulated differentially expressed genes in the liver (one-sided Fisher’s exact test, P = 0.007; Fig. 4). Given the tight interplay between VAT and liver in the MASLD, this finding suggests a liver-specific nature of discordant pPRS compared to its metabolic counterpart, concordant pPRS.

Fig. 4 — Differential expression analysis of paired liver and VAT bulk RNA-seq data for mapped gene sets of concordant and discordant pPRS. The lower right bar plot shows the fraction of upregulated differentially expressed (DE) genes in the liver compared to VAT. The enrichment of pPRS with upregulated DE genes in the liver was calculated using one-sided Fisher’s exact test. FC, fold change.

Discussion

The main findings of this study are (1) the identification of previously unknown loci associated with SLD; and (2) the identification of two distinct types of MASLD, namely, a liver-specific and a systemic type.

BMI, as a proxy of adiposity, amplifies the genetic predisposition to SLD given by common variants^14,22; however, BMI does not consider body fat distribution and body composition. To identify previously unknown genetic loci associated with SLD, we compared a range of measurements of adiposity finding that VAT volume, WFM and BMI were the best independent predictors.

Next, we performed multi-adiposity-adjusted GWAS on PDFF and iron corrected T1, as a measure of liver triglyceride content and inflammation/fibrosis. Our data demonstrate that adiposity indices may confound association between genetic variants and liver triglyceride content and inflammation/fibrosis. By using this approach, we identified 17 previously unknown genetic loci for liver triglyceride content and 9 for liver inflammation, and replicated 6 of these loci in four independent cohorts.

The heritability of liver triglyceride content was influenced by the multi-adiposity adjustment explaining in the best-case scenario approximately 6% more heritability compared to the unadjusted; however, for inflammation this was not the case, suggesting that heritability of inflammation is not directly influenced by adiposity. We have previously demonstrated a causal link between liver triglyceride content and increased risk of liver inflammation using Mendelian randomization²¹. This finding is further supported by our observation that approximately 80% of the genetic loci associated with PDFF also associate with cT1 in the same direction.

Intrahepatocyte triglyceride homeostasis is governed by three fundamental mechanisms: triglyceride synthesis, lipoprotein secretion and energy substrate utilization. Hindering lipoprotein secretion causes liver triglyceride accumulation by retention. Indeed, loss-of-function variants in TM6SF2 and PNPLA3 cause liver triglyceride retention by reducing lipoprotein secretion^38,39. Consistently with hepatic lipoprotein retention, carriers of these variants have lower risk for cardiovascular disease due to the lower circulating lipoproteins^32,40.

Therefore, we generated two pPRSs: one composed of variants in which the association between liver triglyceride content and circulating triglycerides were discordant and one in which they were concordant. ‘Partitioned’ polygenic scores may elucidate disease pathogenesis and capture specific signatures driving the individual disease progression, hence providing a framework for tailored therapeutic interventions⁴¹.

Of note, the concordant pPRS predicts the entire spectrum of cardiometabolic disease. On the contrary, the discordant pPRS is associated with liver disease mirrored by protection from cardiovascular disease due to lipoprotein retention, despite a marginal increase in the risk of diabetes. The liver specificity of the discordant pPRS is further supported by a higher mRNA expression of the genes composing this score in liver versus visceral adipose paired biopsies from individuals with obesity. We additionally generated pPRSs using hypothesis-free soft clustering³⁵, which was very similar to our hypothesis-driven approach.

Our data suggest the presence of at least two distinct types of MASLD with specific disease-causing molecular mechanisms: one specific for the liver and one systemic and entwined with cardiometabolic syndrome (Extended Data Fig. 4). Understanding the molecular mechanisms underlying these components may allow us to find effective treatments for MASLD and cardiometabolic syndrome. Clinically, these entities reflect the presence of individuals rapidly progressing to later stages of MASLD and those with a slow-progressing MASLD associated with the entire metabolic cardiometabolic syndrome. These types of MASLD may account for the disease heterogeneity and help explain why several drugs have failed in clinical trials to treat MASLD.

Extended Data Fig. 4 — a) In the steatotic liver-specific disease, the primary increase in the liver triglyceride content is due to the hepatic retention of very low-density lipoproteins (VLDL). This retention is causally related to liver inflammation, fibrosis, and hepatocellular carcinoma. In this type of MASLD, the higher risk of diabetes is due to the degree of liver fibrosis, while the lower risk of cardiovascular disease (CVD) to lipoprotein retention. b) In the systemic MASLD, the liver is entwined in the crosstalk among metabolic organs. In this type of MASLD, a dysfunctional visceral adipose tissue may increase the diabetes risk and may release free fatty acids that are incorporated into triglycerides in the hepatocytes causing liver steatosis. In turn, liver steatosis causes an overproduction of VLDL with a subsequent increase in circulating low-density lipoproteins (LDL) resulting in a higher risk of CVD. Additionally, the systemic MASLD associates with an increased blood pressure resulting in kidney failure and further increasing the CVD risk. This figure was created with BioRender.com. CKD: chronic kidney disease (failure); VAT: visceral adipose tissue; LD: lipid droplets.

Currently Mendelian randomization studies are carried out by selecting variants associated with a trait and using them to explain the causal relationship with a different trait. In this study, the pPRS had opposite effects on cardiovascular risk, indicating that if we had pooled the variants together, we may have nullified the association. Therefore, our findings support the notion that the pPRSs constructed by integrating variants into physiological pathways may allow clarifying the heterogeneity of disease pathogenesis. Ultimately, this will lead to precision medicine, improving outcome prediction and therapy.

A strength of this study is that the partitioning of the PRS was based on a hypothesis-driven approach with solid knowledge of intracellular lipid homeostasis. While the finding on cardiovascular disease may be expected, the associations with hypertension and diabetes were not granted. Alcohol consumption may have an additive effect on SLD and cardiovascular disease. Alcohol is converted into triglycerides in hepatocytes and alcoholic and nonalcoholic SLD share common genetic determinants, suggesting common disease-causing mechanisms⁴². Therefore, we did not exclude individuals based on alcohol consumption. Nonetheless, sensitivity analyses showed that adjusting for alcohol did not change the results. Finally, we obtained similar results by using a completely different approach, namely, unsupervised clustering, in cohorts of individuals in whom liver biopsy was available (REF).

A limitation of our study is that the identified and replicated genetic loci were based on study cohorts of European ancestry limiting the applicability in non-Europeans. Future studies are warranted to validate these loci and the pPRS in non-European populations. Furthermore, while we performed genetic colocalization and enrichment analyses, the functional implications of the identified loci are yet to be established by in vitro and in vivo experiments.

In conclusion, we identified six previously unknown loci associated with SLD and two distinct types of SLD, namely, one that is liver specific and another that is entwined with the full spectrum of cardiometabolic syndrome.

Methods

UK Biobank

The UK Biobank study has recruited over 500,000 participants aged between 40 and 69 years across the United Kingdom between 2006 and 2010, with extensive phenotypic and genetic data⁴³. The UK Biobank received ethical approval from the National Research Ethics Service Committee North West Multi-Centre Haydock (reference 16/NW/0274). Data used in this study were obtained under application number 37142. European ancestry was defined previously¹² by removing outliers using genomic principal components. Additionally, participants were excluded if they fell into any of these categories: (1) more than ten putative third-degree relatives; (2) a mismatch between self-reported and genetically inferred sex; (3) putative sex chromosome aneuploidy; (4) heterozygosity and missingness outliers; and (5) withdrawn consent⁴³.

Genotypes and imputation

UK Biobank participants were genotyped using two highly similar (>95% overlap) genotyping arrays, which were then imputed centrally by the UK Biobank based on the 1000 Genomes Project phase 3, UK 10K haplotype and Haplotype Reference Consortium reference panels. Starting from approximately 97 million variants, we kept only 9,356,431 variants with a minor allele frequency (MAF) > 1%, imputation quality (INFO) score > 0.8 and Hardy–Weinberg equilibrium P > 1 × 10⁻¹⁰ (refs. ^12,43).

Definition of traits

We used adiposity measures directly provided by the UK Biobank, including VAT (data field 22407), WFM (data field 23100) and impedance of whole body (data field 23106). WHR was calculated by dividing the waist-to-hip circumference. MRI-derived PDFF and liver iron corrected T1 (cT1) were provided directly by the UK Biobank (data fields 40061 and 40062). The details of liver MRI protocols can be found elsewhere⁴⁴. In brief, individuals were scanned using a Siemens 1.5T Magnetom Aera. Two sequences were then used for data acquisition, a multiecho-spoiled gradient-echo and a modified look locker inversion sequence (ShMOLLI) for PDFF and cT1, respectively⁴⁴. The definition of binary traits can be found in Supplementary Table 22.

Phenotypic prediction models

To address the multicollinearity between different measures of adiposity and to verify their contribution in predicting PDFF and cT1 values, we fit penalized linear regression models and carried out a model selection in a tenfold nested cross-validation (CV) using Least Absolute Shrinkage and Selection Operator (LASSO), Ridge and Elastic Net. LASSO penalizes the regression model using the L1-norm, effectively reducing the influence of non-contributing features to zero. On the other hand, Ridge regression utilizes the L2-norm, allowing it to shrink regression coefficients toward zero. Elastic Net combines elements of both LASSO and Ridge by incorporating both L1 and L2 penalties through a mixing parameter α.

To conduct the CV process, the dataset was initially divided into training (80%) and test (20%) sets. Within the training set, the outer CV assessed the performance of each model, while the inner CV was utilized for hyperparameter tuning. This tuning was accomplished by minimizing the mean squared error across a grid of α and shrinkage values in each fold of the outer CV. The best performing model with the lowest mean squared error was then trained on the entire training set within a tenfold CV framework. Subsequently, its performance was evaluated using the remaining test set. Finally, the model with the optimal set of hyperparameters, determined in the previous step, was fitted to the entire dataset for final evaluation⁴⁵. Adiposity indices were standardized before the training, whereas PDFF and cT1 values were rank-based inverse normal transformed. All models were adjusted for age, sex, age², age × sex and age² × sex. All analyses were performed in MATLAB (MathWorks) R2023a.

Genome-wide association analysis

The association between 9 million imputed common variants and PDFF or cT1 under different adiposity adjustments under an additive genetic model was performed using a whole-genome regression model as implemented in REGENIE (v.3.2.8)¹⁶. The analysis was adjusted for age at MRI, sex, age², age × sex, age² × sex, the first ten principal components (PCs) of ancestry, genotyping array and adiposity index, where adiposity index was VAT, WFM, BMI or no adiposity adjustments.

Similarly, we tested the association between independent lead variants from multi-adiposity-adjusted GWAS of PDFF and cT1, and other binary or continuous metabolic traits using either a logistic or a linear whole-genome regression model in REGENIE and adjusted for the same set of covariates, including consistent adiposity adjustments. Individuals with an available PDFF or cT1 measurements were excluded before the association analysis (n = 36,748). In cases where the trait was measured at baseline, we used WHR instead of VAT adjustment, as the latter was not available at baseline. To fit the whole-genome regression model in step 1 of REGENIE, a subset of directly genotyped common variants (MAF > 1%) was used. After excluding variants on long-range LD and major histocompatibility complex (MHC) regions, variants with a missingness <0.01 and with Hardy–Weinberg equilibrium P > 1 × 10⁻¹⁵ were retained. Finally, 146,833 markers left following an LD pruning with a window of 500,000 base pairs and pairwise r²< 0.1 (ref. ¹⁷). Continuous traits were rank-based inverse normal transformed before the analyses.

Identifying independent variants

We first performed LD clumping (PLINK v.1.90b6.26 parameters: –clump-p1 5 × 10⁻⁸ –clump-r2 0.01 –clump-kb 1,000, after excluding individuals with third-degree or closer relatives^17,43) to identify approximately independent loci. Next, to detect statistically independent variants, we conducted approximate step-wise model selection in conditional and joint multiple-single nucleotide polymorphism (SNP) analysis implemented in Genome-wide Complex Trait Analysis (GCTA-COJO¹⁸, v.1.94.0), with an LD window of 10 Mb and using 50,000 randomly selected unrelated Europeans from the UK Biobank for in-sample LD structure, as described previously¹². To examine whether the identified genetic loci were previously reported, we searched the NHGRI-EBI GWAS catalog database⁴⁶ in a window of 1 Mb around each lead variant.

Estimating heritability and genetic correlations

SNP heritability and confounding bias were estimated with LD score regression analysis (LDSC; v.1.0.1, https://github.com/bulik/ldsc/)⁴⁷ using the baseline LD model (v.2.2; https://data.broadinstitute.org/alkesgroup/LDSCORE/), containing 97 annotations, including functional annotations and MAF-/LD-dependent architectures⁴⁸. Similarly, pairwise genetic correlations were calculated using LDSC analysis⁴⁷ after excluding variants in the MHC region (chromosome 6, 25–34 Mb) due to the complex LD structure. Trait pairs with a Benjamini−Hochberg FDR < 0.05 were considered to have a significant genetic correlation. In all analyses, we set the LDSC parameter chisq-max to an arbitrary large number (99,999) to keep large-effect associations.

Pleiotropy analysis

We evaluated whether the independent genome-wide significant loci, adjusted for different adiposity measures, were specific to each adiposity measure, common between PDFF and cT1 GWAS or shared across both. Therefore, if two independent lead variants within 1 Mb of each other were in LD (r²> 0.2), they were assigned the same locus ID (Supplementary Table 4). Circular Manhattan plots were visualized using Circos⁴⁹.

Functionally informed fine-mapping

Functionally informed genetic fine-mapping was performed using PolyFun v.1.0.0 and Sum of Single Effects (SuSiE, v.0.11.92)^50,51. PolyFun was used to estimate per-SNP heritability using L2-regularized extension of stratified LDSC (S-LDSC) and baseline LD model v.2.2 containing 187 annotations^47,48,50. The estimated per-SNP heritabilities were used as prior causal probabilities in SuSiE with a maximum of ten causal variants in each region. The subset of 337,000 unrelated white-British individuals from UK Biobank were used for in-sample LD structure. After excluding the MHC region on chromosome 6, fine-mapping per each locus was performed in a window of 1.5 Mb around the lead genetic variants (P < 5 × 10⁻⁸).

Colocalization

Colocalization was performed between independent genetic loci identified by COJO-GCTA, and summary statistics of gene expression quantitative trait loci (eQTL) of 49 tissues in GTEx (v.8) from the eQTL catalog release 4 (refs ^52,53). The coordinates of GWAS summary statistics were first converted from Build 37 to 38 using liftOver function of rtracklayer R package (v.1.54.0)⁵⁴. We performed colocalization using COLOC-SuSiE assuming the presence of multiple causal variants (coloc R package⁵⁵, v.5.1.0) with default priors and considered variants in a window of 1.5 Mb around the index variant at each locus. We considered only genes with at least one significant variant (FDR P < 0.1, eGenes) and performed the colocalization in a window of 1.5 Mb around each eGene. If SuSiE did not converge after 1,000 iterations, conventional (single causal variant) colocalization was used. Finally, an H4 posterior probability (PP) > 0.8 was considered as strong evidence that both traits share the same causal variant.

Variant annotation

Independent genome-wide significant and fine-mapped variants were annotated using Ensembl Variant Effect Predictor (VEP) accessed from REST API (https://rest.ensembl.org/).

Gene mapping and functional enrichment analysis

To map and prioritize potential candidate genes for independent genetic loci, we employed multiple approaches. (1) SNP2GENE module of FUMA v.1.5.4 (ref. ⁵⁶) was used for positional mapping of lead variants to genes with a maximum distance of 50 kb. (2) eQTL mapping using FUMA by considering only genes with at least one significant eQTL association (FDR < 0.05). (3) 3D chromatin interaction mapping using FUMA by considering only significant interactions (FDR < 1 × 10⁻⁶) within 250–500 bp upstream and downstream of the transcription start site, respectively. (4) Multi-marker analysis of genomic annotation (MAGMA, v.1.08)⁵⁷ implemented in FUMA was carried out to perform genome-wide gene association analysis using 19,535 curated protein-coding genes. Only genes with a Bonferroni threshold below 0.05/19,535 = 2.56 × 10⁻⁶ were kept for gene mapping. Variants within the MHC region were excluded before the analysis. (5) Colocalized genes from colocalization analysis with at least one tissue and an H4 PP > 0.8. (6) Nearest gene(s) to the fine-mapped variants with the maximum PP per each locus. (7) Genes with the highest overall V2G score at each locus were based on Open Targets Genetics⁵⁸. Finally, to prioritize the mapped genes, we calculated an unweighted ranking score by summing over the evidence from the above-mentioned approaches.

By using the set of genes with the maximum ranking score at each locus, we performed functional gene-set enrichment analysis using Enrichr tool⁵⁹ against, ARCHS4 tissues, Reactome biological pathways and Gene Ontology Biological Processes. Significant terms with a Benjamini–Hochberg FDR-corrected P value <0.05 per each database were reported. For visualization, both adjusted P values and Enrichr combined scores (−log(P) × OR) were used.

Partitioned polygenic risk scores of liver triglyceride content

To define the pPRSs, genetic loci (full list in Supplementary Table 15) were assigned into two groups based on their concordant or discordant associations with PDFF and circulating triglycerides. We excluded the genetic loci that did not associate with circulating triglycerides. Finally, the pPRSs were generated by taking the weighted sum of genetic variants, where the strongest association at each locus was used as the weight, following the pleiotropy analysis.

Replication cohorts

NEO

NEO is a population-based cohort study in men and women aged 45 to 65 years, with oversampling of individuals having BMI over 27 kg m⁻² from Leiden and surrounding areas in the Netherlands. At baseline, 6,671 participants were included and around 35% of the NEO participants were randomly selected to undergo hepatic triglyceride content (HTGC) measurements by magnetic resonance spectroscopy. Genotyping was performed using Illumina HumanCoreExome-24 BeadChip and imputed to TOPMed reference genome panel⁶⁰. In the present work, a total of 1,822 individuals of European ancestry with an available HTGC were used.

Liver BIBLE

The Liver BIBLE-2022 cohort comprises 1,144 healthy middle-aged individuals (40–65 years) with metabolic dysfunction (at least three criteria for metabolic syndrome among BMI ≥ 35 kg m⁻², arterial hypertension ≥135/80 mm Hg or therapy, fasting glucose ≥100 mg dl⁻¹ or diabetes, low high-density lipoprotein <45/55 mg dl⁻¹ in males/females and high triglycerides ≥150 mg dl⁻¹) who presented for blood donation from June 2019 to February 2021 at the Transfusion Medicine unit of Fondazione IRCCS Ca’ Granda Hospital (Milan, Italy)⁶¹. Hepatic fat content was estimated noninvasively by controlled attenuation parameter (CAP) with FibroScan device (Echosens). Genotyping was performed by Illumina GlobalScreeningArray (GSA)-24 v.3.0 plus Multidisease Array (Illumina) and further imputed to TOPMed reference genome panel⁶². At the time of analysis, genomic data passing quality control with an available CAP measure were available for 1,081 patients of European ancestry.

MAFALDA

The MAFALDA study started in May 2020 and ended in April 2022. It comprised a total of 468 consecutive participants with morbid obesity (BMI ≥ 35 kg m⁻²) who underwent bariatric surgery at Campus Bio-Medico University of Rome, Italy. In MAFALDA participants, SLD diagnosis was assessed only by liver histology in n = 116, only by vibration-controlled transient elastography, including CAP measurement with FibroScan (Echosens)⁶³ in 141 individuals, with both in 148 individuals and 63 with neither CAP nor liver biopsy. In this study, only individuals with liver fat content estimated by CAP were included (n = 172). Genotyping was performed in the same manner as that of the Liver BIBLE cohort. MAFALDA includes a total of 264 paired visceral and liver biopsies with available bulk transcriptomic data.

Dallas Heart Study

In this study, only 828 European Americans from the Dallas Heart Study (DHS-1) were used. The DHS is a population-based sample study of Dallas County, Texas, USA, where liver triglyceride content was measured by magnetic spectroscopy. Details of this study can be found elsewhere¹³.

Ethics

This research complies with the principles outlined in the Declaration of Helsinki. The UK Biobank received ethical approval from the National Research Ethics Service Committee Northwest Multi-Centre Haydock (reference 16/NW/0274). Data used in this study were obtained under application number 37142. The NEO study was approved by the medical ethical committee of the Leiden University Medical Center. The Liver BIBLE study was approved by the ethical committee of the Fondazione IRCCS Ca’ Granda (ID 1650, revision 23 June 2020). The MAFALDA study was approved by the Local Research Ethics Committee (no. 16/20). The DHS was approved by the institutional review board of the University of Texas Southwestern Medical Center. Each participant provided written informed consent. The baseline characteristics for these cohorts are listed in Supplementary Table 23.

Meta-analysis

The association between previously unknown independent loci for PDFF and cT1 and magnetic resonance spectroscopy liver fat (DHS-1 and NEO studies) or CAP measurement (MAFALDA and Liver BIBLE) was performed using a linear regression analysis adjusted for age, sex, age², age × sex, age² × sex and BMI after a rank-based inverse normal transformation of the response. An inverse-variance meta-analysis was then performed with fixed-effect model using the meta R package (v.6.5.0). For genetic variants not available in either of replication cohorts, a proxy variant was used instead: variants in LD (R²> 0.4) with the lead variant in the UK Biobank within a window of 1.5 Mbp. If no such variant was found in the UK Biobank, the LDproxy tool with Europeans from 1000 Genomes Project was used instead²⁰. In case of multiple proxy variants, the one with the highest LD and functional consequence was selected.

RNA-seq analysis

Total RNA for 264 paired liver and VAT samples from the MAFALDA cohort was isolated using miRNeasy Advanced Mini kit (QIAGEN). RNA sequencing and library preparation was performed in a paired-end 150-bp mode using the Illumina NovaSeq PE150 (Novogene). Reads were aligned to GRCh38 reference genome by STAR⁶⁴ (v.2.7.10a) after quality check (FastQC software v.0.12.0, Babraham Bioinformatics) and trimming of low-quality reads and potential contaminating adapters by Trimmomatic⁶⁵ (v.0.39). Gene-level read counts were quantified by RSEM⁶⁶ (v.1.3.3) tool against the Ensembl (release 107). Samples with insufficient mapping specificity (uniquely to total mapped reads <0.7) were excluded before the analysis. Finally, a paired differential expression analysis of 261 VAT and 244 liver samples was carried out using DESeq2 (ref. ⁶⁷) (v.1.38.3), while adjusting for RNA integrity number, individual ID and five surrogate variables detected by surrogate variable analysis⁶⁸.

Follow-up analysis

The longitudinal association of PRS with the occurrence of the outcomes was tested through Cox proportional hazard regression and expressed as hazard ratios with 95% confidence intervals. The median follow-up was 14.5 years and individuals with any of diagnoses at the baseline were excluded before the analysis (Supplementary Table 22). The proportional hazard assumption was checked through the consideration of Schoenfeld residuals and no violations were detected. Prospective associations were performed in R v.4.0.2 (R Foundation for Statistical Computing).

Gene–adiposity interaction analysis

Gene–adiposity interaction for independent loci from multi-adiposity-adjusted GWAS was performed in REGENIE (v.3.2.8) using robust standard errors (sandwich estimators HC3) to guard against heteroskedasticity¹⁶. The analysis was adjusted for age at MRI, sex, age², age × sex, age² × sex, the first ten PCs of ancestry, genotyping array and adiposity index, where adiposity index was VAT, WFM or BMI. Due to the sensitivity of interaction effect sizes to the trait transformation, PDFF and liver iron corrected T1 were log-transformed before the interaction analyses^69,70.

Mediation analysis

To examine whether the impact of identified independent loci on PDFF or liver iron corrected T1 are mediated via the measures of adiposity, we performed mediation analysis using the mediation R package⁷¹. All models were adjusted for age at MRI, sex, age², age × sex, age² × sex, the first ten PCs of ancestry, genotyping array and the polygenic covariate estimated in step 1 of REGENIE¹⁶. We additionally considered the scenario with a genetic variant-mediator interaction term. The significance of mediation (P values and 95% CIs) was assessed via a nonparametric bootstrap method (1,000 simulations) on the rank-based inverse normal transformed PDFF and liver iron corrected T1 as outcomes and adiposity measures as mediators. We also performed sensitivity analyses to examine sequential ignorability assumption (possible existence of unobserved confounders between adiposity indices and liver traits)³⁰. This was carried out by examining the correlation coefficient between error terms of liver traits and adiposity index models, at which estimated mediation effect was zero (95% CI contains 0).

Association analysis with adiposity measures

The association between independent loci from multi-adiposity-adjusted GWAS and BMI, WFM or VAT was performed using a whole-genome regression model as implemented in REGENIE (v.3.2.8)¹⁶. All models were adjusted for age at MRI, sex, age², age × sex, age² × sex, the first ten PCs of ancestry and genotyping array. Measures of adiposity were rank-based inverse normal transformed before the analysis.

bNMF clustering

We applied bNMF, an unsupervised soft clustering approach, to define ‘hypothesis-free’ clusters of independent loci from multi-adiposity-adjusted GWAS³⁵. This approach has been successfully employed to find physiologically relevant partitioned polygenic risk scores for type 2 diabetes³⁵. We first performed the association analysis between ALT, aspartate aminotransferase, glycated hemoglobin, circulating triglycerides, low-density lipoprotein cholesterol, glucose, creatinine, systolic blood pressure and cystatin C using REGENIE adjusted for age, sex, age², age × sex, age² × sex, the first ten PCs of ancestry and adiposity index, where adiposity index was chosen based on the strongest association from the multi-adiposity-adjusted GWAS. As VAT was not available at baseline, we used WHR instead. Next, a variant-trait association matrix of standardized z-scores (m × n, where m and n are the number of independent loci associating with PDFF and continuous traits mentioned above, respectively) was constructed while accounting for different sample sizes in each GWAS on continuous traits³⁵. This scaled matrix was then aligned to PDFF-increasing alleles. bNMF clustering was performed using the bNMF R pipeline (https://github.com/gwas-partitioning/bnmf-clustering) by setting the maximum number of clusters, K, to 7 for 1,000 iterations and removing highly correlated traits (Pearson correlation coefficient >0.85). After determining the maximum posterior solution at the most probable K, a cutoff maximizing the signal-to-noise ratio (1.08) was used to keep variants in each cluster³⁵. Two top-weighted traits at each cluster were used to define the cluster names. Finally, pPRSs were generated by a weighted sum of genetic variants at each cluster, where weights were derived from the multi-adiposity-adjusted GWAS of PDFF.

Comparison between bNMF and PDFF-TGs pPRS

To compare the PDFF-TGs hypothesis-driven pPRS approach and two pPRSs identified by bNMF algorithm to distinguish between liver, cardiometabolic and kidney outcomes, we performed a Wald test as follows:

W = \frac{{\hat{β}}_{p P R S 1} - {\hat{β}}_{p P R S 2}}{\sqrt{{S E}_{p P R S 1}^{2} + {S E}_{p P R S 2}^{2}}}

where ${\hat{β}}_{pPRS 1}$ , ${S E}_{p P R S 1}^{2}$ and ${\hat{β}}_{pPRS 2}$ , ${S E}_{p P R S 2}^{2}$ are the log OR and standard error of either discordant/concordant or bNMF pPRSs, respectively. In the equation above, W is the Wald test statistic and $W^{2} ~ χ_{1}^{2}$ . Similarly, we compared the model fit, by calculating the difference between Akaike information criterion between the pPRS (Supplementary Table 19).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Online content

Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41591-024-03284-0.

Supplementary information

Supplementary Information^{(6MB, pdf)}

Supplementary Figs. 1–6.

Reporting Summary^{(3.5MB, pdf)}

41591_2024_3284_MOESM3_ESM.xlsx^{(423.4KB, xlsx)}

Supplementary Table 1 Characteristics of European participants from UK Biobank with available PDFF or liver iron corrected T1 (cT1) stratified by sex. Supplementary Table 2 Heritability estimates of PDFF and liver iron corrected T1 under different adiposity adjustments. Supplementary Table 3 Genetic correlation among PDFF and liver iron corrected T1 under different adiposity adjustments. Supplementary Table 4 Pleiotropy analysis of PDFF and liver iron corrected T1 under different adiposity adjustments. Supplementary Table 5 Multi-adipo-adjusted GWAS of PDFF and cT1 across different adjustments for adiposity indices in the UK Biobank. Supplementary Table 6a Overlapping between previously reported genomic loci GWAS catalog and independent loci found in present study for PDFF. Supplementary Table 6b Overlapping between previously reported genomic loci GWAS catalog and independent loci found in present study for liver iron corrected T1. Supplementary Table 7a Functionally informed fine-mapping results for PDFF. Supplementary Table 7b Functionally informed fine-mapping results for liver iron corrected T1. Supplementary Table 8a Colocalization with GTEx eQTLs for PDFF. Supplementary Table 8b Colocalization with GTEx eQTLs for liver iron corrected T1. Supplementary Table 9a Gene mapping for independent genetic loci for PDFF. Supplementary Table 9b Gene mapping for independent genetic loci for liver iron corrected T1. Supplementary Table 10a Gene-set enrichment analysis of mapped genes from PDFF independent genetic loci. Supplementary Table 10b Gene-set enrichment analysis of mapped genes from liver iron corrected T1 independent genetic loci. Supplementary Table 10c Gene-set enrichment analysis of mapped genes for independent genetic loci in PDFF-circulating TGs discordant pPRS. Supplementary Table 10d Gene-set enrichment analysis of mapped genes for independent genetic loci in PDFF-circulating TGs concordant pPRS. Supplementary Table 11 The association between independent novel genetic loci and liver and metabolic-related outcomes in the UK Biobank. Supplementary Table 12 Summarized results of mediation, interaction and association analyses for PDFF and cT1 across different measures of adiposity. Supplementary Table 13 Summary statistics of mediation, interaction and association analyses for PDFF and cT1 across different measures of adiposity. Supplementary Table 14 Meta-analysis of the associations between independent novel genetic loci and HTGC in four independent cohorts. Supplementary Table 15 The association between independent previously known and replicated novel genetic loci from adipo-adjusted PDFF and liver iron corrected T1 and circulating triglycerides in the UK Biobank. Supplementary Table 16 Goodness-of-fit for pPRS. Supplementary Table 17 The association between genetic variants used in PDFF-circulating TGs pPRS and liver-related, cardiometabolic and chronic kidney failure traits in the UK Biobank. Supplementary Table 18 Case–control and prospective association of discordant and concordant pPRS with liver-related, cardiometabolic and chronic kidney failure traits in the UK Biobank. Supplementary Table 19 Comparison of PDFF-TGs and bNMF clustering pPRS approaches. Supplementary Table 20 Baseline characteristics of individuals stratified by the extremes of pPRS. Supplementary Table 21 Sex-specific association of discordant and concordant pPRS with liver-related, cardiometabolic and chronic kidney failure traits in the UK Biobank. Supplementary Table 22 Definition of binary traits. Supplementary Table 23 Baseline characteristics of European participants in replication cohorts stratified by sex.

Acknowledgements

S.R. was supported by the Swedish Cancerfonden (22 2270 Pj), the Swedish Research Council (Vetenskapsradet (VR), 2023-02079), the Swedish state under the Agreement between the Swedish government and the county councils (the ALF agreement, ALFGBG-965360), the Swedish Heart Lung Foundation (20220334), the Wallenberg Academy Fellows from the Knut and Alice Wallenberg Foundation (KAW 2017.0203), the Novo Nordisk Distinguished Investigator Grant - Endocrinology and Metabolism (NNF23OC0082114) and Novo Nordisk Project grants in Endocrinology and Metabolism (NNF20OC0063883). L.V. was supported by the Italian Ministry of Health (Ministero della Salute), Ricerca Finalizzata 2016, RF-2016-02364358 (‘Impact of whole exome sequencing on the clinical management of patients with advanced nonalcoholic fatty liver and cryptogenic liver disease’), Italian Ministry of Health, Ricerca Finalizzata 2021 ‘Targeting the epigenetic regulators Suv420h1/2 in hepatocytes to treat nonalcoholic fatty liver disease’ (TERS) RF-2021-12373889, Italian Ministry of Health (national coordinator) (2023-2026) Ricerca Finalizzata PNRR 2022 ‘RATIONAL: Risk strAtificaTIon Of Nonalcoholic fAtty Liver’ PNRR-MAD-2022-12375656 (L.V.); Italian Ministry of Health (Ministero della Salute), Rete Cardiologica ‘CV-PREVITAL’; Italian Ministry of Health (Ministero della Salute), Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, Ricerca Corrente (L.V. and D.P.); Fondazione Patrimonio Ca’ Granda, ‘Liver BIBLE’ (PR-0361); Innovative Medicines Initiative 2 joint undertaking of the European Union (EU)’s Horizon 2020 research and innovation program and EFPIA EU Program Horizon 2020 (under grant agreement no. 777377) for the project LITMUS; and The EU, H2020-ICT-2018-20/H2020-ICT-2020-2 program ‘Photonics’ under grant agreement no. 101016726 - REVEAL; Gilead_IN-IT-989-5790. The EU, HORIZON-MISS-2021-CANCER-02-03 program ‘Genial’ under grant agreement ‘101096312’. Italian Ministry of University and Research, PNRR – M4 - C2 ‘di R&S su alcune Key Enabling Technologies’ ‘National Center for Gene Therapy and Drugs based on RNA Technology’ CN3 Spoke 4, group ASSET: a sex-specific approach to NAFLD targeting. P.P. was supported by National Institutes of Health grants R01HG010505, R01DK132775 and R01HL170604. The NEO study is supported by the participating departments, the Division and the Board of Directors of the Leiden University Medical Center and by the Leiden University, Research Profile Area ‘Vascular and Regenerative Medicine’.

Extended data

Author contributions

O.J. and S.R. conceived and designed the study. O.J. conducted the main analyses in the UK Biobank. A.D.V. conducted and interpreted the follow-up studies in the UK Biobank. F.T., F.M., R.L.G., J.K., U.V.G. and F.R.R. contributed to the replication in the independent cohorts. O.J., M.A. and K.G. contributed to differential expression and functional enrichment analyses. O.J., S.R., L.V., J.K. and R.M.M. curated data and results. O.J., S.R., L.V., J.K., R.M.M., S.M., P.P., F.P., F.R.R. and U.V.G. provided feedback on the analyses and supervised the project. S.R., O.J., L.V., J.K., R.M.M. and A.D.V. wrote the initial draft of the paper. All authors contributed and approved the final version of the paper.

Peer review

Peer review information

Nature Medicine thanks Ewan Pearson and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Anna Maria Ranzoni, in collaboration with the Nature Medicine team.

Funding

Open access funding provided by University of Gothenburg.

Data availability

All data associated with this study are presented in the paper or the Supplementary Information. Multi-adiposity-adjusted GWAS of PDFF and liver iron corrected T1 (GRCh37) are publicly available on GWAS catalog with the following accession IDs: GCST90446646, GCST90446647, GCST90446648, GCST90446649, GCST90446650, GCST90446651, GCST90446652 and GCST90446653. All external GWAS summary statistics accessed via the GWAS catalog are publicly available and have been cited in Supplementary Table 6a,b. For the UK Biobank, all individual-level phenotype/genotype data are accessible via a formal application to the UK Biobank at http://www.ukbiobank.ac.uk. The ethical approval of the MAFALDA study restricts the public sharing of individual data; however, the data of the liver visceral adipose biopsies from the MAFALDA cohort researchers can submit a proposal to access either raw or analyzed data between 9 to 36 months after publication. Proposals should be directed to S.R. at stefano.romeo@ki.se. S.R. will review each request to assess data availability. Responses will be provided within 8 weeks of receiving the request. It is important to note that patient-related data may be restricted due to confidentiality regulations. If approved for sharing, data will be transferred under a material transfer agreement. NEO study requests should be sent to f.r.rosendaal@lumc.nl. Liver BIBLE study requests should be sent to luca.valenti@unimi.it. Dallas Heart Study requests should be sent to dallasheartstudy@utsouthwestern.edu. The following online databases have been used: GWAS catalog, https://www.ebi.ac.uk/gwas/ and baseline LD model: https://data.broadinstitute.org/alkesgroup/LDSCORE/.

Code availability

Competing interests

S.R. has been consulting for AstraZeneca, GSK, Celgene Corporation, Ribo-cure AB and Pfizer in the last 5 years and received a research grant from AstraZeneca. The funders had no role in the design of the study; in the collection, analyses or interpretation of data; in the writing of the paper or in the decision to publish the results. L.V. has received speaking fees from MSD, Gilead, AlfaSigma and AbbVie, served as a consultant for Gilead, Pfizer, AstraZeneca, Novo Nordisk, Intercept, Diatech Pharmacogenetics, Ionis Pharmaceuticals, Boehringer Ingelheim and Resalis Therapeutics, and received unrestricted research grants from Gilead. R.L.G. is a part-time contractor for Metabolon. O.J. is a part-time consultant to Ribo-cure AB. The other authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Oveis Jamialahmadi, Email: oveis.jamialahmadi@wlab.gu.se.

Stefano Romeo, Email: stefano.romeo@ki.se.

Extended data

is available for this paper at 10.1038/s41591-024-03284-0.

Supplementary information

The online version contains supplementary material available at 10.1038/s41591-024-03284-0.

References

1.Sanyal, A. J. et al. Prospective study of outcomes in adults with nonalcoholic fatty liver disease. N. Engl. J. Med.385, 1559–1569 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Pais, R., Redheuil, A., Cluzel, P., Ratziu, V. & Giral, P. Relationship among fatty liver, specific and multiple-site atherosclerosis, and 10-year Framingham score. Hepatology69, 1453–1463 (2019). [DOI] [PubMed] [Google Scholar]
3.Anstee, Q. M., Targher, G. & Day, C. P. Progression of NAFLD to diabetes mellitus, cardiovascular disease or cirrhosis. Nat. Rev. Gastroenterol. Hepatol.10, 330–344 (2013). [DOI] [PubMed] [Google Scholar]
4.Toh, J. Z. K. et al. A meta-analysis on the global prevalence, risk factors and screening of coronary heart disease in nonalcoholic fatty liver disease. Clin. Gastroenterol. Hepatol.20, 2462–2473.e2410 (2022). [DOI] [PubMed] [Google Scholar]
5.Romeo, S., Sanyal, A. & Valenti, L. Leveraging human genetics to identify potential new treatments for fatty liver disease. Cell Metab.31, 35–45 (2020). [DOI] [PubMed] [Google Scholar]
6.Mantovani, A. et al. Adverse effect of PNPLA3 p.I148M genetic variant on kidney function in middle-aged individuals with metabolic dysfunction. Aliment Pharm. Ther.57, 1093–1102 (2023). [DOI] [PubMed] [Google Scholar]
7.Sun, D. Q. et al. An international Delphi consensus statement on metabolic dysfunction-associated fatty liver disease and risk of chronic kidney disease. Hepatobiliary Surg. Nutr.12, 386–403 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Chen, Y. et al. Genome-wide association meta-analysis identifies 17 loci associated with nonalcoholic fatty liver disease. Nat. Genet. 10.1038/s41588-023-01497-6 (2023). [DOI] [PMC free article] [PubMed]
9.Vujkovic, M. et al. A multiancestry genome-wide association study of unexplained chronic ALT elevation as a proxy for nonalcoholic fatty liver disease with histological and radiological validation. Nat. Genet.54, 761–771 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Emdin, C. A. et al. A missense variant in mitochondrial amidoxime reducing component 1 gene and protection against liver disease. PLoS Genet.16, e1008629 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Miao, Z. et al. Identification of 90 NAFLD GWAS loci and establishment of NAFLD PRS and causal role of NAFLD in coronary artery disease. HGG Adv.3, 100056 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Jamialahmadi, O. et al. Exome-wide association study on alanine aminotransferase identifies sequence variants in the GPAM and APOE associated with fatty liver disease. Gastroenterology160, 1634–1646.e1637 (2021). [DOI] [PubMed] [Google Scholar]
13.Romeo, S. et al. Genetic variation in PNPLA3 confers susceptibility to nonalcoholic fatty liver disease. Nat. Genet.40, 1461–1465 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Romeo, S. et al. Morbid obesity exposes the association between PNPLA3 I148M (rs738409) and indices of hepatic injury in individuals of European descent. Int J. Obes.34, 190–194 (2010). [DOI] [PubMed] [Google Scholar]
15.Agrawal, S. et al. Inherited basis of visceral, abdominal subcutaneous and gluteofemoral fat depots. Nat. Commun.13, 3771 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Mbatchou, J. et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet.53, 1097–1103 (2021). [DOI] [PubMed] [Google Scholar]
17.Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience4, 7 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet.44, 369–375 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Kanai, M. et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat. Genet.50, 390–400 (2018). [DOI] [PubMed] [Google Scholar]
20.Machiela, M. J. & Chanock, S. J. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics31, 3555–3557 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Dongiovanni, P. et al. Causal relationship of hepatic fat with liver damage and insulin resistance in nonalcoholic fatty liver. J. Intern. Med.283, 356–370 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Stender, S. et al. Adiposity amplifies the genetic risk of fatty liver disease conferred by multiple loci. Nat. Genet.49, 842–847 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Madsen, M. S., Siersbæk, R., Boergesen, M., Nielsen, R. & Mandrup, S. Peroxisome proliferator-activated receptor γ and C/EBPα synergistically activate key metabolic adipocyte genes by assisted loading. Mol. Cell. Biol.34, 939–954 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Dongiovanni, P. & Valenti, L. Peroxisome proliferator-activated receptor genetic polymorphisms and nonalcoholic fatty liver disease: any role in disease susceptibility? PPAR Res.2013, 452061 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Sarhangi, N. et al. PPARG (Pro12Ala) genetic variant and risk of T2DM: a systematic review and meta-analysis. Sci. Rep.10, 12764 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Stumvoll, M. & Häring, H. The peroxisome proliferator-activated receptor-γ2 Pro12Ala polymorphism. Diabetes51, 2341–2347 (2002). [DOI] [PubMed] [Google Scholar]
27.Dongiovanni, P., Romeo, S. & Valenti, L. Genetic factors in the pathogenesis of nonalcoholic fatty liver and steatohepatitis. BioMed. Res. Int.2015, 460190 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Lotta, L. A. et al. Integrative genomic analysis implicates limited peripheral adipose storage capacity in the pathogenesis of human insulin resistance. Nat. Genet.49, 17–26 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Ji, Y. et al. Genome-wide and abdominal MRI data provide evidence that a genetically determined favorable adiposity phenotype is characterized by lower ectopic liver fat and lower risk of type 2 diabetes, heart disease and hypertension. Diabetes68, 207–219 (2019). [DOI] [PubMed] [Google Scholar]
30.Imai, K. & Yamamoto, T. Identification and sensitivity analysis for multiple causal mechanisms: revisiting evidence from framing experiments. Political Anal.21, 141–171 (2013). [Google Scholar]
31.MacKinnon, D. P., Krull, J. L. & Lockwood, C. M. Equivalence of the mediation, confounding and suppression effect. Prev. Sci.1, 173–181 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Dongiovanni, P. et al. Transmembrane 6 superfamily member 2 gene variant disentangles nonalcoholic steatohepatitis from cardiovascular disease. Hepatology61, 506–514 (2015). [DOI] [PubMed] [Google Scholar]
33.Tada, H., Usui, S., Sakata, K., Takamura, M. & Kawashiri, M. A. Low-density lipoprotein cholesterol level cannot be too low: considerations from clinical trials, human genetics and biology. J. Atheroscler. Thromb.27, 489–498 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Lauridsen, B. K. et al. Liver fat content, non-alcoholic fatty liver disease and ischaemic heart disease: Mendelian randomization and meta-analysis of 279013 individuals. Eur. Heart J.39, 385–393 (2018). [DOI] [PubMed] [Google Scholar]
35.Smith, K. et al. Multi-ancestry polygenic mechanisms of type 2 diabetes. Nat. Med.30, 1065–1074 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Udler, M. S. et al. Type 2 diabetes genetic loci informed by multi-trait associations point to disease mechanisms and subtypes: a soft clustering analysis. PLoS Med.15, e1002654 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Cherubini, A. et al. Interaction between estrogen receptor-α and PNPLA3 p.I148M variant drives fatty liver disease susceptibility in women. Nat. Med.29, 2643–2655 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Prill, S. et al. The TM6SF2 E167K genetic variant induces lipid biosynthesis and reduces apolipoprotein B secretion in human hepatic 3D spheroids. Sci. Rep.9, 11585 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Pirazzi, C. et al. Patatin-like phospholipase domain-containing 3 (PNPLA3) I148M (rs738409) affects hepatic VLDL secretion in humans and in vitro. J. Hepatol.57, 1276–1282 (2012). [DOI] [PubMed] [Google Scholar]
40.Holmen, O. L. et al. Systematic evaluation of coding variation identifies a candidate causal variant in TM6SF2 influencing total cholesterol and myocardial infarction risk. Nat. Genet.46, 345–351 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Udler, M. S., McCarthy, M. I., Florez, J. C. & Mahajan, A. Genetic risk scores for diabetes diagnosis and precision medicine. Endocr. Rev.40, 1500–1520 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Bianco, C., Casirati, E., Malvestiti, F. & Valenti, L. Genetic predisposition similarities between NASH and ASH:identification of new therapeutic targets. JHEP Rep.3, 100284 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature562, 203–209 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Parisinos, C. A. et al. Genome-wide and Mendelian randomisation studies of liver MRI yield insights into the pathogenesis of steatohepatitis. J. Hepatol.73, 241–251 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Jamialahmadi, O., Tavaglione, F., Rawshani, A., Ljungman, C. & Romeo, S. Fatty liver disease, heart rate and cardiac remodelling: evidence from the UK Biobank. Liver Int.43, 1247–1255 (2023). [DOI] [PubMed] [Google Scholar]
46.Buniello, A. et al. The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res.47, D1005–D1012 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet.47, 291–295 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Gazal, S. et al. Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nat. Genet.49, 1421–1427 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res.19, 1639–1645 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Weissbrod, O. et al. Functionally informed fine-mapping and polygenic localization of complex trait heritability. Nat. Genet.52, 1355–1363 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Ser. B Stat. Methodol.82, 1273–1300 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Kerimov, N. et al. A compendium of uniformly processed human gene expression and splicing quantitative trait loci. Nat. Genet.53, 1290–1299 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Consortium, G. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science369, 1318–1330 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Lawrence, M., Gentleman, R. & Carey, V. rtracklayer: an R package for interfacing with genome browsers. Bioinformatics25, 1841–1842 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Wallace, C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLoS Genet.17, e1009440 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun.8, 1826 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
57.de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol.11, e1004219 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Ghoussaini, M. et al. Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics. Nucleic Acids Res.49, D1311–D1320 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Xie, Z. et al. Gene set knowledge discovery with Enrichr. Curr. Protoc.1, e90 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
60.de Mutsert, R. et al. The Netherlands Epidemiology of Obesity (NEO) study: study design and data collection. Eur. J. Epidemiol.28, 513–523 (2013). [DOI] [PubMed] [Google Scholar]
61.Valenti, L. et al. Definition of healthy ranges for alanine aminotransferase levels: a 2021 update. Hepatol. Commun.5, 1824–1832 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
62.Ellinghaus, D. et al. Genomewide association study of severe Covid-19 with respiratory failure. N. Engl. J. Med.383, 1522–1534 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Tavaglione, F. et al. Accuracy of controlled attenuation parameter for assessing liver steatosis in individuals with morbid obesity before bariatric surgery. Liver Int. 10.1111/liv.15127 (2021). [DOI] [PubMed]
64.Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics29, 15–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
65.Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30, 2114–2120 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform.12, 323 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol.15, 550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E. & Storey, J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics28, 882–883 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Marderstein, A. R. et al. Leveraging phenotypic variability to identify genetic interactions in human phenotypes. Am. J. Hum. Genet. 108, 49–67 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
70.Sulc, J. et al. Quantification of the overall contribution of gene-environment interaction for obesity-related traits. Nat. Commun.11, 1385 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
71.Tingley, D., Yamamoto, T., Hirose, K., Keele, L. & Imai, K. mediation: R package for causal mediation analysis. J. Stat. Softw.10.18637/jss.v059.i05 (2014).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information^{(6MB, pdf)}

Supplementary Figs. 1–6.

Reporting Summary^{(3.5MB, pdf)}

41591_2024_3284_MOESM3_ESM.xlsx^{(423.4KB, xlsx)}

Data Availability Statement

[CR1] 1.Sanyal, A. J. et al. Prospective study of outcomes in adults with nonalcoholic fatty liver disease. N. Engl. J. Med.385, 1559–1569 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Pais, R., Redheuil, A., Cluzel, P., Ratziu, V. & Giral, P. Relationship among fatty liver, specific and multiple-site atherosclerosis, and 10-year Framingham score. Hepatology69, 1453–1463 (2019). [DOI] [PubMed] [Google Scholar]

[CR3] 3.Anstee, Q. M., Targher, G. & Day, C. P. Progression of NAFLD to diabetes mellitus, cardiovascular disease or cirrhosis. Nat. Rev. Gastroenterol. Hepatol.10, 330–344 (2013). [DOI] [PubMed] [Google Scholar]

[CR4] 4.Toh, J. Z. K. et al. A meta-analysis on the global prevalence, risk factors and screening of coronary heart disease in nonalcoholic fatty liver disease. Clin. Gastroenterol. Hepatol.20, 2462–2473.e2410 (2022). [DOI] [PubMed] [Google Scholar]

[CR5] 5.Romeo, S., Sanyal, A. & Valenti, L. Leveraging human genetics to identify potential new treatments for fatty liver disease. Cell Metab.31, 35–45 (2020). [DOI] [PubMed] [Google Scholar]

[CR6] 6.Mantovani, A. et al. Adverse effect of PNPLA3 p.I148M genetic variant on kidney function in middle-aged individuals with metabolic dysfunction. Aliment Pharm. Ther.57, 1093–1102 (2023). [DOI] [PubMed] [Google Scholar]

[CR7] 7.Sun, D. Q. et al. An international Delphi consensus statement on metabolic dysfunction-associated fatty liver disease and risk of chronic kidney disease. Hepatobiliary Surg. Nutr.12, 386–403 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Chen, Y. et al. Genome-wide association meta-analysis identifies 17 loci associated with nonalcoholic fatty liver disease. Nat. Genet. 10.1038/s41588-023-01497-6 (2023). [DOI] [PMC free article] [PubMed]

[CR9] 9.Vujkovic, M. et al. A multiancestry genome-wide association study of unexplained chronic ALT elevation as a proxy for nonalcoholic fatty liver disease with histological and radiological validation. Nat. Genet.54, 761–771 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Emdin, C. A. et al. A missense variant in mitochondrial amidoxime reducing component 1 gene and protection against liver disease. PLoS Genet.16, e1008629 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Miao, Z. et al. Identification of 90 NAFLD GWAS loci and establishment of NAFLD PRS and causal role of NAFLD in coronary artery disease. HGG Adv.3, 100056 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Jamialahmadi, O. et al. Exome-wide association study on alanine aminotransferase identifies sequence variants in the GPAM and APOE associated with fatty liver disease. Gastroenterology160, 1634–1646.e1637 (2021). [DOI] [PubMed] [Google Scholar]

[CR13] 13.Romeo, S. et al. Genetic variation in PNPLA3 confers susceptibility to nonalcoholic fatty liver disease. Nat. Genet.40, 1461–1465 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Romeo, S. et al. Morbid obesity exposes the association between PNPLA3 I148M (rs738409) and indices of hepatic injury in individuals of European descent. Int J. Obes.34, 190–194 (2010). [DOI] [PubMed] [Google Scholar]

[CR15] 15.Agrawal, S. et al. Inherited basis of visceral, abdominal subcutaneous and gluteofemoral fat depots. Nat. Commun.13, 3771 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Mbatchou, J. et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet.53, 1097–1103 (2021). [DOI] [PubMed] [Google Scholar]

[CR17] 17.Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience4, 7 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet.44, 369–375 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Kanai, M. et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat. Genet.50, 390–400 (2018). [DOI] [PubMed] [Google Scholar]

[CR20] 20.Machiela, M. J. & Chanock, S. J. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics31, 3555–3557 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Dongiovanni, P. et al. Causal relationship of hepatic fat with liver damage and insulin resistance in nonalcoholic fatty liver. J. Intern. Med.283, 356–370 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Stender, S. et al. Adiposity amplifies the genetic risk of fatty liver disease conferred by multiple loci. Nat. Genet.49, 842–847 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Madsen, M. S., Siersbæk, R., Boergesen, M., Nielsen, R. & Mandrup, S. Peroxisome proliferator-activated receptor γ and C/EBPα synergistically activate key metabolic adipocyte genes by assisted loading. Mol. Cell. Biol.34, 939–954 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Dongiovanni, P. & Valenti, L. Peroxisome proliferator-activated receptor genetic polymorphisms and nonalcoholic fatty liver disease: any role in disease susceptibility? PPAR Res.2013, 452061 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Sarhangi, N. et al. PPARG (Pro12Ala) genetic variant and risk of T2DM: a systematic review and meta-analysis. Sci. Rep.10, 12764 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Stumvoll, M. & Häring, H. The peroxisome proliferator-activated receptor-γ2 Pro12Ala polymorphism. Diabetes51, 2341–2347 (2002). [DOI] [PubMed] [Google Scholar]

[CR27] 27.Dongiovanni, P., Romeo, S. & Valenti, L. Genetic factors in the pathogenesis of nonalcoholic fatty liver and steatohepatitis. BioMed. Res. Int.2015, 460190 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Lotta, L. A. et al. Integrative genomic analysis implicates limited peripheral adipose storage capacity in the pathogenesis of human insulin resistance. Nat. Genet.49, 17–26 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Ji, Y. et al. Genome-wide and abdominal MRI data provide evidence that a genetically determined favorable adiposity phenotype is characterized by lower ectopic liver fat and lower risk of type 2 diabetes, heart disease and hypertension. Diabetes68, 207–219 (2019). [DOI] [PubMed] [Google Scholar]

[CR30] 30.Imai, K. & Yamamoto, T. Identification and sensitivity analysis for multiple causal mechanisms: revisiting evidence from framing experiments. Political Anal.21, 141–171 (2013). [Google Scholar]

[CR31] 31.MacKinnon, D. P., Krull, J. L. & Lockwood, C. M. Equivalence of the mediation, confounding and suppression effect. Prev. Sci.1, 173–181 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Dongiovanni, P. et al. Transmembrane 6 superfamily member 2 gene variant disentangles nonalcoholic steatohepatitis from cardiovascular disease. Hepatology61, 506–514 (2015). [DOI] [PubMed] [Google Scholar]

[CR33] 33.Tada, H., Usui, S., Sakata, K., Takamura, M. & Kawashiri, M. A. Low-density lipoprotein cholesterol level cannot be too low: considerations from clinical trials, human genetics and biology. J. Atheroscler. Thromb.27, 489–498 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Lauridsen, B. K. et al. Liver fat content, non-alcoholic fatty liver disease and ischaemic heart disease: Mendelian randomization and meta-analysis of 279013 individuals. Eur. Heart J.39, 385–393 (2018). [DOI] [PubMed] [Google Scholar]

[CR35] 35.Smith, K. et al. Multi-ancestry polygenic mechanisms of type 2 diabetes. Nat. Med.30, 1065–1074 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Udler, M. S. et al. Type 2 diabetes genetic loci informed by multi-trait associations point to disease mechanisms and subtypes: a soft clustering analysis. PLoS Med.15, e1002654 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Cherubini, A. et al. Interaction between estrogen receptor-α and PNPLA3 p.I148M variant drives fatty liver disease susceptibility in women. Nat. Med.29, 2643–2655 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR38] 38.Prill, S. et al. The TM6SF2 E167K genetic variant induces lipid biosynthesis and reduces apolipoprotein B secretion in human hepatic 3D spheroids. Sci. Rep.9, 11585 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] 39.Pirazzi, C. et al. Patatin-like phospholipase domain-containing 3 (PNPLA3) I148M (rs738409) affects hepatic VLDL secretion in humans and in vitro. J. Hepatol.57, 1276–1282 (2012). [DOI] [PubMed] [Google Scholar]

[CR40] 40.Holmen, O. L. et al. Systematic evaluation of coding variation identifies a candidate causal variant in TM6SF2 influencing total cholesterol and myocardial infarction risk. Nat. Genet.46, 345–351 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Udler, M. S., McCarthy, M. I., Florez, J. C. & Mahajan, A. Genetic risk scores for diabetes diagnosis and precision medicine. Endocr. Rev.40, 1500–1520 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR42] 42.Bianco, C., Casirati, E., Malvestiti, F. & Valenti, L. Genetic predisposition similarities between NASH and ASH:identification of new therapeutic targets. JHEP Rep.3, 100284 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature562, 203–209 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Parisinos, C. A. et al. Genome-wide and Mendelian randomisation studies of liver MRI yield insights into the pathogenesis of steatohepatitis. J. Hepatol.73, 241–251 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.Jamialahmadi, O., Tavaglione, F., Rawshani, A., Ljungman, C. & Romeo, S. Fatty liver disease, heart rate and cardiac remodelling: evidence from the UK Biobank. Liver Int.43, 1247–1255 (2023). [DOI] [PubMed] [Google Scholar]

[CR46] 46.Buniello, A. et al. The NHGRI-EBI GWAS catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res.47, D1005–D1012 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet.47, 291–295 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR48] 48.Gazal, S. et al. Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nat. Genet.49, 1421–1427 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR49] 49.Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res.19, 1639–1645 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR50] 50.Weissbrod, O. et al. Functionally informed fine-mapping and polygenic localization of complex trait heritability. Nat. Genet.52, 1355–1363 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR51] 51.Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Ser. B Stat. Methodol.82, 1273–1300 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR52] 52.Kerimov, N. et al. A compendium of uniformly processed human gene expression and splicing quantitative trait loci. Nat. Genet.53, 1290–1299 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR53] 53.Consortium, G. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science369, 1318–1330 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR54] 54.Lawrence, M., Gentleman, R. & Carey, V. rtracklayer: an R package for interfacing with genome browsers. Bioinformatics25, 1841–1842 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR55] 55.Wallace, C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLoS Genet.17, e1009440 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR56] 56.Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun.8, 1826 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR57] 57.de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol.11, e1004219 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR58] 58.Ghoussaini, M. et al. Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics. Nucleic Acids Res.49, D1311–D1320 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR59] 59.Xie, Z. et al. Gene set knowledge discovery with Enrichr. Curr. Protoc.1, e90 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR60] 60.de Mutsert, R. et al. The Netherlands Epidemiology of Obesity (NEO) study: study design and data collection. Eur. J. Epidemiol.28, 513–523 (2013). [DOI] [PubMed] [Google Scholar]

[CR61] 61.Valenti, L. et al. Definition of healthy ranges for alanine aminotransferase levels: a 2021 update. Hepatol. Commun.5, 1824–1832 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR62] 62.Ellinghaus, D. et al. Genomewide association study of severe Covid-19 with respiratory failure. N. Engl. J. Med.383, 1522–1534 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR63] 63.Tavaglione, F. et al. Accuracy of controlled attenuation parameter for assessing liver steatosis in individuals with morbid obesity before bariatric surgery. Liver Int. 10.1111/liv.15127 (2021). [DOI] [PubMed]

[CR64] 64.Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics29, 15–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR65] 65.Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30, 2114–2120 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR66] 66.Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform.12, 323 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR67] 67.Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol.15, 550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR68] 68.Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E. & Storey, J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics28, 882–883 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR69] 69.Marderstein, A. R. et al. Leveraging phenotypic variability to identify genetic interactions in human phenotypes. Am. J. Hum. Genet. 108, 49–67 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR70] 70.Sulc, J. et al. Quantification of the overall contribution of gene-environment interaction for obesity-related traits. Nat. Commun.11, 1385 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR71] 71.Tingley, D., Yamamoto, T., Hirose, K., Keele, L. & Imai, K. mediation: R package for causal mediation analysis. J. Stat. Softw.10.18637/jss.v059.i05 (2014).

PERMALINK

Partitioned polygenic risk scores identify distinct types of metabolic dysfunction-associated steatotic liver disease

Oveis Jamialahmadi

Antonio De Vincentis

Federica Tavaglione

Francesco Malvestiti

Ruifang Li-Gao

Rosellina M Mancina

Marcus Alvarez

Kyla Gelev

Samantha Maurotti

Umberto Vespasiani-Gentilucci

Frits Richard Rosendaal

Julia Kozlitina

Päivi Pajukanta

François Pattou

Luca Valenti

Stefano Romeo

Abstract

Main

Results

Visceral adipose tissue, whole-body fat mass and BMI are independent predictors of liver triglyceride content and inflammation/fibrosis

Extended Data Fig. 1. Measures of adiposity are highly correlated with liver triglycerides and inflammation/fibrosis, with VAT, WFM, and BMI being independent predictors of liver outcomes.

Identification of 17 previously unknown loci for liver triglyceride content and 9 for liver inflammation/fibrosis by the multi-adiposity-adjusted GWAS

Fig. 1. Overview of the identified loci for liver triglycerides and inflammation/fibrosis by the multi-adiposity-adjustment GWAS.

Table 1.

Identification of the putative causal loci associated with liver traits

Functional analyses of independent loci associated with liver traits

Previously unknown genetic loci, liver and metabolic traits

Extended Data Fig. 2. Previously unknown genetic loci were associated with liver disease and metabolic traits.

Indices of adiposity contribute differentially to the association between genetic variants and liver triglycerides

The association between six previously unknown loci and liver triglyceride content was replicated in independent cohorts

Fig. 2. The association between six previously unknown loci and hepatic triglyceride content in independent cohorts.

Partitioned polygenic risk scores identify a steatotic liver-specific disease and a systemic MASLD

Extended Data Fig. 3. Association between 26 previously known and 6 previously unknown replicated genetic loci and circulating triglycerides in the UK Biobank.

Fig. 3. Partitioned polygenic risk scores identify a steatotic liver-specific disease and a systemic MASLD.

Sex-specific effect of the association between pPRS and the feature of cardiometabolic syndrome

mRNA expression of loci from the liver-specific PRS is more abundant in the liver

Fig. 4. mRNA expression of loci from the liver-specific (discordant) polygenic risk score is more abundant in the liver compared to the visceral adipose tissue.

Discussion

Extended Data Fig. 4. Putative model of the two different types of MASLD.

Methods

UK Biobank

Genotypes and imputation

Definition of traits

Phenotypic prediction models

Genome-wide association analysis

Identifying independent variants

Estimating heritability and genetic correlations

Pleiotropy analysis

Functionally informed fine-mapping

Colocalization

Variant annotation

Gene mapping and functional enrichment analysis

Partitioned polygenic risk scores of liver triglyceride content

Replication cohorts

NEO

Liver BIBLE

MAFALDA

Dallas Heart Study

Ethics

Meta-analysis

RNA-seq analysis

Follow-up analysis

Gene–adiposity interaction analysis

Mediation analysis

Association analysis with adiposity measures

bNMF clustering

Comparison between bNMF and PDFF-TGs pPRS

Reporting summary

Online content

Supplementary information

Acknowledgements

Extended data

Author contributions

Peer review

Peer review information

Funding

Data availability

Code availability