Abstract
Understanding how genetic variants influence disease risk and complex traits (variant-to-function) is one of the major challenges in human genetics. Here we present a model-driven framework to leverage human genome-scale metabolic networks to define how genetic variants affect biochemical reaction fluxes across major human tissues, including skeletal muscle, adipose, liver, brain and heart. As proof of concept, we build personalised organ-specific metabolic flux models for 524,615 individuals of the INTERVAL and UK Biobank cohorts and perform a fluxome-wide association study (FWAS) to identify 4312 associations between personalised flux values and the concentration of metabolites in blood. Furthermore, we apply FWAS to identify 92 metabolic fluxes associated with the risk of developing coronary artery disease, many of which are linked to processes previously described to play in role in the disease. Our work demonstrates that genetically personalised metabolic models can elucidate the downstream effects of genetic variants on biochemical reactions involved in common human diseases.
Subject terms: Metabolomics, Biochemical networks, Cardiovascular diseases
Here, the authors present a method to build genetically personalised metabolic models across tissues to estimate individualised reaction fluxes. A fluxome-wide association study in UK Biobank identifies fluxes associated with metabolites and coronary artery disease.
Introduction
Genome-wide association studies (GWAS) have identified more than 50,000 genetic variants associated with complex traits or diseases1. While the contribution of individual variants to a given phenotype is generally small, the effect of multiple genetic variants can be aggregated into polygenic scores (PGS), which are highly predictive of disease incidence and enhance existing risk models2–4. However, while GWAS and PGS can be useful for risk stratification5–7, the mechanisms by which genetic variants influence disease risk, i.e., variant to function (V2F), remain largely unsolved. Addressing V2F is a major challenge in human genetics and has the potential to unveil many new therapeutic targets6,8,9.
An approach to address the V2F challenge is to quantify how genetic variation causes disease through the regulation of molecular traits. To this end, genetic variants affecting gene expression are identified and subsequently aggregated into models that can impute the abundance of transcripts and proteins10–12. For example, PredictDB is a database that offers a collection of linear models to impute transcript levels in specific organs of the human body13. PredictDB models were trained in the GTEx dataset, which contains genotype profiling and tissue-specific transcript abundance from post-mortem donors14. Imputed transcript or protein levels can be used to perform transcriptome-wide or proteome-wide association analyses, respectively, to identify gene products associated with disease10,15. Alternatively, PGSs for disease can be used to identify proteins and other gene products which may disrupt polygenic risk16. However, transcripts and proteins do not exert their effects in isolation but in highly connected and complex biological networks. Indeed, previous studies have shown the merit of analysing genetic variation in the context of gene co-expression and gene interaction networks to characterise how the effects of genetic variants contribute to complex traits or diseases by propagating through biological networks17–20.
Metabolism is one of the most prominent biological networks and a comparatively tractable experimental setting in which to address the V2F challenge. Essentially, metabolism is a set of interconnected chemical reactions and transport processes occurring in a highly ordered, regulated and coordinated manner across multiple organs in the human body21. The metabolic phenotype of a given organ is defined by both metabolite concentrations and metabolic fluxes (i.e., the rates at which substrates are converted to products through reactions) and emerges from the complex interaction of metabolites, enzymes, and transmembrane carriers22,23. Metabolite concentrations offer a static snapshot of metabolite distributions, whereas metabolic fluxes provide a map of metabolite traffic through metabolic pathways24.
Genome-scale metabolic models (GSMMs), mathematical representations of the metabolic reaction network arising from the human genome25,26, simulate steady-state metabolic fluxes by formulating network stoichiometry as sets of linear equations and directionality constraints27. GSMMs have emerged as a useful approach to integrate transcriptomics, proteomics, and metabolomics to characterise metabolic flux maps28,29. For example, proteomics, metabolomics, and physiological data have been used to build human organ-specific GSMMs30. Similarly, there is increasing interest in integrating individual measures to build personalised GSMMs that reflect the specific metabolic phenotype in each individual, thus facilitating personalised medicine30–34.
Since gene expression is highly heritable10,13, it may be feasible to leverage human genome-scale metabolic networks to analyse the system-wide effects of genetic variants on metabolism and build genetically personalised GSMMs. To this end, we present a framework where transcript levels imputed from genetic data can be used to simulate personalised and organ-specific, genome-scale flux maps using the quadratic metabolic transformation algorithm (qMTA). Such flux maps provide genetically personalised metabolic models at a genome scale for each tissue. As proof of concept, we build personalised organ-specific flux maps for over 520,000 individuals across the INTERVAL35,36 and UK Biobank (UKB)37 cohorts, then perform a fluxome-wide association study (FWAS) to test the association between organ-specific flux values and directly measured blood metabolite levels. Finally, we apply FWAS to identify fluxes associated with coronary artery disease (CAD), thus demonstrating the potential of genome-scale flux maps for V2F by elucidating intermediary biochemical reactions between genetic variation and common disease.
Results
A computational framework for genetically personalised organ-specific GSMMs
We developed a framework for building personalised organ-specific flux maps from genotype data (Fig. 1; Methods). First, we extract the organ-specific models from the Harvey/Harvetta multiorgan model30, which provide a set of curated metabolic networks for the main organs of the human body. Harvey/Harvetta models were built from the Recon3D human GSMM25, which has been superseded by HUMAN126. HUMAN1 shares 97% of reactions with Recon3D, but it incorporates a myriad of improvements in gene-reaction rules, reaction reversibility and stoichiometric consistency compared to the latter. Hence, we performed a liftover of the Harvey/Harvetta organ-specific models to HUMAN1 (Methods).
With the HUMAN1-based organ-specific models, the next step is to compute a reference flux distribution for each organ under consideration. This is achieved by defining organ-specific metabolic objectives that must be fulfilled (e.g., synthesis of neurotransmitters in the brain), obtaining average organ transcript abundances from GTEx14 and using them as an input for the GIM3E algorithm38. GIM3E is an algorithm that, subject to fulfilling the organ-specific metabolic objectives, seeks to minimise the overall flux through the network using transcript abundance data to give each reaction a minimisation weight inversely proportional to the expression of the enzymes catalysing it. Subsequently, flux sampling39 is applied to identify a representative flux distribution (i.e., sets of flux values) in the solution space within 99% of the GIM3E optimal solution. The resulting set of flux values, termed reference flux distribution, is both enzymatically efficient and consistent with the average transcript abundances in each organ (Supplementary Fig. 1). The flux distribution can be assumed to represent the average metabolic state of each modelled organ in the general population.
Subsequently, models from PredictDB13 are used to impute personalised organ-specific transcript abundances from genotype data. The resulting imputed transcript data are mapped to reactions in the organ-specific subnetworks as putative reaction activity fold changes relative to the average organ-specific transcript expression in GTEx14. The imputed personalised reaction activity fold changes and the reference flux distributions are then utilised by the qMTA to compute genetically personalised organ-specific flux maps. Briefly, qMTA finds the flux distributions most consistent with the putative reaction activity fold changes in each individual (Supplementary Fig. 2; Methods).
Building flux maps for >520,000 individuals
Using the above framework, we built personalised organ-specific flux maps for 37,220 and 487,395 individuals from the INTERVAL35,36 and UKB37 cohorts, respectively. Personalised models were generated for skeletal muscle, adipose tissue, liver, brain, and heart, which together account for roughly 66% of body weight in an average adult40. Overall, 14,220 reaction flux values were computed for each individual. Metabolic fluxes “flow” through pathways where the product of one reaction is the substrate of successive reactions; thus, many of the flux values computed in each individual will have inherent dependencies (Supplementary Fig. 3A–C). As such, from the 14,220 reaction flux values, we selected a subset of 4300 flux values without strong pairwise correlations (ρ < 0.9) for further analysis (Supplementary Fig. 3D; Methods).
Principal component analysis of the personalised organ-specific flux values for individuals of INTERVAL and UKB showed the underlying structure in the data (Supplementary Fig. 4). Fluxes with the greatest loadings on top principal components (PCs) tended to be related to the known metabolism of each organ (Supplementary Data 1). For example, in the liver, fluxes through reactions and transport processes of amino acid, glycerophospholipid, and nucleotide metabolism exhibited large loadings along the first five PCs. Key reactions in cholesterol and bile acid biosynthesis also had large PC loadings, reflecting the function of the liver in cholesterol homoeostasis21. In both skeletal muscle and heart, the top PCs were associated with fluxes through transport processes of amino acids and reactions related to fatty acid β-oxidation, processes which play key roles in skeletal muscle and heart41–44. Notably, in the brain, the main loadings on the top principal components were distributed across a wide range of pathways. For instance, PC1 was associated with reactions and transport processes involving bile acids and their precursors. Bile acids, which can be synthesised within the brain and can also be transported across the blood-brain barrier, have been reported to act as regulators of neurological functions45,46. Likewise, PC2, and to a lesser extent PC3, were related to reactions and transport processes from amino acid metabolism, including reactions linked to neurotransmitters such as dopamine, glycine, glutamate, and nitric oxide. PC4 was associated with reactions of fatty acid metabolism, most notably several reactions involving arachidonic acid, a conditionally essential fatty acid with many roles in brain function in health and disease47,48. Lastly, PC5 was primarily associated with reactions of nucleotide metabolism. Finally, in adipose tissue, all PCs were strongly associated with fatty acid metabolism, including reactions involved in their oxidation, biosynthesis and transport. However, PC2 and PC3 were also associated with reactions of steroid metabolism, reflecting adipose tissue’s capacity to synthesise and convert steroids49.
Fluxome-wide association study for blood metabolites
We next validated that genetically personalised GSMMs could generate reliable and meaningful flux predictions across cohorts. As phenotypes, organ-specific flux maps are expected to lead to distinct profiles in the blood metabolome. To demonstrate this, we performed an association analysis by individually regressing each measured blood metabolic feature against the 4300 personalised fluxes computed in the INTERVAL35,36 and UKB37 cohorts (Supplementary Fig. 2; Methods). The blood metabolome for INTERVAL comprised both Nightingale Health NMR assays (N = 37,720 participants) and Metabolon HD4 mass spec assays (N = 8115 participants)50. In UKB, blood samples for 120,266 participants were profiled with Nightingale Health NMR51.
For INTERVAL, an FDR-adjusted significance threshold of P < 1.0 × 10−6 was applied to control for all tested pairs (Methods). We identified 4312 significant associations between flux values and blood metabolic features in total, of which 1066 were for the Nightingale platform and 3246 for Metabolon (Supplementary Data 2, Fig. 2A, B). Consistent with the role of the liver in whole-body metabolic homoeostasis21, the liver was the organ with the most associations (1301), followed by the heart (1005), skeletal muscle (896), brain (593), and adipose tissue (517) (Fig. 2C, Supplementary Data 2). We externally validated the INTERVAL flux associations with Nightingale metabolites using UKB (Fig. 2D). We found 83% of the INTERVAL associations replicated in UKB with an FDR-adjusted significance threshold of P < 1.0 × 10−6 and consistent direction of the effect sizes. Effect sizes were themselves highly correlated (ρ = 0.82) between INTERVAL and UKB (Supplementary Fig. 5).
Finally, we also evaluated the effect of the underlying genome-scale reconstructions of human metabolism in the FWAS for blood metabolic features. With this aim, we used organ-specific models built from the Recon3D human GSMM25,30 to compute genetically-personalised fluxes for the INTERVAL cohort35,36, test their association to blood metabolic features, and compare the results to the above-described FWAS that had used fluxes computed with HUMAN1-based models. We identified 3895 significant associations between blood metabolic features and the genetically personalised flux values computed using Recon3D-based organ-specific models (Supplementary Data 2). There was a significant overlap with HUMAN1 models as 1761 of these associations could be replicated in the HUMAN1-based FWAS, and the associated effect sizes on blood metabolites were highly correlated between HUMAN1 and Recon3D analyses (ρ = 0.72). However, 2134 associations were only statistically significant on the Recon3D-based analysis and could not be replicated with HUMAN1 models. Likewise, of the 4312 significant associations between blood metabolic features and fluxes computed using HUMAN1 models, 2551 associations could not be detected with Recon3D-based models. Such discrepancy between HUMAN1- and Recon3D-based analyses is not surprising; HUMAN126, which is a newer reconstruction of human metabolism than Recon3D25, expands gene reaction annotations and refines reaction reversibility, both of which can have significant effects on how genetic variation propagates through the network and, thus, can lead to significant differences in the resulting personalised flux maps and the downstream FWAS. Indeed, many discrepancies between the Recon3D and HUMAN1 results are likely artefacts emerging from erroneous or incomplete annotations in Recon3D. Throughout this work, we focus on the analyses and discussion of HUMAN1-based fluxes, as HUMAN1 has been established to be a better representation of human metabolism26, but results obtained with Recon3D-based models will also be provided in the appropriate supplementary data (Supplementary Data 2, Supplementary Data 3, and Supplementary Data 4).
Fluxome associations by metabolic feature class and reaction system
The 4312 significant associations comprised 229 unique blood metabolic features and 763 unique organ-specific metabolic fluxes. Consistent with the coverage of the Nightingale Health and Metabolon HD4 platforms, we found that most of these blood metabolic features were lipid-related (Fig. 3A, Supplementary Data 2). Glycerides and phospholipids were enriched in associations across all organs relative to all features profiled with the Metabolon HD4 assay (Methods), suggesting an association with core reactions (i.e., active in all modelled organs). The liver and adipose tissue were also enriched in associations with steroids, reflecting the role of such organs in cholesterol21 and steroid hormone metabolism49.
We further assessed the metabolic systems of the 763 organ-specific metabolic fluxes from the significant associations (Fig. 3B, Supplementary Data 2) and found that most reactions were functionally part of lipid metabolism, consistent with a large number of associations with lipid metabolic features. Reactions of fatty acid metabolism were significantly enriched in associations with blood metabolic features in all organs relative to all analysed reactions in each organ-specific metabolic network. In the liver, reactions of glyceride and phospholipid metabolism and bile acid metabolism were also enriched.
There was widespread consistency between biochemical pathways and blood metabolic feature classes (Fig. 3C, Supplementary Data 2). For example, reactions from the glycerides and phospholipids system were primarily associated with blood metabolic features of glycerides and phospholipids as well as lipoprotein fractions and their constituents. We found that reactions of fatty acid metabolism were associated mainly with blood glyceride and phospholipids, followed by fatty acids, which themselves provide acyl chains to glycerides and phospholipids. Similarly, reactions from nucleotide metabolism and amino acid metabolism were primarily associated with blood metabolic features of nucleotides and amino acids, respectively.
Fluxes of the hepatic triacylglycerol to cholesteryl ester pathway and blood lipoproteins
In the liver, we identified 555 associations between fluxes and lipoprotein fractions (Supplementary Data 2). Most of these associations were to reactions of glycerides and phospholipids metabolism, which were enriched in associations relative to all analysed liver fluxes (Methods; Fig. 3B). FWAS revealed that a major determinant of triacylglycerols (TAG), free cholesterol (FC), and cholesteryl esters (CE) fractions in lipoproteins was a sequential set of reactions which we term the TAG to cholesterol esterification (TAG-CE) pathway (Fig. 4). In the TAG-CE pathway, TAGs are hydrolysed to diglycerides and fatty acids in the liver, diacylglycerides are then used as a substrate to synthesise phospholipids (i.e., phosphatidylcholine and phosphatidylethanolamine) which are subsequently used as substrates to esterify FC. We found that fluxes through reactions of the TAG-CE pathway were strongly associated with an increased percentage of CE in HDL and decreased TAG levels in LDL and HDL (Table 1, Supplementary Data 3). The pathway was also strongly associated with reduced HDL size, likely driven by a reduction of TAG levels in HDL52. While the associations were primarily found in the liver-specific flux map with mediation by liver-expressed enzymes, these pathways are not necessarily constrained to the liver. For example, the hepatic TAG lipase localises to both the liver and blood53. Similarly, phospholipids synthesised in the liver may be transferred to HDL in circulation, where they can fuel cholesterol esterification catalysed by the liver-secreted lecithin-cholesterol acyltransferase (LCAT)52,54.
Table 1.
Reaction | Blood metabolic feature | Effect size | FDR-adjusted P value (INTERVAL) | FDR-adjusted P value (UKB) |
---|---|---|---|---|
Triacylglycerol lipase | HDL_CE_pct | + | 3.48 × 10−31 | 2.22 × 10−72 |
HDL_CE_pct_C | + | 1.86 × 10−73 | <2.225 × 10−308 | |
HDL_size | − | 2.97 × 10−50 | 4.67 × 10−292 | |
HDL_TG | − | 9.88 × 10−74 | 1.12 × 10−245 | |
LDL_TG | − | 3.37 × 10−78 | 3.30 × 10−250 | |
Ethanolamine phosphotransferase | HDL_CE_pct | + | 1.49 × 10−29 | 5.98 × 10−70 |
HDL_CE_pct_C | + | 4.39 × 10−74 | <2.225 × 10−308 | |
HDL_size | − | 4.25 × 10−53 | 9.52 × 10−307 | |
HDL_TG | − | 2.86 × 10−73 | 1.60 × 10−249 | |
LDL_TG | − | 3.99 × 10−79 | 1.97 × 10−259 | |
Lecithin-cholesterol acyltransferase | HDL_CE_pct | + | 5.45 × 10−22 | 1.15 × 10−52 |
HDL_CE_pct_C | + | 4.60 × 10−46 | 1.12 × 10−235 | |
HDL_size | − | 4.24 × 10−33 | 2.17 × 10−206 | |
HDL_TG | − | 7.02 × 10−54 | 3.70 × 10−171 | |
LDL_TG | − | 9.03 × 10−55 | 2.05 × 10−171 | |
Cholesteryl ester release | HDL_CE_pct | + | 3.76 × 10−26 | 4.17 × 10−62 |
HDL_CE_pct_C | + | 5.19 × 10−59 | 8.75 × 10−291 | |
HDL_size | − | 2.17 × 10−41 | 8.02 × 10−254 | |
HDL_TG | − | 1.97 × 10−61 | 8.76 × 10−206 | |
LDL_TG | − | 1.23 × 10−66 | 1.35 × 10−215 | |
Cholesterol esterase | HDL_CE_pct | − | 1.50 × 10−23 | 4.31 × 10−57 |
HDL_CE_pct_C | − | 4.32 × 10−60 | 2.10 × 10−264 | |
HDL_size | + | 5.11 × 10−37 | 1.29 × 10−226 | |
HDL_TG | + | 3.90 × 10−51 | 7.37 × 10−186 | |
LDL_TG | + | 2.92 × 10−60 | 2.56 × 10−200 | |
Retinyl ester hydrolase | HDL_CE_pct | − | 3.76 × 10−23 | 1.04 × 10−53 |
HDL_CE_pct_C | − | 8.90 × 10−57 | 1.24 × 10−261 | |
HDL_size | + | 2.15 × 10−37 | 1.87 × 10−227 | |
HDL_TG | + | 3.14 × 10−49 | 1.54 × 10−175 | |
LDL_TG | + | 2.19 × 10−57 | 3.34 × 10−193 |
A complete list of significant associations is provided in Supplementary Data 3. The statistical significance of each flux to lipoprotein associations was evaluated with linear regression (two-tailed t-test for flux effect size; Methods). HDL_CE_pct cholesteryl esters to total lipids ratio in HDL, HDL_CE_pct_C esterified cholesterol to total cholesterol ratio in HDL, HDL_size mean diameter for HDL particles, HDL_TG triglycerides in HDL, LDL_TG triglycerides in LDL.
Components of the TAG-CE pathway have been the subject of various studies. For example, rare deficiencies in hepatic lipase activity have been linked to increased TAG levels and decreased CE levels in HDL55. Similarly, genetic variants in hepatic lipase have been associated with total cholesterol levels in HDL56,57. For phospholipids, blocking phosphatidylcholine synthesis has been shown to result in cellular accumulation of TAG both in vitro and in vivo58,59. Similarly, LCAT deficiencies have been associated with reduced cholesterol esterification and increased triglycerides in plasma60,61. It has also been suggested that cholesterol may be inefficiently esterified by LCAT in patients with CAD, leading to a lower CE to FC ratio62.
Conversely, flux through reactions disrupting TAG-CE, such as cholesterol esterase, are predicted by our FWAS to have the opposite effect and are associated with increased TAG levels and decreased cholesterol esterification (Table 1, Supplementary Data 3). Among such reactions, there is the hydrolysis of retinyl esters which can act as an alternative source of free fatty acids inhibiting TAG lipase activity (Fig. 4). Retinyl esters are the most abundant form of vitamin A in the human body and are its most common form in diets and vitamin supplements63. Dietary retinol is esterified in enterocytes, and most of it is transported to hepatocytes by means of lipoproteins, where it is subsequently hydrolysed and transferred to stellate cells for storage64. Notably, the administration of high doses of retinol derivatives has been reported to increase total triglyceride levels and, in some instances, increase total cholesterol in LDL while decreasing total cholesterol in HDL65–68. We hypothesise that this occurs because retinyl esters disrupt the hepatic TAG-CE pathway, inhibiting triglyceride lipase and reducing cholesterol esterification, thus reducing the capacity of HDL to collect FC from other lipoproteins such as LDL52,62,69.
FWAS identifies metabolic fluxes associated with coronary artery disease
We extended our approach of fluxome-wide analysis to common diseases and performed a multi-tissue FWAS for CAD in UKB. We evaluated the association of the 4300 metabolic fluxes with CAD using Cox regression (Methods), which identified 92 significant associations (FDR-adjusted P value < 0.05 controlling for all tested fluxes). Of such associations, 31 could be replicated with genetically personalised fluxes computed with Recon3D-based models, whereas 61 were specific to the HUMAN1-based models. Liver fluxes yielded the largest number of significant associations with CAD (N = 32), followed by fluxes from the adipose tissue (N = 26), heart (N = 15), brain (N = 10), and skeletal muscle (N = 9) (Fig. 5; Supplementary Data 4).
The flux of histamine synthesis through histidine decarboxylase was shown to be strongly associated with CAD in adipose tissue with a hazard ratio (HR) per s.d. of log-transformed flux value of 1.060 and a P value of 2.33 × 10−27 (Supplementary Data 4). Such association was also detected in the liver, where both the fluxes through histidine uptake (HR = 1.024 per s.d., P = 1.65 × 10−5) and histidine decarboxylation (HR = 1.027 per s.d., P = 8.60 × 10−7) were associated with increased CAD risk. Histamine is an inflammatory mediator synthesised from histidine primarily in mast cells70, which reside in most tissues, including liver and adipose tissue71,72. Histamine has been reported to be associated with atherosclerosis via blood lipids and lipoprotein fractions as well as by promoting inflammation73–76. In adipose tissue, polyamine synthesis was also associated with reduced CAD risk (spermidine synthase: HR = 0.9517 per s.d., P = 6.06 × 10−21). Notably, polyamine-rich diets have been established to have a protective effect against cardiovascular disease77,78. Moreover, it has recently been determined that polyamines produced by adipose endothelial cells might protect against obesity, a known risk factor for CAD79, by promoting vascularisation and lipolysis in white adipose tissue80.
Concerning lipid metabolism, the fluxes through the TAG lipase reactions in adipose tissue (HR = 0.9652 per s.d., P = 6.48 × 10−11), heart (HR = 0.9675 per s.d., P = 1.06 × 10−9) and skeletal muscle (HR = 0.9693 per s.d., P = 9.83 × 10−9) were strongly associated with reduced CAD risk, consistent with the anti-atherogenic effect of lipoprotein lipase activity in these organs81. Similarly, the release of free fatty acids from adipose tissue was also associated with reduced CAD risk (e.g., the release of oleic acid: HR = 0.9688 per s.d., P = 5.09 × 10−19 and release of myristic acid: HR = 0.9691 per s.d., P = 6.64 × 10−9). Interestingly, not only is the release of free fatty acids part of normal adipocyte function21, but it is also a key part of the polyamine-driven signalling cascade in adipose tissue80. Conversely, the flux through the phospholipase reaction was associated with increased CAD risk in adipose tissue (HR = 1.027 per s.d., P = 9.48 × 10−7). Notably, phospholipase activities have been suggested to have a causal role in atherosclerosis and have been investigated as potential pharmacological targets to prevent atherosclerosis and CAD82–85.
The fluxes through several transport processes were also identified as associated with CAD. For instance, histamine transport in the liver appears to be associated with CAD risk in a transport process-specific manner, with histamine transport through uniport being associated with decreased CAD risk (HR = 0.942 per s.d., P = 2.60 × 10−28) and its antiport with glutathione being associated to increased risk (HR = 1.051 per s.d., P = 9.77 × 10−20). Also in the liver, transport of bilirubin conjugates was associated to decreased CAD risk (transport of bilirubin-monoglucuronoside: HR = 0.9629 per s.d., P = 2.74 × 10−12 and transport of bilirubin-bisglucuronoside: HR = 0.9808 per s.d., P = 3.49 × 10−4). Notably, the transport process of bilirubin-monoglucuronoside is mediated by SLCO1B1, which also mediates the hepatic uptake of statins, enhancing their therapeutic efficacy86,87. Interestingly, high levels of total bilirubin in blood have been associated with decreased risk of CAD88,89, likely mediated by the modulation of arterial diameter and reactivity90. In both the brain and heart, the flux of prostaglandin E2 transport was also associated with increased CAD risk (brain: HR = 1.058 per s.d., P = 4.23 × 10−26 and heart: HR = 1.049 per s.d., P = 5.27 × 10−19). Prostaglandin E2 is an inflammatory mediator that promotes inflammation and has been reported to contribute to the development of atherosclerotic lesions91,92. Additionally, in the brain, the transport of norepinephrine, a neurotransmitter that can increase blood pressure and may play a role in atherosclerosis93,94, was also associated with increased risk of CAD (HR = 1.041 per s.d., P = 4.49 × 10−14).
Discussion
Here, we present a new framework that uses metabolic modelling to leverage the stoichiometric relationships of enzymes in human genome-scale metabolic networks to characterise how genetic variants affect metabolic phenotypes. We achieve this by integrating genetic effects on transcript levels into organ-specific GSMMs and simulating how they propagate and interact into genome-scale flux maps of major human organs. To validate our method, we built organ-specific models for the liver, heart, skeletal muscle, brain, and adipose tissue for over 520,000 individuals from the INTERVAL35,36 and UKB37 cohorts, surpassing by more than two orders of magnitude the number of personalised GSMMs built in previous works30–33. Association analyses were performed between genetically-personalised flux values and directly measured blood metabolites in both INTERVAL and UKB, identifying many significant and replicable associations. As expected, we found that most blood metabolic features were associated with functionally related flux pathways. Finally, we demonstrate that fluxome-wide analysis can be used to identify putative metabolic drivers of CAD.
With cardiovascular disease being a leading cause of mortality and comorbidity worldwide95, the identification of specific biochemical reactions linked to CAD using population-scale genomic data is of significant interest to both basic discovery science and the development of therapeutics. Indeed, many of the 92 flux associations we identified involve pathways or metabolites that have been associated with CAD in existing studies, such as histamine73,76, TAGs96, or phospholipase activity82,83,85. The modulation of some of these fluxes has been explored as therapies for CAD, namely several phospholipase inhibitors83,84.
Our analysis has several limitations. For instance, as a proof of concept, this study focused on modelling only the five most prominent human organs40, and thus we can only identify flux to phenotype associations in the liver, heart, skeletal muscle, brain, and adipose tissue. However, given the availability of models to impute tissue- or cell-specific transcript abundance from genotype13, this analysis can easily be expanded to other tissues and cell types. Indeed, we envision that future applications may select organs for modelling based on the target diseases or phenotypes. Furthermore, the modelling framework presented here is limited to only simulating the effect of genetic variants affecting transcript levels. In the future, it could also be expanded to model the impact of gain or loss of function variants97 and environmental variables (e.g., diet, lifestyle, and medication) on the personalised flux maps. Additionally, while transcript levels are widely used in genome-scale metabolic modelling26,29,38, protein levels have a more direct effect on enzymatic activity, and new methods are being developed to fully integrate them into GSMMs26,98. With models to impute the levels of proteins becoming increasingly available12,15, we expect that the framework for computing genetically personalised fluxes will be extended to integrate the protein layer in the future. Finally, an inherent limitation of our analysis is that it is dependent on the quality of the underlying metabolic networks and their gene-reaction annotations. Indeed, we determined that an important number of the associations between fluxes and blood metabolomics or CAD risk could not be replicated with models based on an earlier reconstruction of human metabolism (i.e., Recon3D25). However, with human GSMMs becoming increasingly more well-annotated26, differences in FWAS results using models built from different genome-scale reconstructions of human metabolism will progressively become more subtle.
Concerning translating genetically personalised models and fluxes to clinical applications, GSMMs have already been established to have utility for drug discovery and repositioning32,99–102. Therefore, FWAS may enable identifying fluxes associated with disease states and, by extension, the gene knockdowns or metabolic interventions (e.g., dietary supplements or metabolic inhibitors) to target them. FWAS to blood metabolic features may also help screen for potential adverse side effects of metabolic interventions. For example, we identified that retinyl esters might increase TAG levels and reduce cholesterol esterification in lipoproteins, consistent with reports that administering high doses of vitamin A derivatives results in hypertriglyceridemia and dysregulation of cholesterol levels65–68. Furthermore, while it is very early days, personalised fluxes associated with disease risk could also be incorporated into existing risk prediction models, potentially enhancing their predictive capacity.
Overall, this work demonstrates that genome-scale metabolic modelling can contribute to addressing the V2F challenge by characterising how the effects of genetic variants propagate through the metabolic networks of specific human organs.
Methods
INTERVAL cohort
INTERVAL is a cohort of approximately 50,000 participants nested within a randomised trial studying the safety of varying the frequency of blood donation (https://clinicaltrials.gov/ct2/show/NCT01610635). Participants were blood donors aged 18 years and older (median 44 years of age; 50% women) recruited between 2012 and 2014 from 25 NHS Blood and Transplant centres35,36. Genetically personalised fluxes were computed for the 37,220 individuals with genotype and blood metabolome data that had passed quality control.
Genotyping of INTERVAL samples, their quality control and imputation were performed as previously described:103 Participants were genotyped in ten batches using Affymetrix UK Biobank arrays. Duplicate samples, samples with extreme heterozygosity or sex mismatch, were removed, and participants of non-European descent were excluded. First- or second-degree relatives (identity-by-descent ) were also removed, keeping one sample at random from each pair of close relatives. Genotyped variants were removed if they had a call rate <99%, were monomorphic, or had Hardy-Weinberg equilibrium P value < 5 × 10−6. Variants were subsequently phased using SHAPEIT3, then imputed to the UK10K/1000 Genomes reference panels using the Sanger Imputation Server (https://imputation.sanger.ac.uk).
UK Biobank
UKB is a cohort of approximately 500,000 participants from the general UK population (https://www.ukbiobank.ac.uk/). Participants were between age 40 and 69 at recruitment (median 58 years of age; 54% women) and accepted an invitation to attend one of the assessment centres that were established across the United Kingdom between 2006 and 201037. Genetically personalised fluxes were computed for the 487,395 individuals in the version 3 release of the UK Biobank genotype data104 (https://biobank.ndph.ox.ac.uk/showcase/label.cgi?id=263), which has been imputed to the UK10K/1000 genomes and haplotype reference consortium (HRC)105 panels.
Building organ-specific models
For each analysed organ (i.e., adipose tissue, brain, liver, heart, and skeletal muscle), the set of organ-specific metabolic reactions was extracted from the Harvey/Harvetta models (version 1_03c)30, which contain manually curated metabolic networks for the major organs of the human body. To avoid any gender biases, any reaction present in either the male (Harvey) or female (Harvetta) models was included.
Harvey/Harvetta models were built from the Recon3D human GSMM25, which has been superseded by HUMAN126. Hence, we performed a liftover to update the Harvey/Harvetta organ-specific models to HUMAN1. Briefly, the IDs of the organ-specific metabolic reactions from the Harvey/Harvetta models were mapped to HUMAN1 (version 1.11.0) using the mapping provided in the HUMAN1 reaction annotations26. Subsequently, the resulting set of HUMAN1 reaction IDs was used to assemble organ-specific models from HUMAN1 reactions. Manual curation was used to identify and, when possible, correct gaps and missmaps. Some reactions in the Harvey/Harvetta models that were not present on the base Recon3D and thus could not be mapped to HUMAN1, were also added to the resulting network. These reactions included phospholipase, cholesterol esterase, and extracellular LCAT. Additionally, the side acyl chains of triglycerides and phospholipids were simplified to a stoichiometric mix of 1/3 oleoyl, 1/6 palmitoleoyl, 1/6 palmitoyl, 1/6 stearoyl, 1/6 myristoyl in line with the ratio used in Harvey/Harvetta for non-essential fatty acids30. Boundaries for the exchange reactions fluxes (i.e., rates of metabolite uptake or secretion) between each organ-specific model and blood or bile were set as the average bounds of the corresponding reactions in the Harvey and Harvetta models. In some instances, the ranges of metabolite uptake and secretion were further constrained to ensure that they were physiologically relevant. In the brain-specific model, exchange reactions to blood were mapped to the exchange reactions between blood and cerebrospinal fluid defined in Harvey/Harvetta. Such reactions had been defined, taking into consideration the selective permeability of the blood-brain barrier30. Thus, only metabolites permeable to this barrier can be exchanged between blood and the brain-specific model. Next, metabolites in blood or bile were made boundary conditions (i.e., assumed constant), allowing each organ subnetwork to function independently. Finally, given that most HUMAN1 reactions lack a name attribute, unnamed reactions in the resulting network were named using their corresponding name in Recon3D. In a number of instances, ambiguously named reactions were manually renamed.
To validate the resulting organ-specific models, we performed flux variability analysis (FVA)106 to test the capacity of reactions in the networks to carry a significant amount of flux (>10−6 mol/day), and 93% were shown to be capable of carrying a significant flux. Furthermore, models were also evaluated against a set of essential metabolic tasks (i.e., tasks all organs are expected to perform to be viable) and organ-specific metabolic tasks obtained from the HUMAN1 repository26 (Supplementary Data 5). Each organ-specific model was shown to be capable of successfully performing all essential tasks as well as its organ-specific tasks. The resulting organ-specific GSMMs are available on GitHub and permanently archived by Zenodo107.
Additionally, a set of Recon3D-based organ-specific models were also built. Such models were obtained by applying the steps described above without performing the liftover to HUMAN1.
Computing organ-specific reference flux maps
The GIM3E algorithm was applied to compute the reference flux map for each organ. The GIM3E algorithm applies a flux minimisation weighted by transcript abundances allowing to find solutions that are enzymatically efficient, consistent with gene expression data and fulfil a set of metabolic objectives38 (Supplementary Fig. 1). First, a set of metabolic objectives was defined for each organ representing major metabolic functions that each organ fulfils in the conditions under study (Supplementary Data 6). These were added in each organ subnetwork as lower bounds for flux values through reactions associated with those metabolic objectives. Lower bounds were set relative to the maximum flux feasible through such reactions identified with FVA106.
Next, organ-specific transcript abundances were obtained as transcripts per million from the GTEx Portal14 (GTEx Analysis Release V8; dbGaP Accession phs000424.v8.p2; accessed on 05/05/2021) and the average abundance of each transcript in each organ was computed. In the heart, adipose tissue, and brain, there were transcripts abundances measured from multiple source sites. Hierarchical clustering analysis indicated that source sites from each organ were clustered together (Supplementary Fig. 6). Hence, the average of the source sites in each organ was used for the heart, adipose tissue and brain. Average transcript abundances were mapped to the organ-specific subnetworks using the gene reaction annotations of HUMAN126. More in detail, transcript abundances of isoenzymes and enzyme subunits catalysing each reaction or transport process were added and, subsequently, log2 transformed. The resulting values were used as input to apply the flux minimisation weighted by reaction expression38:
1 |
subject to:
where, is a vector of steady-state flux values; is a vector of average transcript abundances mapped to reactions of the organ-specific network; is the 95th percentile of the average transcript abundance values mapped to reactions of the organ-specific network; s is the stoichiometric matrix. Its product with defines the metabolic steady state constraint (i.e., input and output fluxes must be balanced for each metabolite in the network); lb and ub are vectors defining the lower and upper bounds of reactions, respectively. The organ-specific metabolic objectives are defined as lower bounds greater than 0 (i.e., constraining such reactions to being active) for the relevant reactions.
Subsequently, FVA was used to identify the feasible flux ranges within 99% of the optimal value of the GIM3E objective function38. Finally, the resulting solution space was sampled using the Artificially Centred hit-and-run (ACHR) algorithm39 implemented into COBRApy108,109. ACHR was run with a thinning factor of 1000, and 1000 sets of steady-state flux distributions were computed. The average of those flux samples was used as each organ’s reference flux map.
Following the same approach, reference fluxes were also computed for the Recon3D organ-specific models using the gene reaction annotations of Recon3D25.
Imputing individual-specific gene expression data
The elastic net models from PredictDB13 were used to impute organ-specific gene expression levels from individual-level genotypes. These are well-established models that have been extensively validated13,110–112. The latest release of PredictDB models, which had been trained with GTEx v8 data, were obtained from https://predictdb.org/. They were used with PLINK2113 to predict relative transcripts abundances using genotype data from the INTERVAL35,36 and UKB37 cohorts. For adipose, brain and heart tissue, the average of the imputed abundances in each source site was used.
Mapping individual-specific gene expression data to reactions in the model
Imputed individual-specific expression patterns from metabolic genes (i.e., genes coding for enzymes, enzyme subunits, or transmembrane carriers) were mapped to organ-specific models using the gene reaction annotations of HUMAN126. Imputed values were expressed as log2 fold changes relative to average gene expression in GTEx and mapped to reactions in the organ-specific model considering the relative transcript abundance of isoenzymes and enzyme subunits in GTEx:
2 |
where, is the organ-specific score for gene g in individual n computed using the elastic net models from PredictDB; is the average organ-specific gene expression of gene g in GTEx; are the genes associated with reaction R in the organ-specific network; is the imputed reaction activity fold change for reaction R in individual n.
Reaction activity fold changes were also computed for the Recon3D organ-specific models using the gene reaction annotations of Recon3D25.
The quadratic metabolic transformation algorithm
Building upon the principle of the metabolic transformation algorithm101,102, we developed qMTA. qMTA seeks to identify the flux map most consistent with a set of reaction activity fold changes starting from a reference flux distribution (Supplementary Fig. 2). To this end, it minimises the difference between the simulated flux values and the target fluxes (i.e., the product of the flux value in the reference flux distribution and the reaction activity fold change). Additionally, it also minimises the deviation from the reference flux distribution in reactions not mapped to any gene expression fold changes. Furthermore, the two terms of the optimisation function are scaled by the reference flux distribution to prevent biases towards reactions with high flux values.
3 |
Subject to:
where, is the weight given to minimising variation in reactions not mapped to imputed gene expression; are reactions not mapped to imputed gene expression; is the flux vector of the reference flux distribution; is the simulated flux value for reaction i in individual n; m is the minimum value allowed for the scaling factor; are reactions mapped to imputed gene expression.
Personalised flux maps computed with qMTA were subsequently log2 transformed and standardised to zero-mean and unit variance.
Additionally, two hyperparameters in qMTA ( and m) were tuned using the regression analysis with blood metabolic features in the INTERVAL cohort. For each simulated organ, a grid search (w → [100,10,1,0.1,0.01], m → [10−6, 10−7, 10−8, 10−9, 10−10, 10−11, 10−12]) was performed to identify the parameters that resulted in flux maps with the strongest association with both Nightingale Health and Metabolon HD4 metabolic features. This was measured as the summation of the amount of variance explained (R2) for each blood feature-flux value pair when testing associations between metabolic fluxes and blood metabolic features. The resulting parameters were subsequently used in the analysis of the samples from UKB. The process was repeated to identify the best set of hyperparameters for the Recon3D-based models.
Metabolomics
The Nightingale NMR platform quantifies 230 and 249 analytes in INTERVAL and UKB, respectively, including lipoprotein subfractions and ratios, lipids and low molecular weight metabolites (e.g., amino acids)51. In INTERVAL, blood samples were profiled with the Nightingale platform at the baseline of the blood donation assay (N = 37,720). In UKB, metabolite concentrations were determined in 117,981 participants at baseline assessment and 5141 participants at repeat assessment, among which there were 1427 participants with measurements at both time points. For participants with measurements at both baseline and repeat assessment, the measurement at baseline assessment was used114. Values were adjusted for technical covariates using the ukbnmr R package114 and subsequently regressed for age, sex, BMI, and the first 5 PCs of genetic ancestry. Composite biomarkers and ratios were recomputed after adjustment, including 98 and 76 additional biomarker ratios in INTERVAL and UKB, respectively, not provided by the Nightingale platform. Metabolic features not present in both INTERVAL and UKB were excluded from downstream analyses. Likewise, 68 features with markedly distinct variance between INTERVAL and UKB (|log2(sdINTERVAL/sdUKB)| > log2(2.5)) were also excluded. Finally, acetate was excluded due to a large number of NA (>75%) in INTERVAL. Subsequently, measures were standardised to zero-mean and unit variance.
The Metabolon HD4 assay measures ~1000 metabolites (~700 named, ~300 unknown), including lipids, xenobiotics, amino acids and energy-related metabolites. A subset of INTERVAL participants (N=8,115) had their blood profiled with this assay, predominantly using baseline blood samples. Nineteen features were excluded due to a large number of NA (>75%). Values were regressed against technical covariates age, sex, BMI, and the first 5 PCs of genetic ancestry. Subsequently, measures were standardised to zero mean and unit-variance.
Testing associations between metabolic fluxes and blood metabolic features
Due to the linear nature of many metabolic pathways, some flux values were highly intercorrelated (Fig. S3). To remove reaction flux pairs with a strong correlation, for each pair of reaction flux values with ρ > 0.9, the feature with the largest mean absolute correlation to other flux values was removed115. Likewise, both the Nightingale and Metabolon platforms had some metabolic features with strong correlations, and those features with ρ > 0.75 were removed using the same approach used for reaction fluxes. Overall, 4300 scaled flux values and 57 Nightingale Health and 718 Metabolon HD4 blood metabolic features were selected to perform FWAS.
Then, the association of each metabolic feature to each personalised flux value was evaluated using linear regression (Supplementary Fig. 2).
4 |
where, are the measured levels of a blood metabolic feature; is the effect size of flux i on ; is the residual.
Statistical significance was evaluated with a t-test (two-tailed) on effect sizes. The resulting P values were adjusted for multiple testing against all evaluated blood metabolic features—reaction flux pairs using the Benjamini and Yosef Hochberg (i.e., FDR) method.
To evaluate the association between metabolic fluxes computed with Recon3D-based models and blood metabolic features, the set of 4300 uncorrelated flux values in HUMAN1 was mapped to equivalent reactions in the Recon3D-based models. This set of flux values was then used to perform FWAS to the same set of 57 Nightingale Health and 718 Metabolon HD4 blood metabolic features as the HUMAN1 analysis.
Classes of blood metabolic features
Nightingale/Metabolon platforms provide sets of Groups/Sub-pathways to stratify metabolic features. We harmonised both annotations systems to define a set of curated groups that could be applied to both Nightingale and Metabolon features. For instance, the Metabolon features annotated to “Glycerolipid Metabolism” and “Phospholipid metabolism”, and the Nightingale features annotated to “Phospholipids” were all assigned to the curated group “Glycerides and phospholipids”. Some Metabolon features were not annotated (i.e., unknown) and could not be assigned to any curated group. Unknown features were included in the FWAS but omitted from the enrichment analysis. Fisher’s exact test (one-sided) was used to identify metabolite classes enriched in features with significant association to personalised flux values relative to the set of all uncorrelated blood metabolic features. An FDR-adjusted significance threshold of P < 0.05 was applied to control for all tested classes of blood metabolic features across all organs.
Reaction systems
Subsystem annotations for reactions were obtained from the HUMAN1 model26. As some subsystems contained a low number of reactions, functionally related subsystems were merged into larger reaction systems. For instance, the purine metabolism, pyrimidine metabolism and nucleotide metabolism subsystems were aggregated into a reaction system termed nucleotide metabolism. Additionally, transport processes (i.e., annotated in the transport or exchange reactions subsystems) were assigned a system based on the specific metabolites being transported in each process. Briefly, we first assigned a system to each metabolite based on the most frequent reaction system annotation in the reactions in which it participates. For instance, alanine was assigned to the system “amino acid metabolism” since it was the system annotated most in reactions in which alanine participated. Next, each transport process/exchange reaction in HUMAN1 was assigned the system most numerous in the metabolites being transported. For the purpose of this assignment, metabolites that are often cofactors in transport processes (e.g., Na+, K+, H+, and ATP/ADP) were set to give less weight than other metabolites. For instance, the alanine-sodium symporter (alanine[e] + Na+[e] → alanine[c] + Na+[c]) was assigned to the system “amino acid metabolism” as alanine (system: amino acid metabolism) was given more weight than Na+ (system: Miscellaneous). Reaction systems are solely used as annotations and have no influence on network stoichiometry or genetically personalised flux values.
Fisher’s exact test (one-sided) was used to identify reaction systems enriched in reactions with significant association with blood metabolic features relative to the set of all evaluated reactions in each organ. An FDR-adjusted significance threshold of P < 0.05 was applied to control for all tested systems across all organs.
Testing associations between metabolic fluxes and coronary artery disease
Using PheWAS Catalogue (version 1.2), we used the WHO International Classification of Diseases (ICD) diagnosis codes in versions 9 (ICD-9) and 10 (ICD-10) of Phecode 411.4 for CAD case definition in UKB. In detail, we searched for the presence of any of the constituent ICD-9/10 codes in linked health records (including in-patient Hospital Episode Statistics data, and primary and secondary cause of death information from the death registry) and converted the earliest coded date to the age of phenotype onset. Individuals without any codes for CAD were recorded as controls and censored according to the maximum follow-up of the health linkage data (January 31, 2020) or the date of death.
We recorded 34,121 events of CAD and 428,669 controls in UKB, which were used to evaluate the association of genetically personalised fluxes to CAD risk. Association was tested using an age-as-time-scale Cox proportional hazards regression. The Cox models were stratified by sex and adjusted by genotyping array, 10 genetic PCs, BMI and smoking status and fitted using the CoxPHFitter function from the lifelines package for python116. The significance of the flux to CAD risk associations was evaluated with a two-tailed Wald test for the flux HRs. The resulting P values were adjusted for multiple testing against all tested fluxes using the Benjamini and Yosef Hochberg (i.e., FDR) method.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
Participants in the INTERVAL trial were recruited with the active collaboration of NHS Blood and Transplant (www.nhsbt.nhs.uk), which has supported fieldwork and other elements of the trial. DNA extraction and genotyping were co-funded by the National Institute for Health and Care Research (NIHR), the NIHR BioResource (http://bioresource.nihr.ac.uk) and the NIHR Cambridge Biomedical Research Centre (BRC) (no. BRC-1215-20014)*. Nightingale Health NMR assays were funded by the European Commission Framework Programme 7 (HEALTH-F2-2012-279233). Metabolon Metabolomics assays were funded by the NIHR BioResource and the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014)*. The academic coordinating centre for INTERVAL was supported by core funding from the NIHR Blood and Transplant Research Unit in Donor Health and Genomics (no. NIHR BTRU-2014-10024), NIHR BTRU in Donor Health and Behaviour (NIHR203337), UK Medical Research Council (MRC) (no. MR/L003120/1), British Heart Foundation (nos SP/09/002, RG/13/13/30194 and RG/18/13/33946) and the NIHR Cambridge BRC (no. BRC-1215-20014)*. *The views expressed are those of the author(s) and not necessarily those of the NIHR, NHSBT or the Department of Health and Social Care. A complete list of the investigators and contributors to the INTERVAL trial is provided in ref. 36. The academic coordinating centre would like to thank blood donor centre staff and blood donors for participating in the INTERVAL trial. This work was supported by Health Data Research UK, which is funded by the UK MRC, Engineering and Physical Sciences Research Council (EPSRC), Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation and Wellcome. This research has been conducted using the UK Biobank Resource under Application 7439. This work was performed using resources provided by the Cambridge Service for Data-Driven Discovery (CSD3) operated by the University of Cambridge Research Computing Service (www.csd3.cam.ac.uk), provided by Dell EMC and Intel using Tier-2 funding from the Engineering and Physical Sciences Research Council (capital grant EP/P020259/1), and DiRAC funding from the Science and Technology Facilities Council (www.dirac.ac.uk). C.F. is funded Health Data Research UK. S.R. is funded by the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014). S.L. is supported by a Canadian Institutes of Health Research postdoctoral fellowship (MFE-171279). E.P. was funded by the EU/EFPIA Innovative Medicines Initiative Joint Undertaking BigData@Heart grant 116074 and the NIHR BTRU in Donor Health and Genomics (NIHR BTRU-2014-10024) and is funded by the NIHR BTRU in Donor Health and Behaviour (NIHR203337). E.E.D. is supported by the Wellcome Trust grant (206194, 108413/A/15/D). J.D. holds a British Heart Foundation Professorship and an NIHR Senior Investigator Award. M.I. is supported by the Munz Chair of Cardiovascular Prediction and Prevention and the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014). M.I. was also supported by the UK Economic and Social Research 878 Council (ES/T013192/1).
Author contributions
Conceptualisation: C.F., C.Y., and M.I.; Formal Analysis: C.F., Y.X., S.C.R., and M.I.; Investigation: S.C.R., S.A.L., E.P., A.P.N., E.E.D., D.J.R., D.S.P., E.D.A., J.D., A.S.B., and M.I.; Writing—original draft: C.F., C.Y., and M.I.; Supervision: C.Y. and M.I.; All authors reviewed and approved the final paper.
Peer review
Peer review information
Nature Communications thanks Priyanka Baloni and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
The data from the INTERVAL35,36 and UK Biobank37 cohorts is under restricted access as it contains potentially identifying and sensitive patient information. It can be accessed by making a reasoned request to the INTERVAL coordination centre (https://www.intervalstudy.org.uk) and UKB (https://www.ukbiobank.ac.uk/), respectively. Response times from the data access committees are typically under one month. The summary statistics for the FWAS to blood metabolic features and CAD are provided in Supplementary Data 2, Supplementary Data 3 and Supplementary Data 4. The organ-specific genome-scale metabolic models generated in this work are available on the cobrafunctions GitHub repository, which is permanently archived by Zenodo107. HUMAN126 (version 1.11.0) can be obtained from the Human-GEM GitHub repository. The Harvey and Harvetta models (1_03c) are available in the Supporting Information of reference 30. The elastic net PredictDB models (GTEx v8) models13 are available at https://predictdb.org. The GTEx14 gene expression data (GTEx Analysis Release V8; dbGaP Accession phs000424.v8.p2) can be obtained from https://gtexportal.org.
Code availability
The code used to generate personalised organ-specific flux maps from imputed gene expression data is available on GitHub and permanently archived by Zenodo107. qMTA requires the proprietary solver CPLEX(2.6 or newer), which is freely available to academic users.
Competing interests
A.S.B. has received grants (outside of this work) from AstraZeneca, Bayer, Biogen, BioMarin, Bioverativ, Merck, Novartis, Regeneron, and Sanofi. J.D. serves on scientific advisory boards for AstraZeneca, Novartis, and UK Biobank, and has received multiple grants from academic, charitable and industry sources outside of the submitted work. During the preparation of the paper, D.S.P. became a full-time employee of AstraZeneca. The remaining authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Carles Foguet, Email: cf545@medschl.cam.ac.uk.
Michael Inouye, Email: mi336@medschl.cam.ac.uk.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-35017-7.
References
- 1.Tam V, et al. Benefits and limitations of genome-wide association studies. Nat. Rev. Genet. 2019;20:467–484. doi: 10.1038/s41576-019-0127-1. [DOI] [PubMed] [Google Scholar]
- 2.Lambert SA, et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat. Genet. 2021;53:420–425. doi: 10.1038/s41588-021-00783-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wand H, et al. Improving reporting standards for polygenic scores in risk prediction studies. Nature. 2021;591:211–219. doi: 10.1038/s41586-021-03243-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Polygenic Risk Score Task Force of the International Common Disease Alliance. Responsible use of polygenic risk scores in the clinic: potential benefits, risks and gaps. Nat. Med. 10.1038/s41591-021-01549-6 (2021). [DOI] [PubMed]
- 5.Brittain HK, Scott R, Thomas E. The rise of the genome and personalised medicine. Clin. Med. 2017;17:545–551. doi: 10.7861/clinmedicine.17-6-545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Dugger SA, Platt A, Goldstein DB. Drug development in the era of precision medicine. Nat. Rev. Drug Discov. 2018;17:183–196. doi: 10.1038/nrd.2017.226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lambert SA, Abraham G, Inouye M. Towards clinical utility of polygenic risk scores. Hum. Mol. Genet. 2019;28:R133–R142. doi: 10.1093/hmg/ddz187. [DOI] [PubMed] [Google Scholar]
- 8.Nandakumar SK, Liao X, Sankaran VG. In the blood: connecting variant to function in human hematopoiesis. Trends Genet. 2020;36:563–576. doi: 10.1016/j.tig.2020.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Claussnitzer M, Susztak K. Gaining insight into metabolic diseases from human genetic discoveries. Trends Genet. 2021;37:1081–1094. doi: 10.1016/j.tig.2021.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Wainberg M, et al. Opportunities and challenges for transcriptome-wide association studies. Nat. Genet. 2019;51:592–599. doi: 10.1038/s41588-019-0385-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Brandes N, Linial N, Linial M. PWAS: proteome-wide association study—linking genes and phenotypes by functional variation in proteins. Genome Biol. 2020;21:173. doi: 10.1186/s13059-020-02089-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Xu, Y. et al. An atlas of genetic scores to predict multi-omic traits. Preprint at bioRxiv10.1101/2022.04.17.488593 (2022).
- 13.Gamazon ER, et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 2015;47:1091–1098. doi: 10.1038/ng.3367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wingo TS, et al. Brain proteome-wide association study implicates novel proteins in depression pathogenesis. Nat. Neurosci. 2021;24:810–817. doi: 10.1038/s41593-021-00832-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ritchie, S. C. et al. Integrative analysis of the plasma proteome and polygenic risk of cardiometabolic diseases. Nat. Metab. 10.1038/s42255-021-00478-5 (2021). [DOI] [PMC free article] [PubMed]
- 17.Zhu X, Duren Z, Wong WH. Modeling regulatory network topology improves genome-wide analyses of complex human traits. Nat. Commun. 2021;12:1–15. doi: 10.1038/s41467-021-22588-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Carlin DE, et al. A fast and flexible framework for network-assisted genomic association. iScience. 2019;16:155–161. doi: 10.1016/j.isci.2019.05.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Talukdar HA, et al. Cross-tissue regulatory gene networks in coronary artery disease. Cell Syst. 2016;2:196–208. doi: 10.1016/j.cels.2016.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ghosh S, et al. Systems genetics analysis of genome-wide association study reveals novel associations between key biological processes and coronary artery disease. Arterioscler. Thromb. Vasc. Biol. 2015;35:1712–1722. doi: 10.1161/ATVBAHA.115.305513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Frayn, K. N. Metabolic Regulation: A Human Perspective. (2010).
- 22.Stephanopoulos G. Metabolic fluxes and metabolic engineering. Metab. Eng. 1999;1:1–11. doi: 10.1006/mben.1998.0101. [DOI] [PubMed] [Google Scholar]
- 23.Nielsen J. It is all about metabolic fluxes. J. Bacteriol. 2003;185:7031–7035. doi: 10.1128/JB.185.24.7031-7035.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zamboni N, Saghatelian A, Patti GJ. Defining the metabolome: size, flux, and regulation. Mol. Cell. 2015;58:699–706. doi: 10.1016/j.molcel.2015.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Brunk E, et al. Recon3D enables a three-dimensional view of gene variation in human metabolism. Nat. Biotechnol. 2018;36:272–281. doi: 10.1038/nbt.4072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Robinson JL, et al. An atlas of human metabolism. Sci. Signal. 2020;13:1–12. doi: 10.1126/scisignal.aaz1482. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Orth JD, Thiele I, Palsson BØ. What is flux balance analysis? Nat. Biotechnol. 2010;28:245–248. doi: 10.1038/nbt.1614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.de Mas IM, et al. Cancer cell metabolism as new targets for novel designed therapies. Future Med. Chem. 2014;6:1791–1810. doi: 10.4155/fmc.14.119. [DOI] [PubMed] [Google Scholar]
- 29.Jamialahmadi O, Hashemi-Najafabadi S, Motamedian E, Romeo S, Bagheri F. A benchmark-driven approach to reconstruct metabolic networks for studying cancer metabolism. PLoS Comput. Biol. 2019;15:e1006936. doi: 10.1371/journal.pcbi.1006936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Thiele, I. et al. Personalized whole‐body models integrate metabolism, physiology, and the gut microbiome. Mol. Syst. Biol. 10.15252/msb.20198982 (2020). [DOI] [PMC free article] [PubMed]
- 31.Agren R, et al. Reconstruction of genome-scale active metabolic networks for 69 human cell types and 16 cancer types using INIT. PLoS Comput. Biol. 2012;8:e1002518. doi: 10.1371/journal.pcbi.1002518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Folger O, et al. Predicting selective drug targets in cancer through metabolic networks. Mol. Syst. Biol. 2014;7:501–501. doi: 10.1038/msb.2011.35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lewis JE, Forshaw TE, Boothman DA, Furdui CM, Kemp ML. Personalized genome-scale metabolic models identify targets of redox metabolism in radiation-resistant tumors. Cell Syst. 2021;12:68–81.e11. doi: 10.1016/j.cels.2020.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Heinken A, Basile A, Hertel J, Thinnes C, Thiele I. Genome-scale metabolic modeling of the human microbiome in the era of personalized medicine. Annu. Rev. Microbiol. 2021;75:199–222. doi: 10.1146/annurev-micro-060221-012134. [DOI] [PubMed] [Google Scholar]
- 35.Moore C, et al. The INTERVAL trial to determine whether intervals between blood donations can be safely and acceptably decreased to optimise blood supply: study protocol for a randomised controlled trial. Trials. 2014;15:363. doi: 10.1186/1745-6215-15-363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Di Angelantonio E, et al. Efficiency and safety of varying the frequency of whole blood donation (INTERVAL): a randomised trial of 45 000 donors. Lancet. 2017;390:2360–2371. doi: 10.1016/S0140-6736(17)31928-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Sudlow C, et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12:1–10. doi: 10.1371/journal.pmed.1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Schmidt BJ, et al. GIM3E: condition-specific models of cellular metabolism developed from metabolomics and expression data. Bioinformatics. 2013;29:2900–2908. doi: 10.1093/bioinformatics/btt493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kaufman DE, Smith RL. Direction choice for accelerated convergence in hit-and-run sampling. Oper. Res. 1998;46:84–95. doi: 10.1287/opre.46.1.84. [DOI] [Google Scholar]
- 40.Gallagher, D., Chung, S. & Akram, M. Body Composition. in Encyclopedia of Human Nutrition 191–199 (Elsevier, 2013). 10.1016/B978-0-12-375083-9.00027-1.
- 41.Grynberg A, Demaison L. Fatty acid oxidation in the heart. J. Cardiovasc. Pharmacol. 1996;28:S11–S17. doi: 10.1097/00005344-199600003-00003. [DOI] [PubMed] [Google Scholar]
- 42.Drake KJ, Sidorov VY, McGuinness OP, Wasserman DH, Wikswo JP. Amino acids as metabolic substrates during cardiac ischemia. Exp. Biol. Med. 2012;237:1369–1378. doi: 10.1258/ebm.2012.012025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Dickinson JM, Rasmussen BB. Amino acid transporters in the regulation of human skeletal muscle protein metabolism. Curr. Opin. Clin. Nutr. Metab. Care. 2013;16:638–644. doi: 10.1097/MCO.0b013e3283653ec5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Lundsgaard A-M, Fritzen AM, Kiens B. Molecular regulation of fatty acid oxidation in skeletal muscle during aerobic exercise. Trends Endocrinol. Metab. 2018;29:18–30. doi: 10.1016/j.tem.2017.10.011. [DOI] [PubMed] [Google Scholar]
- 45.Monteiro-Cardoso VF, Corlianò M, Singaraja RR. Bile acids: a communication channel in the gut-brain axis. Neuromol. Med. 2021;23:99–117. doi: 10.1007/s12017-020-08625-z. [DOI] [PubMed] [Google Scholar]
- 46.McMillin M, DeMorrow S. Effects of bile acids on neurological function and disease. FASEB J. 2016;30:3658–3668. doi: 10.1096/fj.201600275R. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Rapoport SI. Arachidonic acid and the brain. J. Nutr. 2008;138:2515–2520. doi: 10.1093/jn/138.12.2515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Bosetti F. Arachidonic acid metabolism in brain physiology and pathology: lessons from genetically altered mouse models. J. Neurochem. 2007;102:577–586. doi: 10.1111/j.1471-4159.2007.04558.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Li J, Papadopoulos V, Vihma V. Steroid biosynthesis in adipose tissue. Steroids. 2015;103:89–104. doi: 10.1016/j.steroids.2015.03.016. [DOI] [PubMed] [Google Scholar]
- 50.Lotta LA, et al. A cross-platform approach identifies genetic regulators of human metabolism and health. Nat. Genet. 2021;53:54–64. doi: 10.1038/s41588-020-00751-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Julkunen H, Cichońska A, Slagboom PE, Würtz P. Metabolic biomarker profiling for identification of susceptibility to severe pneumonia and COVID-19 in the general population. Elife. 2021;10:1–20. doi: 10.7554/eLife.63033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Rye K-A, Bursill CA, Lambert G, Tabet F, Barter PJ. The metabolism and anti-atherogenic properties of HDL. J. Lipid Res. 2009;50:S195–S200. doi: 10.1194/jlr.R800034-JLR200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Santamarina-Fojo S, González-Navarro H, Freeman L, Wagner E, Nong Z. Hepatic lipase, lipoprotein metabolism, and atherogenesis. Arterioscler. Thromb. Vasc. Biol. 2004;24:1750–1754. doi: 10.1161/01.ATV.0000140818.00570.2d. [DOI] [PubMed] [Google Scholar]
- 54.Rousset X, Vaisman B, Amar M, Sethi AA, Remaley AT. Lecithin: cholesterol acyltransferase—from biochemistry to role in cardiovascular disease. Curr. Opin. Endocrinol. Diabetes Obes. 2009;16:163–171. doi: 10.1097/MED.0b013e328329233b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Connelly PW, Hegele RA. Hepatic lipase deficiency. Crit. Rev. Clin. Lab. Sci. 1998;35:547–572. doi: 10.1080/10408369891234273. [DOI] [PubMed] [Google Scholar]
- 56.Hodoğlugil U, Williamson DW, Mahley RW. Polymorphisms in the hepatic lipase gene affect plasma HDL-cholesterol levels in a Turkish population. J. Lipid Res. 2010;51:422–430. doi: 10.1194/jlr.P001578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.McCaskie P, et al. The C-480T hepatic lipase polymorphism is associated with HDL-C but not with risk of coronary heart disease. Clin. Genet. 2006;70:114–121. doi: 10.1111/j.1399-0004.2006.00659.x. [DOI] [PubMed] [Google Scholar]
- 58.Machado MV, et al. Mouse models of diet-induced nonalcoholic steatohepatitis reproduce the heterogeneity of the human disease. PLoS ONE. 2015;10:e0127991. doi: 10.1371/journal.pone.0127991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Testerink N, van der Sanden MHM, Houweling M, Helms JB, Vaandrager AB. Depletion of phosphatidylcholine affects endoplasmic reticulum morphology and protein traffic at the Golgi complex. J. Lipid Res. 2009;50:2182–2192. doi: 10.1194/jlr.M800660-JLR200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Sakai N, et al. Targeted disruption of the mouse lecithin:cholesterol acyltransferase (LCAT) gene. J. Biol. Chem. 1997;272:7506–7510. doi: 10.1074/jbc.272.11.7506. [DOI] [PubMed] [Google Scholar]
- 61.Yamashita, S. & Matsuzawa, Y. Low HDL and high HDL syndromes. in Encyclopedia of Endocrine Diseases.vol. 1 327–339 (Elsevier, 2018).
- 62.Gerl MJ, et al. Cholesterol is inefficiently converted to cholesteryl esters in the blood of cardiovascular disease patients. Sci. Rep. 2018;8:14764. doi: 10.1038/s41598-018-33116-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Ross, A. C. Retinol: properties and determination. in Encyclopedia of Food and Healthvol. 1821 604–609 (Elsevier, 2016).
- 64.Shirakami Y, Lee SA, Clugston RD, Blaner WS. Hepatic metabolism of retinoids and disease associations. Biochim. Biophys. Acta. 2012;1821:124–136. doi: 10.1016/j.bbalip.2011.06.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Murray JC, Gilgor RS, Lazarus GS. Serum triglyceride elevation following high-dose vitamin A treatment for pityriasis rubra pilaris. Arch. Dermatol. 1983;119:675–676. doi: 10.1001/archderm.1983.01650320049015. [DOI] [PubMed] [Google Scholar]
- 66.Vahlquist C, Michaëlsson G, Vahlquist A, Vessby B. A sequential comparison of etretinate (Tigason) and isotretinoin (Roaccutane) with special regard to their effects on serum lipoproteins. Br. J. Dermatol. 1985;112:69–76. doi: 10.1111/j.1365-2133.1985.tb02293.x. [DOI] [PubMed] [Google Scholar]
- 67.Bershad S, et al. Changes in plasma lipids and lipoproteins during isotretinoin therapy for acne. N. Engl. J. Med. 1985;313:981–985. doi: 10.1056/NEJM198510173131604. [DOI] [PubMed] [Google Scholar]
- 68.Redlich CA, et al. Effect of long-term beta-carotene and vitamin A on serum cholesterol and triglyceride levels among participants in the Carotene and Retinol Efficacy Trial (CARET) Atherosclerosis. 1999;145:425–432. doi: 10.1016/S0021-9150(99)00266-X. [DOI] [PubMed] [Google Scholar]
- 69.Duong PT, Weibel GL, Lund-Katz S, Rothblat GH, Phillips MC. Characterization and properties of pre beta-HDL particles formed by ABCA1-mediated cellular lipid efflux to apoA-I. J. Lipid Res. 2008;49:1006–1014. doi: 10.1194/jlr.M700506-JLR200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Huang H, Li Y, Liang J, Finkelman FD. Molecular regulation of histamine synthesis. Front. Immunol. 2018;9:1–7. doi: 10.3389/fimmu.2018.01392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Lopez-Perez D, et al. In patients with obesity, the number of adipose tissue mast cells is significantly lower in subjects with type 2 diabetes. Front. Immunol. 2021;12:1–13. doi: 10.3389/fimmu.2021.664576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Jarido V, et al. The emerging role of mast cells in liver disease. Am. J. Physiol. 2017;313:G89–G101. doi: 10.1152/ajpgi.00333.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Clejan S, et al. Blood histamine is associated with coronary artery disease, cardiac events and severity of inflammation and atherosclerosis. J. Cell. Mol. Med. 2002;6:583–592. doi: 10.1111/j.1582-4934.2002.tb00456.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Inouye, M. et al. An immune response network associated with blood lipid levels. PLoS Genet. 6, e1001113 (2010). [DOI] [PMC free article] [PubMed]
- 75.Inouye, M. et al. Metabonomic, transcriptomic, and genomic variation of a population cohort. Mol. Syst. Biol. 6, 1–10 (2010). [DOI] [PMC free article] [PubMed]
- 76.Wang KY, et al. Histamine deficiency decreases atherosclerosis and inflammatory response in apolipoprotein e knockout mice independently of serum cholesterol level. Arterioscler. Thromb. Vasc. Biol. 2011;31:800–807. doi: 10.1161/ATVBAHA.110.215228. [DOI] [PubMed] [Google Scholar]
- 77.Eisenberg T, et al. Cardioprotection and lifespan extension by the natural polyamine spermidine. Nat. Med. 2016;22:1428–1438. doi: 10.1038/nm.4222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Soda K, Kano Y, Chiba F. Food polyamine and cardiovascular disease—an epidemiological study. Glob. J. Health Sci. 2012;4:170–178. doi: 10.5539/gjhs.v4n6p170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Cercato C, Fonseca FA. Cardiovascular risk and obesity. Diabetol. Metab. Syndr. 2019;11:74. doi: 10.1186/s13098-019-0468-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Monelli E, et al. Angiocrine polyamine production regulates adiposity. Nat. Metab. 2022;4:327–343. doi: 10.1038/s42255-022-00544-6. [DOI] [PubMed] [Google Scholar]
- 81.Mead JR, Ramji DP. The pivotal role of lipoprotein lipase in atherosclerosis. Cardiovasc. Res. 2002;55:261–269. doi: 10.1016/S0008-6363(02)00405-4. [DOI] [PubMed] [Google Scholar]
- 82.Sato H, et al. Analyses of group III secreted phospholipase A2 transgenic mice reveal potential participation of this enzyme in plasma lipoprotein modification, macrophage foam cell formation, and atherosclerosis. J. Biol. Chem. 2008;283:33483–33497. doi: 10.1074/jbc.M804628200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Rosenson RS, Hurt-Camejo E. Phospholipase A2 enzymes and the risk of atherosclerosis. Eur. Heart J. 2012;33:2899–2909. doi: 10.1093/eurheartj/ehs148. [DOI] [PubMed] [Google Scholar]
- 84.Giordanetto F, et al. Discovery of AZD2716: a novel secreted phospholipase A2 (sPLA2) inhibitor for the treatment of coronary artery disease. ACS Med. Chem. Lett. 2016;7:884–889. doi: 10.1021/acsmedchemlett.6b00188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Akinkuolie AO, et al. Group IIA secretory phospholipase A2, vascular inflammation, and incident cardiovascular disease: an analysis from the JUPITER trial. Arterioscler. Thromb. Vasc. Biol. 2019;39:1182–1190. doi: 10.1161/ATVBAHA.118.311894. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Oshiro C, Mangravite L, Klein T, Altman R. PharmGKB very important pharmacogene: SLCO1B1. Pharmacogenet. Genomics. 2010;20:211–216. doi: 10.1097/FPC.0b013e328333b99c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Adhyaru BB, Jacobson TA. Safety and efficacy of statin therapy. Nat. Rev. Cardiol. 2018;15:757–769. doi: 10.1038/s41569-018-0098-5. [DOI] [PubMed] [Google Scholar]
- 88.Lin J-P, et al. Association between the UGT1A1*28 allele, bilirubin levels, and coronary heart disease in the Framingham Heart Study. Circulation. 2006;114:1476–1481. doi: 10.1161/CIRCULATIONAHA.106.633206. [DOI] [PubMed] [Google Scholar]
- 89.Suh S, et al. Relationship between serum bilirubin levels and cardiovascular disease. PLoS ONE. 2018;13:e0193041. doi: 10.1371/journal.pone.0193041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.McArdle PF, et al. Association between bilirubin and cardiovascular disease risk factors: using Mendelian randomization to assess causal inference. BMC Cardiovasc. Disord. 2012;12:16. doi: 10.1186/1471-2261-12-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Babaev VR, et al. Macrophage EP4 deficiency increases apoptosis and suppresses early atherosclerosis. Cell Metab. 2008;8:492–501. doi: 10.1016/j.cmet.2008.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Gomez I, Foudi N, Longrois D, Norel X. The role of prostaglandin E2 in human vascular inflammation. Prostaglandins Leukot. Essent. Fat. Acids. 2013;89:55–63. doi: 10.1016/j.plefa.2013.04.004. [DOI] [PubMed] [Google Scholar]
- 93.Bauch HJ, Grünwald J, Vischer P, Gerlach U, Hauss WH. A possible role of catecholamines in atherogenesis and subsequent complications of atherosclerosis. Exp. Pathol. 1987;31:193–204. doi: 10.1016/S0232-1513(87)80001-4. [DOI] [PubMed] [Google Scholar]
- 94.Foulon P, De Backer D. The hemodynamic effects of norepinephrine: far more than an increase in blood pressure! Ann. Transl. Med. 2018;6:S25–S25. doi: 10.21037/atm.2018.09.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Pinaire J, Azé J, Bringay S, Cayla G, Landais P. Hospital burden of coronary artery disease: Trends of myocardial infarction and/or percutaneous coronary interventions in France 2009–2014. PLoS ONE. 2019;14:1–21. doi: 10.1371/journal.pone.0215649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Harchaoui KEL, Visser ME, Kastelein JJP, Stroes ES, Dallinga-Thie GM. Triglycerides and cardiovascular risk. Curr. Cardiol. Rev. 2009;5:216–222. doi: 10.2174/157340309788970315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Jamshidi N, Palsson B. Systems biology of SNPs. Mol. Syst. Biol. 2006;2:1–4. doi: 10.1038/msb4100077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Sánchez BJ, et al. Improving the phenotype predictions of a yeast genome-scale metabolic model by incorporating enzymatic constraints. Mol. Syst. Biol. 2017;13:935. doi: 10.15252/msb.20167411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Agren R, et al. Identification of anticancer drugs for hepatocellular carcinoma through personalized genome-scale metabolic modeling. Mol. Syst. Biol. 2014;10:721–721. doi: 10.1002/msb.145122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Raškevičius V, et al. Genome scale metabolic models as tools for drug design and personalized medicine. PLoS ONE. 2018;13:1–14. doi: 10.1371/journal.pone.0190636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Yizhak K, Gabay O, Cohen H, Ruppin E. Model-based identification of drug targets that revert disrupted metabolism and its application to ageing. Nat. Commun. 2013;4:2632. doi: 10.1038/ncomms3632. [DOI] [PubMed] [Google Scholar]
- 102.Valcárcel, L. V, Torrano, V., Tobalina, L., Carracedo, A. & Planes, F. J. rMTA: Robust metabolic transformation analysis. Bioinformatics10.1093/bioinformatics/btz231 (2019). [DOI] [PubMed]
- 103.Astle WJ, et al. The allelic landscape of human blood cell trait variation and links to common complex disease. Cell. 2016;167:1415–1429.e19. doi: 10.1016/j.cell.2016.10.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Bycroft C, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–209. doi: 10.1038/s41586-018-0579-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Loh P-R, et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 2016;48:1443–1448. doi: 10.1038/ng.3679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Gudmundsson S, Thiele I. Computationally efficient flux variability analysis. BMC Bioinforma. 2010;11:489. doi: 10.1186/1471-2105-11-489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Foguet, C. Cobrafunctions V1.0. Zenodohttps://zenodo.org/record/7277058 (2022).
- 108.Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Syst. Biol. 2013;7:74. doi: 10.1186/1752-0509-7-74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Heirendt L, et al. Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v.3.0. Nat. Protoc. 2019;14:639–702. doi: 10.1038/s41596-018-0098-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Li B, et al. Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression. Pac. Symp. Biocomput. 2018;23:448–459. [PMC free article] [PubMed] [Google Scholar]
- 111.Tavares, V., Monteiro, J., Vassos, E., Coleman, J. & Prata, D. Evaluation of genotype-based gene expression model performance: a cross-framework and cross-dataset study. Genes (Basel). 12, 1–12 (2021). [DOI] [PMC free article] [PubMed]
- 112.Hale AT, et al. Multi-omic analysis elucidates the genetic basis of hydrocephalus. Cell Rep. 2021;35:109085. doi: 10.1016/j.celrep.2021.109085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Chang CC, et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Ritchie, S. C. et al. Quality control and removal of technical variation of NMR metabolic biomarker data in ~120,000 UK Biobank participants. Preprint at medRxiv10.1101/2021.09.24.21264079 (2021). [DOI] [PMC free article] [PubMed]
- 115.Kuhn, M. Building Predictive Models in R Using the caret Package. J. Stat. Softw. 28, 1–26 (2008).
- 116.Davidson-Pilon C. lifelines: survival analysis in Python. J. Open Source Softw. 2019;4:1317. doi: 10.21105/joss.01317. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data from the INTERVAL35,36 and UK Biobank37 cohorts is under restricted access as it contains potentially identifying and sensitive patient information. It can be accessed by making a reasoned request to the INTERVAL coordination centre (https://www.intervalstudy.org.uk) and UKB (https://www.ukbiobank.ac.uk/), respectively. Response times from the data access committees are typically under one month. The summary statistics for the FWAS to blood metabolic features and CAD are provided in Supplementary Data 2, Supplementary Data 3 and Supplementary Data 4. The organ-specific genome-scale metabolic models generated in this work are available on the cobrafunctions GitHub repository, which is permanently archived by Zenodo107. HUMAN126 (version 1.11.0) can be obtained from the Human-GEM GitHub repository. The Harvey and Harvetta models (1_03c) are available in the Supporting Information of reference 30. The elastic net PredictDB models (GTEx v8) models13 are available at https://predictdb.org. The GTEx14 gene expression data (GTEx Analysis Release V8; dbGaP Accession phs000424.v8.p2) can be obtained from https://gtexportal.org.
The code used to generate personalised organ-specific flux maps from imputed gene expression data is available on GitHub and permanently archived by Zenodo107. qMTA requires the proprietary solver CPLEX(2.6 or newer), which is freely available to academic users.