Optimization of a modeling platform to predict oncogenes from genome‐scale metabolic networks of non‐small‐cell lung cancers

You‐Tyun Wang; Min‐Ru Lin; Wei‐Chen Chen; Wu‐Hsiung Wu; Feng‐Sheng Wang

doi:10.1002/2211-5463.13231

. 2021 Jul 20;11(8):2078–2094. doi: 10.1002/2211-5463.13231

Optimization of a modeling platform to predict oncogenes from genome‐scale metabolic networks of non‐small‐cell lung cancers

You‐Tyun Wang ¹, Min‐Ru Lin ¹, Wei‐Chen Chen ¹, Wu‐Hsiung Wu ¹, Feng‐Sheng Wang ^1,^✉

PMCID: PMC8329960 PMID: 34137202

Abstract

Cancer cell dysregulations result in the abnormal regulation of cellular metabolic pathways. By simulating this metabolic reprogramming using constraint‐based modeling approaches, oncogenes can be predicted, and this knowledge can be used in prognosis and treatment. We introduced a trilevel optimization problem describing metabolic reprogramming for inferring oncogenes. First, this study used RNA‐Seq expression data of lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) samples and their healthy counterparts to reconstruct tissue‐specific genome‐scale metabolic models and subsequently build the flux distribution pattern that provided a measure for the oncogene inference optimization problem for determining tumorigenesis. The platform detected 45 genes for LUAD and 84 genes for LUSC that lead to tumorigenesis. A high level of differentially expressed genes was not an essential factor for determining tumorigenesis. The platform indicated that pyruvate kinase (PKM), a well‐known oncogene with a low level of differential gene expression in LUAD and LUSC, had the highest fitness among the predicted oncogenes based on computation. By contrast, pyruvate kinase L/R (PKLR), an isozyme of PKM, had a high level of differential gene expression in both cancers. Phosphatidylserine synthase 1 (PTDSS1), an oncogene in LUAD, was inferred to have a low level of differential gene expression, and overexpression could significantly reduce survival probability. According to the factor analysis, PTDSS1 characteristics were close to those of the template, but they were unobvious in LUSC. Angiotensin‐converting enzyme 2 (ACE2) has recently garnered widespread interest as the SARS‐CoV‐2 virus receptor. Moreover, we determined that ACE2 is an oncogene of LUSC but not of LUAD. The platform developed in this study can identify oncogenes with low levels of differential expression and be used to identify potential therapeutic targets for cancer treatment.

Keywords: cancer cell metabolism, constraint‐based modeling, flux balance analysis, tissue‐specific metabolic models, trilevel optimization

Identifying differentially expressed genes is critical in exploring molecular mechanisms of cancers. A high level of differential gene expression is not an essential factor for tumorigenesis. We developed an optimization platform that can identify oncogenes with low levels of differential expression and potential therapeutic targets for cancer treatment. The platform was applied on LUAD and LUSC to illustrate its performance.

graphic file with name FEB4-11-2078-g003.jpg

Abbreviations

ACAA2: acetyl‐CoA acyltransferase 2
ACE2: angiotensin‐converting enzyme 2
BCAT1: branched chain amino acid transaminase 1
COBRA: constraint‐based reconstruction and analysis toolbox
CORDA: cost optimization reaction dependency assessment
COSMIC: Catalogue Of Somatic Mutations In Cancer
DEG: differentially expressed gene
ENO1: enolase 1
FVA: flux variability analysis
GLO1: glyoxalase I
GPR: gene–protein–reaction
GSMN: genome‐scale metabolic network
HPA: human protein atlas
iMAT: integrative metabolic analysis tool
LUAD: lung adenocarcinoma
LUSC: lung squamous cell carcinoma
MFVA: metabolite‐flow variability analysis
NCI‐60: 60 human tumor cell line anticancer drug screen from the US National Cancer Institute
NF‐κB: nuclear factor‐κB
NHDE: nested hybrid differential evolution
NSCLC: non‐small‐cell lung carcinoma
PKLR: pyruvate kinase L/R
PKM: pyruvate kinase
PSPH: phosphoserine phosphatase
PTDSS1: phosphatidylserine synthase 1
SARS‐CoV‐2: severe acute respiratory syndrome coronavirus 2
SHMT1: serine hydroxymethyltransferase 1
SLC: solute carrier
TCGA: The Cancer Genome Atlas
TLOP: trilevel optimization problem
VMH: Virtual Metabolic Human

Lung carcinoma is one of the most common malignancies, resulting in the largest number of cancer‐related deaths worldwide [1]. Two main subtypes of lung cancer exist, namely small‐cell lung carcinoma and non‐small‐cell lung carcinoma (NSCLC), accounting for 15% and 85% of all lung cancers, respectively [2]. Lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC), two main subtypes of NSCLC, are predominant lung cancers, accounting for 40% and 33%, respectively, of cancer deaths worldwide [3]. Lung carcinoma is related to genetic and epigenetic dysregulation, and understanding its biological mechanism is crucial for developing effective treatment.

Systems biology approaches and big database mining have been applied to construct genetic and epigenetic networks with next‐generation sequencing data of LUAD and LUSC for comparing the differential genetic and epigenetic progression mechanisms [4]. Such approaches might enable the deciphering of genotype discrepancy for both tissues. However, genes and their expression alone do not always constitute a reliable indicator of cellular phenotype. The recent availability of omics datasets allows the analysis of cellular characteristics at the levels of genes, mRNAs, proteins, and metabolites. A genome‐scale metabolic network (GSMN) can offer a biological mechanism that links genotype to phenotype; it can help us understand cell physiology and certain disease phenotypes caused by metabolic dysregulation [5, 6]. Human metabolism is complex and specialized in different tissue and cell types. The reprogramming of tissue‐specific metabolism in GSMNs will provide deeper insights into the metabolic basis of various physiological and pathological processes. Such metabolic reprogramming approaches have been applied to predict oncogenes, essential enzymes, and drug targets for developing novel medical treatments [7, 8, 9, 10, 11].

Cancer metabolism is an emerging hallmark of cancer [12]. Genetic alterations and epigenetic modifications of cancer cells result in the abnormal regulation of cellular metabolic pathways that differ from normal cells. GSMNs combined with constraint‐based modeling approaches can predict the metabolic reprogramming of cancer cells to reveal oncogenes, essential enzymes, and drug targets for developing novel medical treatments [13, 14, 15, 16, 17, 18, 19, 20, 21, 22]. The first large‐scale reconstructed metabolic model for cancer was built based on the gene expression data of all cancer cell lines in the NCI‐60 collection and the human general metabolic network (Recon 1) [14, 23]. This model was applied on the identification of essential genes and cytostatic drug targets of cancer cell lines [14] and prediction of metabolic targets for inhibiting cancer migration [23]. The GSMN (iHepatocytes2322) for hepatocytes was reconstructed by extending Recon 1 using data from Human Metabolic Reaction 2.0 database and proteomics data in Human Protein Atlas (https://metabolicatlas.org/). This GSMN was used to identify PSPH, SHMT1, and BCAT1 as potential therapeutic targets for the treatment of nonalcoholic steatohepatitis using the transcriptomics data obtained from patients with nonalcoholic fatty liver disease [16]. Another small‐scale constraint‐based model was created and combined with machine‐learning techniques to investigate the mechanism of pyruvate dehydrogenase under hypoxia [13].

Due to the complexity and specialization of human metabolism in different tissue and cancer cells, mapping tissue‐specific metabolisms in GSMNs can advance the understanding of cancer metabolism [24]. Recon 2.2 and Recon 3D are the most comprehensive human genome‐scale network reconstructions [25, 26]. Recon 2.2 was incorporated with the Human Protein Atlas (HPA) [27] to reconstruct GSMNs of the colorectal tissue [18] and head‐and‐neck tissue [21] at normal and cancerous states. Recon 2M.2 [28, 29] integrated with RNA‐Seq data from NCI‐60 cell lines presented a systematic framework for the generation of gene–transcript–protein–reaction that enables the accurate prediction of metabolic behaviors. Recon 3D is a human general genome‐scale network reconstruction that includes three‐dimensional metabolite and protein structure data and enables an integrated analysis of metabolic functions in humans. In this study, we first applied the CORDA method [30] integrated with The Cancer Genome Atlas (TCGA) database [31] and Recon 3D to reconstruct GSMNs for LUAD, LUSC, and their healthy cells. Multivariate analysis was used to analyze the reactions, metabolites, and enzyme‐encoding genes of these GSMNs to discriminate differential expressions between normal and cancer cells. The oncogene inference optimization formulation [18, 21] was used to mimic gene screening procedures in a wet laboratory to evaluate the mechanism by which gene dysregulations induce tumorigenesis.

Materials and methods

Reconstruction of tissue‐specific metabolic models

This study applied RNA‐Seq data from TCGA database to reconstruct genome‐scale metabolic models (Fig. 1) for LUAD and LUSC and their corresponding healthy tissues. These metabolic models were entirely flux‐consistent inspected by the 'findFluxConsistentSubset' function of COBRA toolbox. In total, 533 and 502 samples for LUAD and LUSC and 59 and 49 corresponding healthy samples were collected, respectively. Quantile normalization was applied to normalize the raw data for healthy and cancerous samples to compute the mean, confidence interval, and coefficient of dispersion for each gene. Such data were then used to evaluate supportive genes and obtain a high differential expression of enzyme‐encoding genes between the cancer and healthy cells. Recon 3D consisted of 2247 enzyme‐encoding genes, which were classified into four levels based on their participation, namely high, medium, low, and not detected. Four groups of confidence reactions, namely high, medium, negative, and others, were obtained through gene–protein–reaction association in Recon 3D. The tissue‐specific GSMNs of healthy and cancer cells were reconstructed using the CORDA algorithm and saved in SBML format. We developed a systems biology program (SBP) platform to automatically develop the stoichiometric models and GPR model in GAMS format to perform the simulation.

Oncogene inference optimization

The oncogene inference optimization framework was modified from a trilevel optimization problem (TLOP) that has been applied to analyze colorectal cancer [18]. In this study, the objectives in the TLOP considered the consistency between the change trends of mutant fluxes/metabolite flows compared with the template and that of cancer cells, and applied fuzzy equal constraints to quantitatively limit the change ratios of the mutant as close as possible to the template. These objectives in the modified approach can qualitatively and quantitatively optimize the flux pattern of a mutant as close as possible to the template. This oncogene inference optimization framework has been used to detect tumor suppressor genes in head‐and‐neck squamous cell carcinoma [21] and can be used as a mimic experiment for gene screening to predict oncogenes, similar to metabolic transformation algorithm for identification of drug targets [10, 32]. The outer optimization aims to infer the dysregulation of enzyme‐encoding genes that alter the metabolism of normal cells leading to cancer, and the inner optimization problems present perturbed behaviors of mutant cells. The mathematical formulation is expressed as follows:

\{\begin{cases} Outer optimization problem: \\ Similarity ratio of metabolite-flow rates and fluxes to the template: \\ max_{δ, z_{i}} S R_{M}, max_{δ, z_{i}} S R_{F} \\ Fuzzy equal grade of metabolite-flow rates and fluxes \\ compared to the template: \\ \tilde{\underset{δ, z_{i}}{Equal}} L F C_{M}^{MUBL} \approx L F C_{M}^{CABL}, \tilde{\underset{δ, z_{i}}{Equal}} L F C_{F}^{MUBL} \approx L F C_{F}^{CABL} \\ subject to the inner optimization problems: \\ \{\begin{cases} Flux balance analysis (FBA) problem \\ max_{v_{f / b}} o b j \equiv (w_{ATP} v_{ATP} + w_{biomass} v_{biomass}) \\ subject to \\ N (v_{f} ‐ v_{b}) = 0 \\ v_{f / b, i}^{LB} \leq v_{f / b, i} \leq v_{f / b, i}^{UB}, z_{i} \notin Ω^{MU} \\ v_{f / b, j}^{L B, M U} \leq v_{f / b, j} \leq v_{f / b, j}^{U B, M U}, z_{j} \in Ω^{MU} \end{cases} \\ \{\begin{cases} Uniform flux distribution (UFD) problem \\ min_{v_{f / b}} \sum_{i \in Ω^{Int}} {(v_{f, k})}^{2} + {(v_{b, k})}^{2} \\ subject to \\ N (v_{f} ‐ v_{b}) = 0 \\ v_{f / b, i}^{LB} \leq v_{f / b, i} \leq v_{f / b, i}^{UB}, z_{i} \notin Ω^{MU} \\ v_{f / b, j}^{L B, M U} \leq v_{f / b, j} \leq v_{f / b, j}^{U B, M U}, z_{j} \in Ω^{MU} \\ o b j \geq o b j^{*} \end{cases} \end{cases}

(1)

where v _f/b is the forward/backward flux vector of reversible reactions; N is an m × n stoichiometric matrix where m is the number of metabolites, and n is the number of reactions; $v_{f / b, i}^{LB}$ and $v_{f / b, i}^{UB}$ are the positive lower and upper bounds of the ith forward/backward flux, respectively; $v_{f / b, i}^{L B, M U}$ and $v_{f / b, i}^{U B, M U}$ are the positive lower and upper bounds of the ith upregulation, downregulation, or knockout flux in the set of mutated reactions Ω ^MU due to the ith enzyme dysregulation, which is determined using the GPR model; obj ^* is the maximum cellular objective obtained from the flux balance analysis (FBA) problem; $\tilde{Equal}$ is the fuzzy equal objective function that represent the fuzzy goals. For example, the LFC_m ^MUBL and LFC_m ^CABL should be restored to a state that is as close as possible; the integer vector z is used to determine mutated enzymes; and δ is the regulated strength parameter for the mutants with a value within (0, 1].

The GPR model used a pseudo‐enzyme coding number strategy to represent GPR associations in Recon 3D [18]. It identified redundant pseudoenzymes and isozymes in the model such that the reactions were catalyzed through reduced association. Therefore, pseudoenzymes were applied to determine modulated genes, and the level of the mutated bounds was computed using the following equations:

\begin{matrix} Upregulation: \\ \{\begin{cases} (1 ‐ δ) v_{f, i}^{basal} + δ v_{f, i}^{UB} \leq v_{f, i} \leq v_{f, i}^{UB} \\ v_{b, i}^{LB} \leq v_{b, i} \leq (1 ‐ δ) v_{b, i}^{basal} + δ v_{b, i}^{LB} ; i \in Ω^{MU} \end{cases} \\ Downregulation : \\ \{\begin{cases} v_{f, i}^{LB} \leq v_{f, i} \leq (1 ‐ δ) v_{f, i}^{basal} + δ v_{f, i}^{LB} \\ (1 ‐ δ) v_{b, i}^{basal} + δ v_{b, i}^{UB} \leq v_{b, i} \leq v_{b, i}^{UB} ; i \in Ω^{MU} \ Ω^{IZ} \end{cases} \\ \{\begin{cases} v_{f, i}^{LB} \leq v_{f, i} \leq v_{f, i}^{UB} \\ v_{b, i}^{LB} \leq v_{b, i} \leq v_{b, i}^{UB} ; i \in Ω^{MU} \cap Ω^{IZ} \end{cases} \\ Knockout : \\ \{\begin{cases} v_{f, i} = 0 \\ v_{b, i} = 0 ; i \in Ω^{MU} \ Ω^{IZ} \end{cases} \\ \{\begin{cases} v_{f, i}^{LB} \leq v_{f, i} \leq v_{f, i}^{UB} \\ v_{b, i}^{LB} \leq v_{b, i} \leq v_{b, i}^{UB} ; i \in Ω^{MU} \cap Ω^{IZ} \end{cases} \end{matrix}

(2)

where $v_{f / b . i}^{basal}$ is the basal flux in the normal state, and Ω ^IZ is the set of reactions regulated by isozymes represented in the GPR model.

Multiple objectives are considered in the outer optimization problem in Eqn (1). In the first and second objectives, the similarity ratios of metabolite‐flow rates and fluxes (SR_M and SR_F ) are maximized for determining a dysregulated metabolite‐flow/flux pattern that is as similar as possible to the template. The third and fourth objectives are used to obtain a mutant log2 fold change, $L F C_{M / F}^{MUBL}$ , of metabolite‐flow rates/fluxes as close as possible to that of the template, $L F C_{M / F}^{CABL}$ . The similarity ratios of the metabolite‐flow rates/fluxes (SR_M and SR_F ) for a mutant are evaluated as follows:

S R_{M / F} = \frac{\sum_{m = 1}^{N_{M / F}} |μ_{m}^{M / F}|}{N_{M / F}}

(3)

where the similarity indicator ( $μ_{m}^{M / F}$ ) for each metabolite‐flow rate or flux in the metabolic network is defined as follows:

μ_{m}^{M / F} = \{\begin{cases} 1, if L F C_{M / F, m}^{MUBL} > t o l_{+} and L F C_{M / F, m}^{CABL} > t o l_{+} \\ ‐ 1, if L F C_{M / F, m}^{MUBL} < t o l_{‐} and L F C_{M / F, m}^{CABL} < t o l_{‐} \\ 0, otherwise \end{cases}

(4)

where log2 fold changes, $L F C_{M / F}^{CABL}$ , of the metabolite‐flow rates/fluxes of templates are computed from the reconstructed GSMNs for cancer and normal cells in advance.

The tolerances for increase or decrease are defined as $t o l_{+} = \log_{2} (1 + ε)$ and $t o l_{‐} = \log_{2} (1 ‐ ε)$ , respectively, and ε is the percentage of flux alteration. A numerical example is provided (Doc. S1) to illustrate the computation of flux template, similarity ratio, and logarithmic fold change. The log2 fold change between the metabolite‐flow rate of the mth metabolite in cancer or dysregulated (denoted as MU) and normal states (denoted as BL) is computed as follows:

L F C_{M, m}^{MUBL} = \log_{2} (\frac{r_{m, M U}}{r_{m, B L}})

(5)

where the metabolite‐flow rate is a pool of flux‐sum synthesis rates of the mth metabolite in the dysregulated or normal cells, and expressed as follows:

r_{m} = \sum_{s \in Ω^{c}} (\sum_{N_{ij} > 0, j} N_{ij} v_{f, j} ‐ \sum_{N_{ij} < 0, j} N_{ij} v_{b, j}), m \in Ω^{m}

(6)

where N_ij is the stoichiometric coefficient of the ith metabolite participating in the jth reaction; Ω ^c is the set of metabolites located in different compartments; and Ω ^m is the set of metabolites in the GSMN. The bracket in Eqn (6) indicates the synthesis rates of the mth metabolite at its located compartment (i.e., sum up the forward fluxes, v_f,j , and backward fluxes, v_b,j , of the metabolite). The log2 fold change of the forward/backward flux in dysregulated and normal states is defined as follows:

L F C_{F, f / b}^{MUBL} = \log_{2} (\frac{v_{f / b, M U}}{v_{f / b, B L}})

(7)

The fold changes of the template, $L F C_{M / F, m}^{CABL}$ can be obtained by applying to above definition of $L F C_{M / F, m}^{MUBL}$ on a reconstructed cancer model instead of dysregulated models. Note that the templates are the flux distribution patterns for cancer and normal tissue. The templates can obtain from clinical data if they are available; otherwise, they were computed from the FBA and UFD problems without the dysregulated restrictions.

Fitness evaluation

The TLOP in Eqn (1) is a mixed‐integer optimization problem that is NP‐hard [33]. Classical algorithms for solving bilevel optimization problems use duality theory to convert the inner‐level optimization problem into constraints in the outer‐level problem. However, duality transformation is difficult for multilevel optimization problems, such as the TLOP in this study. We applied the NHDE algorithm (Doc. S2), which has been used to solve oncogene inference problems [18, 21], to infer the oncogenes of LUAD and LUSC. The problem [Eqn (1)] consisted of the crisp objectives and fuzzy equal objective to introduce a combination of weighted‐sum and minimum decisions for evaluating the fitness, η _D , which was used in the NHDE algorithm as follows:

η_{D} = [(η_{S} + η_{E}) / 2 + min \{η_{S}, η_{E}\}] / 2

(8)

where η _S is the average similarity ratios of SR_M and SR_F , and the membership grade, η _E , is used to measure how close the fuzzy equal objective (logarithmic fold change of the metabolite‐flow rates/fluxes) of the mutant is to the template.

The fuzzy equal objective for each metabolite is quantified by eliciting a membership function. In this study, the membership function is a combination of the left‐hand ( $η_{m}^{L}$ ) and right‐hand ( $η_{m}^{R}$ ) side linear membership functions, as shown in Fig. 2. The mathematical expressions are respectively formulated as follows:

η_{m}^{L} (L F C_{m}^{MUBL}) = \frac{L F C_{m}^{MUBL} ‐ L F C_{m}^{C A B L, L B}}{L F C_{m}^{CABL} ‐ L F C_{m}^{C A B L, L B}}

(9)

η_{m}^{R} (L F C_{m}^{MUBL}) = \frac{L F C_{m}^{C A B L, U B} ‐ L F C_{m}^{MUBL}}{L F C_{m}^{C A B L, U B} ‐ L F C_{m}^{CABL}}

(10)

where $L F C_{m}^{C A B L, L B}$ and $L F C_{m}^{C A B L, U B}$ are the lower and upper bounds of the log2 fold change of metabolite‐flow rate/flux in the cancer and basal states for the mth metabolite, and their levels can be provided by the user in advance as follows:

L F C_{m}^{C A B L, L B} = \{\begin{cases} L F C_{m}^{CABL} / 4, if L F C_{m}^{CABL} > 0 \\ 4 L F C_{m}^{CABL}, if L F C_{m}^{CABL} < 0 \end{cases}

(11)

L F C_{m}^{C A B L, U B} = \{\begin{cases} 4 L F C_{m}^{CABL}, if L F C_{m}^{CABL} \geq 0 \\ L F C_{m}^{CABL} / 4, if L F C_{m}^{CABL} \leq 0 \end{cases}

(12)

_{The fuzzy equal membership grade for each metabolite/flux is elicited as follows:}

η_{m} (L F C_{m}^{MUBL}) = max \{min [η_{m}^{L} (L F C_{m}^{MUBL}), η_{m}^{R} (L F C_{m}^{MUBL}), 1], 0\}

(13)

Fig. 2 — Fuzzy equal membership function. Fuzzy equal membership grade, $η_{m} (L F C_{m}^{MUBL})$ , of a mutant consisting of a left‐hand side membership function, $η_{m}^{L}$ , and right‐hand side membership function, $η_{m}^{R}$ , applied to evaluate the closeness for $L F C_{m}^{MUBL}$ of the mutant to the template. The membership grade is zero if $L F C_{m}^{MUBL}$ exceeds the lower bound ( $L F C_{m}^{C A B L, L B}$ ) or upper bound ( $L F C_{m}^{C A B L, U B}$ ). Conversely, the membership grade is between 0 and 1 if the membership function is within its bounds.

The decision grade of the network sums the membership grades for all metabolites/fluxes as $η_{E} = \frac{1}{M} \sum_{m = 1}^{M} η_{m} (L F C_{m}^{MUBL})$ , which is between 0 and 1. The decision grade differs from a least‐square error criterion in regression methods. According to the definition of the fuzzy equal membership function in Eqn (13), the grade is zero if $L F C_{m}^{MUBL}$ exceeds the lower bound ( $L F C_{m}^{C A B L, L B}$ ) or upper bound ( $L F C_{m}^{C A B L, U B}$ ). Conversely, the membership grade is between 0 and 1 if the membership function is within its bounds.

Metabolite‐flow variability analysis

Generally, the optimal fluxes of FBA in the TLOP problem could have many distributions with an identical objective value. Bias in similarity ratios may be yielded through such an evaluation. To overcome such a drawback, flux variability analysis (FVA) can be applied in a posterior inspection to determine the maximum and minimum values of all fluxes that satisfy the constraints and allow for the same optimal objective value. FVA can be applied to compute minimum and maximum fluxes to yield a flux space of a metabolic network [34]. Moreover, it must cover all objective values of the cell growth because cancer cell growth may not proliferate sustainably at its maximum rate. In this study, we introduced metabolite‐flow variability analysis (MFVA) to compute the minimum and maximum quantities of each metabolite for the normal model and the mutants, respectively. The MFVA formulation was expressed as follows:

\begin{matrix} MFVA Problem for cancer, normal, and mutant cases \\ \{\begin{cases} \underset{ζ \in (0, 1]}{max / min} r_{m} \\ subject to the inner optimization problems: \\ \{\begin{cases} FBA Problem: \\ max_{v_{f / b}} o b j \equiv (w_{ATP} v_{ATP} + w_{biomass} v_{biomass}) \\ N^{C A / B L} (v_{f} ‐ v_{b}) = 0 \\ v_{f / b, j}^{LB} \leq v_{f / b, j} \leq v_{f / b, j}^{UB} \end{cases} \\ \{\begin{cases} UFD problem: \\ min_{v_{f / b}} (\sum_{f} v_{f}^{2} + \sum_{b} v_{b}^{2}) \\ N^{C A / B L} (v_{f} ‐ v_{b}) = 0 \\ v_{f / b, i}^{LB} \leq v_{f / b, i} \leq v_{f / b, i}^{UB} \\ o b j \geq ζ o b j^{*} \end{cases} \end{cases} \end{matrix}

(14)

The metabolite‐flow intervals, $[r_{m}^{min}, r_{m}^{max}]$ , for cancer and normal cell and each mutant can be obtained through MFVA, and were used to determine the trend of flux change between dysregulated case and normal situation in terms of seven categories of classification [21]. However, the categories were a qualitative measure to determine the trend of flux change between mutant and normal case. In this study, we introduced the interval arithmetic [35, 36] for the fuzzy equal membership grades in Eqn (13) to yield the interval membership grade as a quantitative measure to determine how much close to the template for each mutant. The interval decision grade, [η _E ] _i , for GSMN of the ith mutant was calculated as follows:

{[η_{E}]}_{i} = {[η_{E, min}, η_{E, max}]}_{i} = \frac{1}{M} {[\sum_{m = 1}^{M} η_{m, min}, \sum_{m = 1}^{M} η_{m, max}]}_{i}

(15)

The computational procedures of the minimum and maximum log2 fold changes (η _E _,min and η _E _,max) are explained in detail in Doc. S3.

Results and discussions

Analysis of tissue‐specific metabolic models

The GSMN of Recon 3D were downloaded from VHM database (https://www.vmh.life) and consisted of 5835 metabolites, 10600 reactions, and 2247 associated genes. For LUAD and LUSC, 533 and 502 cancer samples and 59 and 49 corresponding healthy samples, respectively, were obtained from TCGA database. The GSMNs for healthy and cancerous lung tissues were reconstructed using the CORDA algorithm, and statistics of the metabolites and reactions for LUAD and LUSC are presented in Fig. 3. The four models had 3360 metabolites, 5125 reactions, and 1747 genes in common, as shown in the overlapping region in Fig. 3. The cancer models comprised 3773 metabolites, 6158 reactions, and 1901 genes for LUAD and 3836 metabolites, 6290 reactions, and 1962 genes for LUSC (Fig. 3). The corresponding heathy models consisted of 4227 metabolites, 6803 reactions, and 1907 genes for LUAD and 4254 metabolites, 6761 reactions, and 1916 genes for LUSC. Twelve metabolic pathways with top‐ranked number of metabolites and reactions for both tissues are shown in Fig. 4. From the classification, we observed more than 900 and 300 metabolites for fatty acyls and carboxylic acids, respectively, for both GSMNs; more than 1500 and 900 reactions for extracellular transports and fatty acid oxidation, respectively, were observed.

Fig. 3 — Statistics of reconstructed metabolic models. Statistics of reconstructed metabolic models for LUAD and LUSC and their corresponding healthy models. LUAD‐CA and LUSC‐CA indicate the cancer models for LUAD and LUSC, respectively, and LUAD‐HT and LUSC‐HT denote their corresponding healthy models, respectively. The number in the overlapping regions of two, three, and four models indicates their common elements.

Fig. 4 — Statistics of major metabolic pathways. Classification of metabolites and reactions for the reconstructed GSMNs for LUAD and LUSC. The classifications for metabolites and reactions were sought through the definitions in the HMDB database (https://hmdb.ca/) and the VMH database (https://www.vmh.life/), respectively.

Identifying differentially expressed genes (DEGs) is critical in exploring molecular mechanisms of biological conditions [37]. We assessed P values and fold changes (log₂(CA/HT)) using ANOVA in the sas® software (https://www.sas.com/) to determine the differential expressions of enzyme‐encoding genes for LUAD and LUSC between the normal and tumor samples from TCGA database (Doc. S4). In total, 159 and 241 enzyme‐encoding genes for LUAD and LUSC, respectively, were within the absolute values of log2 fold change > 2 and P < 0.05, as shown in the volcano plots (Doc. S4). Such DEGs were used as a set of candidate genes to solve the oncogene inference optimization problem.

Inferred oncogenes

The Catalogue Of Somatic Mutations In Cancer (COSMIC) database (https://cancer.sanger.ac.uk/cosmic) have collected 723 cancer genes that are somatically mutated and causally implicated in human cancer, including 45 enzyme‐encoding genes involved in Recon 3D. Total 25 out of the 45 enzyme‐encoding genes regulate reactions in Recon 3D according to the GPR association. To demonstrate the effectiveness of the oncogene inference optimization algorithm, the similarity ratio (η _S ) and membership grade (η _E ) of each dysregulation of the 25 enzyme‐encoding genes for LUAD and LUSC tissue‐specific GSMNs reconstructed based on data from different databases (TCGA and HPA) were computed (Fig. 5). The results show most of the similarity ratios for LUAD are greater than 0.76, except CANT1 and SLC34A2, and greater than 0.8 for LUSC. However, the membership grade varies from case to case (with value ranging from 0.16 to 0.76).

Fig. 5 — Similarity ratios and membership grades for dysregulated genes. The dysregulated genes were retrieved from the COSMIC database (https://cancer.sanger.ac.uk/cosmic). TCGA and HPA indicate the similarity ratios and membership grades computed using the GSMNs reconstructed based on data from the TCGA and HPA databases, respectively. *ATP1A1* and *CHST11* genes are not available in the GPR association of the reconstructed GSMNs.

The NHDE algorithm in [18, 21] was applied to evaluate the fitness in inferring carcinogenesis for all candidate enzyme‐encoding genes. High DEGs (159 genes for LUAD and 241 genes for LUSC) for both tissues were first applied individually to enable the TLOP to determine the carcinogenicity of each gene. We obtained 9 of 159 genes for LUAD and 21 of 241 genes for LUSC that have high levels of differential expression in the computation. The optimal fitness, η _D , for these dysregulations were within [0.69, 0.76] for LUAD and [0.76, 0.81] for LUSC (Doc. S5). Additionally, the NHDE algorithm was also applied to identify oncogenes that have a low level of differential expression. We determined that 36 and 63 oncogenes for LUAD and LUSC had > 0.72 and > 0.76 optimal fitness, respectively (Doc. S5), in the computation.

Tables 1 and 1, 2 list the 15 top‐ranked oncogenes for LUAD (η _D > 0.75) and LUSC (η _D > 0.85), respectively. Pyruvate kinase (PKM) had the highest fitness (η _D = 0.781), and it plays a pivotal role in regulating glucose‐derived carbon from catabolic to biosynthetic pathways. Its dysregulation is a hallmark of tumorigenesis [38, 39, 40] and leads to several cancers, such as breast cancer, renal cell carcinoma, hepatocellular carcinoma, and colorectal cancer. There are four isozymes of pyruvate kinase in mammals (L, R, M1, and M2) encoded by two different genes: PKLR and PKM. The L and R isozymes are generated from the PKLR by differential splicing of RNA; the M1 and M2 forms are produced from the PKM gene by differential splicing. The PKLR gene achieved the ninth highest inferred oncogene, η _D = 0.744, in Table 1. The optimal fitness for the dysregulated enzymes was > 0.744, which indicated that metabolic alterations for both dysregulations were > 74% consistent with the template. We also reconstructed the tissue‐specific GSMNs of LUAD and LUSC using gene expression data from the HPA database. Such models were applied to inspect the fitness of the inferred oncogenes from TCGA database, as shown in Tables 1 and 2. In comparison with data obtained from both approaches, we could decipher that higher fitness value indicates higher possibility of tumorigenesis.

Table 1.

Top 15 inferred oncogenes of LUAD. η _S ^TCGA and η _S ^HPA are the average similarity ratios for reconstructed GSMNs based on the data from the TCGA and HPA databases, respectively. η _E ^TCGA and η _E ^HPA are the membership grades of the fuzzy equal function for reconstructed GSMNs based on the data from the TCGA and HPA databases, respectively. Higher η _E values indicate higher consistency of the flux alterations with the template. DEG and P value were calculated in sas® software. The pathway for each gene was found from the GeneCards (https://www.genecards.org/) and VMH (https://www.vmh.life/) databases. A gene is biological significant if |DEG| > 2 and P value < 0.05.

Gene	DEG ^a	P value ^b	(η _S ^TCGA , η _E ^TCGA )	(η _S ^HPA _, η _E ^HPA )	Pathway	Disease (score) ^c
PKM	0.98	1.01E−62	(0.842, 0.761)	(0.874, 0.734)	Abacavir pathway	Breast adenocarcinoma (0.90)
ENO1	1.36	4.78E−78	(0.839, 0.745)	(0.856, 0.747)	HIF‐1‐α transcription factor network	Lung cancer susceptibility 3 (0.72)
PTDSS1	0.17	1.29E−06	(0.835, 0.741)	(0.851, 0.725)	Glycerophospholipid biosynthetic pathway	Polyneuropathy (1.34)
GLTP	−0.17	6.10E−06	(0.834, 0.736)	(0.872, 0.737)	Sphingolipid metabolism	Cervical squamous cell carcinoma (0.44)
OCRL	0.52	3.91E−49	(0.831, 0.738)	NA	3‐phosphoinositide degradation	Lowe oculocerebrorenal syndrome (2.11)
CA12	1.60	5.26E−12	(0.831, 0.736)	(0.870, 0.439)	Nitrogen metabolism	Hemangioma of subcutaneous tissue (1.42), lung cancer (0.37)
SLC22A7	0.79	3.10E−03	(0.837, 0.728)	(0.873, 0.737)	Zidovudine pathway	renal cell carcinoma (0.64)
PLPP1	−0.17	3.30E−02	(0.825, 0.737)	(0.545, 0.165)	Triacylglycerol biosynthesis	Myxosarcoma (1.33)
PKLR	2.17	4.34E−04	(0.830, 0.715)	(0.875, 0.740)	Abacavir pathway	Intracortical osteogenic sarcoma (1.50)
MTHFD2	1.53	1.15E−47	(0.832, 0.727)	(0.878, 0.757)	Nucleotide metabolism	Mitochondrial complex I deficiency (0.87)
SLC25A11	−0.13	6.69E−03	(0.842, 0.716)	(0.868, 0.734)	Glucose metabolism	Paragangliomas 6 (2.83)
ALDH4A1	0.23	8.64E−04	(0.835, 0.723)	(0.858, 0.731)	Alanine, aspartate and glutamate metabolism	Hyperprolinemia, type Ii (2.48)
GAPDH	1.93	4.52E−61	(0.826, 0.731)	(0.855, 0.733)	Cori cycle	Angioimmunoblastic T‐cell lymphoma (1.14)
SLC13A5	3.24	2.64E−07	(0.835, 0.721)	(0.860, 0.731)	Transport of glucose and other sugars	Nasal cavity benign neoplasm (1.50)
SLC20A1	0.92	2.65E‐22	(0.834, 0.721)	(0.867, 0.736)	Glucose/energy metabolism	Leukemia (0.86)

Open in a new tab

^{^a}

DEG = log2(CA/HT) denotes a differential expression gene and is computed from cancer and healthy samples of TCGA datasets.

^{^b}

P value is computed from cancerous and healthy samples of TCGA datasets.

^{^c}

Diseases and scores are obtained from the GeneCards database.

Table 2.

Top 15 inferred oncogenes of LUSC. η _S ^TCGA and η _S ^HPA are the average similarity ratios for reconstructed GSMNs based on the data from the TCGA and HPA databases, respectively. η _E ^TCGA and η _E ^HPA are the membership grades of the fuzzy equal function for reconstructed GSMNs based on the data from the TCGA and HPA databases, respectively. Higher η _E values indicate higher consistency of the flux alterations with the template. DEG and P value were calculated in sas® software. The pathway for each gene was found from the GeneCards (https://www.genecards.org/) and VMH (https://www.vmh.life/) databases. A gene is biological significant if |DEG| > 2 and P value < 0.05.

Gene	DEG ^a	P value ^b	(η _S ^TCGA , η _E ^TCGA )	(η _S ^HPA _, η _E ^HPA )	Pathway	Disease (score) ^c
SLCO2B1	−2.22	8.80E−20	(0.917, 0.829)	(0.871, 0.842)	Atenolol pathway	Ileum cancer (1.31)
SLC9A1	−0.22	3.08E−03	(0.911, 0.822)	(0.847, 0.826)	Osteoclast signaling	Gastroesophageal reflux (1.00)
SLC7A10	2.68	1.14E−02	(0.911, 0.819)	(0.862, 0.835)	Differentiation of white and brown adipocyte	Follicular lymphoma (0.91)
SLC20A1	0.42	3.45E−04	(0.904, 0.822)	(0.870, 0.843)	Glucose/energy metabolism	Leukemia (0.86)
AQP8	−1.27	1.85E−05	(0.906, 0.817)	(0.878, 0.854)	Detoxification of reactive oxygen species	Colorectal adenoma (0.89)
AGL	0.42	1.49E−12	(0.908, 0.814)	NA	Glycogen metabolism	Bladder lateral wall cancer (1.26)
SLC12A4	−1.11	8.74E−40	(0.906, 0.815)	(0.871, 0.841)	Transport of glucose and other sugars	Fish‐eye disease (1.50)
KYAT1	0.74	2.25E−36	(0.909, 0.812)	(0.865, 0.838)	Selenocompound metabolism	Schizophrenia (0.74)
SLC6A2	4.35	1.46E−16	(0.897, 0.818)	NA	Methylphenidate pathway	Adrenal medulla cancer (1.21)
SLC5A12	4.93	1.70E−63	(0.898, 0.816)	(0.867, 0.631)	NRF2 pathway	Follicular lymphoma (0.91)
SLC4A2	−0.32	3.49E−07	(0.903, 0.810)	(0.867, 0.631)	Bile secretion	Hepatocellular carcinoma (0.68)
SLCO1C1	−0.91	1.11E−04	(0.898, 0.813)	(0.867, 0.631)	Transport of vitamins and nucleosides	Allan–Herndon–Dudley syndrome (1.20)
SLC23A2	−0.26	1.75E−04	(0.897, 0.814)	(0.866, 0.839)	Metabolism of water‐soluble vitamins and cofactors	Hepatitis C virus (0.87)
ACE2	0.77	2.46E−06	(0.896, 0.815)	(0.867, 0.631)	A‐beta plaque formation and APP metabolism	Renal oncocytoma (0.72)
SLC43A1	−0.97	9.28E−14	(0.897, 0.810)	(0.866, 0.840)	Amino acid transport across the plasma membrane	Seminoma (1.07)

Open in a new tab

^{^a}

DEG = log2(CA/HT) denotes a differential expression gene and is computed from cancer and healthy samples of TCGA datasets.

^{^b}

P value is computed from cancerous and healthy samples of TCGA datasets.

^{^c}

Diseases and scores are obtained from the GeneCards database.

Both PKM and PKLR isozymes are involved in the catalysis reaction (R_PYK) of phosphoenolpyruvate‐to‐pyruvate conversion. Furthermore, PKM not only catalyzes R_PYK but also the reaction of R_RE2954C, that is, the conversion of phosphoenolpyruvate and deoxythymidine‐5’‐diphosphate to pyruvate and deoxythymidine‐5'‐triphosphate. Moreover, PKLR catalyzes the R_r0280 reaction to convert phosphoenolpyruvate and deoxyadenosine diphosphate to pyruvate and deoxyadenosine triphosphate. The three aforementioned reactions used different metabolites to transform phosphoenolpyruvate to pyruvate. In addition, the products, namely deoxythymidine‐5'‐triphosphate, adenosine triphosphate, and deoxyadenosine triphosphate, of the three reactions acted as precursors of biomass reactions. We applied a reaction‐based approach discussed in the oncogene inference optimization problem [18] to determine which reaction was a dominant malfunction in LUAD and LUSC. The optimal similarity ratios (η _S ) and membership grades (η _E ) for these dysregulated reactions were similar (Table 3). PKM dysregulated R_PKY and R_RE2954C, which yielded a slightly higher η _S and η _E than did PKLR. Both PKM and PKLR dysregulated three reactions to obtain nearly the same characteristic. The flux fold change of each reaction was greater than twofold in pyruvate synthesis, which was consistent with the Warburg effect of enhanced pyruvate formation to increase lactate production (Table 3).

Table 3.

Comparison of carcinogenicity caused by reaction‐based and enzyme‐based dysregulations in LUAD and LUSC, respectively. Flux fold change (LFC) is denoted as log2 fold change between cancer and healthy states; LFC was computed as log2(r _cancer/r _normal). η _S and η _E indicate the average similarity ratio and fuzzy equal membership grade, respectively. H, proton; PEP, phosphoenolpyruvate; DTDP, deoxythymidine‐5'‐diphosphate; ADP, adenosine diphosphate; DADP, deoxyadenosine diphosphate; PYR, pyruvate; ATP, adenosine triphosphate; DTTP, deoxythymidine‐5'‐triphosphate; DATP, deoxyadenosine triphosphate.

Dysregulated reactions/genes	Reactions	LUAD		LUSC
Dysregulated reactions/genes	Reactions	LFC	(η _S _, η _E )	LFC	(η _S _, η _E )
R_PYK	H + ADP + PEP → ATP + PYR	3.753	(0.829, 0.723)	3.128	(0.876, 0.754)
R_RE2954C	H + PEP + DTDP ↔ PYR + DTTP	3.671	(0.843, 0.745)	3.687	(0.867, 0.738)
R_r0280	H + PEP + DADP → PYR + DATP	3.901	(0.830, 0.714)	3.361	(0.882, 0.760)
R_RE2954C + R_r0280	H + PEP + D PYR + DTTP H + PEP + DADP → PYR + DATP	2.620 2.324	(0.841, 0.743)	2.60 2.239	(0.868, 0.740)
PKM	H + ADP + PEP → ATP + PYR H + PEP + DTDP ↔ PYR + DTTP	2.260 2.868	(0.842, 0.761)	1.857 2.424	(0.860, 0.728)
PKLR	H + ADP + PEP → ATP + PYR H + PEP + DADP → PYR + DATP	2.563 2.878	(0.830, 0.715)	2.395 2.680	(0.869, 0.752)
PKM + PKLR	H + ADP + PEP → ATP + PYR H + PEP + DTDP ↔ PYR + DTTP H + PEP + DADP → PYR + DATP	1.749 2.300 2.019	(0.848, 0.754)	1.735 2.286 1.984	(0.863, 0.717)

Open in a new tab

Enolase 1 (ENO1), along with PKM and PKLR, is a glycolytic enzyme. Glycolysis is an ATP‐generating step that is pivotal in cancer cell proliferation and metastasis. ENO1 is overexpressed in several tumor types, including NSCLC [41, 42, 43]. ENO1 is an upstream enzyme of PKM and PKLR that catalyzes 2‐phosphoglycerate to form phosphoenolpyruvate. We observed that its DEG for LUAD obtained from RNA‐Seq datasets in TCGA was not statistically significant (DEG = 1.36). Furthermore, the computation results revealed that the flux fold change increased by 1.33 times from normal to cancer states. The average similarity ratio (η _S = 0.839) and membership grade (η _E = 0.745) of ENO1 were slightly smaller than those of PKM.

Phosphatidylserine synthase, encoded by PTDSS1, is involved in phosphatidylserine biosynthetic pathway, which is a part of phospholipid metabolism. Phosphatidylserine is a precursor in the biomass reaction of the reconstructed metabolic model. Its dysregulation causes Lenz–Majewski syndrome, which is a rare disease characterized by complex craniofacial, dental, cutaneous, and limb abnormalities combined with intellectual disability [44]. Phosphatidylserine signaling is highly dysregulated in the tumor microenvironment and autoimmune diseases [45]. According to the computation results, 1.1% upregulation of PTDSS1 in LUAD could increase the production of phosphatidylserine that exhibited carcinogenic effects that yielded the average similarity ratio (η _S = 0.835) and membership grade (η _E = 0.741). By contrast, PTDSS1 was one of the 34 top‐ranked oncogenes for LUSC that yielded η _S and η _E values of 0.899 and 0.789 (Doc. S5), respectively. A survival analysis for oncogenes can be applied to investigate the clinical significance of metabolic alterations. In this study, we surveyed a survival analysis from the HPA database to explain the survival significance of the inferred oncogenes. The high expression of PTDSS1 in LUAD could significantly reduce the survival probability compared with that of PKM, PKLR, and ENO1 (Doc. S6).

Moreover, we also reconstructed LUAD and LUSC GSMNs for comparison using the iMAT algorithm [46] based on data from different databases (TCGA and HPA). The similarity ratios and membership grades for each dysregulated gene are shown in Doc. S7. The results show that the similarity ratios of LUAD and LUSC could fulfill the prediction. Because the GSMNs reconstructed by the iMAT algorithm are more parsimonious than by the CORDA algorithm, some genes are not available in the GPR association of the GSMNs.

Dysregulation of membrane transporters

The computation results (Doc. S5) reveal that 12 out of 45 genes for LUAD and 45 out of 84 genes for LUSC served as solute carriers (SLCs). SLC genes could easily cause carcinogenesis, and we found that 11 SLC genes for LUAD (except SLC6A4) are common to those for LUSC. These SLC gene‐encoding proteins are categorized into SLC families and SLC anion transporter families. Such transmembrane transporters could mediate the influx and efflux of substances such as ions, nucleotides, and sugars across biological membranes. The dysregulation of these genes can drive metabolic diseases, such as type II diabetes. Reports have indicated that the mediation of SLC5A2 and SLC13A5 genes may be therapeutic targets for treating type II diabetes and nonalcoholic fatty liver diseases [47, 48]. In the computation, SLC13A5 was determined to be an oncogene in LUAD and LUSC, and it had high gene differential expression (DEG = 3.239 for LUAD and 5.126 for LUSC) between cancer tissues and healthy tissues. However, SLC5A2 was identified in LUSC only, but it possessed low gene differential expression (DEG = −1.1228). SLC5A2 encodes a member of the sodium glucose cotransporter family, which is a sodium‐dependent glucose transport protein. The dysregulation of SLC5A2 could lead to non‐small‐cell lung cancer and pancreatic [49] and prostate adenocarcinomas [50].

SLC13A5 is a sodium‐coupled citrate transporter that plays a key role in importing citrate from the bloodstream into human cells. Surveys conducted using PubMed and GeneCards indicated that SLC13A5 is related to nasal cavity neoplasm and hepatocellular carcinoma [51]. The computation revealed that the fold change of its mediated flux increased more than sixfold. Other common genes, namely SLC25A11, SLC20A1, and SLC22A7, achieved high fitness (Table 1). The oxoglutarate carrier SLC25A11 is important for ATP production in cancer during NADH transportation from the cytosol to mitochondria as a malate. The dysregulation of SLC25A11 could lead to non‐small‐cell lung cancer [52] and liver cancer [53]. Sodium‐dependent phosphate transporter 1 (SLC20A1) plays a fundamental housekeeping role in phosphate transport, such as absorbing phosphate from interstitial fluid for normal cellular functions such as cellular metabolism, signal transduction, and nucleic acid and lipid synthesis. Some review articles [54] have indicated that the phosphate transporter is overexpressed in tumor cells, which was consistent with the fold change of DEG (0.92 for LUAD and 0.42 for LUSC) evaluated from TCGA database; therefore, it has been considered a key promoter of tumorigenesis.

Angiotensin‐converting enzyme 2, encoded by ACE2, has recently garnered widespread interest as the SARS‐CoV‐2 receptor. It appears to be an infective agent responsible for coronavirus disease 2019 and associated cardiovascular diseases [33, 55]. We determined that ACE2 was an oncogene of LUSC and achieved an average similarity ratio of 0.896 and membership grade of 0.815, making it one of the 15 top‐ranked oncogenes. However, ACE2 was not implicated to cause tumorigenesis in LUAD because although its average similarity ratio could reach 0.82, its membership grade was 0.385, which was smaller than those of the 15 top‐ranked oncogenes. Furthermore, we inspected RNA‐Seq data from TCGA and found that fold changes in DEG (1.67 for LUAD and 0.77 for LUSC) for both cancers increased nonsignificantly. According to the HPA database, the high expression of ACE2 in LUSC indicated high survival probability during the initial stages, but the results after 11 years were identical (Doc. S6). By contrast, high or low ACE2 expression in LUAD was not differentiated.

Results of MFVA

In this study, we applied the TLOP to infer oncogenes in GSMNs of lung adenocarcinoma and lung squamous cell carcinoma. FBA was involved in the inner optimization problem of TLOP. FBA can calculate steady‐state metabolic fluxes for GSMNs in a reasonable computational time with modern personal computers, but it is a biased method in constraint‐based modeling approaches for yielding optimal flux distributions. Monte Carlo sampling methods for GSMNs can cope with such a biased prediction, but still spend a lot of computer time [56]. In this study, we introduced an interval arithmetic [35, 36] for MFVA, an extension of FBA, to determine the robustness of metabolic models in various simulation conditions. However, its use has been somewhat limited by the long computation time compared with FBA. MFVA is generally incapable of embedding in the oncogene inference problem [Eqn (1)] due to the computational burden, but it could be applied to investigate whether the optimal results were achieved. MFVA was applied to calculate the lower and upper flux‐sum bounds (i.e., 4366 observed components) for each dysregulated gene. The metabolic flux‐sum bound matrix was formed by representing each dysregulated gene as a column of lower and upper flux‐sum bounds (Fig. 6) as calculated through MFVA. Moreover, the flux‐sum bound column of the template was added to the matrix. Such metabolic flux‐sum bound matrices for all mutants of LUAD and LUSC were then used to perform a factor analysis to yield two factors (Fig. 7A and B).

Fig. 6 — Concept of MFVA. Concept for establishing flux‐sum bounds of each mutant through MFVA for factor analysis.

In total, 23 genes of LUAD in Factor 1 had factor loadings > 0.73 (Fig. 7A; numerical data are listed in Doc. S8). Acetyl‐CoA acyltransferase 2, encoded by ACAA2, and lactoylglutathione lyase, encoded by GLO1, had the two highest factor loadings (0.914) in Factor 1. ACAA2 catalyzes the final step of the mitochondrial fatty acid beta‐oxidation pathway. To date, clinical diseases caused by mutations or variants of ACAA2 have not been identified. However, the ACAA2 locus has been associated with abnormal blood lipid levels, particularly HDL and LDL cholesterol levels [55]. GLO1 participates in pyruvate metabolism to convert S‐lactoylglutathione into methylglyoxal and glutathione, and regulates TNF‐induced transcriptional activity of NF‐κB. For Factor 2, phosphatidylserine synthase 1, encoded by PTDSS1, had a factor loading of 0.84, which was close to the template (0.81). PTDSS1 mainly catalyzes the conversion of phosphatidylcholine in the phosphatidylserine biosynthesis pathway, which is part of phospholipid metabolism.

The factor analysis of 42 separate dysregulated genes for LUSC in two groups is shown in Fig 7B (Doc. S8). The 23 genes of Factor 1 in the first group had factor loadings > 0.64 (Fig. 7B). The template was still close to PTDSS1 (0.74) and yielded a factor loading of 0.51. The 18 genes of Factor 2 in the second group had factor loadings > 0.81 (Fig. 7B).

The lower and upper flux‐sum bounds obtained through MFVA were applied to compute the minimum and maximum membership grades (Fig. 8). In other words, the interval range of each mutant was compared with the lower and upper bounds of the template through interval computation (Doc. S9). A numerical example in Doc. S3 describes the computation of interval numbers to yield the minimum and maximum membership grades (Fig. 8). The optimal membership grade obtained through FBA in the oncogene inference problem was within the range of each mutant. PTDSS1 achieved the highest range among the mutants in LUAD, and the maximum membership grade of the mutants for LUSC was smoothly variant. According to the factor analysis, LUAD characteristics were similar to those of PTDSS1 but unobvious to those of LUSC.

Conclusion

This study integrated the RNA‐Seq data of healthy and cancerous lung tissues downloaded from TCGA database with the genome‐scale metabolite and protein structure data of Recon 3D to reconstruct tissue‐specific GSMNs. The models were first used to generate flux patterns as a template/control in the trilevel oncogene inference optimization framework for inferring tumorigenic genes. The similarity ratios and fuzzy equal membership grades are the objectives in the oncogene inference optimization platform. The similarity ratio was used as a quality criterion to evaluate the similarity between the dysregulated flux pattern and that of the template. The fuzzy equal membership grade was used as a quantity metric to measure how close to the template the fold change of a dysregulated flux pattern is. The platform involved with the template could detect 45 and 84 tumorigenic genes for LUAD and LUSC, respectively. We observed that a high level of DEGs was not an essential factor for determining tumorigenesis. Nine out of 45 and 21 out of 84 genes for LUAD and LUSC, respectively, with high levels of differential expression based on cancer and healthy samples in TCGA database; the other genes, such as PKM and PTDSS1, have low levels of differential expression. The MFVA used the interval arithmetic to yield the interval membership grade as a quantitative measure of biased predictions from FBA.

Conflict of interest

The authors declare no conflict of interest.

Author contributions

FSW was responsible for study conception and design and drafted the original manuscript. WHW wrote the program and revised the manuscript. YTW, MRL, and WCC performed the data analysis and database survey. All authors have read and approved the final manuscript.

Supporting information

Doc. S1. A numerical example to illustrate the computation of the template, similarity ratio, and LFC.

Click here for additional data file.^{(1MB, pdf)}

Doc. S2. Detail description of the NHDE algorithm.

Click here for additional data file.^{(896.8KB, pdf)}

Doc. S3. A numerical example to illustrate the interval computation of membership grade of fuzzy equal function.

Click here for additional data file.^{(1.2MB, pdf)}

Doc. S4. Differential expressions of enzyme‐encoding genes and volcano plots for LUAD and LUSC.

Click here for additional data file.^{(7.7MB, xlsx)}

Doc. S5. Inferred oncogenes for LUAD and LUSC.

Click here for additional data file.^{(42.8KB, xlsx)}

Doc. S6. Survival analysis obtained from the HPA database to explain survival significance of the inferred oncogenes.

Click here for additional data file.^{(906KB, pdf)}

Doc. S7. Similarity ratios and membership grades of the dysregulated genes from Tables 1 and 2 for the LUAD and LUSC GSMNs reconstructed by integrating the iMAT algorithm with data from different databases (TCGA or HPA).

Click here for additional data file.^{(407.1KB, pdf)}

Doc. S8. Results of factor analysis.

Click here for additional data file.^{(62.2KB, xlsx)}

Doc. S9. The lower and upper flux‐sum bounds obtained through MFVA.

Click here for additional data file.^{(44.5KB, xlsx)}

Acknowledgements

This study was supported by the Ministry of Science and Technology of Taiwan (Grant No. MOST106‐2221‐E‐194‐049‐MY3 and MOST109‐2320‐B‐194‐003).

Data accessibility

The data that support the findings of this study are available in the Supporting information of this article.

References

1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA and Jemal A (2018) Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68, 394–424. [DOI] [PubMed] [Google Scholar]
2. Zappa C and Mousa SA (2016) Non‐small cell lung cancer: current treatment and future advances. Trans Lung Cancer Res 5, 288–300. [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Herbst RS, Morgensztern D and Boshoff C (2018) The biology and management of non‐small cell lung cancer. Nature 553, 446–454. [DOI] [PubMed] [Google Scholar]
4. Yeh S‐J, Chang C‐A, Li C‐W, Wang LH‐C and Chen B‐S (2019) Comparing progression molecular mechanisms between lung adenocarcinoma and lung squamous cell carcinoma based on genetic and epigenetic networks: big data mining and genome‐wide systems identification. Oncotarget 10, 3760–3806. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Asgari Y, Zabihinpour Z, Salehzadeh‐Yazdi A, Schreiber F and Masoudi‐Nejad A (2015) Alterations in cancer cell metabolism: The Warburg effect and metabolic adaptation. Genomics 105, 275–281. [DOI] [PubMed] [Google Scholar]
6. Jalili M, Gebhardt T, Wolkenhauer O and Salehzadeh‐Yazdi A (2018) Unveiling network‐based functional features through integration of gene expression into protein networks. Biochim Biophys Acta Mol Basis Dis 1864, 2349–2359. [DOI] [PubMed] [Google Scholar]
7. Auslander N, Cunningham CE, Toosi BM, McEwen EJ, Yizhak K, Vizeacoumar FS, Parameswaran S, Gonen N, Freywald T, Bhanumathy KK et al. (2017) An integrated computational and experimental study uncovers FUT9 as a metabolic driver of colorectal cancer. Mol Syst Biol 13, 956. [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Gatto F, Ferreira R and Nielsen J (2020) Pan‐cancer analysis of the metabolic reaction network. Metab Eng 57, 51–62. [DOI] [PubMed] [Google Scholar]
9. Hu J, Locasale JW, Bielas JH, O'Sullivan J, Sheahan K, Cantley LC, Heiden MGV and Vitkup D (2013) Heterogeneity of tumor‐induced gene expression changes in the human metabolic network. Nat Biotechnol 31, 522–529. [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Yizhak K, Gabay O, Cohen H and Ruppin E (2013) Model‐based identification of drug targets that revert disrupted metabolism and its application to ageing. Nat Commun 4, 2632. [DOI] [PubMed] [Google Scholar]
11. Zielinski DC, Jamshidi N, Corbett AJ, Bordbar A, Thomas A and Palsson BO (2017) Systems biology analysis of drivers underlying hallmarks of cancer cell metabolism. Sci Rep 7, 41241. [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Pavlova NN and Thompson CB (2016) The emerging hallmarks of cancer metabolism. Cell Metab 23, 27–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Eyassu F and Angione C (2017) Modelling pyruvate dehydrogenase under hypoxia and its role in cancer metabolism. R Soc Open Sci 4, 170360. [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Folger O, Jerby L, Frezza C, Gottlieb E, Ruppin E and Shlomi T (2011) Predicting selective drug targets in cancer through metabolic networks. Mol Syst Biol 7, 501. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Gottstein W, Olivier BG, Bruggeman FJ and Teusink B (2016) Constraint‐based stoichiometric modelling from single organisms to microbial communities. J R Soc Interface 13, 20160627. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Mardinoglu A, Agren R, Kampf C, Asplund A, Uhlen M and Nielsen J (2014) Genome‐scale metabolic modelling of hepatocytes reveals serine deficiency in patients with non‐alcoholic fatty liver disease. Nat Commun 5, 3083. [DOI] [PubMed] [Google Scholar]
17. Wang Y, Eddy JA and Price ND (2012) Reconstruction of genome‐scale metabolic models for 126 human tissues using mCADRE. BMC Syst Biol 6, 153. [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Wang F‐S, Wu W‐H, Hsiu W‐S, Liu Y‐J and Chuang K‐W (2019) Genome‐scale metabolic modeling with protein expressions of normal and cancerous colorectal tissues for oncogene inference. Metabolites 10, 16. [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Wu M and Chan C (2012) Human metabolic network: reconstruction, simulation, and applications in systems biology. Metabolites 2, 242–253. [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Wu H‐Q, Cheng M‐L, Lai J‐M, Wu H‐H, Chen M‐C, Liu W‐H, Wu W‐H, Chang PM‐H, Huang C‐YF, Tsou A‐P et al. (2017) Flux balance analysis predicts Warburg‐like effects of mouse hepatocyte deficient in miR‐122a. PLoS Comput Biol 13, e1005618. [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Wu W‐H, Li F‐Y, Shu Y‐C, Lai J‐M, Chang PM‐H, Huang C‐YF and Wang F‐S (2020) Oncogene inference optimization using constraint‐based modelling incorporated with protein expression in normal and tumour tissues. R Soc Open Sci 7, 191241. [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Yizhak K, Chaneton B, Gottlieb E and Ruppin E (2015) Modeling cancer metabolism on a genome scale. Mol Syst Biol 11, 817. [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Yizhak K, Le Dévédec SE, Rogkoti VM, Baenke F, de Boer VC , Frezza C, Schulze A, van de Water B and Ruppin E (2014) A computational study of the Warburg effect identifies metabolic targets inhibiting cancer migration. Mol Syst Biol 10, 744. [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Aurich MK, Paglia G, Rolfsson Ó, Hrafnsdóttir S, Magnúsdóttir M, Stefaniak MM, Palsson BØ, Fleming RMT and Thiele I (2015) Prediction of intracellular metabolic states from extracellular metabolomic data. Metabolomics 11, 603–619. [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Brunk E, Sahoo S, Zielinski DC, Altunkaya A, Dräger A, Mih N, Gatto F, Nilsson A, Preciat Gonzalez GA, Aurich MK et al. (2018) Recon3D enables a three‐dimensional view of gene variation in human metabolism. Nat Biotechnol 36, 272–281. [DOI] [PMC free article] [PubMed] [Google Scholar]
26. Swainston N, Smallbone K, Hefzi H, Dobson P, Brewer J, Hanscho M, Zielinski D, Ang KS, Gardiner N, Gutierrez J et al. (2016) Recon 2.2: from reconstruction to model of human metabolism. Metabolomics 12, 109. [DOI] [PMC free article] [PubMed] [Google Scholar]
27. Uhlen M, Oksvold P, Fagerberg L, Lundberg E, Jonasson K, Forsberg M, Zwahlen M, Kampf C, Wester K, Hober S et al. (2010) Towards a knowledge‐based human protein atlas. Nat Biotechnol 28, 1248–1250. [DOI] [PubMed] [Google Scholar]
28. Blais EM, Rawls KD, Dougherty BV, Li ZI, Kolling GL, Ye P, Wallqvist A and Papin JA (2017) Reconciled rat and human metabolic networks for comparative toxicogenomics and biomarker predictions. Nat Commun 8, 14250. [DOI] [PMC free article] [PubMed] [Google Scholar]
29. Ryu JY, Kim HU and Lee SY (2017) Framework and resource for more than 11,000 gene‐transcript‐protein‐reaction associations in human metabolism. Proc Natl Acad Sci USA 114, E9740–E9749. [DOI] [PMC free article] [PubMed] [Google Scholar]
30. Schultz A and Qutub AA (2016) Reconstruction of tissue‐specific metabolic networks using CORDA. PLoS Comput Biol 12, e1004808. [DOI] [PMC free article] [PubMed] [Google Scholar]
31. National Cancer Institute of U.S. (2019) Department of Health and Human Services. The Cancer Genome Atlas Program, Washington, D.C.
32. Valcárcel LV, Torrano V, Tobalina L, Carracedo A and Planes FJ (2019) rMTA: robust metabolic transformation analysis. Bioinformatics 35, 4350–4355. [DOI] [PubMed] [Google Scholar]
33. Pozo C, Miró A, Guillén‐Gosálbez G, Sorribas A, Alves R and Jiménez L (2015) Gobal optimization of hybrid kinetic/FBA models via outer‐approximation. Comput Chem Eng 72, 325–333. [Google Scholar]
34. Heirendt L, Arreckx S, Pfau T, Mendoza SN, Richelle A, Heinken A, Haraldsdóttir HS, Wachowiak J, Keating SM, Vlasov V et al. (2019) Creation and analysis of biochemical constraint‐based models using the COBRA Toolbox vol 3.0. Nat Protoc 14, 639–702. [DOI] [PMC free article] [PubMed] [Google Scholar]
35. Kaufmann A and Gupta MM (1991) Introduction to Fuzzy Arithmetic: Theory and Applications. International Thomson Computer Press, MI. [Google Scholar]
36. Zettervall H (2014) Fuzzy Set Theory Applied to Make Medical Prognoses for Cancer Patients. Blekinge Institute of Technology, Sweden. [Google Scholar]
37. Crow M, Lim N, Ballouz S, Pavlidis P and Gillis J (2019) Predictability of human differential gene expression. Proc Natl Acad Sci 116, 6491–6500. [DOI] [PMC free article] [PubMed] [Google Scholar]
38. Luo W and Semenza GL (2012) Emerging roles of PKM2 in cell metabolism and cancer progression. Trends Endocrinol Metab 23, 560–566. [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Morita M, Sato T, Nomura M, Sakamoto Y, Inoue Y, Tanaka R, Ito S, Kurosawa K, Yamaguchi K, Sugiura Y et al. (2018) PKM1 confers metabolic advantages and promotes cell‐autonomous tumor cell growth. Cancer Cell 33, 355–367. [DOI] [PubMed] [Google Scholar]
40. Zahra K, Dey T, Ashish M, Mishra SP and Pandey U (2020) Pyruvate kinase M2 and cancer: the role of PKM2 in promoting tumorigenesis. Front Oncol 10, 159. [DOI] [PMC free article] [PubMed] [Google Scholar]
41. Altenberg B and Greulich KO (2004) Genes of glycolysis are ubiquitously overexpressed in 24 cancer classes. Genomics 84, 1014–1020. [DOI] [PubMed] [Google Scholar]
42. Fu Q‐F, Liu Y, Fan Y, Hua S‐N, Qu H‐Y, Dong S‐W, Li R‐L, Zhao M‐Y, Zhen Y, Yu X‐L et al. (2015) Alpha‐enolase promotes cell glycolysis, growth, migration, and invasion in non‐small cell lung cancer through FAK‐mediated PI3K/AKT pathway. J Hematol Oncol 8, 22. [DOI] [PMC free article] [PubMed] [Google Scholar]
43. Ji M, Wang Z, Chen J, Gu L, Chen M, Ding Y and Liu T (2019) Up‐regulated ENO1 promotes the bladder cancer cell growth and proliferation via regulating β‐catenin. Biosci Rep 39. [DOI] [PMC free article] [PubMed] [Google Scholar]
44. Sohn M, Ivanova P, Brown HA, Toth DJ, Varnai P, Kim YJ and Balla T (2016) Lenz‐Majewski mutations in PTDSS1 affect phosphatidylinositol 4‐phosphate metabolism at ER‐PM and ER‐Golgi junctions. Proc Natl Acad Sci USA 113, 4314–4319. [DOI] [PMC free article] [PubMed] [Google Scholar]
45. Birge RB, Boeltz S, Kumar S, Carlson J, Wanderley J, Calianese D, Barcinski M, Brekken RA, Huang X, Hutchins JT et al. (2016) Phosphatidylserine is a global immunosuppressive signal in efferocytosis, infectious disease, and cancer. Cell Death Differ 23, 962–978. [DOI] [PMC free article] [PubMed] [Google Scholar]
46. Zur H, Ruppin E and Shlomi T (2010) iMAT: an integrative metabolic analysis tool. Bioinformatics 26, 3140–3142. [DOI] [PubMed] [Google Scholar]
47. Schumann T, König J, Henke C, Willmes DM, Bornstein SR, Jordan J, Fromm MF and Birkenfeld AL (2020) Solute carrier transporters as potential targets for the treatment of metabolic disease. Pharmacol Rev 72, 343–379. [DOI] [PubMed] [Google Scholar]
48. Superti‐Furga G, Lackner D, Wiedmer T, Ingles‐Prieto A, Barbosa B, Girardi E, Goldmann U, Gürtl B, Klavins K, Klimek C et al. (2020) The RESOLUTE consortium: unlocking SLC transporters for drug discovery. Nat Rev Drug Discov 19, 429–430. [DOI] [PubMed] [Google Scholar]
49. Taira N, Atsumi E, Nakachi S, Takamatsu R, Yohena T, Kawasaki H, Kawabata T and Yoshimi N (2018) Comparison of GLUT‐1, SGLT‐1, and SGLT‐2 expression in false‐negative and true‐positive lymph nodes during the ¹⁸F‐FDG PET/CT mediastinal nodal staging of non‐small cell lung cancer. Lung Cancer 123, 30–35. [DOI] [PubMed] [Google Scholar]
50. Scafoglio C, Hirayama BA, Kepe V, Liu J, Ghezzi C, Satyamurthy N, Moatamed NA, Huang J, Koepsell H, Barrio JR et al. (2015) Functional expression of sodium‐glucose transporters in cancer. Proc Natl Acad Sci USA 112, E4111–E4119. [DOI] [PMC free article] [PubMed] [Google Scholar]
51. Li Z, Li D, Choi EY, Lapidus R, Zhang L, Huang S‐M, Shapiro P and Wang H (2017) Silencing of solute carrier family 13 member 5 disrupts energy homeostasis and inhibits proliferation of human hepatocarcinoma cells. J Biol Chem 292, 13890–13901. [DOI] [PMC free article] [PubMed] [Google Scholar]
52. Lee J‐S, Lee H, Lee S, Kang JH, Lee S‐H, Kim S‐G, Cho ES, Kim NH, Yook JI and Kim S‐Y (2019) Loss of SLC25A11 causes suppression of NSCLC and melanoma tumor formation. EBioMedicine 40, 184–197. [DOI] [PMC free article] [PubMed] [Google Scholar]
53. Pan G, Wang R, Jia S, Li Y, Jiao Y and Liu N (2020) SLC25A11 serves as a novel prognostic biomarker in liver cancer. Sci Rep 10, 9871. [DOI] [PMC free article] [PubMed] [Google Scholar]
54. Lacerda‐Abreu MA, Russo‐Abrahão T, de Queiroz Monteiro R , Rumjanek FD and Meyer‐Fernandes JR (2018) Inorganic phosphate transporters in cancer: Functions, molecular mechanisms and possible clinical applications. Biochim Biophys Acta, Rev Cancer 1870, 291–298. [DOI] [PubMed] [Google Scholar]
55. Kathiresan S, Melander O, Guiducci C, Surti A, Burtt NP, Rieder MJ, Cooper GM, Roos C, Voight BF, Havulinna AS et al. (2008) Six new loci associated with blood low‐density lipoprotein cholesterol, high‐density lipoprotein cholesterol or triglycerides in humans. Nat Genet 40, 189–197. [DOI] [PMC free article] [PubMed] [Google Scholar]
56. Schellenberger J, Que R, Fleming RMT, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S et al. (2011) Quantitative prediction of cellular metabolism with constraint‐based models: the COBRA Toolbox v2.0. Nat Protoc 6, 1290–1307. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Doc. S1. A numerical example to illustrate the computation of the template, similarity ratio, and LFC.

Click here for additional data file.^{(1MB, pdf)}

Doc. S2. Detail description of the NHDE algorithm.

Click here for additional data file.^{(896.8KB, pdf)}

Doc. S3. A numerical example to illustrate the interval computation of membership grade of fuzzy equal function.

Click here for additional data file.^{(1.2MB, pdf)}

Doc. S4. Differential expressions of enzyme‐encoding genes and volcano plots for LUAD and LUSC.

Click here for additional data file.^{(7.7MB, xlsx)}

Doc. S5. Inferred oncogenes for LUAD and LUSC.

Click here for additional data file.^{(42.8KB, xlsx)}

Doc. S6. Survival analysis obtained from the HPA database to explain survival significance of the inferred oncogenes.

Click here for additional data file.^{(906KB, pdf)}

Click here for additional data file.^{(407.1KB, pdf)}

Doc. S8. Results of factor analysis.

Click here for additional data file.^{(62.2KB, xlsx)}

Doc. S9. The lower and upper flux‐sum bounds obtained through MFVA.

Click here for additional data file.^{(44.5KB, xlsx)}

Data Availability Statement

The data that support the findings of this study are available in the Supporting information of this article.

[feb413231-bib-0001] 1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA and Jemal A (2018) Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68, 394–424. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0002] 2. Zappa C and Mousa SA (2016) Non‐small cell lung cancer: current treatment and future advances. Trans Lung Cancer Res 5, 288–300. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0003] 3. Herbst RS, Morgensztern D and Boshoff C (2018) The biology and management of non‐small cell lung cancer. Nature 553, 446–454. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0004] 4. Yeh S‐J, Chang C‐A, Li C‐W, Wang LH‐C and Chen B‐S (2019) Comparing progression molecular mechanisms between lung adenocarcinoma and lung squamous cell carcinoma based on genetic and epigenetic networks: big data mining and genome‐wide systems identification. Oncotarget 10, 3760–3806. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0005] 5. Asgari Y, Zabihinpour Z, Salehzadeh‐Yazdi A, Schreiber F and Masoudi‐Nejad A (2015) Alterations in cancer cell metabolism: The Warburg effect and metabolic adaptation. Genomics 105, 275–281. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0006] 6. Jalili M, Gebhardt T, Wolkenhauer O and Salehzadeh‐Yazdi A (2018) Unveiling network‐based functional features through integration of gene expression into protein networks. Biochim Biophys Acta Mol Basis Dis 1864, 2349–2359. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0007] 7. Auslander N, Cunningham CE, Toosi BM, McEwen EJ, Yizhak K, Vizeacoumar FS, Parameswaran S, Gonen N, Freywald T, Bhanumathy KK et al. (2017) An integrated computational and experimental study uncovers FUT9 as a metabolic driver of colorectal cancer. Mol Syst Biol 13, 956. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0008] 8. Gatto F, Ferreira R and Nielsen J (2020) Pan‐cancer analysis of the metabolic reaction network. Metab Eng 57, 51–62. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0009] 9. Hu J, Locasale JW, Bielas JH, O'Sullivan J, Sheahan K, Cantley LC, Heiden MGV and Vitkup D (2013) Heterogeneity of tumor‐induced gene expression changes in the human metabolic network. Nat Biotechnol 31, 522–529. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0010] 10. Yizhak K, Gabay O, Cohen H and Ruppin E (2013) Model‐based identification of drug targets that revert disrupted metabolism and its application to ageing. Nat Commun 4, 2632. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0011] 11. Zielinski DC, Jamshidi N, Corbett AJ, Bordbar A, Thomas A and Palsson BO (2017) Systems biology analysis of drivers underlying hallmarks of cancer cell metabolism. Sci Rep 7, 41241. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0012] 12. Pavlova NN and Thompson CB (2016) The emerging hallmarks of cancer metabolism. Cell Metab 23, 27–47. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0013] 13. Eyassu F and Angione C (2017) Modelling pyruvate dehydrogenase under hypoxia and its role in cancer metabolism. R Soc Open Sci 4, 170360. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0014] 14. Folger O, Jerby L, Frezza C, Gottlieb E, Ruppin E and Shlomi T (2011) Predicting selective drug targets in cancer through metabolic networks. Mol Syst Biol 7, 501. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0015] 15. Gottstein W, Olivier BG, Bruggeman FJ and Teusink B (2016) Constraint‐based stoichiometric modelling from single organisms to microbial communities. J R Soc Interface 13, 20160627. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0016] 16. Mardinoglu A, Agren R, Kampf C, Asplund A, Uhlen M and Nielsen J (2014) Genome‐scale metabolic modelling of hepatocytes reveals serine deficiency in patients with non‐alcoholic fatty liver disease. Nat Commun 5, 3083. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0017] 17. Wang Y, Eddy JA and Price ND (2012) Reconstruction of genome‐scale metabolic models for 126 human tissues using mCADRE. BMC Syst Biol 6, 153. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0018] 18. Wang F‐S, Wu W‐H, Hsiu W‐S, Liu Y‐J and Chuang K‐W (2019) Genome‐scale metabolic modeling with protein expressions of normal and cancerous colorectal tissues for oncogene inference. Metabolites 10, 16. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0019] 19. Wu M and Chan C (2012) Human metabolic network: reconstruction, simulation, and applications in systems biology. Metabolites 2, 242–253. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0020] 20. Wu H‐Q, Cheng M‐L, Lai J‐M, Wu H‐H, Chen M‐C, Liu W‐H, Wu W‐H, Chang PM‐H, Huang C‐YF, Tsou A‐P et al. (2017) Flux balance analysis predicts Warburg‐like effects of mouse hepatocyte deficient in miR‐122a. PLoS Comput Biol 13, e1005618. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0021] 21. Wu W‐H, Li F‐Y, Shu Y‐C, Lai J‐M, Chang PM‐H, Huang C‐YF and Wang F‐S (2020) Oncogene inference optimization using constraint‐based modelling incorporated with protein expression in normal and tumour tissues. R Soc Open Sci 7, 191241. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0022] 22. Yizhak K, Chaneton B, Gottlieb E and Ruppin E (2015) Modeling cancer metabolism on a genome scale. Mol Syst Biol 11, 817. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0023] 23. Yizhak K, Le Dévédec SE, Rogkoti VM, Baenke F, de Boer VC , Frezza C, Schulze A, van de Water B and Ruppin E (2014) A computational study of the Warburg effect identifies metabolic targets inhibiting cancer migration. Mol Syst Biol 10, 744. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0024] 24. Aurich MK, Paglia G, Rolfsson Ó, Hrafnsdóttir S, Magnúsdóttir M, Stefaniak MM, Palsson BØ, Fleming RMT and Thiele I (2015) Prediction of intracellular metabolic states from extracellular metabolomic data. Metabolomics 11, 603–619. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0025] 25. Brunk E, Sahoo S, Zielinski DC, Altunkaya A, Dräger A, Mih N, Gatto F, Nilsson A, Preciat Gonzalez GA, Aurich MK et al. (2018) Recon3D enables a three‐dimensional view of gene variation in human metabolism. Nat Biotechnol 36, 272–281. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0026] 26. Swainston N, Smallbone K, Hefzi H, Dobson P, Brewer J, Hanscho M, Zielinski D, Ang KS, Gardiner N, Gutierrez J et al. (2016) Recon 2.2: from reconstruction to model of human metabolism. Metabolomics 12, 109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0027] 27. Uhlen M, Oksvold P, Fagerberg L, Lundberg E, Jonasson K, Forsberg M, Zwahlen M, Kampf C, Wester K, Hober S et al. (2010) Towards a knowledge‐based human protein atlas. Nat Biotechnol 28, 1248–1250. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0028] 28. Blais EM, Rawls KD, Dougherty BV, Li ZI, Kolling GL, Ye P, Wallqvist A and Papin JA (2017) Reconciled rat and human metabolic networks for comparative toxicogenomics and biomarker predictions. Nat Commun 8, 14250. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0029] 29. Ryu JY, Kim HU and Lee SY (2017) Framework and resource for more than 11,000 gene‐transcript‐protein‐reaction associations in human metabolism. Proc Natl Acad Sci USA 114, E9740–E9749. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0030] 30. Schultz A and Qutub AA (2016) Reconstruction of tissue‐specific metabolic networks using CORDA. PLoS Comput Biol 12, e1004808. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0031] 31. National Cancer Institute of U.S. (2019) Department of Health and Human Services. The Cancer Genome Atlas Program, Washington, D.C.

[feb413231-bib-0032] 32. Valcárcel LV, Torrano V, Tobalina L, Carracedo A and Planes FJ (2019) rMTA: robust metabolic transformation analysis. Bioinformatics 35, 4350–4355. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0033] 33. Pozo C, Miró A, Guillén‐Gosálbez G, Sorribas A, Alves R and Jiménez L (2015) Gobal optimization of hybrid kinetic/FBA models via outer‐approximation. Comput Chem Eng 72, 325–333. [Google Scholar]

[feb413231-bib-0034] 34. Heirendt L, Arreckx S, Pfau T, Mendoza SN, Richelle A, Heinken A, Haraldsdóttir HS, Wachowiak J, Keating SM, Vlasov V et al. (2019) Creation and analysis of biochemical constraint‐based models using the COBRA Toolbox vol 3.0. Nat Protoc 14, 639–702. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0035] 35. Kaufmann A and Gupta MM (1991) Introduction to Fuzzy Arithmetic: Theory and Applications. International Thomson Computer Press, MI. [Google Scholar]

[feb413231-bib-0036] 36. Zettervall H (2014) Fuzzy Set Theory Applied to Make Medical Prognoses for Cancer Patients. Blekinge Institute of Technology, Sweden. [Google Scholar]

[feb413231-bib-0037] 37. Crow M, Lim N, Ballouz S, Pavlidis P and Gillis J (2019) Predictability of human differential gene expression. Proc Natl Acad Sci 116, 6491–6500. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0038] 38. Luo W and Semenza GL (2012) Emerging roles of PKM2 in cell metabolism and cancer progression. Trends Endocrinol Metab 23, 560–566. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0039] 39. Morita M, Sato T, Nomura M, Sakamoto Y, Inoue Y, Tanaka R, Ito S, Kurosawa K, Yamaguchi K, Sugiura Y et al. (2018) PKM1 confers metabolic advantages and promotes cell‐autonomous tumor cell growth. Cancer Cell 33, 355–367. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0040] 40. Zahra K, Dey T, Ashish M, Mishra SP and Pandey U (2020) Pyruvate kinase M2 and cancer: the role of PKM2 in promoting tumorigenesis. Front Oncol 10, 159. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0041] 41. Altenberg B and Greulich KO (2004) Genes of glycolysis are ubiquitously overexpressed in 24 cancer classes. Genomics 84, 1014–1020. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0042] 42. Fu Q‐F, Liu Y, Fan Y, Hua S‐N, Qu H‐Y, Dong S‐W, Li R‐L, Zhao M‐Y, Zhen Y, Yu X‐L et al. (2015) Alpha‐enolase promotes cell glycolysis, growth, migration, and invasion in non‐small cell lung cancer through FAK‐mediated PI3K/AKT pathway. J Hematol Oncol 8, 22. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0043] 43. Ji M, Wang Z, Chen J, Gu L, Chen M, Ding Y and Liu T (2019) Up‐regulated ENO1 promotes the bladder cancer cell growth and proliferation via regulating β‐catenin. Biosci Rep 39. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0044] 44. Sohn M, Ivanova P, Brown HA, Toth DJ, Varnai P, Kim YJ and Balla T (2016) Lenz‐Majewski mutations in PTDSS1 affect phosphatidylinositol 4‐phosphate metabolism at ER‐PM and ER‐Golgi junctions. Proc Natl Acad Sci USA 113, 4314–4319. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0045] 45. Birge RB, Boeltz S, Kumar S, Carlson J, Wanderley J, Calianese D, Barcinski M, Brekken RA, Huang X, Hutchins JT et al. (2016) Phosphatidylserine is a global immunosuppressive signal in efferocytosis, infectious disease, and cancer. Cell Death Differ 23, 962–978. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0046] 46. Zur H, Ruppin E and Shlomi T (2010) iMAT: an integrative metabolic analysis tool. Bioinformatics 26, 3140–3142. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0047] 47. Schumann T, König J, Henke C, Willmes DM, Bornstein SR, Jordan J, Fromm MF and Birkenfeld AL (2020) Solute carrier transporters as potential targets for the treatment of metabolic disease. Pharmacol Rev 72, 343–379. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0048] 48. Superti‐Furga G, Lackner D, Wiedmer T, Ingles‐Prieto A, Barbosa B, Girardi E, Goldmann U, Gürtl B, Klavins K, Klimek C et al. (2020) The RESOLUTE consortium: unlocking SLC transporters for drug discovery. Nat Rev Drug Discov 19, 429–430. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0049] 49. Taira N, Atsumi E, Nakachi S, Takamatsu R, Yohena T, Kawasaki H, Kawabata T and Yoshimi N (2018) Comparison of GLUT‐1, SGLT‐1, and SGLT‐2 expression in false‐negative and true‐positive lymph nodes during the ¹⁸F‐FDG PET/CT mediastinal nodal staging of non‐small cell lung cancer. Lung Cancer 123, 30–35. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0050] 50. Scafoglio C, Hirayama BA, Kepe V, Liu J, Ghezzi C, Satyamurthy N, Moatamed NA, Huang J, Koepsell H, Barrio JR et al. (2015) Functional expression of sodium‐glucose transporters in cancer. Proc Natl Acad Sci USA 112, E4111–E4119. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0051] 51. Li Z, Li D, Choi EY, Lapidus R, Zhang L, Huang S‐M, Shapiro P and Wang H (2017) Silencing of solute carrier family 13 member 5 disrupts energy homeostasis and inhibits proliferation of human hepatocarcinoma cells. J Biol Chem 292, 13890–13901. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0052] 52. Lee J‐S, Lee H, Lee S, Kang JH, Lee S‐H, Kim S‐G, Cho ES, Kim NH, Yook JI and Kim S‐Y (2019) Loss of SLC25A11 causes suppression of NSCLC and melanoma tumor formation. EBioMedicine 40, 184–197. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0053] 53. Pan G, Wang R, Jia S, Li Y, Jiao Y and Liu N (2020) SLC25A11 serves as a novel prognostic biomarker in liver cancer. Sci Rep 10, 9871. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0054] 54. Lacerda‐Abreu MA, Russo‐Abrahão T, de Queiroz Monteiro R , Rumjanek FD and Meyer‐Fernandes JR (2018) Inorganic phosphate transporters in cancer: Functions, molecular mechanisms and possible clinical applications. Biochim Biophys Acta, Rev Cancer 1870, 291–298. [DOI] [PubMed] [Google Scholar]

[feb413231-bib-0055] 55. Kathiresan S, Melander O, Guiducci C, Surti A, Burtt NP, Rieder MJ, Cooper GM, Roos C, Voight BF, Havulinna AS et al. (2008) Six new loci associated with blood low‐density lipoprotein cholesterol, high‐density lipoprotein cholesterol or triglycerides in humans. Nat Genet 40, 189–197. [DOI] [PMC free article] [PubMed] [Google Scholar]

[feb413231-bib-0056] 56. Schellenberger J, Que R, Fleming RMT, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S et al. (2011) Quantitative prediction of cellular metabolism with constraint‐based models: the COBRA Toolbox v2.0. Nat Protoc 6, 1290–1307. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Optimization of a modeling platform to predict oncogenes from genome‐scale metabolic networks of non‐small‐cell lung cancers

You‐Tyun Wang

Min‐Ru Lin

Wei‐Chen Chen

Wu‐Hsiung Wu

Feng‐Sheng Wang

Abstract

Abbreviations

Materials and methods

Reconstruction of tissue‐specific metabolic models

Fig. 1.

Oncogene inference optimization

Fitness evaluation

Fig. 2.

Metabolite‐flow variability analysis

Results and discussions

Analysis of tissue‐specific metabolic models

Fig. 3.

Fig. 4.

Inferred oncogenes

Fig. 5.

Table 1.

Table 2.

Table 3.

Dysregulation of membrane transporters

Results of MFVA

Fig. 6.

Fig. 7.

Fig. 8.

Conclusion

Conflict of interest

Author contributions

Supporting information

Acknowledgements

Data accessibility

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases