Skip to main content
International Journal of Genomics logoLink to International Journal of Genomics
. 2019 Oct 1;2019:1750291. doi: 10.1155/2019/1750291

Computational Identification of Metabolic Pathways of Plasmodium falciparum using the k-Shortest Path Algorithm

Jelili Oyelade 1,2,, Itunuoluwa Isewon 1,2, Olufemi Aromolaran 1,2, Efosa Uwoghiren 1,2, Titilope Dokunmu 2,3, Solomon Rotimi 2,3, Oluwadurotimi Aworunse 4, Olawole Obembe 4, Ezekiel Adebiyi 1,2
PMCID: PMC6791207  PMID: 31662957

Abstract

Plasmodium falciparum, a malaria pathogen, has shown substantial resistance to treatment coupled with poor response to some vaccines thereby requiring urgent, holistic, and broad approach to prevent this endemic disease. Understanding the biology of the malaria parasite has been identified as a vital approach to overcome the threat of malaria. This study is aimed at identifying essential proteins unique to malaria parasites using a reconstructed iPfa genome-scale metabolic model (GEM) of the 3D7 strain of Plasmodium falciparum by filling gaps in the model with nineteen (19) metabolites and twenty-three (23) reactions obtained from the MetaCyc database. Twenty (20) currency metabolites were removed from the network because they have been identified to produce shortcuts that are biologically infeasible. The resulting modified iPfa GEM was a model using the k-shortest path algorithm to identify possible alternative metabolic pathways in glycolysis and pentose phosphate pathways of Plasmodium falciparum. Heuristic function was introduced for the optimal performance of the algorithm. To validate the prediction, the essentiality of the reactions in the reconstructed network was evaluated using betweenness centrality measure, which was applied to every reaction within the pathways considered in this study. Thirty-two (32) essential reactions were predicted among which our method validated fourteen (14) enzymes already predicted in the literature. The enzymatic proteins that catalyze these essential reactions were checked for homology with the host genome, and two (2) showed insignificant similarity, making them possible drug targets. In conclusion, the application of the intelligent search technique to the metabolic network of P. falciparum predicts potential biologically relevant alternative pathways using graph theory-based approach.

1. Introduction

Malaria remains one of the leading global health challenges with about 216 million cases and more than 445,000 deaths recorded in 2016. According to the World Health Organization (WHO), 88% of these deaths occurred in Africa [1] and Plasmodium falciparum accounted for the majority of the cases. P. falciparum has developed resistance to all antimalarial medications including the most potent one—artemisinin [24]. Some genetic changes and metabolic alterations confer survival advantage that enables the parasite to evade drug effects thereby ensuring its survival. However, there is still a poor understanding of the processes utilized by P. falciparum to evade drug effects, which also hamper vaccine development [5]. The incomplete knowledge of the metabolic pathways of P. falciparum has also been identified as a major impediment towards the development of an effective treatment [6]; hence, the need for intensified research that is aimed at understanding better the parasite biology. Computational approach, which predicts metabolic networks and possible alternative pathways pertinent to the parasite's survival, can identify candidate drug targets to eliminate the disease globally. Some previous studies have been done in this regard. Figure 1 explains the various paths (a), pathways (b), and complete network constructions (c) in metabolic pathways.

Figure 1.

Figure 1

(a) describes the paths in a metabolic network construction; (b) describes the pathways; and (c) describes the entire networks of the metabolic construction.

The identification of these undiscovered pathways within the metabolism of the malaria parasite entails enumerating not only the shortest path within the metabolic pathway but also the list of other feasible paths within a source compound and a target compound. This could be regarded as an optimization problem. The metabolic system or metabolism of a specific cell or an organism is the entire system of metabolic reactions of the cell or organism. A metabolic pathway is an associated subsystem of the metabolic system either as a representation of particular processes or characterized by functional boundaries, e.g., the system between a glucose (initial substance) and a pyruvate (final substance). The k-shortest path technique enables the optimal and suboptimal metabolic pathways to be identified within the metabolic network [7].

Croes et al. [8] presented a reaction to the representation of the metabolic network in order to identify relevant pathways in a biological network. However, the path finding algorithm applied was unable to handle a large dataset. Faust et al. [9] combined a random walk-based reduction of the graph with the shortest path-based algorithm and applied on a yeast metabolic network. This approach is computationally intensive due to several runs by the shortest path algorithm used. Oyelade et al. [10] applied a colour coding algorithm to search for minimum pathways in the P. falciparum interaction network. They discovered “identified” and “unknown” genes and signal transduction pathway involved in metabolic activities of P. falciparum. However, this approach is only capable of obtaining the shortest signal pathway. Also, in another work of Oyelade et al. [11], the qualitative Petri net model for the glycolysis pathway in P. falciparum was built and analysed for its structural and quantitative properties using the Petri net theory which only give insights into the complex net behavior of the pathway. Essential reactions in the metabolic network of P. falciparum have been predicted using several computational techniques such as in silico knockout screening [12], load and choke point [13], centrality measures [14], flux balance analysis [15], and machine learning approach [16] etc.

The goal of this study is to computationally identify branching metabolic pathways of P. falciparum using the enhanced k-shortest path technique with improved prediction precision. This technique was used in our previous work [17] although limited to finding only alternative paths in selected metabolic pathways. In this work, we obtained the genome-scale metabolic model of the 3D7 strain of P. falciparum from a previous study [18]. The dataset contains 325 genes and 670 metabolic reactions. The underlying architecture of the metabolic network was modelled using a reaction graph where reactions are designated as the nodes and two reactions are neighbors, if the product of a reaction is the substrate of the other. Compound graph representation is not suitable for path finding algorithms because a reaction can have more than one compound as both input and output, e.g., A + B ≥ C + D, thereby leading to a bunch of redundant edges. The existence of two different types of nodes in a bipartite graph makes it complex to traverse by the path finding algorithm. The choice of a reaction graph was informed by the aim of this study, which is to identify more alternative metabolic pathways, and the pathways are known to be chains of reactions and also due to the applicability of path finding algorithms which requires a single type of node to compute the shortest paths within the network. The T∗ algorithm, the k-shortest path technique developed by Kadivar [19], was adapted to extract the shortest paths from a defined source node to a target node. Only two metabolic pathways were considered for the alternative path analysis in this work due to the overlapping nature of the pathways that made it difficult to identify source and target reactions relating to a specific pathway.

We set the value of k of the path finding algorithm to equal five (5) because as k increases, the biological relevance of the path reduces. The identification of essential enzymes in a particular network allows a possible drug target to be identified [16, 2022]. Therefore, in drug development, essential enzymes are generally recognized as perfect drug target candidates for potential new drugs and vaccines to treat and prevent diseases since their deletion from a network can compromise its integrity [21, 22]. Hence, the essentiality of reactions was carried out using the betweenness centrality measure to determine essential reactions within a reconstructed network and subsequently predicted 32 essential enzymes.

2. Materials and Methods

2.1. Reconstructing the Metabolic Network

Metabolic reaction data were obtained from Chiappino-Pepe et al.'s study [18]; also 19 metabolites and 23 reactions were obtained from the MetaCyc database [23] to fill gaps in the iPfa GEM (see appendix A for the list of reactions obtained from the MetaCyc database). In a metabolic network, there are several metabolites that are commonly involved in reactions that cause shortcuts without biological meaning when computing paths in a simple graph [24]. These metabolites are referred to as pool metabolites or currency metabolites such as proton and water. Most of them often produce biological misinterpretations due to the artifactual links between nodes.

Kim et al. [25] identified twenty-five (25) currency metabolites out of which we eliminated twenty (20) from the network before reconstructing the graph. The remaining five (5) currency metabolites identified by [16] were left in the network since their presence does not have a significant effect that could lead to shortcuts without biological meaning, and a uniform weight is assigned to all edges in the network.

2.2. The Algorithm

The T∗ algorithm was selected among the other k-shortest paths because of its superior computational performance (as shown in Table 1). This algorithm requires topological sorting of the graph nodes as its input, which is only possible for a directed acyclic graph. Figure 2 shows the algorithm flow chart of the metabolic network construction.

Table 1.

Time complexity of T∗, K∗, Yen, Feng, EA, and LVEA algorithms. As shown in the Table 1, the T∗ algorithm has better computational performance in terms of running time when compared to others [19].

Algorithm Time complexity
T O(m + nk log d)
K O(m + n log n + k)
Lazy variant of Eppstein's algorithm (LVEA) O(m + n log n + k log k)
Eppstein's algorithm (EA) O(m + n log n + k log k)
Feng O(kn(m + n log n))
Yen O(1/2(Kn3))

Figure 2.

Figure 2

T∗ algorithm flow chart for metabolic network reconstruction.

Given that P[i] = {P1i, P2i, ⋯, Pki}as the set of the ki-shortest paths from s to node i and L(P1i) ≤ L(P2i) ≤ L(Pki), where L(P) denotes the length of path P. Let (i, j) ∈ A, because of topological ordering and since order (j) is larger than the other nodes in all si paths; Pli ⋃ {(i, j)} is a loopless path for each l ∈ {1, 2, ⋯, ki}. Therefore if P[s] = ∅, then P[i] = ⋃jA−(i)PϵPjP ⋃ {(j, i)} (stated in Algorithm 1).

Algorithm 1.

Algorithm 1

The T∗ algorithm was modified in two significant ways:

Firstly, we eliminated the first stage (topological ordering) due to the nature of the dataset where several reactions within the dataset could be on the same level in the topological order thereby varying the results from each run of the algorithm. For instance, if reactions A, B, and C have the same topological order, then it implies that at different runs of the algorithm, reaction A could come first or reaction B or reaction C, which varies the result of the algorithm at different runs. The metabolic network of P. falciparum contains over six hundred compounds and over a thousand reactions. The inconsistency inherent in the ordering of the reactions will have severe biological implication for the eventual shortest paths generated by the algorithm. For instance, if the order of a vital reaction within the pathway is less than that of the source compound for a particular pathway, then the reaction will be omitted from the shortest paths because the algorithm starts traversing from the source node to the target node in the topological order. Secondly, the topological ordering is not applicable to networks with a cycle since metabolic networks are known to contain loops; hence, we introduced heuristic function to enhance the prediction precision of the alternative paths. Figure 3 shows the modified T algorithm for metabolic network construction. The reactions in the annotated pathway were retrieved and used to guide the search of the algorithm to list biological feasible paths.

Figure 3.

Figure 3

Modified T algorithm flow chart for metabolic network reconstruction.

Therefore, the modified Talgorithm 2 is given.

Algorithm 2.

Algorithm 2

2.2.1. Reward Shaping Formulation

The concept of reward shaping was also introduced to the k-shortest path algorithm to improve the quality of biologically relevant alternative path prediction. Reward shaping is the addition of an extra reward signal that encodes some heuristic knowledge of the system designer or domain expert, thus encouraging the learning agent to explore parts of the state space believed to contain good solutions [2628]. In practice, the choice of reward function is intuitively selected [29] and this has been successfully applied to speed up reinforcement learning techniques in complex domains [30, 31].

The reward Rk of each feasible path between a source reaction and target reaction is computed using the number of reactions within the path that can be found in the set of annotated reactions RA. An annotated pathway was obtained from the KEGG database. The justification for this model is based on the assumption that the annotated pathway for a metabolic process represents the shortest path; hence, the next shortest alternative path is most likely to have branched from the annotated path thereby containing most of the reactions in annotation. The reactions in the feasible path were compared to the annotated path to determine the number of intersecting reactions. However, the total cost of a path is a function of its length and the number of annotated reaction contained in the path.

Mk=i=1nxicri,x=1,if riϵRA,0,otherwise, (1)

where Mk represents the initial reward for path k, c(ri) represents the coefficient for reaction (ri) which has a constant value of 1. xi will assume the value of 1 if ri is present in the set of annotated reaction for that pathway and 0 if otherwise.

In addition to the objective, obtaining biologically relevant paths is also to obtain the shortest path; hence, a penalty function is introduced which is meant to reduce the reward of paths based on their length; this is represented in

Pk=nkt, (2)

where t is a “balancing factor” which is an arbitrary value that is suitable to provide a penalty value that balances the positiveness of annotated reactions present in the path and the disadvantages of the path length.

The reward for a path k is given as the difference between its initial reward and penalty score as shown in

Rk=MkPk. (3)

In this study, we chose t to be 0.5 for a fair penalty score and the adjusted cost is the difference between the number of reaction in the path and the reward. We then applied the modified T algorithm to the glycolysis and pentose phosphate pathway of P. falciparum to obtain the k-shortest paths for k = 5.

3. Results

We then applied the modified T algorithm to the glycolysis and pentose phosphate pathway of P. falciparum to obtain the k-shortest paths for k = 5. Table 2 shows the number of reactions present in a predicted path and its computed cost. The five alternative paths identified by our method for each of the target metabolic pathways were further verified manually to identify artifacts and biologically plausible paths. Two alternative paths were identified to be a feasible path for metabolic activities in the pentose phosphate pathway. One alternative path was identified in the glycolysis pathway, which could serve as an alternative path for metabolic activities in the pathway. The reactions represented in gold in Figures 4 and 5 indicate reactions identified by this study to have created alternative paths in the respective metabolic pathways. The reaction represented in blue in Figure 5 represents a nonannotated reaction identified by this study to create an alternative path glucose 6-P to glycolysis.

Table 2.

Computed cost of predicted paths.

K No. of reaction in the path No. of reaction in annotated pathway Reward (Rk) Adjusted cost
Glycolysis
1 9 8 3.5 5.5
2 10 9 4 6
3 11 9 3.5 7.5
4 11 9 3.5 7.5
5 12 10 4 8

Pentose phosphate
1 8 8 4 4
2 9 9 4.5 4.5
3 10 9 4 6
4 11 11 5.5 5.5
5 11 11 5.5 5.5

Figure 4.

Figure 4

Predicted alternative metabolic paths in the glycolysis pathway. The compound (D-glucose-1-P) highlighted in gold represents the predicted alternative path to the final product (pyruvate).

Figure 5.

Figure 5

Predicted alternative metabolic paths in the pentose phosphate pathway. The compounds (D-xylulose-5P and glycerone phosphate) highlighted in gold represent the predicted alternative paths to the final product. The compound (sedoheptulose 1,7 biphosphate) highlighted in blue is the predicted nonannotated compound in the pathway.

For the glycolysis pathway, we predicted a pathway, which could provide an alternative path for the generation of pyruvate. This metabolic reaction involves alpha-D-glucose-1-phosphate uridylyltransferase. The predicted alternative paths for the glycolysis pathway are presented in Figure 4.

The predicted alternative paths for the pentose phosphate pathway are presented in Figure 5. Two (2) additional metabolic reactions were predicted to have created an alternative path in the pathway; they are D-glyceraldehyde-3-phosphate glycolaldehyde transferase and sedoheptulose 7-phosphate 1-phosphotransferase reactions, respectively.

3.1. Essential Reaction Prediction

To verify the predicted pathways, a test of the essentiality of these genes was done on the predicted pathways. Betweenness centrality measure was applied to the reconstructed metabolic network to each reaction in the network to determine the essentiality of a particular metabolic pathway that is critical for the production of a target compound identified. The betweenness centrality method is stated as follows:

bikpj=gikpjgik, (4)

where bik(pj) is the proportion of the shortest path linking pi to pk that contain pj, gik(pj) is the number of the shortest path that contain point pj as an intermediary in the shortest path from pi to pk, and gik is the number of the shortest path from pi to pk.

dij=k=1nbikpjijk, (5)

where dij is the pair dependency which represents the degree to which a point, pi, must depend upon another, pj.

CBpj=i=1ndij, (6)

where CB(pj) is the betweenness point centrality.

The genes that coded the enzymes responsible for the essential reactions discovered were obtained and blasted against the human genome to evaluate the existence of homology in the host. We used a centrality score of 0.25 as the level of significance because the centrality score of some of our predicted reactions that validated the gold standard set of essential enzymes and other predicted ones in literature has a centrality score of approximately 0.3. Table 3 shows the reactions with a centrality score from 0.25 and above as the predicted essential reactions. A total of 32 reactions were predicted as essential, out of which 13 have been validated in the literature.

Table 3.

Reactions with the centrality score above the specified level of significance.

Reaction ID Reaction name EC number Biological process Centrality score Reference
HBDG_c Hemoglobin digestion Cysteine and methionine 0.521053
T_c_to_a_C00073 L-Methionine transport reaction Cysteine and methionine 0.442105
R00896_c L-Cysteine:2-oxoglutarate aminotransferase 2.6.1.1 Cysteine and methionine 0.426316
T_c_to_r_C00641 1,2-Diacyl-sn-glycerol transport reaction Glycerolipid 0.409357
R02251_r Acyl-CoA:1,2-diacyl-sn-glycerol O-acyltransferase 2.3.1.20 Glycerolipid pathway 0.324619
T_c_to_r_C00641 1,2-Diacyl-sn-glycerol transport reaction Glycerophospholipid pathway 0.42986
R00177_c ATP:L-methionine S-adenosyltransferase 2.5.1.6 Glycerophospholipid pathway 0.264179 Oyelade et al. [12]
R02037_R06868
_R06869_c
S-Adenosyl-L-methionine:(methyl)ethanolamine-phosphate N-methyltransferase 2.1.1.103 Glycerophospholipid pathway 0.255603 Choubey et al. [40], Pessi et al. [41], Witola et al. [42]
R00497_c Gamma-L-glutamyl-L-cysteine:glycine ligase (ADP-forming) 6.3.2.3 Glycine, serine and threonine 0.287619 Chiappino-Pepe et al. [18], Huthmacher et al. [39]
R01061_c D-Glyceraldehyde-3-phosphate:NAD+ oxidoreductase (phosphorylating) 1.2.1.12 1.2.1.59 Glycolysis pathway 0.552381
R01068_c D-Fructose-1,6-bisphosphate D-glyceraldehyde-3-phosphate-lyase (glycerone-phosphate-forming) 4.1.2.13 Glycolysis pathway 0.495238 Wanidworanun et al. [43]
R00769_c UTP:D-fructose-6-phosphate 1-phosphotransferase 2.7.1.11 Glycolysis pathway 0.490476
R03427_c Inositol-1,4-bisphosphate 1-phosphatase 3.1.3.57 Inositol phosphate pathway 0.533333
R04372_c Phosphatidylinositol-3,4-bisphosphate 4-phosphatase 3.1.3.66 Inositol phosphate pathway 0.533333
R01187_c 1D-Myo-inositol 3-phosphate phosphohydrolase 3.1.3.25 Inositol phosphate pathway 0.47619
R01802_c CDP-diacylglycerol:myo-inositol 3-phosphatidyltransferase 2.7.8.11 Inositol phosphate pathway 0.419048 Chiappino-Pepe et al. [18], Fatumo et al. [44]
R03429_c Inositol-1,3,4-trisphosphate 5/6-kinase 2.7.1.159 Inositol phosphate pathway 0.285714
R01641_c Sedoheptulose-7-phosphate:D-glyceraldehyde-3-phosphate glycolaldehyde transferase 2.2.1.1 Pentose phosphate pathway 0.328478
R00720_c Inosine 5′-triphosphate pyrophosphohydrolase 3.6.1.8 3.6.1.19 Purine metabolism 0.448718
R01231_c Xanthosine-5′-phosphate:L-glutamine amido-ligase (AMP-forming) 6.3.5.2 Purine metabolism 0.37833 McConkey [46]
R00576_c Glutamine-pyruvate transaminase 2.6.1.15 Purine metabolism 0.345098
R02024_c 2′-Deoxycytidine diphosphate:oxidized-thioredoxin 2′-oxidoreductase 1.17.4.1 Purine metabolism 0.340121 Barker et al. [47], Chakrabarti et al. [48], Lytton et al. [49]
R01135_c IMP:L-aspartate ligase (GDP-forming) 6.3.4.4 Purine metabolism 0.330982 Eaazhisai et al. [50]
R00328_c GDP phosphohydrolase 3.6.1.5; 3.6.1.6; 3.6.1.42 Purine metabolism 0.287731
R00570_c ATP:CDP phosphotransferase 2.7.4.6 Pyrimidine 0.45977 Oyelade et al. [12], Chiappino-Pepe et al. [18]
R02024_c 2′-Deoxycytidine diphosphate:oxidized-thioredoxin 2′-oxidoreductase 1.17.4.1 Pyrimidine 0.419157 Barker et al. [47], Chakrabarti et al. [48], Chiappino-Pepe et al. [18], Lytton et al. [49]
R00571_c UTP:ammonia ligase (ADP-forming) 6.3.4.2 Pyrimidine 0.255172 Chiappino-Pepe et al. [18]
R00573_c UTP:L-glutamine amido-ligase (ADP-forming) 6.3.4.2 Pyrimidine 0.255172 Chiappino-Pepe et al. [18]
R06517_c Acyl-CoA:sphingosine N-acyltransferase 2.3.1.24 Sphingolipid 0.431917 Gerold and Schwarz [32]
R02251_c Acyl-CoA:1,2-diacyl-sn-glycerol O-acyltransferase 2.3.1.20 Sphingolipid 0.325708
R00351_m Acetyl-CoA:oxaloacetate C-acetyltransferase (thioester-hydrolysis) 2.3.3.1 2.3.3.3 TCA 0.377739
R02569_m Acetyl-CoA:enzyme N6-(dihydrolipoyl)lysine S-acetyltransferase 2.3.1.12 TCA 0.281995 Chiappino-Pepe et al. [18]

3.2. Visualization of Predicted Pathway

Figures 6(a) and 6(b) are the visualization of the predicted glycolysis pathway where Figure 6(a) represents the unevaluated version of the particular pathway while Figure 6(b) represents the pathways of the betweenness centrality measure. The nodes with the biggest structure in Figure 6(b) represent the predicted essential reactions of the pathway. Similarly, Figures 7(a) and 7(b) are the visualization of the predicted glycolysis pathway where Figure 7(a) represents the unevaluated version of the particular pathway while Figure 7(b) represents the pathways that the betweenness centrality measure has been applied. The biggest nodes in Figure 7(b) denote the predicted essential reactions of the pathway.

Figure 6.

Figure 6

(a) Predicted glycolysis pathway. (b) Predicted essential glycolysis pathway.

Figure 7.

Figure 7

(a) Predicted pentose phosphate pathway. (b) Predicted essential pentose phosphate pathway.

4. Discussion

Several P. falciparum metabolic network reconstructions have been done previously, but this study is aimed at predicting alternative metabolic pathways for energy generation in the malaria parasite via the pentose phosphate and glycolytic reactions using the k-shortest path algorithm. The essentiality of these reactions to the survival of these parasites was also evaluated in the predicted models (Table 3) and genes with no human homologs identified (Table 4). Sequence alignment was performed for all the 32 enzymes to identify the existence of homology; two of the enzymes show insignificant similarity with the host. Energy production in apicomplexans involves both the aerobic and anaerobic pathways, and this varies from species to species [32]. P. falciparum is a fast growing organism in the human erythrocyte. Therefore, in order to meet its energy need for growth and cell division, it relies on anaerobic oxidation of glucose to generate ATP. This is achieved through a series of enzyme-catalyzed breakdown of glucose to pyruvate, a metabolic process otherwise called the glycolysis or Embden–Meyerhof–Parnas pathway [33]. This process does not require oxygen to generate ATP, but results in the accumulation of pyruvate. While the main purpose of glycolysis is to generate ATP, one of its intermediates, glucose-6-phosphate, is the precursor for the pentose phosphate pathway [34]. This pathway serves an important purpose of generating NADPH and pentose sugars, which are moieties needed for the synthesis of nucleic acids and membrane lipids that are molecular raw materials needed by the parasite for cell division.

Table 4.

Predicted essential genes with no homologs in the human genome.

Gene name Enzyme name E.C. number Biological process
PF3D7_1343000 Phosphoethanolamine N-methyltransferase 2.1.1.103 Glycerophospholipid
PF3D7_1437200 Ribonucleoside-diphosphate reductase 1.17.4.1 Purine/pyrimidine

Similarly, different stages of malaria parasites have been reported to have different energy requirements while P. falciparum-resistant strains have been shown to alter their genetic and metabolic pathways, in order to utilize nutrient requirements for fitness and survival in a drug environment. A recent study identified metabolic changes in mutant parasites, which shows differential transport and utilization of nutrients, as shown in computational prediction that incorporates metabolomics data from sensitive and resistant isolates [35]. Malaria parasite also lacks capability to store carbohydrates [36, 37]; it constantly requires the production of this through glucose and can utilize other carbohydrate precursors scavenged from a host such as UDP-glucose; hence, the identified alternative metabolic pathways that P. Falciparum utilizes for survival are crucial for curbing drug resistance and can be targeted as vaccine candidates to reduce malaria transmission. Variable regulation of host UTP-glucose-1-phosphate uridylyltransferase (UDP-glucose phosphorylase), which catalyzes energy-generating reactions such as galactose metabolism and glycogen synthesis, has been reported [34] thereby making it a good target. This was identified as an essential pathway in glycolysis in the present study. Other predictions from this study corroborate the findings from other studies; these are highlighted in Table 3. Amino acid deprivation can alter the growth of P. falciparum in vitro; however, it may not completely kill the parasite [38]. This regulation of nutrient requirement to cope with deficit is one of the many survival mechanisms already identified in the parasite. However, other essential amino acid pathways identified can be targeted together to achieve a detrimental effect on the parasite's survival. Our study identified gamma-L-glutamyl-L-cysteine:glycine ligase as an essential reaction for glycine, serine, and threonine metabolism. This finding has been previously reported in other studies [18, 39].

For the cysteine and methionine pathways, hemoglobin digestion is essential for the malaria parasite; L-methionine transport reaction and L-cysteine:2-oxoglutarate aminotransferase were also predicted as essential reactions. The altered transport of nutrients across the membrane may also vary in sensitive and resistant parasites [35]; hence, vaccines or therapies that target these pathways will provide potentially lethal effects on the parasites. Other predicted essential reactions within the glycerophospholipid pathway are 1,2-diacyl-sn-glycerol transport reaction, L-methionine s-adenosyltransferase, and s-adenosyl-L-methionine (only the latter has validated predictions in the literature [4042]). In the glycolytic pathway, three reactions: D-glyceraldehyde-3-phosphate; D-fructose-1,6-bisphosphate D-glyceraldehyde-3-phosphate-lyase; and UTP:D-fructose-6-phosphate 1-phosphotransferase, were predicted as essential reactions with D-glyceraldehyde-3-phosphate-lyase validated in the literature [43]. Five reactions, namely, inositol-1,4-bisphosphate 1-phosphatase; phosphatidylinositol-3,4-bisphosphate 4-phosphatase; 1-D-myoinositol-3-phosphate phosphohydrolase; CDP-diacylglycerol:myo-inositol 3-phosphatidyltransferase; and inositol-1,3,4-trisphosphate 5/6-kinase, were the predicted essential reactions within the inositol phosphate pathway out of which CDP-diacylglycerol:myo-inositol 3-phosphatidyltransferase is validated [18, 44]. It is noteworthy that D-glyceraldehyde-3-phosphate dehydrogenase has homolog in the human genome (Table 4).

Similarly, our study predicted a metabolite—sedoheptulose-1,7-diphosphate—from the breakdown of xylulose, in the pentose phosphate pathway, which may be utilized by the parasite to scavenge for energy. Only sedoheptulose-7-phosphate:D-glyceraldehyde-3-phosphate glycolaldehyde transferase was predicted as an essential reaction within the pentose phosphate pathway. P. falciparum essentially synthesize nucleotides de novo. Hence, it cannot salvage pyrimidines from an extracellular environment. This biosynthetic pathway has been identified as a good target for malaria control [45]. For the purine and pyrimidine pathways, five and four reactions were predicted as essential and have been reported in other studies (see details in Table 3). The implications of these hypothesis are that in addition to other predicted pathways, if biologically validated through disruption of genes, encoding the enzymes for these pathways in P. falciparum will further elucidate survival and alternate energy generation pathways in the parasite. The hypothesized potential pharmacological enzyme targets essential for the parasite can be targeted to control malaria infection globally.

5. Conclusions

In this work, we have been able to predict alternative metabolic paths in the glycolysis and pentose phosphate pathways of P. falciparum. We predicted two (2) essential proteins in the glycerophospholipid, purine, pyrimidine, and glycolysis metabolic pathways (Table 4) without homology with a host. With the use of heuristic function to enhance the k-shortest path algorithm, we have been able to identify potential biologically relevant paths using a computational graph-based technique which hitherto is less utilized due to its very high false positive result. Biologically targeting these candidate proteins in Plasmodium is recommended to improve understanding of the predicted alternative pathways and the precision of the graph theory-based method.

Acknowledgments

This work is fully supported by the Covenant University and Covenant University Center for Research, Innovation and Discovery with Grant no. CUCRID RG 034.04.15/FS. We also thank Dr. Jelili Oyelade for the contribution from his PhD thesis.

Data Availability

The datasets used in this study are available in http://lcsb-databases.epfl.ch/pathways/Gems.

Conflicts of Interest

The authors disclose no potential conflicts of interest.

Authors' Contributions

All authors contributed to this work. Jelili Oyelade contributed to the original idea and conception. The implementation of the concept was achieved by Olufemi Aromolaran. Titilope Dokumu and Solomon Rotimi did the biological interpretation of the results. Jelili Oyelade, Itunuoluwa Isewon, and Olufemi Aromolaran wrote the manuscript. Jelili Oyelade, Itunuoluwa Isewon, and Olufemi Aromolaran were responsible for the structure and argument for the paper. Jelili Oyelade, Itunuoluwa Isewon, Efosa Uwoghiren, Olufemi Aromolaran, Titilope Dokunmu, Solomon Rotimi, Oluwadurotimi Aworunse, Olawole Obembe, and Ezekiel Adebiyi were responsible for the critical revision and approval of the final version.

.

Supplementary Materials

Table S1: reactions obtained from the MetaCyc database to fill gaps in the iPfa GEM.

References

  • 1.WHO. World Malaria Report 2016. World Health Organization; 2016. [Google Scholar]
  • 2.Chakraborty A. Emerging drug resistance in Plasmodium falciparum: a review of well-characterized drug targets for novel antimalarial chemotherapy. Asian Pacific Journal of Tropical Disease. 2016;6(7):581–588. doi: 10.1016/S2222-1808(16)61090-3. [DOI] [Google Scholar]
  • 3.Cowman A. F., Healer J., Marapana D., Marsh K. Malaria: biology and disease. Cell. 2016;167(3):610–624. doi: 10.1016/j.cell.2016.07.055. [DOI] [PubMed] [Google Scholar]
  • 4.World Health Organization. Antimicrobial Resistance: 2014 Global Report on Surveillance. World Health Organization; 2014. [Google Scholar]
  • 5.Birkett A. J., Moorthy V. S., Loucq C., Chitnis C. E., Kaslow D. C. Malaria vaccine R&D in the decade of vaccines: breakthroughs, challenges and opportunities. Vaccine. 2013;31:B233–B243. doi: 10.1016/j.vaccine.2013.02.040. [DOI] [PubMed] [Google Scholar]
  • 6.Koff W. C., Burton D. R., Johnson P. R., et al. Accelerating next-generation vaccine development for global disease prevention. Science. 2013;340(6136):p. 1232910. doi: 10.1126/science.1232910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Eppstein D. Finding the k shortest paths. Proceedings 35th Annual Symposium on Foundations of Computer Science; 1994; Santa Fe, Mexico. pp. 154–165. [DOI] [Google Scholar]
  • 8.Croes D., Couche F., Wodak S. J., Van Helden J. Metabolic PathFinding: inferring relevant pathways in biochemical networks. Nucleic Acids Research. 2005;33(Web Server):W326–W330. doi: 10.1093/nar/gki437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Faust K., Dupont P., Callut J., Van Helden J. Pathway discovery in metabolic networks by subgraph extraction. Bioinformatics. 2010;26(9):1211–1218. doi: 10.1093/bioinformatics/btq105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Oyelade J., Ewejobi I., Brors B., Eils R., Adebiyi E. Computational identification of signalling pathways in Plasmodium falciparum. Infection, Genetics and Evolution. 2011;11(4):755–764. doi: 10.1016/j.meegid.2010.11.006. [DOI] [PubMed] [Google Scholar]
  • 11.Oyelade J., Isewon I., Rotimi S., Okunoren I. Modeling of the glycolysis pathway in Plasmodium falciparum using Petri nets. Bioinformatics and Biology Insights. 2016;10:p. BBI.S37296. doi: 10.4137/BBI.S37296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Oyelade J., Isewon I., Uwoghiren E., Aromolaran O., Oladipupo O. In silico knockout screening of Plasmodium falciparum reactions and prediction of novel essential reactions by analysing the metabolic network. BioMed Research International. 2018;2018:11. doi: 10.1155/2018/8985718.8985718 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Rahman S. A., Schomburg D. Observing local and global properties of metabolic pathways:“load points” and “choke points” in the metabolic networks. Bioinformatics. 2006;22(14):1767–1774. doi: 10.1093/bioinformatics/btl181. [DOI] [PubMed] [Google Scholar]
  • 14.Acencio M. L., Lemke N. Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information. BMC Bioinformatics. 2009;10(1) doi: 10.1186/1471-2105-10-290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Becker S. A., Feist A. M., Mo M. L., Hannum G., Palsson B. Ø., Herrgard M. J. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox. Nature Protocols. 2007;2(3):727–738. doi: 10.1038/nprot.2007.99. [DOI] [PubMed] [Google Scholar]
  • 16.Plaimas K., Eils R., König R. Identifying essential genes in bacterial metabolic networks with machine learning methods. BMC Systems Biology. 2010;4(1):p. 56. doi: 10.1186/1752-0509-4-56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Oyelade J., Isewon I., Aromolaran O., Uwoghiren E. Computational prediction of alternative metabolic pathways of Plasmodium falciparum. Proceedings of the 10th International Conference on Bioinformatics and Computational Biology, BICOB 2018; 2018. [Google Scholar]
  • 18.Chiappino-Pepe A., Tymoshenko S., Ataman M., Soldati-Favre D., Hatzimanikatis V. Bioenergetics-based modeling of Plasmodium falciparum metabolism reveals its essential genes, nutritional requirements, and thermodynamic bottlenecks. PLoS Computational Biology. 2017;13(3, article e1005397) doi: 10.1371/journal.pcbi.1005397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kadivar M. A new $ O (m+ kn log overline {d}) $ algorithm to find the $ k $ shortest paths in acyclic digraphs. Transactions on Combinatorics. 2016;5(3):23–31. [Google Scholar]
  • 20.Yeh I., Hanekamp T., Tsoka S., Karp P. D., Altman R. B. Computational analysis of Plasmodium falciparum metabolism: organizing genomic information to facilitate drug discovery. Genome Research. 2004;14(5):917–924. doi: 10.1101/gr.2050304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Jing M. A. Metabolic Network Based Gene Essentiality Analysis. Doctoral dissertation; 2012. [Google Scholar]
  • 22.Qin C., Sun Y., Dong Y. A new method for identifying essential proteins based on network topology properties and protein complexes. PLoS One. 2016;11(8, article e0161042) doi: 10.1371/journal.pone.0161042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Caspi R., Altman T., Billington R., et al. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Research. 2014;42(D1):D459–D471. doi: 10.1093/nar/gkt1103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Arita M. The metabolic world of Escherichia coli is not small. Proceedings of the National Academy of Sciences. 2004;101(6):1543–1547. doi: 10.1073/pnas.0306458101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kim T., Dreher K., Nilo-Poyanco R., et al. Patterns of metabolite changes identified from large-scale gene perturbations in Arabidopsis using a genome-scale metabolic network. Plant Physiology. 2015;167(4):1685–1698. doi: 10.1104/pp.114.252361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Brys T., Harutyunyan A., Vrancx P., Taylor M. E., Kudenko D., Nowé A. Multi-objectivization of reinforcement learning problems by reward shaping. International Joint Conference on Neural Networks (IJCNN); 2014; pp. 2315–2322. IEEE. [DOI] [Google Scholar]
  • 27.Mataric M. J. Reward functions for accelerated learning. Machine Learning Proceedings 1994; 1994; pp. 181–189. Elsevier. [DOI] [Google Scholar]
  • 28.Randløv J., Alstrøm P. ICML. Vol. 98. Citeseer; 1998. Learning to drive a bicycle using reinforcement learning and shaping; pp. 463–471. [Google Scholar]
  • 29.Sutton R. S., Barto A. G. Reinforcement learning: An introduction. Vol. 1. Cambridge: MIT press; 1998. [Google Scholar]
  • 30.Devlin S., Kudenko D., Grześ M. An empirical study of potential-based reward shaping and advice in complex, multi-agent systems. Advances in Complex Systems. 2011;14(02):251–278. doi: 10.1142/S0219525911002998. [DOI] [Google Scholar]
  • 31.Efthymiadis K., Kudenko D. 2013 IEEE Conference on Computational Inteligence in Games (CIG) IEEE; 2013. Using plan-based reward shaping to learn strategies in StarCraft: Broodwar; pp. 1–8. [Google Scholar]
  • 32.Gerold P., Schwarz R. T. Biosynthesis of glycosphingolipids de-novo by the human malaria parasite Plasmodium falciparum. Molecular and Biochemical Parasitology. 2001;112(1):29–37. doi: 10.1016/S0166-6851(00)00336-4. [DOI] [PubMed] [Google Scholar]
  • 33.van Niekerk D. D., Penkler G. P., du Toit F., Snoep J. L. Targeting glycolysis in the malaria parasite Plasmodium falciparum. The FEBS Journal. 2016;283(4):634–646. doi: 10.1111/febs.13615. [DOI] [PubMed] [Google Scholar]
  • 34.Bozdech Z., Ginsburg H. Data mining of the transcriptome of Plasmodium falciparum: the pentose phosphate pathway and ancillary processes. Malaria journal. 2005;4(1):p. 17. doi: 10.1186/1475-2875-4-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Carey M. A., Papin J. A., Guler J. L. Novel Plasmodium falciparum metabolic network reconstruction identifies shifts associated with clinical antimalarial resistance. BMC Genomics. 2017;18(1):p. 543. doi: 10.1186/s12864-017-3905-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Crawfod M. J., Fraunholz M. J., Roos D. S. Molecular and Medical Parasitology. Academic Press; 2003. Part III-Energy metabolism in the apicomplexa; pp. 154–169. [DOI] [Google Scholar]
  • 37.Alonso M. D., Lomako J., Lomako W. M., Whelan W. J. A new look at the biogenesis of glycogen. FASEB Journal. 1995;9(12):1126–1137. doi: 10.1096/fasebj.9.12.7672505. [DOI] [PubMed] [Google Scholar]
  • 38.Babbitt S. E., Altenhofen L., Cobbold S. A., et al. Plasmodium falciparum responds to amino acid starvation by entering into a hibernatory state. Proceedings of the National Academy of Sciences. 2012;109(47):E3278–E3287. doi: 10.1073/pnas.1209823109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Huthmacher C., Hoppe A., Bulik S., Holzhütter H. G. Antimalarial drug targets in Plasmodium falciparum predicted by stage-specific metabolic network analysis. BMC Systems Biology. 2010;4(1) doi: 10.1186/1752-0509-4-120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Choubey V., Maity P., Guha M., et al. Inhibition of Plasmodium falciparum choline kinase by hexadecyltrimethylammonium bromide: a possible antimalarial mechanism. Antimicrobial Agents and Chemotherapy. 2007;51(2):696–706. doi: 10.1128/AAC.00919-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Pessi G., Kociubinski G., Mamoun C. B. A pathway for phosphatidylcholine biosynthesis in Plasmodium falciparum involving phosphoethanolamine methylation. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(16):6206–6211. doi: 10.1073/pnas.0307742101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Witola W. H., El Bissati K., Pessi G., Xie C., Roepe P. D., Mamoun C. B. Disruption of the Plasmodium falciparum PfPMT gene results in a complete loss of phosphatidylcholine biosynthesis via the serine-decarboxylase-phosphoethanolamine-methyltransferase pathway and severe growth and survival defects. Journal of Biological Chemistry. 2008;283(41):27636–27643. doi: 10.1074/jbc.M804360200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Wanidworanun C., Nagel R. L., Shear H. L. Antisense oligonucleotides targeting malarial aldolase inhibit the asexual erythrocytic stages of Plasmodium falciparum. Molecular and Biochemical Parasitology. 1999;102(1):91–101. doi: 10.1016/S0166-6851(99)00087-0. [DOI] [PubMed] [Google Scholar]
  • 44.Fatumo S., Plaimas K., Mallm J.-P., et al. Estimating novel potential drug targets of Plasmodium falciparum by analysing the metabolic network of knock-out strains in silico. Infection, Genetics and Evolution. 2009;9(3):351–358. doi: 10.1016/j.meegid.2008.01.007. [DOI] [PubMed] [Google Scholar]
  • 45.Krungkrai S. R., Krungkrai J. Insights into the pyrimidine biosynthetic pathway of human malaria parasite Plasmodium falciparum as chemotherapeutic target. Asian Pacific Journal of Tropical Medicine. 2016;9(6):525–534. doi: 10.1016/j.apjtm.2016.04.012. [DOI] [PubMed] [Google Scholar]
  • 46.McConkey G. A. Plasmodium falciparum: isolation and characterisation of a gene encoding protozoan GMP synthase. Experimental Parasitology. 2000;94(1):23–32. doi: 10.1006/expr.1999.4467. [DOI] [PubMed] [Google Scholar]
  • 47.Barker R. H., Metelev V., Rapaport E., Zamecnik P. Inhibition of Plasmodium falciparum malaria using antisense oligodeoxynucleotides. Proceedings of the National Academy of Sciences. 1996;93(1):514–518. doi: 10.1073/pnas.93.1.514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Chakrabarti D., Schuster S. M., Chakrabarti R. Cloning and characterization of subunit genes of ribonucleotide reductase, a cell-cycle-regulated enzyme, from Plasmodium falciparum. Proceedings of the National Academy of Sciences. 1993;90(24):12020–12024. doi: 10.1073/pnas.90.24.12020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Lytton S. D., Mester B., Libman J., Shanzer A., Cabantchik Z. I. Mode of action of iron (III) chelators as antimalarials: II. Evidence for differential effects on parasite iron-dependent nucleic acid synthesis. Blood. 1994;84(3):910–915. [PubMed] [Google Scholar]
  • 50.Eaazhisai K., Jayalakshmi R., Gayathri P., et al. Crystal structure of fully ligated adenylosuccinate synthetase from Plasmodium falciparum. Journal of Molecular Biology. 2004;335(5):1251–1264. doi: 10.1016/j.jmb.2003.11.036. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials

Table S1: reactions obtained from the MetaCyc database to fill gaps in the iPfa GEM.

Data Availability Statement

The datasets used in this study are available in http://lcsb-databases.epfl.ch/pathways/Gems.


Articles from International Journal of Genomics are provided here courtesy of Wiley

RESOURCES