Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2023 Feb 27;13:3309. doi: 10.1038/s41598-023-30517-y

Predicting anti-cancer activity in flavonoids: a graph theoretic approach

Simon Mukwembi 1, Farai Nyabadza 2,
PMCID: PMC9971244  PMID: 36849585

Abstract

In drug design, there are two major causes of drug failure in the clinic. First, the drug has to work, and second, the drug should be safe. Identifying compounds that work for certain ailments require enormous experimental time and, in general, is cost intensive. In this paper, we are concerned with melanoma, a special type of cancer that affects the skin. In particular, we seek to provide a mathematical model that can predict the ability of flavonoids, a vast and natural class of compounds that are found in plants, in reversing or alleviating melanoma. The basis for our model is the conception of a new graph parameter called, for lack of better terminology, graph activity, which captures melanoma cancer healing properties of the flavonoids. With a superior coefficient of determination, R2=1, the new model faithfully reproduces anti-cancer activities of some known data-sets. We demonstrate that the model can be used to rank the healing abilities of flavonoids which could be a powerful tool in the screening, and identification, of compounds for drug candidates.

Subject terms: Drug discovery, Mathematics and computing

Introduction

In humans, a natural pigment, melanin, does not only provide a major defense mechanism against the ultraviolet light of the sun1, but also determines the colour of the skin, hair and eyes2. The production of melanin is catalyzed by the enzyme, tyrosinase. The overproduction of melanin leads to undesirable skin conditions, such as hyperpigmentation and melanoma skin cancer, that have an effect on the quality of life. The cancer, melanoma, has increasingly become a common complaint among patients consulting with dermatologists3.

Several agents, such as hydroquinone, corticosteroids, kojic acid and arbutin3,4, that reduce the production of melanin by inhibiting the activity of the catalyst, tyrosinase, are well known. Unlike natural tyrosinase inhibitors, which are generally considered to be cheap and free of harmful side effects, the existing traditional anti-tyrosinase agents suffer a legion of limitations such as high levels of toxicity, low stability, poor skin-penetration, and insufficient activity (see, for instance3). The effectiveness of a therapeutic agent is generally measured by the inhibition concentration at 50%, IC50, i.e., the quantity of the inhibitory agent required to inhibit the biological process, such as tyrosinase activity, by 50%5. The quantity IC50, is synonymous to the half-saturation constant in ecology. The half-saturation constant is defined as the resource availability at which half of the maximum intake is reached and it determines the outcome of models and may contribute to explain behavioural traits, life-strategies and species occurrence6.

A well researched class of natural agents, flavonoids, commonly found in plants, have been reported to have anti-tyrosinase activity1,2,5,7. Experiments to determine anti-tyrosinase activity of possible candidates for drug agents come with their own burdens. Among the burdens are high costs, laboratory experiment time, human efforts and the problems associated with animal sacrifice. Of particular concern however, as Hughes et al.8 puts it, drugs fail in the clinic for two main reasons; the first is that they do not work and the second is that they are not safe. Consequently, the development of mathematical models that can predict the effectiveness of compounds in inhibiting certain biological processes become handy.

In this paper, we will exploit graph theory to develop a model that predicts the anti-tyrosinase activity, i.e., IC50 values, of flavonoids. Our model can be used, not only to rationalise existing data, but also to predict new or unknown anti-tyrosinase activity in flavonoids.

Graphs

A graph G=(V,E) is a mathematical object which consists of a finite set V of elements called vertices, together with a set E, of 2-element subsets of V, called the edges of G. As early as 1875, Cayley (see for instance9), in his quest to enumerate chemical molecules called alkanes, he made an observation that molecules can be modelled by graphs where atoms are represented by vertices and two vertices are joined by an edge if the corresponding atoms are linked by bonds. This graph model became widely known as the molecular graph. The degree, deg v, of a vertex v of G is the number of edges incident with it. We say that v is an end vertex if its degree is 1. We will sometimes refer to the set of all end vertices as external vertices and call the set of all vertices of degree greater than 1 internal vertices. The irregularity index, t(v), of a vertex (introduced in10 and applied to studies in chemistry in11) is defined as the number of neighbours of v with distinct degrees. The distance dG(u,v) between vertices u and v in G is defined as the length of a shortest path joining u and v in G. The eccentricity ec(v) of a vertex v of G is the distance between v and a vertex furthest away from v in G.

The graph parameters

Consider a connected graph G=(V,E) of order n, i.e., with n vertices. Over a hundred of graph parameters that are used to find relationships between the structure of a molecule and its physical properties in order to predict the physicochemical, biomedical, environmental and toxicological properties of a compound directly from its molecular structure are legion in the literature1217. It turns out that, to date, no relationships on the known parameters and anti-tyrosinase activity have been reported on. We create here a parameter that is broken down into two individual indices; one index specifically for external vertices, while the other one looks at internal vertices. We will then show that this new parameter can predict anti-tyrosinase activity.

Let S be the external vertices of G, i.e., the set of all end vertices in G. Let Q be the set of internal vertices of G. For a vertex v of G, the score, s(v), of v is the quantity

s(v):=uSd(u,v)ifvS,t(v)degv·ec(v)ifvQ.

We define two invariants, D(G) and ζ(G), of G as follows:

D(G)=vSs(v)n3,

and

ζ(G)=vQs(v)n2.

For lack of better terminology, we will call D(G) the external activity of G, and ζ(G), the internal activity of G, respectively. For instance, for the molecular graph T of the known standard tyrosinase inhibitor depicted in Fig. 1, Taxfolin, we have

D(T)=196223=0.0184072andζ(T)=85.8333222=0.1773415. 1

Figure 1.

Figure 1

Tyrosinase inhibitor, Taxifolin (5,7,3,4-flavan-on-ol), also known as dihydroquercetin.

The model

Consider a flavonoid F whose molecular graph is G. We combine the internal and external activities of G to predict the anti-tyrosinase activity, i.e., IC50 value, of F. Table 1 presents flavonoids which were isolated from a plant in1. We will capitalise on this data-set to determine the values of the constants for our model, (2). We compute D(G) and ζ(G) for each of the flavonoid in Table 1. The graphs are shown in Fig. 2.

Table 1.

Data for nine flavonoids.

Flavonoid D(G) ζ(G) IC50 (μM)
Dihydroquercetin-4-methylether 0.016848853 0.187145558 115
Dihydroquercetin-7,4-dimethylether 0.015552662 0.1953125 162
5,7,3,5-Tetrahydroxyflavanone 0.014685239 0.155706727 423
Blumeatin 0.013429752 0.164600551 624
Quercetin 0.018407213 0.177341598 96
Rhamnetin 0.016931043 0.187775677 107
Tamarixetin 0.016931043 0.193761815 144
Luteolin 0.015117158 0.197656841 258
Luteolin-7-methyl ether 0.013899324 0.202479339 350

Figure 2.

Figure 2

Molecular graphs of isolated flavonoids, Source1.

In Table 1, we present D(G) and ζ(G) of the flavonoids, together with their anti-tyrosine activities, which were determined in1.

Given that the activity is driven by two invariants, D(G) and ζ(G), we propose a multivariable function to model the effects of the two invariants on the IC50 values. We propose a model of the form

ic50(G)=f([D(G)],[ζ(G)])=α1+α2[D(G)]+α3[ζ(G)]+α4[D(G)]2+α5[D(G)][ζ(G)]+α6[ζ(G)]2+α7[D(G)]2[ζ(G)]+α8[D(G)][ζ(G)]2+α9[ζ(G)]3. 2

Fitting (2) to data given in Table 1, we obtain the values of the constants as

α1=4.571×105,α2=-9.612×107,α3=1.027×106,α4=1.197×109,α5=8.704×108,α6=-4.565×107,α7=-7.457×109,α8=-1.715×109,α9=1.363×108.

The given parameter values αi,i=1,,9, produce a perfect fit of the model to the data. It is important to note that we resorted to the multivariable polynomial since it gives the best goodness of fit value measured by R2 or R-square. The strength of the relationship between a model and the dependent variables is measured by R2[0,1] or on a scale of 0-100%. We also note that the Curve Fitting Toolbox that we used in fitting our data to the model, gives a number of goodness of fit statistics for parametric models and in particular, the sum of squares due to error (SSE) and R-square. The summed square of residuals (SSE) is a measure of the total deviation of the response values from the fit to the experimental data. Our fit gives: SSE =2.609×10-19 and R2=1.

A consideration of the contour map of Fig. 3 gives the results in Fig. 4. All the data points lie in the same contour colour indicating a perfect fit of the polynomial to the data.

Figure 3.

Figure 3

Show the multivariable polynomial fit to the data.

Figure 4.

Figure 4

Shows the contour plot of D(G) and ζ(G).

A plot of the residuals is shown in Fig. 5. The residuals are found by looking at the difference between the data and the fit. The smaller the difference, the better the fit. The graph shows that the data points lie (almost) on the D(G)ζ(G)-trace, indicating very low values of the residuals.

Figure 5.

Figure 5

Show the plot of the residuals, where residuals=data-fit.

Discussion and applications

Our model has been developed, not only to rationalise existing data, but also to predict new or unknown anti-tyrosinase activity in flavonoids. However, for the sake of brevity and with such a very high goodness of fit, we use the newly developed model to rank the order of inhibition for 26 flavonoids. Additional flavonoids can, nevertheless, be added to the ranking list by using the model. The results are presented in Table 2.

Table 2.

26 Flavonoids - rank order of inhibition.

NO COMPOUND D(G) ζ(G) PREDICTED IC50(μM)
1 Kojic acid 0.028 0.238333 - 112089.011
2 Chrysin 0.004082228 0.202216066 - 40655.55419
3 Shikonin 0.028074722 0.193499615 - 33061.25049
4 Baicalein 0.00625 0.19833325 - 26435.10902
5 Galangin 0.00725 0.18 - 14014.27835
6 Dihydromyricetin 0.022355552 0.149968431 - 9665.559405
7 Naphthazarin 0.020408163 0.193877551 - 3748.192859
8 Xanthoxylin 0.036443149 0.147959184 - 2455.072672
9 Tropolone 0.008230453 0.222222222 - 1193.329897
10 Morin 0.017843727 0.170110186 35.71991087
11 Quercetin 0.018407213 0.177341598 93.33756038
12 Quercetin-3-rutinoside 0.018407213 0.177341591 93.33764064
13 Taxfolin 0.018407213 0.177341529 93.33836295
14 Rhamnetin 0.016931043 0.187775677 105.5945071
15 Dihydroquercetin-4-methylether 0.016848853 0.187145558 113.0227843
16 Tamarixetin 0.016931043 0.193761815 147.3433049
17 Dihydroquercetin-7,4-dimethylether 0.015552662 0.1953125 162.703958
18 Luteolin 0.015117158 0.197656841 259.6199685
19 Luteolin-7-methyl ether 0.013899324 0.202479339 352.5847798
20 5,7,3,5-Tetrahydroxyflavanone 0.014685239 0.155706727 427.3532608
21 Blumeatin 0.013429752 0.164600551 617.4170072
22 Apigenin 0.0105 0.219166675 4333.366225
23 Fisetin 0.01425332 0.212018141 4757.002259
24 Rosmarinic acid 0.020197997 0.250493047 6636.14232
25 3,7,4-Trihydroxyflavone 0.01025 0.235 27875.46695
26 Isoeugenol 0.020833333 0.275462986 33581.06508

Quite exciting is the fact that whilst the second compound, among the ranked flavonoids, chrysin, is commonly being used for bodybuilding, treating anxiety, inflammation, gout, erectile dysfunction, to mention but a few, it is appearing on the list as an excellent anti-tyrosinase agent which is only 3-fold weaker, and 34-fold stronger than the standard tyrosinase inhibitors, kojic acid and tropolone4, respectively. Whilst these results on chrysin confirm Wu et al.18’s findings that chrysin has potential use in skin photoprotection, Xie et al.19 found, in experimental work, that ‘chrysin had no effects on the (tyrosinase) enzyme’.

It is interesting to note that, through expanding the ranking list, our model can be used as a tool for identifying flavonoids that can be candidates for anti skin-cancer drug agents. This is critical in drug development as new drug agents get identified. Any identified agents can be tested using the model to determine their likelihood of use in fighting skin-cancer in the absence of laboratory tests.

Finally, we note here that it will be a worthwhile exercise to investigate the mathematical properties of the external, and internal activities of a graph, especially lower and upper bounds, and characterising the extremal graphs. Whilst external vertices play a significant role in influencing the anti-tyrosinase activity, the most dominant factor is the contribution made by the internal vertices. Each internal vertex v’s contribution is based on the distance between v and a vertex furthest away from v. This distance is then amplified by the irregularity of the neighbours of v.

The work presented in this paper is limited by the data set that is small. Predictive models depend a lot on the size of the data set. This model only predicts for flavonoids yet there may be other compounds that could be used as anti-cancer drugs. In reality, however, the class of flavonoids is much bigger than presented here. Despite these short comings, the model presented here forms a very good basis of predictive modelling in drug design. The model managed to confirm the effectiveness of some of the most used anticancer drugs such as tropolone and kojic acid.

Acknowledgements

The authors are very grateful to the anonymous reviewers for their careful reading and constructive comments. The authors are also grateful for GES 4.0 funding from the University of Johannesburg towards the production of the manuscript.

Author contributions

S.M. and F.N. wrote the main manuscript. S.M. wrote the graph-theoretic section and F.N. wrote the modelling section. All authors reviewed the manuscript.

Data availability

All data generated or analysed during this study are included in this published article.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Saewan N, Koysomboon S, Chantrapromma K. Anti-tyrosinase and anti-cancer activities of flavonoids from Blumea balsamifera DC. J. Med. Plants Res. 2011;5(6):1018–1025. [Google Scholar]
  • 2.Thach BD, Dao VQ, Giang TTL, Cang DT, Linh LNT, Ben TT, Uyen NPA, Suong NK. Antioxidant and antityrosinase activities of flavonoid from Blumea balsamifera (L.) DC. leaves extract. Eur. J. Res. Med. Sci. 2017;5(1):1–6. [Google Scholar]
  • 3.de Freitas MM, Fontes PR, Souza PM, Fagg CW, Guerra ENS, de MedeirosNobrega YK, Silveira D, Fonseca-Bazzo Y, Simeoni LA, de Mello M. Homem, Magalhaes PO. Extracts of Morus nigra L. leaves standardized in chlorogenic acid, Rutin and Isoquercitrin: Tyrosinase inhibition and cytotoxicity. PLoS One. 2016;1:1–24. doi: 10.1371/journal.pone.0163130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Chang TS. An updated review of tyrosinase inhibitors. Int. J. Mol. Sci. 2009;10:2440–2475. doi: 10.3390/ijms10062440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ryu YB, Ha TJ, Curtis-Long MJ, Ryu HW, Gal SW, Park KH. Inhibitory effects on mushroom tyrosinase by flavones from the stem barks of Morus lhou (S.) Koidz. J. Enzyme Inhib. Med. Chem. 2008;23(6):922–930. doi: 10.1080/14756360701810207. [DOI] [PubMed] [Google Scholar]
  • 6.Mulder C, Jan endriks A. Half-saturation constants in functional responses. Glob. Ecol. Conserv. 2014;2:161–169. doi: 10.1016/j.gecco.2014.09.006. [DOI] [Google Scholar]
  • 7.Obaid RJ, Mughal EU, Naeem N, Sadiq A, Alsantali RI, Jassas RS, Moussa Z, Ahmed SA. Natural and synthetic flavonoid derivatives as new potential Tyrosinase inhibitors: A systematic review. RSC Adv. 2021;11:22159–22198. doi: 10.1039/D1RA03196A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Hughes JP, Rees S, Kalindjian SB, Philpott KL. Principles of early drug discovery. Br. J. Pharmacol. 2011;162:1239–1249. doi: 10.1111/j.1476-5381.2010.01127.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Burch KJ, Wakefield DK, Whitehead EG., Jr Boiling point models of alkanes. MATCDY. 2003;47:25–52. [Google Scholar]
  • 10.Mukwembi S. A note on diameter and the degree sequence of a graph. Appl. Math. Lett. 2012;25:175–178. doi: 10.1016/j.aml.2011.08.010. [DOI] [Google Scholar]
  • 11.Gutman I. Irregularity of molecular graphs. Kragujevac J. Sci. 2016;38:71–81. doi: 10.5937/KgJSci1638071G. [DOI] [Google Scholar]
  • 12.Dankelmann P, Mukwembi S. The distance concept and distance in graphs. In: Gutman I, Furtula B, editors. Distance in Molecular Graphs Theory. Univ. Kragujevac; 2012. pp. 3–48. [Google Scholar]
  • 13.Sharma V, Goswami R, Madan AK. Eccentric connectivity index: A novel highly discriminating topological descriptor for structure-property and structure-activity studies. J. Chem. Inf. Model. 1997;37:273–282. [Google Scholar]
  • 14.Randic M, Balaban AT, Basak SC. On structural interpretation of several distance related topological indices. J. Chem. Inf. Comput. Sci. 2001;41:593–601. doi: 10.1021/ci000105l. [DOI] [PubMed] [Google Scholar]
  • 15.Sardana S, Madan AK. Application of graph theory: Relationship of antimycobacterial activity of quinolone derivatives with eccentric connectivity index and Zagreb group parameters. Match Commun. Math. Comput. Chem. 2002;45:35–53. [Google Scholar]
  • 16.Schultz HP. Topological organic chemistry. 1. Graph theory and topological indices of alkanes. J. Chem. Inf. Comput. Sci. 1989;29:227–228. doi: 10.1021/ci00063a012. [DOI] [PubMed] [Google Scholar]
  • 17.Wiener H. Structural determination of paraffin boiling points. J. Am. Chem. Soc. 1947;69:17–20. doi: 10.1021/ja01193a005. [DOI] [PubMed] [Google Scholar]
  • 18.Wu NL, Fang JY, Chen M, Wu CJ, Huang CC, Hung CF. Chrysin protects epidermal keratinocytes from UVA- and UVB-induced damage. J. Agric. Food Chem. 2011;59(15):391–400. doi: 10.1021/jf200931t. [DOI] [PubMed] [Google Scholar]
  • 19.Xie LP, Chen QX, Huang H, Wang HZ, Zhang RQ. Inhibitory effects of some flavonoids on the activity of mushroom tyrosinase. Biochem. Mosc. 2003;68(4):487–491. doi: 10.1023/A:1023620501702. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data generated or analysed during this study are included in this published article.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES