Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2023 Feb 24;13:3239. doi: 10.1038/s41598-023-28416-3

Curvilinear regression analysis of benzenoid hydrocarbons and computation of some reduced reverse degree based topological indices for hyaluronic acid-paclitaxel conjugates

Vignesh Ravi 1, Kalyani Desikan 1,
PMCID: PMC9958057  PMID: 36828838

Abstract

Graph theoretical molecular descriptors alias topological indices are a convenient means for expressing in numerical form the chemical structure encoded in a molecular graph. The structure descriptors derived from molecular graphs are widely used in quantitative structure-property relationship (QSPR) and quantitative structure-activity relationship (QSAR) studies. The reason for introducing new indices is to obtain predictions of target properties of considered molecules that are better than the predictions obtained using already known indices. In this paper, we apply the reduced reverse degree based indices introduced in 2021 by Vignesh et al. In the QSPR analysis, we first compute the reduced reverse degree based indices for a family of benzenoid hydrocarbon molecules and then we obtain the correlation with the Physico-chemical properties of the considered molecules. We show that all the properties taken into consideration for the benzenoid hydrocarbons can be very effectively predicted by the reduced reverse degree based indices. Also, we have compared the predictive capability of reduced reverse degree based topological descriptors against 16 existing degree based indices. Further, we compute the defined reduced reverse degree based topological indices for Hyaluronic Acid-Paclitaxel Conjugates (HAP)n, n1.

Subject terms: Applied mathematics, Cheminformatics

Introduction

The most important part of the drug discovery process is finding and improving promising lead compounds as quickly and cheaply as possible. Drug design methodologies can be classified into two primary groups: ligand based and ligand-receptor interaction based approaches. Ligand based apporaches depend only on the structure of ligands. Ligand-receptor interaction based approaches are often referred to as structure-based methods (docking, molecular dynamics, etc.).

The past decade has seen tremendous progress in computational approaches to drug design, and this has played a crucial part in the creation of several currently available medications. Molecular topology1 is a new approach used to design and select new molecules, particularly new pharmaceuticals. It uses topological factors rather than physical or geometric dimensions. Molecular topology not only provides an alternate method but also a different paradigm compared to conventional drug design methodologies. Molecular topology has proven to be useful in molecular design despite its inability to account for the physical or chemical properties of ligands and receptors or the ligand-receptor interaction. In drug development and design, molecular topology has been used to find new hits and leads. The discovery of novel lead compounds using molecular topology has enabled the interpretation of results in terms of their structural and physicochemical properties. The toxicity profile of a drug can be affected by its physicochemical properties, which are thought to play a role in its absorption, distribution, metabolism, and elimination. Physicochemical descriptors form the basis of most current approaches to molecular and drug design2,3. In this manner, a direct connection can be established between the experimental properties and chemical structures.

A common critique of molecular topology, however, is that it functions like a “black box,” bypassing the mechanism of action that offers neither a physical explanation nor a chemical explanation of the processes4. However, this critique may be answered by the fact that topology determines physical and geometrical magnitudes, a claim backed by both experimental and theoretical physicists. This results in molecular topology becoming a self-consistent and independent paradigm describing the molecular behavior. The results of molecular topology can be understood both in terms of structure and in terms of physicochemical characteristics This leads to both the mechanisms of action and the underlying physical chemistry to be deduced from the mathematical topological pattern. This is the most rational explanation, as it is the physical variable that is dependent on mathematics, not the other way around.

Molecular topology is independent of physicochemical molecular descriptors. This fact explains why it is feasible to find new lead compounds using only the knowledge gleaned from mathematical-topological patterns and to interpret the findings in terms of their structural and physicochemical properties. Molecular Topology has transformed from a potential intriguing option to a cornerstone of the drug discovery process. One of them is Quantitative Structure Analysis Relationship (QSAR), which is still effective and common and is also coupled with machine learning58. Additionally, molecular topology has become more relevant recently in terms of novel or unexpected uses. For instance, Speck-Planche et al. developed the multi-scale de novo drug design paradigm, which provides instructions to create novel compounds with desirable drug-like properties9.

In general, descriptors are classified into two types, namely, Experimental (logP, aqueous solubility etc.,) and Theoretical (assessed in Silico from 1D, 2D or 3D molecular structures) descriptors1. Based on the parameterization type, the descriptors are sorted into distinct categories: geometric, quantum-chemical, electrostatic, constitutional, and topological10. Descriptors based on the molecular graph representation are widely used because they incorporate precious chemical information like size, degree of branching, neighbourhood of atoms, flexibility and overall shape. Molecular topological indices can be broken down into structure-explicit descriptors like quantum chemical ones and structure-implicit ones like hydrophobicity or electronic constants ones11. Incorporating ideas from molecular topology makes computing these indices straightforward. In fact, many physicochemical and biological properties can be quickly and accurately estimated using molecular topology12.

Based on the known features of existing pharmaceuticals, topological indices can be used to design new lead compounds. These medications have superior pharmacokinetic, pharmacodynamic and toxicological qualities compared to those already available. This can be accomplished at a very low computational cost and economic cost within a short period of time. Due to the need to reduce the costs associated with both the synthesis and clinical testing of pharmaceuticals, the pharmaceutical industry has contributed to the increased interest in molecular descriptors. The development of the predictive quantitative structure-property relationship (QSPR) and quantitative structure-activity relationship13 (QSAR) models is particularly important for the design of fine chemicals and pharmaceuticals that are made for a specific purpose. For Structure Activity Relationship (SAR) connections to be developed for medicinal compounds using computational or theoretical methods, accurate representations of molecular structure are essential. The molecular descriptor is the final outcome of applying a logical and mathematical procedure that enables the transformation of chemical information encoded in the symbolic representation of a molecule into a meaningful number.

Lead discovery and lead optimization are only two examples of how topological indices are put to use in the drug design and discovery process. Topological indices play an important part in the process of developing any QSPR or QSAR14 model because they quantitatively represent the chemical information that is encoded. They allow researchers to delve into the mechanical aspects of a biochemical process and aid in uncovering the mathematical correlation that exists between chemical structure information and the response of interest. For instance, natural bio-polymer monomers (nucleotides and amino acids) can act as nodes in a graph, with the edges representing covalent bonds, hydrogen bridges, electrostatic interactions, and van der Waals bonds. As a result, the structure of complex biopolymers can be reduced to the topology of a graph, revealing useful information about these molecular systems. Topological indices can be used to characterize graphs that represent molecular systems. Topological indices store information about the connections between molecular atoms and the properties of those atoms. Natural biopolymers such as DNA, RNA, and protein sequences can also be characterized in this manner.

Considering that a theoretical physics starting hypothesis was proved for the first time in experimental physics, the following result is pertinent4. Traditionally it has been believed that topology is dependent on energy. This false assumption gave rise to the criticism that topological descriptors serve as a “black box” since they lack a physical interpretation. Now, if topology (a mathematical magnitude) is what determines energy (a physical magnitude) rather than the other way around, as advanced physicists and chemists claim, then mathematical descriptors can be used to predict the energy of any system along with other physical or experimental properties dependent on it. This is precisely what happens when novel active molecules are created or found utilising topological indices.

Quantitative structure-property relationship (QSPR) and Quantitative structure-activity relationship (QSAR)15 studies rely heavily on molecular descriptors, many of which are based on topological indices.

In a chemical graph, nodes represent atoms or molecules and the links denote the chemical bonding between the atoms or molecules. Graph theoretical molecular descriptors are topological indices. These are graph invariants that play an important role in pharmaceutical science, chemistry, materials science and engineering, etc. The value of a molecular descriptor is not dependent on the particular molecular representation, such as atom numbering or labeling. Molecules of hydrocarbons are modeled using the corresponding molecular graph. Here the vertices represent the carbon atoms and the edges represent the bonds between them. The degree of a vertex represents the number of edges that are incident on that vertex and it is denoted by du or d(u)16.

Benzenoid hydrocarbons are represented through benzenoid graphs comprising of hexagons. Here there are only two types of vertices, (i.e.), vertices of degree 2 or 317,18. In benzenoids, there are only 3 types of edges, (i.e.), (2, 2), (2, 3) and (3, 3) where the numbers in each tuple denote the degree of the end vertices of the corresponding edge. Twelve physico-chemical properties of benzenoid hydrocarbons have been selected based on the availability of data: Boiling Point (BP), Critical Temperature (CT), Critical Volume (CV), Critical Pressure (CP), Exact Mass, Heavy Atom Count, HL, GE, Log P, MR, PI and Molecular Weight (MW). The data for the benzenoid hydrocarbon molecules is presented in Table 1. Energy of benzenoid hydrocarbons are taken from19. The experimental values of boiling points were taken from Basak et al.20.

Table 1.

Data for benzenoid hydrocarbons.

S. No. BP CT CP CV GE LOG P MR HL PI MW EM HAC
1 80.1 323.79 47.69 263.5 121.68 2.03 25.28 0.66 8 78.11 78.04695 6
2 218 484.95 38.97 409.5 252.38 3.03 42.45 1.67 13.6832 128.17 128.0626 10
3 340 586.11 32.43 555.5 383.08 4.03 59.62 2.68 19.3137 178.23 178.0783 14
4 340 586.11 32.43 555.5 383.08 4.03 59.62 2.68 19.4483 178.23 178.0783 14
5 432.02 650.8 27.41 701.5 513.78 5.9 76.79 3.69 24.9308 228.3 228.0909 18
6 436.7 650.8 27.41 701.5 513.78 5.03 76.79 3.69 25.1875 228.3 228.0909 18
7 438 650.8 27.41 701.5 513.78 5.03 76.79 3.69 25.1012 228.3 228.0909 18
8 448 650.8 27.41 701.5 513.78 5.03 76.79 3.69 25.1922 228.3 228.0909 18
9 438 650.8 27.41 701.5 513.78 5.03 76.79 3.69 25.2745 228.3 228.0909 18
10 404 625.65 30.73 619.5 491.18 5.08 68.36 3.47 22.5055 202.25 202.0783 16
11 524.6 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.544 278.3 278.1096 22
12 547.5 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.7255 278.3 278.1096 22
13 524 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.8805 278.3 278.1096 22
14 524.7 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.8795 278.3 278.1096 22
15 547.5 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.7627 278.3 278.1096 22
16 525 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.999 278.3 278.1096 22
17 524.7 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.9386 278.3 278.1096 22
18 520 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.9432 278.3 278.1096 22
19 524.7 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.839 278.3 278.1096 22
20 518 714.53 23.47 847.5 644.48 6.02 93.96 4.7 30.9418 278.3 278.1096 22
21 400 689.17 26.08 765.5 621.88 5.34 85.53 4.48 28.2453 252.3 252.0939 20
22 467.5 689.17 26.08 765.5 621.88 5.34 85.53 4.48 28.3361 252.3 252.0939 20
23 495 689.17 26.08 765.5 621.88 5.34 85.53 4.48 28.222 252.3 252.0939 20
24 500 728.06 24.85 829.5 729.98 5.66 94.28 5.27 31.4251 276.3 276.0939 22
25 604 779.67 20.33 993.5 775.18 7.02 111.13 5.71 36.1557 328.4 328.1252 26
26 525 767.68 23.7 893.5 838.08 5.98 103.02 6.06 34.5718 300.4 300.0939 24

Vignesh et al.21 defined the reduced reverse degree as

RR(v)=Δ(G)-d(v)+2 1

  Vignesh et al.21 proposed some reduced reverse degree-based topological indices. These indices are

RRM1(G)=uvE[RR(u)+RR(v)] 2
RRM2(G)=uvE[RR(u)RR(v)] 3
RRHM1(G)=uvE[RR(u)+RR(v)]2 4
RRHM2(G)=uvE[RR(u)RR(v)]2 5
RRF(G)=uvE[RR(u)2+RR(v)2] 6
RRABC(G)=uvE[RR(u)+RR(v)-2RR(u)RR(v)] 7
RRGA(G)=uvE[2RR(u)RR(v)RR(u)+RR(v)] 8
RRRα(G)=uvE[RR(u)RR(v)]α 9

In Eq. (9) we consider α=1,-1,12,-12.

In this proposed work, we consider 26 benzenoid hydrocarbon molecules and obtain the values for the above-mentioned reduced reverse degree based topological indices to perform curvilinear analysis and ascertain the predictive capability of the indices against the physico-chemical properties of the benzenoid hydrocarbons.

Curvilinear regression analysis of proposed indices

In this section, we analyze the aforesaid reduced reverse degree based topological indices with respect to the following physico-chemical properties of the benzenoid hydrocarbon molecules: Boiling Point (BP), Critical Pressure (CP), Critical Temperature (CT), Critical Volume (CV), Exact Mass, Heavy Atom Count, HL, GE, Log P, MR, PI and Molecular Weight (MW). The experimental values of physico-chemical properties of benzenoid hydrocarbon molecules are given in Table 1.

We analyze the topological indices via-a-vis the physico-chemical properties using the following regression models

P=α1(TI)+γ 10
P=α1(TI)2+α2(TI)+γ 11

where P is the physical property, TI is the topological descriptor, αi, i=1,2 and γ represent the coefficients and constant, respectively. For the twelve physico-chemical properties we found the correlation between the properties and the twenty indices proposed by us. Based on the recommendations of the International Academy of Mathematical Chemistry (IAMC), we have only considered the topological indices for which R20.8. We now present the analysis of the linear and quadratic regression models based on the R2 value and Root Mean Square Error value.

Results and discussion

Using Eq. (10), we obtained the linear regression models for the physico-chemical properties vis-a-vis the index for which the R2 value is maximum for the property. We observe that

  • RRR-1 is best suited for predicting Boiling Point (BP), Critical Pressure (CP) and Log P with corresponding R2 values 0.9587, 0.9539 and 0.9637, respectively.

  • RRR-12 is best suited for predicting Critical Volume (CV), Exact Mass (EM), Heavy Atom Count (HAC), Molar Refraction (MR) and Molecular Weight (MW) with corresponding R2 values 0.9967, 0.9979, 0.9979, 0.9979 and 0.9979, respectively.

  • RRHM1 is best suited for predicting (GE) with R2 value 0.9986.

  • RRGA is best suited for predicting Critical Temperature (CT) with R2 value 0.9602.

  • RRABC is best suited for predicting (PI) with R2 value 0.9976.

  • RRF is best suited for predicting Henry’s Law (HL) with R2 value 0.9995.

Hence the linear regression models for predicting Boiling Point (BP), Critical Pressure (CP), Critical Temperature (CT), Critical Volume (CV), Exact Mass, Heavy Atom Count, HL, GE, Log P, MR, PI and Molecular Weight (MW) are:

BP^=129(RRR-1)-75.26 12
LogP^=1.228(RRR-1)+0.296 13
CT^=16.62(RRGA)+291.4 14
GE^=0.9654(RRHM1)+26.46 15
PI^=1.63(RRABC)+1.197 16
HL^=0.01448(RRF)-0.03858 17
CP^=-6.6085(RRR-1)+54.162 18
CV^=73.21(RRR-12)+46.92 19
EM^=25.23(RRR-12)+3.109 20
HAC^=2.024(RRR-12)-0.022 21
MR^=8.699(RRR-12)-0.6273 22
MW^=25.25(RRR-12)+3.143 23

  We use Eq. (12) to compute the predicted boiling point BP^ for each of the benzenoid hydrocarbon molecules, refer Table 2. Among the linear models, we have listed the best predictive models for the remaining physical properties in Eqs. (13) to (23).

Table 2.

Predicted boiling points for 26 benzenoid hydrocarbon molecules.

S. No. RRR-1 BP BP^ RES RES2 RES2/BP RES %
1 1.5 80.1 118.24 38.14 1454.66 18.16054 47.61548
2 2.2778 218 218.5762 0.5762 0.332006 0.001523 0.264312
3 3.0556 340 318.9124 21.0876 444.6869 1.307903 6.202235
4 3.0833 340 322.4857 17.5143 306.7507 0.902208 5.151265
5 3.8333 432.02 419.2357 12.7843 163.4383 0.378312 2.959192
6 3.8889 436.7 426.4081 10.2919 105.9232 0.242554 2.356744
7 3.8611 438 422.8219 15.1781 230.3747 0.52597 3.46532
8 3.8889 448 426.4081 21.5919 466.2101 1.040648 4.819621
9 3.9167 438 429.9943 8.0057 64.09123 0.146327 1.827785
10 3.8889 404 426.4081 22.4081 502.1229 1.242879 5.546559
11 4.6111 524.6 519.5719 5.0281 25.28179 0.048193 0.958464
12 4.6389 547.5 523.1581 24.3419 592.5281 1.082243 4.446009
13 4.6667 524 526.7443 2.7443 7.531182 0.014372 0.523721
14 4.6667 524.7 526.7443 2.0443 4.179162 0.007965 0.389613
15 4.6389 547.5 523.1581 24.3419 592.5281 1.082243 4.446009
16 4.7222 525 533.9038 8.9038 79.27765 0.151005 1.695962
17 4.6944 524.7 530.3176 5.6176 31.55743 0.060144 1.070631
18 4.6944 520 530.3176 10.3176 106.4529 0.204717 1.984154
19 4.6667 524.7 526.7443 2.0443 4.179162 0.007965 0.389613
20 4.6944 518 530.3176 12.3176 151.7233 0.292902 2.377915
21 4.2222 400 469.4038 69.4038 4816.887 12.04222 17.35095
22 3.9444 467.5 433.5676 33.9324 1151.408 2.462904 7.258267
23 4.1944 495 465.8176 29.1824 851.6125 1.720429 5.895434
24 4.5278 500 508.8262 8.8262 77.90181 0.155804 1.76524
25 5.3889 604 619.9081 15.9081 253.0676 0.418986 2.633791
26 4.8333 525 548.2357 23.2357 539.8978 1.028377 4.425848

We obtain the predicted values for all properties using the proposed linear models and we summarize the results in Table 3. From Table 3 we see that the average residual error of BP is 5.3008% with R2 value 0.9587.

Table 3.

Statistical summary of linear models for 26 benzenoid hydrocarbons.

Property Model number R2 χ2 Avg. residual % RMSE
BP (12) 0.9587 44.7293 5.3008 23.2957
Log P (13) 0.9637 0.18095 1.9591 0.2074
CT (14) 0.9608 20.3819 2.3567 18.9922
GE (15) 0.9986 1.3062 0.7285 5.8237
PI (16) 0.9976 0.0855 0.8416 0.3184
HL (17) 0.9995 0.0035 0.2915 0.0256
CP (18) 0.9539 1.2389 3.1242 1.2647
CV (19) 0.9967 2.6089 0.7031 9.3284
EM (20) 0.9979 0.6239 0.4748 2.5671
HAC (21) 0.9979 0.0505 0.4404 0.2046
MR (22) 0.9978 0.2287 0.4379 0.9010
MW (23) 0.9979 5.0938 0.4796 2.5666

The hypotheses which we considered for checking the goodness of fit of the regression models are:

H0:

Proposed regression model is a good fit.

H1:

Proposed regression model is not a good fit.

The table value for goodness of fit (χ2) with d.o.f. 25 is 37.652 for 5% level of significance.

From Table 3, we observe that the calculated χ2 values for all the properties except BP are lesser than the table value with d.o.f. 25 for 5% level of significance. Thus we accept all the models except model (12).

Though the R2 values are high in the linear models, some of the models have high residual error and thus we explore the predictive capability of quadratic regression models.

Quadratic regression models

Using Eq. (11), we obtained the quadratic regression models for the physico-chemical properties vis-a-vis each of the proposed indices. We observe that

  • RRR-1 is best suited for predicting Boiling Point (BP) and Log P with corresponding R2 values 0.9673 and 0.9651, respectively.

  • RRR-12 is best suited for predicting Critical Pressure (CP), Critical Volume (CV), Exact Mass (EM), Heavy Atom Count (HAC), Molar Refraction (MR) and Molecular Weight (MW) with corresponding R2 values 0.9916, 0.9968, 0.9980, 0.9979, 0.9980 and 0.9980, respectively.

  • RRHM1 is best suited for predicting (GE) with R2 value 0.9988.

  • RRGA is best suited for predicting Critical Temperature (CT) with R2 value 0.9918.

  • RRABC is best suited for predicting (PI) with R2 value 0.9978.

  • RRF is best suited for predicting Henry’s Law (HL) with R2 value 0.9996.

Among the quadratic models, we listed the best predictive models from Eqs. (24) to (35) and the quadratic regression models for Boiling Point (BP), Critical Pressure (CP), Critical Temperature (CT), Critical Volume (CV), Exact Mass, Heavy Atom Count, HL, GE, Log P, MR, PI and Molecular Weight (MW) are:

BP^=-10.47(RRR-1)2+202.4(RRR-1)-192.6 24
LogP^=-0.0407(RRR-1)2+1.513(RRR-1)-0.1598 25
CT^=-0.3939(RRGA)2+31.57(RRGA)-166.2 26
GE^=0.000047(RRHM1)2+0.9208(RRHM1)+35.4 27
PI^=-0.004891(RRABC)2+1.761(RRABC)+0.4245 28
HL^=8.815(10-8)(RRF)2+0.01444(RRF)-0.03427 29
CP^=0.1774(RRR-12)2-5.46(RRR-12)+62.12 30
CV^=-0.2004(RRR-12)2+76.46(RRR-12)+35.08 31
EM^=-0.05377(RRR-12)2+26.1(RRR-12)-0.06767 32
HAC^=-0.003732(RRR-12)2+2.084(RRR-12)-0.2424 33
MR^=-0.0148(RRR-12)2+8.939(RRR-12)-1.503 34
MW^=-0.0085(RRR-12)2+3.042(RRR-12)-1.157 35

   We use Eq. (24) to compute the predicted boiling point BP^ for each of the every benzenoid hydrocarbon molecules, refer Table 4. From Table 5 we see that the average error is 3.0962% with the R2 value is 0.9673. Similarly, we predict all properties with the help of the quadratic models which we proposed and we summarize the result in Table 5.

Table 4.

Predicted boiling points for 26 benzenoid hydrocarbon molecules using quadratic model.

S. No. RRR-1 BP BP^ RES RES2 RES2/BP RES %
1 1.5 80.1 87.4425 7.3425 53.91231 0.673063 9.166667
2 2.2778 218 214.1045 3.895544 15.17526 0.069611 1.786947
3 3.0556 340 328.0983 11.90172 141.6509 0.41662 3.500505
4 3.0833 340 331.9244 8.075636 65.2159 0.191811 2.375187
5 3.8333 432.02 429.4118 2.608238 6.802904 0.015747 0.603731
6 3.8889 436.7 436.1699 0.530137 0.281046 0.000644 0.121396
7 3.8611 438 432.7989 5.201096 27.0514 0.061761 1.187465
8 3.8889 448 436.1699 11.83014 139.9522 0.312393 2.640656
9 3.9167 438 439.5246 1.524638 2.32452 0.005307 0.348091
10 3.8889 404 436.1699 32.16986 1034.9 2.561634 7.962837
11 4.6111 524.6 518.071 6.529046 42.62845 0.081259 1.244576
12 4.6389 547.5 521.0053 26.49469 701.9684 1.282134 4.839212
13 4.6667 524 523.9235 0.076511 0.005854 1.12E-05 0.014601
14 4.6667 524.7 523.9235 0.776511 0.602969 0.001149 0.147991
15 4.6389 547.5 521.0053 26.49469 701.9684 1.282134 4.839212
16 4.7222 525 529.7009 4.70094 22.09884 0.042093 0.895417
17 4.6944 524.7 526.8151 2.115072 4.473532 0.008526 0.403101
18 4.6944 520 526.8151 6.815072 46.44521 0.089318 1.310591
19 4.6667 524.7 523.9235 0.776511 0.602969 0.001149 0.147991
20 4.6944 518 526.8151 8.815072 77.7055 0.150011 1.701751
21 4.2222 400 475.3249 75.32487 5673.837 14.18459 18.83122
22 3.9444 467.5 442.8512 24.64875 607.5609 1.299596 5.27246
23 4.1944 495 472.1479 22.85206 522.2166 1.054983 4.616578
24 4.5278 500 509.1815 9.181534 84.30057 0.168601 1.836307
25 5.3889 604 594.062 9.937986 98.76357 0.163516 1.645362
26 4.8333 525 541.0725 16.07246 258.324 0.492046 3.061421

Table 5.

Statistical summary of quadratic models for 26 benzenoid hydrocarbons.

Property Model number R2 χ2 Avg. residual % RMSE
BP (24) 0.9673 24.6097 3.0962 21.1934
Log P (25) 0.9651 0.1703 1.9449 0.2077
CT (26) 0.9874 3.3935 1.0299 10.9277
GE (27) 0.9988 1.2211 0.6948 5.6693
PI (28) 0.9978 0.0814 0.8564 0.3075
HL (29) 0.9996 0.0035 0.2806 0.0261
CP (30) 0.9916 0.2814 1.5133 0.5521
CV (31) 0.9968 2.5356 0.6829 9.4338
EM (32) 0.9980 0.6086 0.4829 2.5974
HAC (33) 0.9980 0.0496 0.4394 0.2075
MR (34) 0.9979 0.2255 0.4630 0.9150
MW (35) 0.9980 0.6076 04819 2.5963

The table value for the goodness of fit (χ2) with d.o.f. 25 is 37.652 for 5% level of significance. The same hypothesis which we considered in linear models apply for the quadratic models too.

From Table 5, we see that the calculated χ2 value of boiling point is less than the table value with d.o.f. 25 in 5% level of significances and thus we accept all the models.

The quadratic models (second order) obtained by us are a good fit for predicting all the properties. Hence we stop with the quadratic regression models.

Figures 1, 2, 3, 4, 5, 6 show the plots of the quadratic regression models that give the best predicted values for the properties.

Figure 1.

Figure 1

Quadratic regression curves for RRR-1 against BP and LogP.

Figure 2.

Figure 2

Quadratic regression curves for RRR-12 against CP and CV.

Figure 3.

Figure 3

Quadratic regression curves for RRR-12 against EM and HAC.

Figure 4.

Figure 4

Quadratic regression curves for RRR-12 against MR and MW.

Figure 5.

Figure 5

Quadratic regression curves for RRHM1 against GE and RRGA. against CT

Figure 6.

Figure 6

Quadratic regression curves for RRABC against PI and RRF against HL.

Comparison of our indices against existing indices

In this section, we present the comparison of our proposed indices against 16 well known degree based indices.

The degree based indices that we have taken into consideration include Zagreb (M1, M2)22,23 and Hyper Zagreb (HM1, HM2)24,25 , Forgotten index F26, Atom Bond Connectivity Index ABC27, Randić R28, Geometric Arithmetic index GA29, Harmonic H30, Randić version indices R-1, R12 and R-12, Gourava indices31 (GO1, GO2) and Hyper Gourava indices32 (HGO1, HGO2).

We formulated the linear regression models for the physico-chemical properties vis-a-vis these 16 degree based indices. Among these indices, we selected the best models based on the R2 value. For instance, the property Boiling Point (BP) is predicted well by the index R-12. It has a maximum R2 value 0.8679.

In Table 6, we present the comparison of 16 degree based indices and 10 reduced reverse degree based indices against the properties of 26 benzenoid hydrocarbons along with their corresponding R2 values. Based on the R2 values corresponding to both the linear and quadratic regression models, it can be observed that the reduced reverse degree based indices are well correlated with the physico-chemical properties than the degree based indices. This implies that the reduced reverse degree-based descriptors are more suited for predicting the physico-chemical properties of benzenoid hydrocarbons compared to the existing degree based indices.

Table 6.

Comparison of reduced reverse degree indices against 16 degree based indices.

Property Existing index Existing index R2 RR index/R2(linear) RR index /R2 (quadratic)
BP R-12 0.8679 RRR-1 / 0.9587 RRR-1 / 0.9673
Log P R-12 0.8629 RRR-1 / 0.9637 RRR-1 / 0.9651
CT ABC 0.8902 RRGA / 0.9608 RRGA / 0.9874
GE GO2 0.9401 RRHM1 / 0.9986 RRHM1 / 0.9988
PI ABC 0.9247 RRABC /0.9976 RRABC / 0.9978
HL F 0.9392 RRF / 0.9995 RRF / 0.9996
CP R-1 0.8717 RRR-1 /0.9539 RRR-12 / 0.9916
CV R-12 0.9095 RRR-12 / 0.9967 RRR-12 / 0.9968
EM ABC 0.9145 RRR-12 / 0.9979 RRR-12 /0.9980
HAC ABC 0.9167 RRR-12 / 0.9979 RRR-12 / 0.9980
MR ABC 0.9176 RRR-12 / 0.9978 RRR-12 / 0.9979
MW ABC 0.9146 RRR-12 / 0.9979 RRR-12 / 0.9980

Application to hyaluronic acid-paclitaxel conjugates

Cancer is one of the leading factors of mortality in the world, and its fatality rate is on the rise, with breast, stomach, lung, and colon cancers responsible for the bulk of deaths. Although there have been significant advancements in cancer biology and treatments. There are still challenges in the treatment of primary and metastatic disease. Furthermore, present anticancer medications cause limited selectivity and significant toxicity, greatly restricting their usefulness. In recent years, there have been some advancements in molecularly targeted anticancer therapy.

Hyaluronic acid (HA) is a compound that occurs naturally. It is a glycosaminoglycan polymer composed of a linear structure of units of D-glucuronic acid and N-acetyl-D-glucosamine. These are linked via alternating β-1,3- and β-1,4-glycosidic bonds. Disaccharide, HA’s primary structure is energetically stable33. Because of its unique biodegradable, biocompatible, harmless, hydrophilic, and non-immutable properties, HA is a prospective cancer treatment compound; also, HA receptors are over-expressed on numerous tumour cells. HA intends to improve antitumor therapeutics by targeting CD44-overexpressing cells, which is a rapidly developing platform nowadays3436. HA is an excellent medication transporter and target.37. Paclitaxel (PTX) is an effective medication that is prescribed for a variety of malignancies, notably ovarian, breast, lung, bladder, prostate, and esophageal tumors. Although PTX treatment has its own set of drawbacks, such as poor solubility and associated adverse effects, as well as the excipients commonly utilised in its formulation. The key benefits of HA-PTX conjugates include increased water solubility and activity retention, as well as the possibility of using it as a drug carrier to boost anti-tumor potency3840. Figure 7 shows the structure of hyaluronic acid-paclitaxel conjugates. Numerous degree-based indices of the Hyaluronic acid (HA) conjugates41 and Hyaluronic Acid-Paclitaxel conjugates4244 have been proposed by researchers.

Figure 7.

Figure 7

Molecular graph of (HAP)n, n=1.

There are still no reduced reverse degree based topological results on the molecular structures of HA-Paclitaxel conjugates. As a result of the immense pharmaceutical significance of HA-Paclitaxel conjugates, present study intends to investigate the reduced reverse degree based topological indices of the chemical structure of HA-Paclitaxel conjugates. Furthermore, these findings may serve as a theoretical base for pharmacological engineering.

Methodology and results

Figure 7 depicts the structure of the HA-Paclitaxel conjugates for the values n=1. Edge partitioning based on the reduced reverse degree sum counting is the mechanism employed here. By observing the graph structure, we get V((HAP)n) = 87n and E((HAP)n) = 96n.

Results on reduced reverse degree based descriptors of hyaluronic acid-paclitaxel conjugates

Here we compute the reduced reverse degree based topological indices of (HAP)n, n1.

Applying the reduced reverse degree based edge partitions as given in Table 7, we get

RRM1((HAP)n)=uvE1[5+4]+uvE2[5+3]+uvE3[5+2]+uvE4[4+4]+uvE5[4+3]+uvE6[4+2]+uvE7[3+3]+uvE8[3+2]+uvE9[2+2]=664n+4.
RRM2((HAP)n)=uvE1[54]+uvE2[53]+uvE3[52]+uvE4[44]+uvE5[43]+uvE6[42]+uvE7[33]+uvE8[32]+uvE9[22]=1133n+15.
RRHM1((HAP)n)=uvE1[5+4]2+uvE2[5+3]2+uvE3[5+2]2+uvE4[4+4]2+uvE5[4+3]2+uvE6[4+2]2+uvE7[3+3]2+uvE8[3+2]2+uvE9[2+2]2=4684n+60.
RRHM2((HAP)n)=uvE1[54]2+uvE2[53]2+uvE3[52]2+uvE4[44]2+uvE5[43]2+uvE6[42]2+uvE7[33]2+uvE8[32]2+uvE9[22]2=14335n+431.
RRF((HAP)n)=uvE1[52+42]+uvE2[52+32]+uvE3[52+22]+uvE4[42+42]+uvE5[42+32]+uvE6[42+22]+uvE7[32+32]+uvE8[32+22]+uvE9[22+22]=2418n+30.
RRABC((HAP)n)=uvE1[5+4-254]+uvE2[5+3-253]+uvE3[5+2-252]+uvE4[4+4-244]+uvE5[4+3-243]+uvE6[4+2-242]+uvE7[3+3-233]+uvE8[3+2-232]+uvE9[2+2-222]=62.6009177191n-0.1082.
RRGA((HAP)n)=uvE1[2545+4]+uvE2[2535+3]+uvE3[2525+2]+uvE4[2444+4]+uvE5[2434+3]+uvE6[2424+2]+uvE7[2333+3]+uvE8[2323+2]+uvE9[2222+2]=93.9919881117n+1.9836.
RRRα((HAP)n)=uvE1[54]α+uvE2[53]α+uvE3[52]α+uvE4[44]α+uvE5[43]α+uvE6[42]α+uvE7[33]α+uvE8[32]α+uvE9[22]α.

In the above RRRα((HAP)n),

  • If α=12, then RRRα((HAP)n) = 326.571942n+2.00803434.

  • If α=-1, then RRRα((HAP)n) = 8.8986n-0.0819.

  • If α=-12, then RRRα((HAP)n) = 26.1823n-0.0075.

Table 7.

Reduced reverse degree based edge partitions.

Ei (RRR(u),RRR(v)) Number of edges
E1 (5, 4) n+1
E2 (5, 3) 16n
E3 (5, 2) 4n
E4 (4, 4) 13n+1
E5 (4, 3) 32n-1
E6 (4, 2) 3n
E7 (3, 3) 19n-1
E8 (3, 2) 7n
E9 (2, 2) n

Conclusion

In this article, novel topological descriptors RRM1, RRM2, RRHM1, RRHM2, RRF, RRABC, RRGA and RRRα have been analyzed with respect to 26 Benzenoid Hydrocarbons. From the QSPR analysis, it is evident that these descriptors are useful molecular descriptors. We tested the predictive capability of the indices with respect to 26 Benzenoid Hydrocarbons.

QSPR study using curvilinear models reveals that both linear and quadratic regression models provide good estimates for the physico-chemical properties of the 26 Benzenoid Hydrocarbons. From quadratic regression models, we observe that our proposed indices have high correlation with all the physico-chemical properties considered in the above sections.

The best models that predict the physico-chemical properties are as follows:

  1. RRR-1 is best suited for predicting the properties, BP, LogP.

  2. RRR-12 is best suited for predicting the properties, CP, CV, EM, HAC, MR and MW.

  3. RRHM1 is best suited for predicting GE.

  4. RRGA is best suited for predicting CT.

  5. RRABC is best suited for predicting PI.

  6. RRF is best suited for predicting HL.

On comparing the proposed reduced reverse degree-based indices with the existing degree-based topological indices, we found that our proposed indices are well correlated with all the physico-chemical properties of 26 benzenoid hydrocarbons. The errors are significantly reduced for our proposed indices. This demonstrates the significance of the reduced reverse degree-based descriptors in predicting the physico-chemical properties of benzenoid hydrocarbons over the existing degree-based descriptors.

The defined reduced reverse degree based topological indices have been determined for Hyaluronic Acid-Paclitaxel Conjugates (HAP)n, n1.

In future, these indices can be applied to various transformations of graphs and to analyze different chemical networks.

Acknowledgements

The authors would like to acknowledge the editor and reviewers for their valuable comments towards the improvement of this paper. Also, the authors sincerely thank the management of Vellore Institute of Technology, Chennai, India, for their support and encouragement to carry out this research work.

Author contributions

V.R. was involved in conceptualization and writing-original draft preparation; K.D. was involved in supervision and validation.

Funding

This research work is financially supported by Vellore Institute of Technology, Chennai, India.

Data availability

All data generated or analysed during this study are included in this published article. The experimental data for benzenoid hydrocarbons were taken from https://pubchem.ncbi.nlm.nih.gov. MS-Excel 2019 was used for statistical analyses.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Todeschini R, Consonni V. Handbook of Molecular Descriptors. New York: Wiley; 2008. [Google Scholar]
  • 2.Azam, F. Ginger components as anti-alzheimer drugs: focus on drug design. Neuroprotect. Effect. Phytochem. Neurol. Disorders, 149 (2017).
  • 3.Azam F, Mohamed N, Alhussen F. Molecular interaction studies of green tea catechins as multitarget drug candidates for the treatment ofparkinson’s disease: computational and structural insights. Netw. Comput. Neural Syst. 2015;26(3–4):97–115. doi: 10.3109/0954898X.2016.1146416. [DOI] [PubMed] [Google Scholar]
  • 4.Zanni R, Galvez-Llompart M, Garcia-Domenech R, Galvez J. What place does molecular topology have in today’s drug discovery? Expert Opin. Drug Discov. 2020;15(10):1133–1144. doi: 10.1080/17460441.2020.1770223. [DOI] [PubMed] [Google Scholar]
  • 5.Lavecchia A. Machine-learning approaches in drug discovery: methods and applications. Drug Discovery Today. 2015;20(3):318–331. doi: 10.1016/j.drudis.2014.10.012. [DOI] [PubMed] [Google Scholar]
  • 6.Gertrudes JC, Maltarollo VG, Silva R, Oliveira PR, Honorio KM, Da Silva A. Machine learning techniques and drug design. Curr. Med. Chem. 2012;19(25):4289–4297. doi: 10.2174/092986712802884259. [DOI] [PubMed] [Google Scholar]
  • 7.Lo Y-C, Rensi SE, Torng W, Altman RB. Machine learning in chemoinformatics and drug discovery. Drug Discovery Today. 2018;23(8):1538–1546. doi: 10.1016/j.drudis.2018.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhang L, Tan J, Han D, Zhu H. From machine learning to deep learning: progress in machine intelligence for rational drug discovery. Drug Discovery Today. 2017;22(11):1680–1685. doi: 10.1016/j.drudis.2017.08.010. [DOI] [PubMed] [Google Scholar]
  • 9.Speck-Planche A. Recent advances in fragment-based computational drug design: Tackling simultaneous targets/biological effects. Fut. Med. Chem. 2018;10(17):2021–2024. doi: 10.4155/fmc-2018-0213. [DOI] [PubMed] [Google Scholar]
  • 10.Mauri, A., Consonni, V. & Todeschini, R. Handbook of computational chemistry. ed j leszczynski. A Kaczmarek-Kedziera, T Puzyn, MG Papadopoulos, H Reis and MK Shukla (Cham: Springer Int. Publishing) Molecular descriptors, 2065–93 (2017).
  • 11.Estrada, E., Patlewicz, G. & Uriarte, E. From molecular graphs to drugs. a review on the use of topological indices in drug design and discovery (2003).
  • 12.Galvez J, Galvez-Llompart M, Zanni R, Garcia-Domenech R. Molecular topology-dissimilar similarities. Drug Discov. Today Technol. 2013;10(4):475–481. doi: 10.1016/j.ddtec.2013.05.001. [DOI] [PubMed] [Google Scholar]
  • 13.Katritzky AR, Lobanov VS, Karelson M. Qspr: the correlation and quantitative prediction of chemical and physical properties from structure. Chem. Soc. Rev. 1995;24(4):279–287. doi: 10.1039/cs9952400279. [DOI] [Google Scholar]
  • 14.Katritzky AR, Mu L, Lobanov VS, Karelson M. Correlation of boiling points with molecular structure. 1. A training set of 298 diverse organics and a test set of 9 simple inorganics. J. Phys. Chem. 1996;100(24):10400–10407. doi: 10.1021/jp953224q. [DOI] [Google Scholar]
  • 15.Doley A, Buragohain J, Bharali A. Inverse sum indeg status index of graphs and its applications to octane isomers and benzenoid hydrocarbons. Chemom. Intell. Lab. Syst. 2020;203:104059. doi: 10.1016/j.chemolab.2020.104059. [DOI] [Google Scholar]
  • 16.West DB, et al. Introduction to Graph Theory. Upper Saddle River: Prentice Hall; 2001. [Google Scholar]
  • 17.Lučić B, Trinajstić N, Zhou B. Comparison between the sum-connectivity index and product-connectivity index for benzenoid hydrocarbons. Chem. Phys. Lett. 2009;475(1–3):146–148. doi: 10.1016/j.cplett.2009.05.022. [DOI] [Google Scholar]
  • 18.Randic M. Quantitative structure-property relationship. Boiling points of planar benzenoids. New J. Chem. 1996;20(10):1001–1009. [Google Scholar]
  • 19.Dias, J.R. Handbook of polycyclic hydrocarbons. Part a: Benzenoid hydrocarbons (1987).
  • 20.Basak, S.C., Grunwald, G.D. & Niemi, G.J. Use of graph-theoretic and geometrical molecular descriptors in structure-activity relationships. In: From Chemical Topology to Three-dimensional Geometry, pp. 73–116. Springer, Berlin (2002).
  • 21.Ravi, V., Siddiqui, M. K., Chidambaram, N. & Desikan, K. On topological descriptors and curvilinear regression analysis of antiviral drugs used in covid-19 treatment. Polycyclic Aromatic Compound. 1–14 (2021).
  • 22.Gutman, I., Ru?? Ić, B., Trinajstić, N. & Wilcox Jr, C.F. Graph theory and molecular orbitals. xii. acyclic polyenes. J. Chem. Phys.62(9), 3399–3405 (1975).
  • 23.Balaban, A.T., Motoc, I., Bonchev, D. & Mekenyan, O. Topological indices for structure-activity correlations. In: Steric Effects in Drug Design, pp. 21–55. Springer, Berlin (1983).
  • 24.Došlic T, Furtula B, Graovac A, Gutman I, Moradi S, Yarahmadi Z. On vertex-degree-based molecular structure descriptors. MATCH Commun. Math. Comput. Chem. 2011;66(2):613–626. [Google Scholar]
  • 25.Došlić T, Réti T, Vukičević D. On the vertex degree indices of connected graphs. Chem. Phys. Lett. 2011;512(4–6):283–286. doi: 10.1016/j.cplett.2011.07.040. [DOI] [Google Scholar]
  • 26.Furtula B, Gutman I. A forgotten topological index. J. Math. Chem. 2015;53(4):1184–1190. doi: 10.1007/s10910-015-0480-z. [DOI] [Google Scholar]
  • 27.Estrada, E., Torres, L., Rodriguez, L. & Gutman, I. An atom-bond connectivity index: Modelling the enthalpy of formation of alkanes (1998).
  • 28.Randic M. Characterization of molecular branching. J. Am. Chem. Soc. 1975;97(23):6609–6615. doi: 10.1021/ja00856a001. [DOI] [Google Scholar]
  • 29.Vukičević D, Furtula B. Topological index based on the ratios of geometrical and arithmetical means of end-vertex degrees of edges. J. Math. Chem. 2009;46(4):1369–1376. doi: 10.1007/s10910-009-9520-x. [DOI] [Google Scholar]
  • 30.Fajtlowicz S. On conjectures of graffiti-ii. Congr. Numer. 1987;60:187–197. [Google Scholar]
  • 31.Kulli V. The gourava indices and coindices of graphs. Ann. Pure Appl. Math. 2017;14(1):33–38. doi: 10.22457/apam.v14n1a4. [DOI] [Google Scholar]
  • 32.Kulli V. On hyper-gourava indices and coindices. Int. J. Math. Arch. 2017;8(12):116–120. [Google Scholar]
  • 33.Mráček A, Varhaníková J, Lehockỳ M, Gřundělová L, Pokopcová A, Velebnỳ V. The influence of hofmeister series ions on hyaluronan swelling and viscosity. Molecules. 2008;13(5):1025–1034. doi: 10.3390/molecules13051025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Arpicco S, Milla P, Stella B, Dosio F. Hyaluronic acid conjugates as vectors for the active targeting of drugs, genes and nanocomposites in cancer treatment. Molecules. 2014;19(3):3193–3230. doi: 10.3390/molecules19033193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Dosio F, Arpicco S, Stella B, Fattal E. Hyaluronic acid for anticancer drug and nucleic acid delivery. Adv. Drug Deliv. Rev. 2016;97:204–236. doi: 10.1016/j.addr.2015.11.011. [DOI] [PubMed] [Google Scholar]
  • 36.Zhang X, He F, Xiang K, Zhang J, Xu M, Long P, Su H, Gan Z, Yu Q. Cd44-targeted facile enzymatic activatable chitosan nanoparticles for efficient antitumor therapy and reversal of multidrug resistance. Biomacromol. 2018;19(3):883–895. doi: 10.1021/acs.biomac.7b01676. [DOI] [PubMed] [Google Scholar]
  • 37.Markman M, Mekhail TM. Paclitaxel in cancer therapy. Expert Opin. Pharmacother. 2002;3(6):755–766. doi: 10.1517/14656566.3.6.755. [DOI] [PubMed] [Google Scholar]
  • 38.Leonelli F, La Bella A, Migneco LM, Bettolo RM. Design, synthesis and applications of hyaluronic acid-paclitaxel bioconjugates. Molecules. 2008;13(2):360–378. doi: 10.3390/molecules13020360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Xin D, Wang Y, Xiang J. The use of amino acid linkers in the conjugation of paclitaxel with hyaluronic acid as drug delivery system: Synthesis, self-assembled property, drug release, and in vitro efficiency. Pharm. Res. 2010;27(2):380–389. doi: 10.1007/s11095-009-9997-9. [DOI] [PubMed] [Google Scholar]
  • 40.Chen Y, Peng F, Song X, Wu J, Yao W, Gao X. Conjugation of paclitaxel to c-6 hexanediamine-modified hyaluronic acid for targeted drug delivery to enhance antitumor efficacy. Carbohyd. Polym. 2018;181:150–158. doi: 10.1016/j.carbpol.2017.09.017. [DOI] [PubMed] [Google Scholar]
  • 41.Wang J, Wang Y, Wang Y, Zheng L. Computation on the topological indices of hyaluronic acid. J. Appl. Anal. Comput. 2020;10(3):1193–1198. [Google Scholar]
  • 42.Zheng L, Wang Y, Gao W. Topological indices of hyaluronic acid-paclitaxel conjugates’ molecular structure in cancer treatment. Open Chem. 2019;17(1):81–87. doi: 10.1515/chem-2019-0009. [DOI] [Google Scholar]
  • 43.Jahanbani A, Shao Z, Sheikholeslami SM. Calculating degree based multiplicative topological indices of hyaluronic acid-paclitaxel conjugates’ molecular structure in cancer treatment. J. Biomol. Struct. Dyn. 2021;39(14):5304–5313. doi: 10.1080/07391102.2020.1800512. [DOI] [PubMed] [Google Scholar]
  • 44.Rauf, A., Naeem, M., Maqbool, S. & Siddiqui, M. K. Irregularity indices for certain anti-tumor and anti-covid drugs. Polycyclic Aromatic Compounds 1–14 (2021).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data generated or analysed during this study are included in this published article. The experimental data for benzenoid hydrocarbons were taken from https://pubchem.ncbi.nlm.nih.gov. MS-Excel 2019 was used for statistical analyses.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES