Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2026 Jan 8;16:3839. doi: 10.1038/s41598-025-33948-x

Exploring the properties of benzenoid hydrocarbons through QSPR modeling and domination-based energy parameters

Shanmugavelan Sankaran 1, Natarajan Chidambaram 1,
PMCID: PMC12852721  PMID: 41507301

Abstract

This article aims to explore two emerging areas of graph theory: chemical graph theory and domination theory. We specifically focus on a graph parameter that combines aspects of graph energy and domination, known as the dominating energy of a simple connected graph. This concept refers to the sum of the absolute values of the eigenvalues of the corresponding dominating matrix. Additionally, we discuss several variants of dominating energy, including total dominating energy, connected dominating energy, and a distance-based variant called hop dominating energy. We also introduce the Estrada version and Resolvent version of these dominating energies and examine their predictive capabilities in relation to a set of 12 physico-chemical properties and the Inline graphic-electron energy of 35 benzenoid hydrocarbons through linear, quadratic, cubic, and multiple linear regression analyses. As a result, we identify the best predictive models, which have applications in various fields, including health science, pharmaceuticals, production, and design engineering. These best predictive models are tested using k-fold cross-validation to assess their stability. Furthermore, a comparative study is conducted to evaluate how the regression models based on domination-based energy parameters perform against models derived from well-known degree-based topological indices, such as the first and second Zagreb indices, the Forgotten index, the Randic index, and the Wiener index. This analysis aims to demonstrate that the domination-based energy parameters are more effective in predicting the physicochemical properties of the selected benzenoid hydrocarbons. Additionally, an external validation is also performed using another set of 5 benzenoid hydrocarbons to ensure the efficacy of the best-fitting models based on domination parameters.

Keywords: Inline graphic-electron energy, Dominating energy, Total dominating energy, Connected dominating energy, Hop dominating energy, Estrada index, Resolvent energy

Subject terms: Chemistry, Mathematics and computing

Introduction

Domination in graphs is one of the prominent areas of graph theory that has been well studied. A subset Inline graphic of vertices of a graph Inline graphic is said to form a dominating(total dominating) set of G if each vertex in Inline graphic (in G) is adjacent to a vertex in S. The least cardinality of a minimal dominating set (total dominating set) is called the domination number (total domination number) of G and is indicated by Inline graphic (Inline graphic), respectively. A dominating set S of a graph G is a connected dominating set if Inline graphic, the subgraph induced by S, is connected. The minimum cardinality of the connected dominating set is the connected domination number and is denoted by Inline graphic. A subset Inline graphic of a graph G is called a hop dominating set if for every Inline graphic, there exists a vertex u in S such that Inline graphic in G. The least cardinality of a minimum hop dominating set is called the hop domination number of G and denoted by Inline graphic. In literature, there are several works done effectively in this area, and newer versions of dominations have been introduced as and when a necessity arises in areas related to Engineering, Medical Science, Optimization Problems, Social Media network analysis, etc., For a detailed study on dominations and its variants, one may refer1,2.

The energy of a graph G is defined as the sum of absolute values of the eigenvalues of the (0, 1)-adjacency matrix corresponding to a graph G with n vertices, namely as

Inline graphic. If the eigenvalues of a molecular graph (of a conjugated Inline graphic-electron system) are labeled as Inline graphic, then the total Inline graphic-electron energy Inline graphic (of the underlying molecule in its ground electronic State), as calculated within the Hückel molecular orbital (HMO) approximation (Refer 1, 2), is equal to

graphic file with name d33e298.gif 1

where Inline graphic and Inline graphic are constants, n is an odd number. If n is an even number, then

graphic file with name d33e318.gif 2

(See Ref3).

Composed solely of six-membered rings, polycyclic aromatic hydrocarbons (PAHs) compounds-also referred to as benzenoid hydrocarbons (BHCs)-are fully conjugated condensed polycyclic unsaturated hydrocarbons that need a high level of chemical and thermodynamic stability. Numerous benzenoid hydrocarbons, including phenanthrene, anthracene, chrysene, pyrene, perylene, benzo[e]pyrene, benzo[a]pyrene, coronene, benzo[ghi]perylene, and anthanthrene, are found in soils and the earliest marine sediments. Certain chemicals, including phenanthrene, perylene, and anthracene, were discovered in a 150-million-year-old fossil sea lily. The chemical industry uses anthracene and naphthalene extensively. Catalytic gas-phase oxidation transforms about 65% of it into phthalic anhydride, which is utilised in the manufacturing of plastics. Zone refining is used to create anthracene, which is employed as a scintillation counter in nuclear physics. Additionally, anthracene has been proposed for use as a semiconductor and photoconductor. Benzo[a]pyrene and numerous other identified carcinogens that are not PAHs are among the more than 3800 chemicals found in tobacco smoke. When tobacco smoke is inhaled, these chemicals instantly invade the lungs. Pyrene is used to make dyes. Chrysene has been used in UV filters and as a photosensitizer. For looking into the applications of Various PAHs (See Ref4).

QSPR (Quantitative Structure-Property Relationship) analysis of drugs and chemical compounds has garnered significant interest from researchers worldwide. Consequently, there are over 500 research articles available on this topic in the literature. Recently, domination-based or domination degree-based QSPR analysis has attracted attention from researchers in the field of chemical graph theory. The following provides a brief survey of this topic.

Hayat et al.5 compared the Inline graphic-electron energy and the well-performing valency-based topological descriptors of lower benzenoid hydrocarbons. Also, Hayat et al.6 examined topological descriptors based on degree to estimate the thermodynamic properties of benzenoid hydrocarbons. Shiv Kumar et al.7 analyzed energy and Estarda index of benzenoid hydrocarbons with several topological indices. Khan et al.8 conducted a comparative study of seven domination parameters alongside the physico-chemical properties of benzenoid hydrocarbons. Kuriachan et al.9 examined the chemical significance of domination and power domination numbers through QSPR analysis. Shashidhara et al.10 investigated the role of domination-based indices in predicting the physico-chemical properties of butane derivatives. Bommahalli Jayaraman et al.11 explored the properties of anti-tuberculosis drugs using QSPR graph modeling and various domination-based topological indices. Lastly, Kuriachan et al.12 predicted the Inline graphic-electron energy and a set of physico-chemical properties of benzenoid hydrocarbons by employing domination degree-based entropies.

In this work, we have considered several variants of dominating energies like the total dominating energy arising from total dominating sets (introduced by Cockayne, Dawes, and Hedetniemi13), connected dominating energy arising from connected dominating sets (introduced by Sampathkumar and Walikar14), and hop dominating energy arising from hop dominating sets (Natarajan and Ayyaswamy15) for a selected set of 35 benzenoid hydrocarbons.

The dominating matrix Inline graphic with respect to a minimum dominating set D of a graph G is defined as a square matrix Inline graphic of order n, where

graphic file with name d33e412.gif

The energy obtained from this matrix Inline graphic is referred to as dominating energy, which was introduced by Kamal Kumar16. The total dominating energy introduced by Malathy17 is a variant of dominating energy with respect to a minimum total dominating set of G. A distance-based dominating energy parameter known as the hop dominating energy was introduced by Palani18, who explored several bounds related to it.

In the area of topological indices, one of the well-studied indices is the Estrada index of a graph or network G, denoted by EE(G), which is defined as the sum of the exponentials of the eigenvalues of the adjacency matrix of G. Ernesto Estrada introduced this parameter as a measure of the degree of folding of a protein19. Ivan Gutman et al.20 introduced an energy variant named resolvent energy of a graph, defined as Inline graphic and established some bounds on it. At the very outset, we introduce the Estrada version and Resolvent version of the four aforesaid dominating energies and perform a QSPR analysis to identify the best predictors for 12 chosen physico-chemical properties and the Inline graphic-electron energy of 35 benzenoid hydrocarbons.

Molecular structure of some benzenoid hydrocarbons

This section presents the molecular structure graph of a set of 35 benzenoid hydrocarbons (Refer Fig. 1) considered.

Fig. 1.

Fig. 1

Fig. 1

Chemical structure of selected BHCs.

Methodology

A collection of benzenoid hydrocarbons is selected based on a literature review, and their molecular structure are converted into a simple graph G. Then, we obtain various types of matrices associated with molecular graphs, including a dominating matrix Inline graphic, a total dominating matrix Inline graphic, a connected dominating matrix Inline graphic, and a hop dominating matrix Inline graphic, each defined by its minimal set. Next, we calculate the eigenvalues, or latent values, of these matrices, which allows us to compute their respective energies: Dominating Energy (DE), Total Dominating Energy (TDE), Connected Dominating Energy (CDE), and Hop Dominating Energy (HDE). It is important to note that for any simple and connected graph G, there can be multiple minimal dominating sets (as well as variants of these sets), and determining them is an NP-complete problem.

The energy parameters based on domination used in this study are derived from the molecular graph representation of each compound. This representation captures both the topological and energetic characteristics of the molecular structure. These parameters quantify the dominant influence that specific atoms or substructures have on the molecular energy framework, reflecting the balance among connectivity, bond strength, and electronic distribution. Since some physico-chemical properties are determined by these same structural and electronic factors, the correlations observed between domination-based energy parameters and physicochemical properties are meaningful. Therefore, the Quantitative Structure-Property Relationship (QSPR) relationships developed in this study are not merely empirical; they are grounded in the intrinsic connection between molecular topology, electronic interactions, and macroscopic physico-chemical behavior. This provides a mechanistic rationale for the predictive capability of domination-based descriptors within the QSPR framework.

Let us consider a molecular graph G of Naphthalene (Ref. Fig. 2) having domination number of 3 with a minimal dominating set as Inline graphic whose dominating adjacency matrix as

graphic file with name d33e541.gif

using this matrix, the respective dominating energy (DE) is calculated as 14.14553. Similarly, its total domination number is 6 with a minimal total dominating set

Fig. 2.

Fig. 2

Molecular graph of Naphthalene.

Inline graphic having TDE as 14.29253, Inline graphic with a minimal connected dominating set as Inline graphic having Inline graphic and Inline graphic with a minimal hop dominating set as Inline graphic having Inline graphic. With the help of MATLAB Software, we compute the eigenvalues and their respective energies of a 35 benzenoid hydrocarbons mentioned in Section 2.

Further to the computation of dominating energies, motivated by the idea of Estrada index, we introduce the Estrada-type energy variants, namely

Estrada Dominating Energy(EDE), Estrada Total Dominating Energy(ETDE), Estrada Connected Dominating Energy(ECDE), and Estrada Hop Dominating Energy(EHDE). For the sake of brevity, let us see the precise definition of the Estrada dominating energy of a graph as follows:

The Estrada dominating energy (EDE) of a graph G is the sum of the exponential of the eigenvalues of a dominating matrix of G concerning a minimal dominating set. That is, Inline graphic, where Inline graphic, Inline graphic are the eigen values of the dominating matrix of G. Similarly, the Estarda version of total dominating energy, connected dominating energy, and hop dominating energy can be defined. Emboldened by the idea of resolvent energy given by Ivan Gutman, we introduce a resolvent version of these dominating energies. For instance, we give the definition of resolvent dominating energy as follows: Inline graphic, where Inline graphic, Inline graphic are the eigenvalues of the dominating matrix of G. Similarly, the resolvent total dominating energy (TDER), resolvent connected dominating energy (CDER), and resolvent hop dominating energy (HDER) can be defined.

Finding all eigenvalues (respectively energies) in a graph with more than 5 nodes manually is challenging. Therefore, the following algorithm  1 is used to compute various domination-based energies of 35 BHCs, which has a total running time of Inline graphic, where n is the order of a graph, which includes plotting of the graph and computing various domination-based energies.

After obtaining the computed values of the domination-based energies for 35 benzenoid hydrocarbons, we will proceed to regression analysis to identify the most effective predictors for the selected physico-chemical properties and the Inline graphic-electron energy of these benzenoid hydrocarbons. To achieve this, we will utilize the linear, quadratic, cubic, and multiple linear regression models given in Eqs. (3), (4),(5), (6).

graphic file with name d33e683.gif 3
graphic file with name d33e687.gif 4
graphic file with name d33e691.gif 5
graphic file with name d33e695.gif 6

We perform linear, quadratic, cubic, and multiple linear regression analyses employing the data analysis tools in Microsoft Excel. To assess the effectiveness of the parameters, we measure the statistical metrics, the squared correlation coefficient Inline graphic with minimum RMSE, and the level of significance p-value (Inline graphic.

Algorithm 1.

Algorithm 1

Computation of dominating, Estrada dominating and Resolvent dominating energy variants.

Results and discussions

Dominating energies, Estrada versions, Resolvent versions and physico-chemical Properties

This subsection contains the experimental values of some physico-chemical properties of benzenoid hydrocarbons such as total Inline graphic electron energy Inline graphic, Boiling Point (BP), Enthalpy of Vapourization (EV), Flash Point (FP), Molar Refractivity (MR), LogP, Polarizability(P), Molar Volume (MV), Molecular Weight(MW), Inline graphic, Exact Mass(EM), Complexity(C) and Index of Refraction(IR) as shown in Table 4. Dominating Energy(DE), Total Dominating Energy(TDE), Connected Dominating Energy(CDE), and Hop Dominating Energy (HDE) of aforesaid 35 benzenoid hydrocarbons, their Estrada and Resolvent versions are given in Table 1, Table 2, and Table 3.

Table 4.

Experimental values of physico-chemical properties of benzenoid hydrocarbons.

Benzenoid Hydrocarbons Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Naphthalene 13.68324 221.5 43.8 78.9 44.1 3.45 17.5 123.5 128.17 3.3 128.063 80.6 1.632
Phenanthrene 19.44825 337.4 55.8 146.6 61.9 4.68 24.6 157.7 178.23 4.5 178.078 174 1.715
Anthracene 19.31371 337.4 55.8 146.6 61.9 4.68 24.6 157.7 178.23 4.4 178.078 154 1.715
Chrysene 25.19226 448 67.9 209.1 79.8 5.91 31.6 191.8 228.3 5.7 228.094 264 1.771
Tetraphene 25.10124 436.7 66.7 209.1 79.8 5.91 31.6 191.8 228.3 5.8 228.094 294 1.771
Triphenylene 25.27447 425 65.3 209.1 79.8 5.91 31.6 191.8 228.3 4.9 228.094 217 1.771
Naphthacene 24.93082 436.7 66.7 209.1 79.8 5.91 31.6 191.8 228.3 5.9 228.094 236 1.771
Benzo[a]pyrene 28.222 495 73.4 228.6 90.3 6.4 35.8 196.1 252.3 6 252.094 372 1.887
Benzo[e]pyrene 28.33605 467.5 70.2 228.6 90.3 6.4 35.8 196.1 252.3 6.4 252.094 336 1.887
Perylene 28.24531 467.5 70.2 228.6 90.3 6.4 35.8 196.1 252.3 5.8 252.094 304 1.887
Anthanthrene 31.25883 497.1 73.6 247.2 100.8 6.89 40 200.4 276.3 6.7 276.094 411 2.009
Benzo[ghi]perylene 31.42507 501 74.1 247.2 100.8 6.89 40 200.4 276.3 6.6 276.094 411 2.009
Dibenz[a,h]anthracene 30.88051 524.7 76.9 264.5 97.6 7.14 38.7 225.9 278.3 6.5 278.11 361 1.812
Dibenz[a,j]anthracene 30.87948 524.7 76.9 264.5 97.6 7.14 38.7 225.9 278.3 6.5 278.11 363 1.812
Picene 30.94323 519 76.2 264.5 97.6 7.14 38.7 225.9 278.3 7 278.11 361 1.812
Coronene 34.57184 525.6 77 265.2 111.4 7.38 44.1 204.7 300.4 7.2 300.094 376 2.14
Benzo[c]phenanthrene 25.18749 436.7 66.7 209.1 79.8 5.91 31.6 191.8 228.3 5.7 228.094 266 1.771
Pyrene 22.50546 404 63 168.8 72.5 5.17 28.7 162 202.25 4.9 202.078 217 1.852
Dibenzo[a,e]pyrene 34.06463 552.3 80.2 282 108.1 7.63 42.9 230.2 302.4 7.3 302.11 480 1.913
Dibenzo[a,h]pyrene 35.01609 552.3 80.2 282 108.1 7.63 42.9 230.2 302.4 7 302.11 436 1.913
Dibenzo[a,i]pyrene 33.95415 552.3 80.2 282 108.1 7.63 42.9 230.2 302.4 7.3 302.11 436 1.913
Dibenzo[a,l]pyrene 34.03074 552.3 80.2 282 108.1 7.63 42.9 230.2 302.4 7.2 302.11 480 1.913
Peropyrene 37.08962 579 83.4 298.8 118.7 8.12 47 234.5 326.4 7.9 326.1096 470 2.018
Ovalene 46.49743 641.1 94.6 341.5 150.3 5.23 59.6 247.5 398.5 10 398.1096 696 2.374
Heptacene 41.76752 677 95.8 360.6 133.3 9.6 52.8 294.1 378.5 8.6 378.1409 516 1.867
Bisanthene 40.07786 604.8 86.6 315.2 129.2 8.62 51.2 238.8 350.4 8 350.1096 542 2.13
Dibenzo[a,l]pentacene 42.13737 677 95.8 360.6 133.3 9.6 52.8 294.1 378.5 9 378.1409 568 1.867
Dibenzo[a,l]tetracene 36.5169 604.1 86.5 314.6 115.5 8.37 45.8 260 328.4 7.7 328.1252 464 1.843
Naphtho[2,3-a]pyrene 33.86343 552.3 80.2 282 108.1 7.63 42.9 230.2 302.4 7.3 302.1096 480 1.913
Naphtho[8,1,2-bcd]perylene 37.07919 579 83.4 298.8 118.7 8.12 47 234.5 326.4 7.6 326.1096 564 2.018
Tribenzo[b,n,pqr]perylene 43.06532 653.8 92.8 346 136.5 9.35 54.1 268.6 376.4 7.5 376.1252 622 2.025
Tribenzo[a,fg,op]tetracene 39.8883 629.3 89.7 330.5 126 8.86 49.9 264.3 352.4 8.9 352.1252 593 1.932
Pyreno[4,5-e]pyrene 42.91389 653.8 92.8 346 136.5 9.35 54.1 268.6 376.4 8.9 376.1252 566 2.025
Dibenzo[c,p]chrysene 36.72852 604.1 86.5 314.6 115.5 8.37 45.8 260 328.4 8.2 328.1252 508 1.843
Dibenzo[h,rst]pentaphene 39.80515 629.3 89.7 330.5 126 8.86 49.9 264.3 352.4 8.3 352.1252 541 1.932

Table 1.

Various dominating energies of benzenoid hydrocarbons.

Benzenoid Hydrocarbons Inline graphic Inline graphic Inline graphic Inline graphic
Naphthalene 14.14553 14.29253 14.21946 13.93802
Phenanthrene 20.02900 20.00899 20.00899 19.48162
Anthracene 19.96785 19.97509 20.21946 19.92484
Chrysene 25.65454 26.05585 25.90934 25.70146
Tetraphene 24.48823 24.00401 26.04673 24.05752
Triphenylene 26.23916 26.11814 26.01396 25.9844
Naphthacene 25.70894 26.02029 26.19696 25.46996
Benzo[a]pyrene 29.17721 29.54633 29.15970 28.75643
Benzo[e]pyrene 29.26951 29.33715 29.27007 29.12181
Perylene 29.22411 29.64711 29.53962 28.74621
Anthanthrene 32.17127 32.81920 32.40955 31.79821
Benzo[ghi]perylene 32.25418 32.70785 32.41700 32.27813
Dibenz[a,h]anthracene 31.73627 31.88252 31.80070 31.38626
Dibenz[a,j]anthracene 31.93031 32.50403 31.88053 31.50966
Picene 32.03420 31.95476 32.29014 31.45075
Coronene 35.36022 35.91710 31.81428 35.39362
Benzo[c]phenanthrene 25.92649 26.44518 25.92478 25.72919
Pyrene 23.36092 23.75894 23.17634 23.06131
Dibenzo[a,e]pyrene 35.09856 35.15906 35.19604 34.91243
Dibenzo[a,h]pyrene 35.01609 35.14323 35.13200 34.86737
Dibenzo[a,i]pyrene 35.04356 35.54709 35.11893 34.76311
Dibenzo[a,l]pyrene 35.11977 35.19661 35.20418 34.82597
Peropyrene 38.44123 38.57915 38.56967 37.94622
Ovalene 47.84893 48.37911 48.10101 47.49065
Heptacene 42.87842 43.91203 43.26110 42.98885
Bisanthene 41.37296 42.00453 41.83565 41.09883
Dibenzo[a,l]pentacene 43.29852 43.67487 44.03088 42.97023
Dibenzo[a,l]tetracene 37.52532 37.76280 37.87964 37.35005
Naphtho[2,3-a]pyrene 34.94097 34.96789 35.10226 34.57653
Naphtho[8,1,2-bcd]perylene 38.40593 38.58730 38.49761 37.88938
Tribenzo[b,n,pqr]perylene 44.39448 44.62701 44.52155 43.87773
Tribenzo[a,fg,op]tetracene 41.08300 41.26112 41.34210 40.95246
Pyreno[4,5-e]pyrene 44.27678 44.62139 44.53261 44.38665
Dibenzo[c,p]chrysene 37.94690 38.02896 38.02535 37.55958
Dibenzo[h,rst]pentaphene 40.99070 41.01379 41.10991 40.58322

Table 2.

Dominating energies with respect to the Estrada version.

Benzenoid Hydrocarbons Inline graphic Inline graphic Inline graphic Inline graphic
Naphthalene 36.56473 49.59962 50.80898 34.66148
Phenanthrene 52.38371 72.23319 72.23319 52.53371
Anthracene 52.38722 70.29463 74.82321 52.01804
Chrysene 68.82106 93.44150 93.84929 66.87952
Tetraphene 61.01630 87.88976 97.74443 61.94124
Triphenylene 70.79450 89.19469 100.64750 74.96603
Naphthacene 68.18121 90.63103 96.56156 66.71444
Benzo[a]pyrene 78.49290 98.26882 107.02520 73.49382
Benzo[e]pyrene 77.37778 99.54656 108.51760 78.55569
Perylene 77.37321 95.22381 111.85470 73.58340
Anthanthrene 86.13822 104.19570 120.25160 80.10396
Benzo[ghi]perylene 86.14930 105.18170 120.88580 86.09410
Dibenz[a,h]anthracene 84.01095 113.02970 119.39000 77.95799
Dibenz[a,j]anthracene 87.73030 109.41620 114.28300 83.22039
Picene 86.66966 112.52800 123.57920 77.96853
Coronene 94.93797 120.73110 115.41620 94.07581
Benzo[c]phenanthrene 68.29182 89.42645 92.58968 65.64847
Pyrene 57.55509 77.39763 81.46004 59.65623
Dibenzo[a,e]pyrene 94.33057 121.87510 131.51900 89.48429
Dibenzo[a,h]pyrene 94.31972 121.30650 132.59050 89.64652
Dibenzo[a,i]pyrene 93.18162 114.73530 132.71560 92.02295
Dibenzo[a,l]pyrene 93.19735 118.94410 137.32910 90.86382
Peropyrene 102.35170 129.05950 140.74280 98.70259
Ovalene 130.22780 153.93550 176.18180 118.45890
Heptacene 110.87530 154.64850 163.20040 107.28670
Bisanthene 111.15200 140.96510 162.08420 102.17040
Dibenzo[a,l]pentacene 115.62740 156.87900 164.30530 108.91700
Dibenzo[a,l]tetracene 99.81919 135.86920 139.72700 96.60709
Naphtho[2,3-a]pyrene 94.30381 122.80540 127.35850 88.10015
Naphtho[8,1,2-bcd]perylene 104.55080 128.60190 140.77120 98.86585
Tribenzo[b,n,pqr]perylene 118.18930 145.61020 169.17950 112.35050
Tribenzo[a,fg,op]tetracene 109.03160 138.75400 155.90940 110.45310
Pyreno[4,5-e]pyrene 119.30470 143.05100 163.80830 115.16590
Dibenzo[c,p]chrysene 102.43490 131.97460 143.88860 95.27327
Dibenzo[h,rst]pentaphene 109.01620 140.03690 157.05990 104.41130

Table 3.

Dominating energies with respect to the Resolvent version.

Benzenoid Hydrocarbons Inline graphic Inline graphic Inline graphic Inline graphic
Naphthalene 1.05873 1.09482 1.09497 1.047341
Phenanthrene 1.03476 1.05770 1.05770 1.03411
Anthracene 1.03476 1.05760 1.06329 1.03469
Chrysene 1.02699 1.04081 1.04081 1.020745
Tetraphene 1.02001 1.04589 1.04415 1.02001
Triphenylene 1.02734 1.04071 1.04745 1.02741
Naphthacene 1.02404 1.04074 1.04411 1.02074
Benzo[a]pyrene 1.02216 1.03290 1.03564 1.01684
Benzo[e]pyrene 1.02214 1.03292 1.03827 1.02214
Perylene 1.022136 1.03285 1.03831 1.01684
Anthanthrene 1.01832 1.02711 1.03157 1.01394
Benzo[ghi]perylene 1.01832 1.02717 1.03157 1.01831
Dibenz[a,h]anthracene 1.01813 1.03133 1.03354 1.01375
Dibenz[a,j]anthracene 1.02032 1.03130 1.03134 1.01596
Picene 1.02031 1.03132 1.03574 1.01375
Coronene 1.01542 1.02642 1.03135 1.02071
Benzo[c]phenanthrene 1.02404 1.04074 1.04078 1.02401
Pyrene 1.02655 1.04380 1.04392 1.02661
Dibenzo[a,e]pyrene 1.01708 1.02628 1.02628 1.01523
Dibenzo[a,h]pyrene 1.01708 1.02628 1.02996 1.01524
Dibenzo[a,i]pyrene 1.01707 1.02624 1.02996 1.01525
Dibenzo[a,l]pyrene 1.01707 1.02627 1.03179 1.01524
Peropyrene 1.01609 1.02236 1.02548 1.01301
Ovalene 1.01166 1.01674 1.01979 1.00860
Heptacene 1.01079 1.02124 1.02463 1.00964
Bisanthene 1.01389 1.02191 1.02594 1.00989
Dibenzo[a,l]pentacene 1.01196 1.02124 1.02355 1.00964
Dibenzo[a,l]tetracene 1.01445 1.02536 1.02690 1.01289
Naphtho[2,3-a]pyrene 1.017081 1.02629 1.028122 1.01342
Naphtho[8,1,2-bcd]perylene 1.01611 1.02236 1.02548 1.01301
Tribenzo[b,n,pqr]perylene 1.01318 1.01890 1.02360 1.00972
Tribenzo[a,fg,op]tetracene 1.01379 1.02182 1.02583 1.01380
Pyreno[4,5-e]pyrene 1.01318 1.01898 1.02247 1.01202
Dibenzo[c,p]chrysene 1.01599 1.02534 1.02846 1.01289
Dibenzo[h,rst]pentaphene 1.01379 1.02182 1.02584 1.01114

Linear regression analysis

Using the linear regression model given in Eq. (3), we determine the linear relation between 12 physico-chemical properties as well as the Inline graphic-electron energy and 12 dominating energy parameters.

Based on the data provided in Tables 1, 2, 3, and 4, we obtain Inline graphic values for the linear regression model, and the values are listed in Table 5. Note that the maximum Inline graphic obtained against each property is displayed in bold.

Table 5.

The square of the correlation coefficient obtained by the linear regression model.

Predictor/Property Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
DE 0.99839 0.95588 0.97014 0.9622 0.99781 0.99783 0.8699 0.99544 0.93907 0.99544 0.94857
TDE 0.99663 0.95369 0.96831 0.95869 0.99721 0.99721 0.86482 0.99381 0.93621 0.99381 0.94237
CDE 0.99181 0.95811 0.97355 0.96485 0.98986 0.98999 0.88613 0.99271 0.93948 0.99273 0.95512
HDE 0.99814 0.95539 0.96972 0.96161 0.99761 0.99762 0.86984 0.99527 0.93995 0.99527 0.94449
EDE 0.99219 0.94328 0.95754 0.95252 0.99137 0.99155 0.85376 0.98665 0.93367 0.98666 0.94742
ETDE 0.97526 0.9715 0.98202 0.97927 0.96916 0.96898 0.93118 0.98592 0.93532 0.98589 0.90887
ECDE 0.98089 0.95164 0.96443 0.96213 0.97685 0.97708 0.89041 0.98234 0.91879 0.98327 0.94322
EHDE 0.98122 0.94076 0.9502 0.94885 0.97905 0.97889 0.85352 0.97671 0.91835 0.97671 0.92405
DER 0.77086 0.84416 0.81458 0.82654 0.76606 0.76582 0.72311 0.77649 0.75093 0.77648 0.7128
TDER 0.79413 0.84794 0.81827 0.82985 0.79031 0.79039 0.70179 0.79135 0.75188 0.79138 0.74675
CDER 0.78483 0.85198 0.82284 0.82823 0.7802 0.78018 0.70997 0.78502 0.75853 0.78503 0.74981
HDER 0.79701 0.873 0.8498 0.85709 0.79117 0.79126 0.77015 0.80701 0.77315 0.80708 0.77074

In the linear regression model, the properties Inline graphic-electron energy Inline graphic, MR, P, MW, and EM are notably well predicted by DE , whereas properties like BP, EV, FP and MV are predicted by Estarda version dominating parameter ETDE. The most suitable predictor for the properties Inline graphic and C is the dominating energy parameter HDE and CDE. The other two properties, LogP and IR, are not predicted by any of these 12 dominating parameters in the linear regression model.

graphic file with name d33e3725.gif 7
graphic file with name d33e3729.gif 8
graphic file with name d33e3733.gif 9
graphic file with name d33e3737.gif 10
graphic file with name d33e3741.gif 11
graphic file with name d33e3745.gif 12
graphic file with name d33e3749.gif 13
graphic file with name d33e3753.gif 14
graphic file with name d33e3757.gif 15
graphic file with name d33e3762.gif 16
graphic file with name d33e3766.gif 17

Quadratic regression analysis

Using the quadratic regression model described in Eq.  4, we conduct a comprehensive analysis of 12 physico-chemical properties alongside the Inline graphic-electron energy of 35 benzenoid hydrocarbons. Table 7 presents the Inline graphic values for each dominating energy parameter, illustrating their predictive accuracy for the properties. The parameters with the highest Inline graphic values are highlighted.

Table 7.

The square of the correlation coefficient obtained by the quadratic regression model.

Predictor/Property Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
DE 0.99839 0.97237 0.97508 0.97465 0.99789 0.99791 0.87529 0.99551 0.93925 0.99552 0.94861
TDE 0.99664 0.9703 0.97331 0.97109 0.99727 0.99727 0.87008 0.99389 0.93641 0.99389 0.94237
CDE 0.9921 0.97782 0.98036 0.98032 0.9899 0.99003 0.89294 0.99323 0.94032 0.99325 0.95522
HDE 0.99816 0.97202 0.97478 0.97422 0.99766 0.99766 0.87509 0.99537 0.94013 0.99537 0.9445
EDE 0.99225 0.95893 0.96193 0.96475 0.99174 0.99192 0.86107 0.98665 0.93367 0.98666 0.948
ETDE 0.97573 0.98355 0.98531 0.98845 0.96395 0.96919 0.93148 0.98614 0.93613 0.98611 0.90999
ECDE 0.98102 0.96763 0.96955 0.97507 0.97685 0.97709 0.89489 0.98348 0.91987 0.98351 0.94323
EHDE 0.98162 0.9511 0.95239 0.95621 0.97988 0.97971 0.8555 0.9769 0.91861 0.97689 0.92532
DER 0.94063 0.94705 0.94304 0.94507 0.93916 0.93891 0.86027 0.94605 0.90829 0.94602 0.88893
TDER 0.95418 0.93063 0.92438 0.9265 0.95485 0.95514 0.79875 0.94425 0.88981 0.94427 0.93053
CDER 0.94245 0.93797 0.93345 0.92731 0.94178 0.942 0.81558 0.93786 0.90182 0.93786 0.93494
HDER 0.86917 0.90319 0.92438 0.89651 0.866 0.86636 0.82946 0.88033 0.8315 0.88041 0.85729

In the quadratic regression model, the parameter DE is a better predictor for the properties Inline graphic, MR, P, MW and EM. Additionally, the parameter ETDE serves as the best predictor for the properties BP, EV, FP, and MV. Meanwhile, the parameter CDE provides a better prediction for the properties Inline graphic and C. The other two properties, LogP and IR, are not predicted in quadratic analysis by any of these 12 dominating parameters.

graphic file with name d33e4569.gif 18
graphic file with name d33e4574.gif 19
graphic file with name d33e4578.gif 20
graphic file with name d33e4582.gif 21
graphic file with name d33e4586.gif 22
graphic file with name d33e4590.gif 23
graphic file with name d33e4594.gif 24
graphic file with name d33e4598.gif 25
graphic file with name d33e4602.gif 26
graphic file with name d33e4606.gif 27
graphic file with name d33e4610.gif 28

Cubic regression analysis

Using the cubic regression model described in Eq.  5, we present an extensive analysis of the physico-chemical characteristics of 12 and the electron energy Inline graphic of 35 benzenoid hydrocarbons. The Inline graphic values for each dominating energy parameter are shown in Table 9, demonstrating the precision with which they predict properties. The parameters with the highest Inline graphic values are highlighted.

Table 9.

The square of the correlation coefficient obtained by the cubic regression model.

Predictor/Property Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
DE 0.99841 0.97237 0.97521 0.97493 0.99807 0.84393 0.99808 0.87981 0.99553 0.93951 0.99553 0.94930
TDE 0.99665 0.97030 0.97345 0.97133 0.9974 0.84129 0.99739 0.87420 0.99391 0.93669 0.99391 0.94335
CDE 0.99210 0.97793 0.98036 0.98106 0.99000 0.85179 0.99012 0.89983 0.99332 0.94044 0.99335 0.95713
HDE 0.99818 0.97203 0.97498 0.97440 0.99784 0.83995 0.99784 0.87879 0.99538 0.94051 0.99538 0.94541
EDE 0.99227 0.95895 0.96196 0.96515 0.99194 0.84721 0.99211 0.86811 0.98670 0.93436 0.98671 0.94833
ETDE 0.97589 0.98489 0.98687 0.98875 0.96940 0.80256 0.96925 0.93243 0.98614 0.93614 0.98611 0.91577
ECDE 0.98133 0.96785 0.97011 0.97509 0.97763 0.81802 0.97783 0.8976 0.98361 0.91990 0.98364 0.94428
EHDE 0.98165 0.95123 0.95285 0.95628 0.98002 0.79765 0.97983 0.85685 0.97690 0.91940 0.97690 0.92567
DER 0.95362 0.95331 0.95479 0.95189 0.95336 0.75343 0.95315 0.87737 0.96121 0.91845 0.96117 0.89242
TDER 0.97851 0.93739 0.93756 0.93420 0.98181 0.72344 0.98221 0.80690 0.96697 0.91070 0.96698 0.95393
CDER 0.96298 0.94395 0.9453 0.93500 0.96445 0.71371 0.96484 0.82514 0.95785 0.92037 0.95784 0.95162
HDER 0.88853 0.91408 0.91341 0.90755 0.88746 0.69216 0.88783 0.84295 0.90047 0.84146 0.90054 0.86509
graphic file with name d33e5393.gif 29
graphic file with name d33e5398.gif 30
graphic file with name d33e5403.gif 31
graphic file with name d33e5408.gif 32
graphic file with name d33e5413.gif 33
graphic file with name d33e5419.gif 34
graphic file with name d33e5424.gif 35
graphic file with name d33e5429.gif 36
graphic file with name d33e5434.gif 37
graphic file with name d33e5439.gif 38
graphic file with name d33e5444.gif 39
graphic file with name d33e5450.gif 40

In cubic regression analysis, the parameter DE is a better predictor for the properties Inline graphic, MR, P, MW and EM. Additionally, the parameter ETDE serves as the best predictor for the properties BP, EV, FP and MV. Meanwhile, the parameters HDE and CDE provide better predictions for the properties Inline graphic and C. Notably, the property LogP is predicted only on this cubic regression model.

Based on the results from the linear, quadratic and cubic regression analyses presented in Tables  6,  8 and  10, we observe that the parameter DE effectively predicts the properties Inline graphic, MR, P, MW and EM while CDE serves as the best predictor for the property C in the linear, quadratic and cubic regression model. Additionally, in the linear and cubic regression model, HDE is the best predictor for the property Inline graphic, whereas LogP is predicted in the cubic regression model alone by a dominating energy parameter CDE. Furthermore, the dominating energy parameter ETDE most accurately predicts the properties BP, EV, FP, and MV. These optimal predictive fits resulting from our analyses are illustrated in Figs.  3, 4, and 5.

Table 6.

Best predictive fits from linear regression model.

Property Curve equation Predictor Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Inline graphic (7) DE 0.99839 0.31183 0.17633 20404.94018 Significant
BP (8) ETDE 0.97151 17.83241 12.84763 1125.07983 Significant
EV (9) ETDE 0.98202 1.66777 1.27705 1802.62206 Significant
FP (10) ETDE 0.97927 9.67293 7.42094 1559.14398 Significant
MR (11) DE 0.99781 1.15489 0.76539 15049.21482 Significant
P (12) DE 0.99783 0.45522 0.30005 15194.28505 Significant
MV (13) ETDE 0.93118 10.4247 7.65217 446.52845 Significant
MW (14) DE 0.99544 4.44527 3.09808 7203.47938 Significant
Inline graphic (15) HDE 0.93995 0.36619 0.24470 516.54263 Significant
EM (16) DE 0.99544 4.43945 3.09724 7209.75157 Significant
C (17) CDE 0.95512 31.43366 24.84534 702.31417 Significant

Table 8.

Best predictive fits from quadratic regression model.

Property Curve equation Predictor Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Inline graphic (18) DE 0.99839 0.31620 0.18622 9922.25197 Significant
BP (19) ETDE 0.98355 13.75895 10.69668 956.65258 Significant
EV (20) ETDE 0.98531 1.53093 1.22181 1073.06986 Significant
FP (21) ETDE 0.98845 7.33133 5.50324 1369.80396 Significant
MR (22) DE 0.99789 1.15219 0.75936 7560.51507 Significant
P (23) DE 0.99791 0.45434 0.30069 7627.28634 Significant
MV (24) ETDE 0.93148 10.56373 7.54743 217.49485 Significant
MW (25) DE 0.99551 4.47754 3.00560 3550.28906 Significant
Inline graphic (26) CDE 0.94032 0.37072 0.24192 252.09312 Significant
EM (27) DE 0.99552 4.47135 2.99176 3553.88481 Significant
C (28) CDE 0.95522 31.88665 25.04629 341.28532 Significant

Table 10.

Best predictive fits from cubic regression model.

Property Curve equation Predictor Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Inline graphic (29) DE 0.99841 0.31916 9.05864 6492.90693 Significant
BP (30) ETDE 0.98489 13.39839 84.37971 673.47040 Significant
EV (31) ETDE 0.98687 1.47056 32.17270 776.54949 Significant
FP (32) ETDE 0.98875 7.35354 75.24060 908.21434 Significant
MR (33) DE 0.99807 1.11914 2.18840 5343.41646 Significant
LogP (34) CDE 0.85179 0.61365 22.17382 59.38852 Significant
P (35) DE 0.99808 0.44223 1.08478 5368.01209 Significant
MV (36) ETDE 0.93243 10.65802 80.56005 142.58791 Significant
MW (37) DE 0.99553 4.54259 24.80174 2299.58485 Significant
Inline graphic (38) HDE 0.94051 0.37606 1.69507 163.35636 Significant
EM (39) DE 0.99553 4.53621 3.38478 2302.01627 Significant
C (40) CDE 0.95713 31.69644 24.55696 230.72418 Significant

Fig. 3.

Fig. 3

Linear regression curves for MW and EM against DE, C against CDE, Inline graphic against HDE and MV against ETDE.

Fig. 4.

Fig. 4

Quadratic regression curves for Inline graphic against DE and FP against ETDE.

Fig. 5.

Fig. 5

Cubic regression curves for MR and P against DE, BP and EV against ETDE and LogP against CDE.

Multiple linear regression analysis

In this subsection, we extend our analysis from linear regression to multiple linear regression, examining the effectiveness of the selected parameters in predicting the 13 properties under consideration.

graphic file with name d33e6055.gif 41
graphic file with name d33e6059.gif 42
graphic file with name d33e6063.gif 43
graphic file with name d33e6067.gif 44
graphic file with name d33e6071.gif 45
graphic file with name d33e6075.gif 46
graphic file with name d33e6079.gif 47
graphic file with name d33e6083.gif 48
graphic file with name d33e6087.gif 49
graphic file with name d33e6091.gif 50
graphic file with name d33e6096.gif 51
graphic file with name d33e6100.gif 52

The statistical measures obtained from the multiple linear regression analysis are summarized in Table 11.

Table 11.

Multiple linear regression model.

Property Curve equation Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphicvalue
Inline graphic (41) 0.9991 0.9990 0.2412 0.1412 11378.5012 Significant
BP (42) 0.9919 0.9909 9.9503 7.3470 922.3792 Significant
EV (43) 0.9932 0.9921 1.0907 65.3728 852.531 Significant
FP (44) 0.9937 0.9926 5.7055 3.7977 909.4549 Significant
MR (45) 0.9988 0.9986 0.9019 0.8050 6174.9556 Significant
P (46) 0.9988 0.9986 0.3584 0.2757 6133.3447 Significant
MV (47) 0.9620 0.9570 8.1194 6.0297 190.1198 Significant
MW (48) 0.9996 0.9994 1.53337 1.0020 8684.3976 Significant
Inline graphic (49) 0.9395 0.9377 0.3676 0.2392 512.3178 Significant
EM (50) 0.9996 0.9994 1.5357 0.9963 8642.3407 Significant
C (51) 0.9657 0.9598 29.2988 20.4776 163.4755 Significant
IR (52) 0.8404 0.8063 0.0622 0.8337 24.5819 Significant

Regression analysis using topological indices

In this section, we perform a QSPR analysis using well-known topological indices, namely the first and second Zagreb indices, forgotten index (F index), RandiInline graphic index and Wiener index. The topological indices considered in our study are presented in the following Table 12.

Table 12.

List of topological indices.

S.no Name of the Index Formula
1 First Zagreb Index (Inline graphic) Inline graphic
2 Second Zagreb Index (Inline graphic) Inline graphic
3 Forgotten Index (F-index) Inline graphic
4 Randić Index (R) Inline graphic
5 Wiener Index (W) Inline graphic

For brevity, only the results from the quadratic and cubic regression models are presented in the following discussion, despite conducting linear, quadratic, and cubic regression analyses (Refer Table 13, Table 15).

Table 13.

The square of the correlation coefficient obtained by the quadratic regression model for topological indices.

Predictor/Property Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Inline graphic 0.99131 0.95179 0.95178 0.95066 0.99598 0.99602 0.82623 0.98078 0.93067 0.98077 0.94808
Inline graphic 0.97925 0.93313 0.93016 0.93022 0.98564 0.98572 0.79394 0.96347 0.91355 0.96346 0.94371
F 0.98231 0.93806 0.93601 0.93512 0.98886 0.98894 0.80134 0.96815 0.91923 0.96813 0.943
R 0.99822 0.98141 0.988447 0.98473 0.99614 0.99609 0.89731 0.99961 0.94432 0.99961 0.94475
W 0.97228 0.9708 0.97909 0.98124 0.96413 0.96422 0.94338 0.98368 0.93005 0.98369 0.92166

Table 15.

The square of the correlation coefficient obtained by the cubic regression model for topological indices.

Predictor/Property Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Inline graphic 0.99197 0.95272 0.95226 0.95269 0.99627 0.88295 0.99632 0.8392 0.98223 0.93071 0.98222 0.94969
Inline graphic 0.98052 0.93418 0.93089 0.93241 0.98646 0.88601 0.98653 0.8078 0.96577 0.91355 0.96576 0.94564
F 0.98355 0.93916 0.93676 0.93737 0.98964 0.8891 0.98971 0.81535 0.97041 0.91925 0.97039 0.94491
R 0.99823 0.98146 0.9848 0.98484 0.99625 0.8324 0.99618 0.89919 0.99961 0.94442 0.99961 0.94647
W 0.97228 0.98262 0.98525 0.99036 0.96413 0.80957 0.96417 0.9532 0.98408 0.93088 0.98409 0.92514

The properties C and MV are better predicted by the parameters Inline graphic and W, while the parameter R predicts all other properties in quadratic regression analysis (Refer Tabel 14).

graphic file with name d33e6788.gif 53
graphic file with name d33e6792.gif 54
graphic file with name d33e6796.gif 55
graphic file with name d33e6800.gif 56
graphic file with name d33e6804.gif 57
graphic file with name d33e6808.gif 58
graphic file with name d33e6812.gif 59
graphic file with name d33e6816.gif 60
graphic file with name d33e6820.gif 61
graphic file with name d33e6825.gif 62
graphic file with name d33e6829.gif 63

Table 14.

Best predictive fits from quadratic regression model for topological indices.

Property Curve equation Predictor Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Inline graphic (53) R 0.99822 0.31775 0.21352 8982.03992 Significant
BP (54) R 0.98141 13.98473 10.74680 944.80009 Significant
EV (55) R 0.98845 1.50501 1.18610 1014.31181 Significant
FP (56) R 0.98473 8.06259 5.98945 1031.61281 Significant
MR (57) R 0.99614 1.48862 1.10645 4133.87508 Significant
P (58) R 0.99609 0.59311 0.43983 4072.52056 Significant
MV (59) W 0.94338 9.18168 71.40028 266.58515 Significant
MW (60) R 0.99961 1.26598 0.91693 40771.15712 Significant
Inline graphic (61) R 0.94432 0.34240 0.22437 271.33784 Significant
EM (62) R 0.99961 1.26329 0.91472 40873.34864 Significant
C (63) Inline graphic 0.94808 32.83100 24.18711 292.13897 Significant

In cubic regression analysis, the parameter Inline graphic is a better predictor for the properties MR, P, and C. Additionally, the parameter W serves as the best predictor for the properties BP, EV, FP and MV. Meanwhile, the parameter R provides better prediction for the properties Inline graphic, MW, Inline graphic , and EM. Notably, the property LogP is predicted only on this cubic regression model by the parameter F (Refer Table 16).

Table 16.

Best predictive fits from cubic regression model for topological indices.

Property Curve equation Predictor Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
Inline graphic (64) R 0.99823 0.31744 0.23098 5812.23611 Significant
BP (65) W 0.98262 13.52456 49.71794 584.07454 Significant
EV (66) W 0.98525 1.46698 24.91537 690.03105 Significant
FP (67) W 0.99036 6.40680 19.20082 1061.15736 Significant
MR (68) Inline graphic 0.99627 1.46341 13.94508 2762.90645 Significant
LogP (69) F 0.88910 0.49957 26.08164 82.844430 Significant
P (70) Inline graphic 0.99632 0.57626 220.33429 2795.14761 Significant
MV (71) W 0.95320 8.35644 73.29516 209.99591 Significant
MW (72) R 0.99961 1.26357 0.92537 26431.8255 Significant
Inline graphic (73) R 0.94442 0.34210 0.22241 175.572110 Significant
EM (74) R 0.99961 1.26076 0.91823 26503.2185 Significant
C (75) Inline graphic 0.94969 32.31805 111.97250 195.040420 Significant
graphic file with name d33e7793.gif 64
graphic file with name d33e7797.gif 65
graphic file with name d33e7801.gif 66
graphic file with name d33e7805.gif 67
graphic file with name d33e7809.gif 68
graphic file with name d33e7813.gif 69
graphic file with name d33e7817.gif 70
graphic file with name d33e7821.gif 71
graphic file with name d33e7825.gif 72
graphic file with name d33e7829.gif 73
graphic file with name d33e7833.gif 74
graphic file with name d33e7838.gif 75

In comparing our results from the cubic regression model and the quadratic regression model, we found that the cubic regression model is the best fit for predicting nearly all the selected physico-chemical properties of BHCs. We will validate these best-fit results using k-fold cross-validation in the following sub section.

k-fold cross-validation

In this section, we perform a k-fold cross-validation using Algorithm 2 to examine the stability of the best predictive fits identified in regression models resulting from domination-based energy parameters as well as the topological indices, whose results are tabulated in Table 17 and Table 18.

Table 17.

5-fold cross-validation based on the best predictive model for Dominating energies.

Property Parameter Model Model Equation Inline graphic RMSE Inline graphic Inline graphic
Inline graphic DE Quadratic (18) 0.99839 0.31621 0.99820 0.31900
BP ETDE Cubic (30) 0.98489 13.39839 0.97950 14.6765
EV ETDE Cubic (31) 0.98687 1.47056 0.97320 1.97600
FP ETDE Quadratic (21) 0.98845 7.33133 0.98620 7.66240
MR DE Cubic (33) 0.99807 1.11914 0.99720 1.27260
LogP CDE Cubic (34) 0.85179 0.61365 0.42440 1.13810
P DE Cubic (35) 0.99808 0.44223 0.99740 0.48060
MV ETDE Linear (13) 0.93118 10.4247 0.91230 11.4261
MW DE Linear (14) 0.99544 4.44527 0.99440 4.79450
Inline graphic HDE Linear (15) 0.93995 0.36619 0.93370 0.37360
EM DE Linear (16) 0.99544 4.43945 0.99470 4.66660
C CDE Linear (17) 0.95512 31.43366 0.94890 32.5835

Table 18.

5-fold cross-validation based on the best predictive model for degree-based topological indices.

Property Parameter Model Model Equation Inline graphic RMSE Inline graphic Inline graphic
Inline graphic R Cubic (64) 0.99823 0.31744 0.99673 0.43123
BP R Cubic (65) 0.98262 13.52456 0.96924 17.99113
EV W Quadratic (55) 0.98447 1.50501 0.97622 1.86227
FP W Cubic (67) 0.99036 6.40680 0.98141 8.89551
MR Inline graphic Cubic (68) 0.99627 1.46341 0.99217 2.12117
LogP F Cubic (69) 0.88910 0.49957 0.43439 1.12821
P Inline graphic Cubic (70) 0.98632 0.57626 0.99362 0.75845
MV W Cubic (71) 0.95310 8.35644 0.94007 9.44609
MW R Cubic (72) 0.99610 1.26357 0.99916 1.85604
Inline graphic R Quadratic (61) 0.94442 0.34240 0.94075 0.35319
EM R Cubic (74) 0.99961 1.26076 0.99951 1.41159
C Inline graphic Cubic (75) 0.94969 32.31805 0.93360 37.12756

Algorithm 2.

Algorithm 2

Polynomial Regression with k-fold cross-validation.

From the 5-fold cross-validation summarised in Table 17 and Table 18, the risk of overfitting is observed for the property LogP, since Inline graphic.

External validation

To test the effectiveness of the best predictive models obtained in this study, we have done an external validation using 5 other benzenoid hydrocarbons, namely Benzo[b]chrysene(Inline graphic), Benzo[e]naphtho[2,3-a]pyrene(Inline graphic), Benzo[a]tetracene(Inline graphic), Dibenzo[a,c]anthracene(Inline graphic), and Dibenzopentaphene(Inline graphic). Their actual and predicted values, with their absolute residual error, are presented in Tables 1920 and  21.

Table 19.

Comparison of actual and predicted values of selected benzenoid hydrocarbons.

Benzenoid Hydrocarbons Actual Inline graphic Pred. Inline graphic Abs. Error Actual BP Pred. BP Abs. Error Actual EV Pred. EV Abs. Error Actual FP Pred. FP Abs. Error
Inline graphic 30.83900 31.00745 0.16846 524.70000 449.43641 75.26359 76.90000 50.82943 26.07057 264.50000 288.06040 23.56040
Inline graphic 39.70891 39.76686 0.16846 629.30000 481.43186 147.86814 89.70000 39.09981 50.60019 330.50000 332.66860 2.16860
Inline graphic 30.72564 31.25787 0.53223 557.50000 451.36267 106.13733 83.90000 50.56561 33.33439 204.10000 290.18360 86.08360
Inline graphic 30.94181 31.11381 0.17200 518.00000 455.38970 62.61030 76.10000 49.93258 26.16742 264.50000 279.05560 14.55560
Inline graphic 42.31638 42.21380 0.10258 677.00000 485.31288 191.68712 95.80000 27.03131 68.76869 360.60000 279.49600 81.10400

Table 20.

Comparison of actual and predicted values of selected benzenoid hydrocarbons(Cont.)

Benzenoid Hydrocarbons Actual MR Pred. MR Abs. Error Actual Log P Pred. Log P Abs. Error Actual P Pred. P Abs. Error Actual MV Pred. MV Abs. Error
Inline graphic 97.60000 99.91385 2.31385 7.14000 24.24906 17.10906 38.70000 39.77884 1.07884 225.90000 186.3995 39.5005
Inline graphic 126.00000 129.48627 3.48627 8.86000 44.68115 35.82115 49.90000 51.70556 1.80556 264.30000 115.7081 148.5919
Inline graphic 94.40000 100.72858 6.32858 5.85000 24.22746 18.37746 37.40000 40.10630 2.70630 227.30000 186.0331 41.2669
Inline graphic 97.60000 100.25976 2.65976 7.14000 24.32161 17.18161 38.70000 39.91786 1.21786 225.90000 187.3531 38.5469
Inline graphic 133.30000 138.31108 5.01108 9.60000 51.79594 42.19594 52.80000 55.27872 2.47872 294.10000 -40.6273 334.7273

Table 21.

Comparison of actual and predicted values of selected benzenoid hydrocarbons(Cont.)

Benzenoid Hydrocarbons Actual MW Pred. MW Abs. Error Pred. Inline graphic Actual Inline graphic Abs. Error Actual EM Pred. EM Abs. Error Actual C Pred. C Abs. Error
Inline graphic 278.30000 277.31214 0.98786 6.50000 6.65608 0.15608 278.10950 277.04566 1.06384 399.00000 381.64190 17.35810
Inline graphic 352.4000 351.87253 0.52747 8.00000 8.27919 0.27919 352.12520 351.50421 0.62099 593.00000 546.24136 46.75864
Inline graphic 278.30000 277.73344 1.41168 6.70000 6.64676 0.05324 278.10950 276.62234 1.48716 399.00000 381.43080 17.56920
Inline graphic 278.30000 277.73344 0.56656 6.70000 6.66533 0.03467 278.10960 277.46646 0.64309 361.00000 382.35000 21.35000
Inline graphic 378.50000 375.87197 2.62803 9.00000 8.79623 0.20377 378.14080 375.46354 2.67726 568.00000 593.15660 25.15660

Conclusion

Dominating energy and its variants are emerging energy parameters that many researchers are currently investigating. Although calculating the domination number of a graph is an NP-complete problem, domination-based energies play a crucial role in predicting the physico-chemical properties of chemical compounds.

In this study, we conducted a regression analysis to identify the best predictors for twelve selected physico-chemical properties, as well as the Inline graphic-electron energy, of a set of 35 benzenoid hydrocarbons (BHCs). We utilized a set of 12 domination-based energy parameters for this analysis. Additionally, we performed another regression analysis using 5 well-known topological indices to compare the effectiveness of the domination-based energy parameters in predicting the physico-chemical properties of BHCs. To ensure the reliability of the regression models resulting from these analyses, we implemented 5-fold cross-validation, achieving the following results:

  • (i)

    The parameter Inline graphic showed good correlation with the property Inline graphic Inline graphic in linear regression.

  • (ii)

    In quadratic regression, the energy parameters DE was found to be the most accurate predictor for the property Inline graphic. The topological index R served as the best predictor for the property Inline graphic.

  • (iii)

    In cubic regression, the dominating energy parameter DE demonstrated better certainty with the properties Inline graphic and Inline graphic, where the energy parameter ETDE outperformed all other selected parameters to predict the properties Inline graphic and Inline graphic. The index W was identified as the most accurate predictor for the properties Inline graphic and Inline graphic. Meanwhile, the index R served as a best predictor for the properties Inline graphic and Inline graphic

  • (iv)

    In Multiple linear regression, we observe that all other properties have been well predicted except the property LogP.

The limitation of having a unique minimum dominating set and its variant is extremely challenging because multiple minimum dominating sets are possible due to its NP-complete nature, resulting in little variations in their respective eigenvalues and their energy values too. However, a sufficient attempt is initiated in this study by making use of a minimum dominating set (and its variants) for a precise regression analysis. The methods employed in this work will help chemists and researchers in making a rational inference on predicting physico-chemical properties of other classes of chemical compounds or drugs to provide efficient treatment for diagnosing diseases, through different dominating energies. The outcomes of this study offer an excellent foundation for future developments and aid in the development of realistic models. This study effectively illustrated how well different regression models predict key physico-chemical properties of various benzenoid hydrocarbons, which is validated for 5 other external benzenoid hydrocarbons, namely Benzo[b]chrysene, Benzo[e]naphtho[2,3-a]pyrene, Benzo[a]tetracene, Dibenzo[a,c]anthracene, and Dibenzopentaphene. Further research could investigate the incorporation of a wider range of molecular descriptors beyond these dominating energies in order to further improve the accuracy of these predictions. This may lead to deeper understandings of QSPR analysis and possibly contribute to more accurate and reliable predictive models.

Author contributions

N.C conceptualized the key idea of this research work. S.S wrote the manuscript and did the analysis part. N.C revised the manuscript and fine-tuned it in its present form, and also N.C supervised the overall progress of the work.

Funding

There is no funding for this research work.

Data availability

The data used in this article are taken from PubChem (https://pubchem.ncbi.nlm.nih.gov/) and ChemSpider (https://www.chemspider.com/). Our new data is available in the GitHub web interface (https://github.com/shanmugavelan512-prog/Data-sets-for-domination-based-energies).

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Haynes, T. W., Hedetniemi, S. T. & Slater, P. J. Domination in Graphs: Advanced Topics (Marcel Dekker, New York, 1998).
  • 2.Haynes, T. W., Hedetniemi, S. T. & Henning, M. A. (Eds.) Topics in domination in graphs (Vol. 64). (Springer Nature, 2020).
  • 3.Gutman, I. & Furtula, B. Survey of graph energies. Math. Interdiscip. Res.2(2), 85–129 (2017). [Google Scholar]
  • 4.Gutman, I. & Cyvin, S. J. Introduction to the theory of benzenoid hydrocarbons. (Springer Science & Business Media, 2012).
  • 5.Hayat, S., Khan, S., Khan, A. & Liu, J. B. Valency-based molecular descriptors for measuring the -electronic energy of lower polycyclic aromatic hydrocarbons. Polycycl. Aromat. Compd.42(4), 1113–1129 (2022). [Google Scholar]
  • 6.Hayat, S., Suhaili, N. & Jamil, H. Statistical significance of valency-based topological descriptors for correlating thermodynamic properties of benzenoid hydrocarbons with applications. Comput. Theor. Chem.1227, 114259 (2023). [Google Scholar]
  • 7.Kumar, S., Sarkar, P. & Pal, A. A study on the energy of graphs and its applications. Polycycl. Aromat. Compd.44(6), 4127–4136 (2024). [Google Scholar]
  • 8.Khan, S., Malik, M. Y. H., Colakoglu, O., Ishaq, M., & Faryad, U. A comparative study: Domination parameters and physico-chemical properties of benzenoid hydrocarbons. Polycycl. Aromat. Compd.45(5), 793–809 (2025). 10.1080/10406638.2024.2417716. [Google Scholar]
  • 9.Kuriachan, G. & Pathiban, A. Domination and power domination numbers of dendrimers: Chemical significance of domination variants through QSPR analysis. J. Appl. Math. Inform.43(3), 805–819 (2025). [Google Scholar]
  • 10.Shashidhara, A. A., Ahmed, H., Nandappa, D. S. & Cancan, M. Domination version: Sombor index of graphs and its significance in predicting physicochemical properties of butane derivatives. Eurasian Chem. Commun.5(1), 91–102 (2023). [Google Scholar]
  • 11.Bommahalli Jayaraman, B. & Siddiqui, M. K. Exploring the properties of antituberculosis drugs through QSPR graph models and domination-based topological descriptors. Sci. Rep.14(1), 1–25 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Kuriachan, G. & Parthiban, A. Prediction of -electronic energy and physical properties of benzenoid hydrocarbons using domination degree-based entropies. Sci. Rep.15(1), 11359 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Cockayne, E. J., Dawes, R. M. & Hedetniemi, S. T. Total domination in graphs. Networks10(3), 211–219 (1980). [Google Scholar]
  • 14.Sampathkumar, E. & Walikar, H. B. The connected domination number of a graph. J. Math. Phys. Sci.13(6), 607–613 (1979). [Google Scholar]
  • 15.Natarajan, C. & Ayyaswamy, S. Hop domination in graphs-II. Versita23(2), 187–199 (2015).
  • 16.Kumar, M. K. Domination energy of some well-known graphs. Int. J. Appl. Math.3(1), 417–424 (2012). [Google Scholar]
  • 17.Malathy, K. & Meenakshi, S. Minimum total dominating energy of some standard graphs. Int. J. Recent Technol. Eng. (IJRTE)8, 272–276 (2019). [Google Scholar]
  • 18.Palani, K. & Kumari, M. L. Minimum hop dominating energy of a graph. Advan. Appl. Math. Sci21(3), 2022 (2022). [Google Scholar]
  • 19.De La Peña, J. A., Gutman, I. & Rada, J. Estimating the Estrada index. Linear Algebra Appl.427(1), 70–76 (2007). [Google Scholar]
  • 20.Gutman, I., Furtula, B., Zogic, E. & Glogić, E. Resolvent energy of graphs.MATCH Commun. Math. Comput. Chem.75, 279–290 (2016). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data used in this article are taken from PubChem (https://pubchem.ncbi.nlm.nih.gov/) and ChemSpider (https://www.chemspider.com/). Our new data is available in the GitHub web interface (https://github.com/shanmugavelan512-prog/Data-sets-for-domination-based-energies).


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES