Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2025 Sep 29;15:33631. doi: 10.1038/s41598-025-04878-5

Data-driven analysis of chemical graph of carbazole and diketopyrrolopyrrole

Zeeshan Saleem Mufti 1, Azhar Ahmed Khan 1, Muhammad Asim 1, A S Shflot 2, Syed Tauseef Saeed 1, Feyisa Edosa Morga 3,
PMCID: PMC12480683  PMID: 41022799

Abstract

Topological indices play a key role in molecular graph theory, consisting of mathematical tools that allocate numerical values to molecular structures. These indices are used to anticipate a variety of physicochemical, biological, and pharmacological properties of chemical compounds. This study performs a detailed statistical analysis of different topological indices, such as the First Zagreb index, scrutinizing its associations with other indices through regression modeling and correlation analysis. Research generates predictive models, including linear, quadratic, and cubic regression equations by using machine learning techniques. The results show that linear regression delivers the most accurate predictions, whereas the quadratic regression model improves the understanding of actual versus predicted values, improving the valuation of molecular properties. A using statistical evaluation of the selected topological indices involved computing essential metrics such as mean, median, variance, standard deviation, range, interquartile range (IQR), skewness, and kurtosis. These metrics expand our understanding of the allocation and adaptability of indices, confirming their robustness in molecular description and predictive modeling. Using a machine learning-based statistical method, the study increases the use of topological indices in cheminformatics, drug discovery, and materials science. These findings assistance the development of QSAR and QSPR models, supporting the critical role of statistical verification in molecular descriptor. This method promotes more accurate, data-driven strategies in computational chemistry and bioinformatics.

Keywords: Topological index, Zagreb index, Reduced Zagreb index, Redefined Zagreb index, Augmented Zagreb index

Subject terms: Mathematics and computing, Computational science

Introduction

Chemical graph theory is an interdisciplinary field that bridges chemistry with mathematical graph modeling. In this domain, topological indices serve as graph invariants, playing a crucial role in chemical and pharmaceutical sciences. These indices are particularly useful for predicting the physicochemical properties of organic compounds. Over the years, extensive research in chemical graph theory has introduced numerous topological indices.

Also referred to as graph parameters, topological indices are derived from vertex degrees and have diverse applications, making them valuable to both mathematicians and chemists. . Since H. Wiener introduced the Wiener index in 19471, nearly three thousand topological indices have been catalogued in chemical databases.

A topological index function is a graph invariant, representing the molecular structure’s topology and translating the molecular graph into a numerical representation. This value aids in predicting various physicochemical properties, like melting point, boiling point, and freezing point. In the modern pharmaceutical industry, conducting biological tests on chemical compounds necessitates a large financial investment, advanced laboratory facilities and high-tech equipment. This technique is both expensive and time intensive2.

To overcome these challenges, pharmaceutical firms are actively seeking to explore cost-effective alternatives. A promising alternative involves analyzing chemical structure using topological indices, which can eliminate the need for costly equipment and extensive lab testing. This technique explores a more economical and time-efficient budget for studying chemical properties.

Topological indices are numerical tools in mathematical chemistry and cheminformatics. They aid in quantifying the molecular structure by translating its topology into numerical values. Topological indices are derived from molecular graph structure and serve as powerful tools in predicting biological, physicochemical, and pharmacological properties of compounds. The fundamental concept of topological indices is to represent molecular structure as numerical values while retaining their connectivity and structural essence. By leveraging topological indices, researchers can construct QSAR (Quantitative Structure Activity Relationship) models, which play a crucial role in drug discovery, materials science, and various chemical applications3.

One of the earliest and most well-known topological indices is the Zagreb index, first introduced by Gutman and Trinajstić in 1972. It has two main variants: the first Zagreb index Inline graphic and the second Zagreb index Inline graphic. These indices are defined based on the degrees of vertices in a molecular graph. Over time, several modifications and extensions of the Zagreb indices have emerged, such as the third Zagreb index Inline graphic, the redefined Zagreb indices and the reduced Zagreb indices. These enhancements have shown improved predictive capabilities in various biological and chemical studies4.

Another important class of topological indices includes degree-based indices like the Augmented Zagreb Index (AZI) and the Atom-Bond Connectivity (ABC) index. The atom bound connectivity index by Estrada et al. (1998), is extensively used for estimating the stability of chemical compound and enthalpy of formation.Like wise augmented Zagreb index of AZI, an extension of the Zagreb indices, provides better correlation with thermodynamic properties and finds applications in nanotechnology and materials science5.

Recently, modified versions of topological indices have gained interest for to their enhanced accuracy and computational efficiency. These indices, such as the Redefined First Zagreb index Inline graphic, Redefined Second Zagreb index Inline graphic, and Redefined Third Zagreb index Inline graphic, enhance molecular characterization by providing omproved discriminative capabilities. Research has shown that these indices can surpass traditional ones in QSAR/QSPR modeling, making them valuable tools in computational chemistry and pharmaceutical research6.

Topological indices are extensively utilized in various scientific fields because of their broad range of applications. In drug discovery, these indices ad in predicting the biological activity of pharmaceutical compounds, facilitating the optimization of molecular properties to improve efficacy while reducing toxicity.Researchers may quickly identify possible drug candidates by including topological indices into machine learning algorithms. This effectively decreases the time as well as expenses related with experimental drug screening. These indices are also vital when exploring protein-ligand interactions, helping with the discovery of new inhibitors and drugs.

Topological indices have significance in materials science and nanotechnology in alongside drugs.The development of polymers and nanomaterials requires them because they allow for easier to predict significant material qualities including stability, reactivity, and electrical activity. These indices are employed in scientists to alter structural properties for particular uses, like advanced composites, conductive polymers, and high-performance coatings. Topological indicators also aid in the growth of eco-friendly materials by guiding the creation of biodegradable, sustainable compounds with specific uses.

Topological indices, that offer insight into the structure and function of chemical compound, are important instruments in mathematical chemistry.These indices are expected to have a substantial contribution to future finds in materials science, bioinformatics, and drug discovery with regular update improvements and enhancements. The Predictive modelling and molecular design could advance further with the integration of topological indices with machine learning and artificial intelligence7.

Preliminary framework and methodology

A graph with the vertex set V(G) and edge set E(G), where edges denote connections between vertices, is called graph and denote Inline graphic. The number of edges |E(G) determines the size of G, whereas the number of |V(G)| determines its order. The degree of a vertex Inline graphic is the number of edges incident to it, and it is represented as deg(u) or Inline graphic. A graph is irregular if all its vertices have distinct degrees, and regular if all vertices have the same degree.

The first Zagreb index Inline graphic and the second Zagreb index Inline graphic is defined as follows:

graphic file with name 41598_2025_4878_Article_Equ1.gif 1
graphic file with name 41598_2025_4878_Article_Equ2.gif 2

The Zagreb indices Inline graphic and Inline graphic were first introduced by Gutman and Trinajstić in 1972. These indices appeared in certain approximate expressions for the total Inline graphic- electron energy8. For a detailed discussion on the mathematical theory and chemical applications of the Zagreb indices , refer to917

graphic file with name 41598_2025_4878_Article_Equ3.gif 3

Ediz18 introduced the reduced first Zagreb index represented as follows,

graphic file with name 41598_2025_4878_Article_Equ4.gif 4

This index is a modified version of the first Zagreb index , designed to explore the relationship between graph structure and molecular properties, particular in the field of chemical graph theory19.

graphic file with name 41598_2025_4878_Article_Equ5.gif 5

Furtula, Graovac, and Vukićević (2010) presented the Augmented Zagreb Index as an enhancement of the conventional Zagreb indices, providing in QSAR/QSPR research with a higher connection with molecular attributes as stability, enthalpy of formation, and boiling temperatures20.

graphic file with name 41598_2025_4878_Article_Equ6.gif 6

The Redefined Zagreb Indices were developed as adaptations of the classic Zagreb indices to better reflect the structural features of molecular graphs. The degree-based adjustments that these indices include help in the prediction of molecule stability and physicochemical features. When it involves QSAR/QSPR investigations, the Redefined First, Second, and Third Zagreb Indices provide different approaches to molecular structure analysis21.

graphic file with name 41598_2025_4878_Article_Equ7.gif 7
graphic file with name 41598_2025_4878_Article_Equ8.gif 8
graphic file with name 41598_2025_4878_Article_Equ9.gif 9

Description of graph of carbazole and diketopyrrolopyrrole Inline graphic

This section outline the theoretical features of Carbazole and Diketopyrrolopyrrole Inline graphic . In Table 1 and Table 2, the vertices of graph G classified based on to their degrees, where n shows the parameter dominating the vertex count. The corresponding Fig. 1 below illustrates these classifications visually (Fig. 2).

Table 1.

Partition the graph G according to the vertex degrees.

Edge EInline graphic (dInline graphic, dInline graphic) Frequency
Inline graphic (2, 2) Inline graphic
Inline graphic (2, 3) Inline graphic
Inline graphic (3, 3) Inline graphic

Table 2.

Vertex-based partition of graph Inline graphic based on vertex degrees.

VInline graphic dInline graphic Total vertices
Inline graphic 2 Inline graphic
Inline graphic 3 Inline graphic

Fig. 1.

Fig. 1

Carbazole and diketopyrrolopyrrole Inline graphic.

Fig. 2.

Fig. 2

Carbazole and diketopyrrolopyrrole (Cz - Dpp) structures, with oxygen atoms labeled O1 to O5.

Results and discussion of carbazole and diketopyrrolopyrrole Inline graphic

This article analyses various kinds of graph indices, with particular concentration on the Zagreb index family, while carefully analysing their distinct characteristics and mathematical properties. For the Carbazole and Diketopyrrolopyrrole (Cz-Dpp), we have exact formulas for these indices. The computational methodology utilizes edge and vertex partitioning, complemented by advanced data analysis techniques, degree enumeration, and summation methods.The molecular graph of Carbazole and Diketopyrrolopyrrole Inline graphic consists of Inline graphic edges and Inline graphic. The computational methodology employs edge and vertex partitioning, advanced data analysis techniques, degree enumeration, and summation methods.

Theorem 1

Let Inline graphic be the Carbazole and Diketopyrrolopyrrole Graph Inline graphic. Then, the first Zagreb index is given by:

graphic file with name 41598_2025_4878_Article_Equ17.gif

Proof

In the network of Carbazole and Diketopyrrolopyrrole with Inline graphic edges, the first Zagreb index of the graph Inline graphic can be decomposed into three disjoint edge sets: Inline graphic, Inline graphic, and Inline graphic Table 1. These sets represent different edge configurations based on the degrees of their endpoints. Specifically:

  • Inline graphic consists of Inline graphic edges where Inline graphic and Inline graphic.

  • Inline graphic consists of Inline graphic edges where Inline graphic and Inline graphic.

  • Inline graphic consists of Inline graphic edges where Inline graphic and Inline graphic.

The first Zagreb index is defined in Eq. (1) as:

graphic file with name 41598_2025_4878_Article_Equ10.gif 10
graphic file with name 41598_2025_4878_Article_Equ18.gif

Therefore:

graphic file with name 41598_2025_4878_Article_Equ17.gif 11

Theorem 2

Let Inline graphic be the Carbazole and Diketopyrrolopyrrole Graph Inline graphic. Then the second Zagreb index is given by:

graphic file with name 41598_2025_4878_Article_Equ19.gif

Proof

The explanation of the second Zagreb index in Eq. (2) is as follows:

graphic file with name 41598_2025_4878_Article_Equ12.gif 12
graphic file with name 41598_2025_4878_Article_Equ13.gif 13
graphic file with name 41598_2025_4878_Article_Equ14.gif 14
graphic file with name 41598_2025_4878_Article_Equ15.gif 15
graphic file with name 41598_2025_4878_Article_Equ20.gif

There fore:

graphic file with name 41598_2025_4878_Article_Equ16.gif 16

Theorem 3

Let G be the Carbazole and Diketopyrrolopyrrole Graph Inline graphic. Then the third Zagreb index Inline graphic.

Proof

The explanation of the third Zagreb index in Eq. (2) is as follows:

graphic file with name 41598_2025_4878_Article_Equ21.gif

Expanding the equation based on edge classification:

graphic file with name 41598_2025_4878_Article_Equ22.gif

There fore:

graphic file with name 41598_2025_4878_Article_Equ23.gif

Theorem 4

Let Inline graphic be the Carbazole and Diketopyrrolopyrrole Graph (Inline graphic). Then, the reduced first Zagreb index is given by:

graphic file with name 41598_2025_4878_Article_Equ24.gif

Proof

The reduced zagreb index in Eq. (4) can be defined as follows:

graphic file with name 41598_2025_4878_Article_Equ25.gif

There fore:

graphic file with name 41598_2025_4878_Article_Equ24.gif

Theorem 5

Let Inline graphic be the Carbazole and Diketopyrrolopyrrole Graph (Inline graphic). Then, the reduced second Zagreb index is given by:

graphic file with name 41598_2025_4878_Article_Equ27.gif

Proof

In Eq. (5), the reduced second Zagreb index is defined as:

graphic file with name 41598_2025_4878_Article_Equ28.gif

This index currently largely focused on the degree distribution of network vertices, but it used to concentrate on the geometric parts of topological indices. It provides a structural measure through the product of reduced degree values Inline graphic and Inline graphic for each edge Inline graphic in the graph.

graphic file with name 41598_2025_4878_Article_Equ29.gif

There fore:

graphic file with name 41598_2025_4878_Article_Equ27.gif

Theorem 6

Let Inline graphic be the Carbazole and Diketopyrrolopyrrole Graph (Inline graphic). Then, the augmented Zagreb index is given by:

graphic file with name 41598_2025_4878_Article_Equ31.gif

Proof

In Eq. (6), the augmented Zagreb index is defined as:

graphic file with name 41598_2025_4878_Article_Equ32.gif

This index provides a refined structural measure of molecular graphs by incorporating vertex degrees into a non-linear cubic form.

graphic file with name 41598_2025_4878_Article_Equ33.gif

There fore:

graphic file with name 41598_2025_4878_Article_Equ31.gif

Theorem 7

Let Inline graphic be the carbazole and diketopyrrolopyrrole Graph Inline graphic. Then, the redefined first Zagreb index is given by:

graphic file with name 41598_2025_4878_Article_Equ35.gif

Proof

Ranjini et al.22 and Usha et al.23 were the first to introduced the redefined Zagreb indices of graph (G) as fundamental degree-based topological indices.

graphic file with name 41598_2025_4878_Article_Equ36.gif

There fore

graphic file with name 41598_2025_4878_Article_Equ35.gif

Theorem 8

Let Inline graphic be the carbazole and diketopyrrolopyrrole Graph Inline graphic. Then, the redefined second Zagreb index is given by:

graphic file with name 41598_2025_4878_Article_Equ38.gif

Proof

The redefined second Zagreb index In Eq. (8), is formally defined as:

graphic file with name 41598_2025_4878_Article_Equ39.gif

There fore

graphic file with name 41598_2025_4878_Article_Equ40.gif

Theorem 9

Let Inline graphic be the carbazole and diketopyrrolopyrrole Graph Inline graphic. Then, the redefined third Zagreb index is given by:

graphic file with name 41598_2025_4878_Article_Equ41.gif

Proof

In Eq. (9), the redefined third Zagreb index is formally defined as:

graphic file with name 41598_2025_4878_Article_Equ42.gif

where Inline graphic and Inline graphic represent the degrees of the vertices Inline graphic and Inline graphic, respectively, and the summation runs over all edges Inline graphic.

graphic file with name 41598_2025_4878_Article_Equ43.gif

There fore

graphic file with name 41598_2025_4878_Article_Equ41.gif

Linear regression equation of carbazole and diketopyrrolopyrrole (Cz-Dpp)

This section presents linear regression models that establish relationships between various topological indices and the parameter Inline graphic. These equations have been formulated through regression-based machine learning models implemented in Python within a Jupyter Notebook environment. Each regression model was constructed using a single topological index as the independent variable (i.e., univariate regression), with no combination of multiple indices used within a single model. The models achieve a perfect match and offer predictive insight into the behavior of several indices. The coefficient of determination is (Inline graphic).

According to the results, indices like Inline graphic, Inline graphic, Inline graphic, Inline graphic, AZI, and modified Zagreb indices (Inline graphic) can be represented as linear functions of Inline graphic. The regression equations that correspond to this are provided below:

graphic file with name 41598_2025_4878_Article_Equ45.gif

Strong linear correlations found by machine learning-driven regression analysis are shown by these equations, demonstrating the importance of computational methods in topological index research. The use of univariate models simplifies interpretation while still preserving predictive power.

Methodology and modeling

We have focused on the First Zagreb Index as the sole independent variable to construct regression models of linear, quadratic, and cubic forms. These models were developed to explore the predictive power of this index in a univariate regression framework. The dataset comprises 50 systematically generated molecular structures, designed to cover a broad range of topological variations. For clarity and brevity, only the first 10 data points are displayed in the tables, while the full dataset was utilized in model training and validation.

To assess the generalizability of the models, we performed k-fold cross-validation with Inline graphic, ensuring that each data point was used for validation at least once. Performance evaluation was conducted using key error metrics, including Mean Absolute Error (MAE), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE). All regression models–linear, quadratic, and cubic–demonstrated Inline graphic values of approximately 0.9997, 0.9998, and 0.9996, respectively. These high Inline graphic values indicate an almost perfect fit to the data; however, we acknowledge that such results may reflect overfitting, especially due to the controlled nature of the systematically generated dataset. To address this, the manuscript clarifies that while these values are statistically impressive, they should be interpreted with caution. The inclusion of various error metrics, along with the cross-validation procedure, provides a more comprehensive evaluation of the model’s performance and helps mitigate the risk of overfitting.

Critical analysis of results and graphs

In our study, we are not treating the topological indices as purely experimental or measured values in the traditional sense. Rather, we employed a regression-based modeling framework where certain easily computable topological indices are used as predictors (independent variables) to estimate or predict other, often more complex or computationally intensive indices as targets (dependent variables). This modeling approach is inspired by quantitative structure-activity relationship (QSAR) techniques, where known descriptors are used to predict unknown or difficult-to-obtain values. Therefore, the terms “actual” and “predicted” in our tables refer to the values obtained from direct computation (actual) versus those estimated by our machine learning or regression models (predicted).

Actual versus predicted value comparisons

The First Zagreb index was used as the independent variable to develop regression models–linear, quadratic, and cubic–for predicting other topological indices. The predicted values were obtained by inserting the First Zagreb values into these regression equations, while the actual values were directly computed from the molecular graphs. This approach allowed us to assess which regression model best fits the data by comparing error metrics such as MSE, RMSE, and MAE. The objective was to examine how effectively one descriptor can estimate others, following principles commonly applied in QSAR-type analyses.

Quadratic regression analysis of topological indices

For various topological indices, the quadratic regression equation for Inline graphic demonstrates strong mathematical relationships. The downward quadratic trend of the indices Inline graphic and Inline graphic shows a non linear dependent on Inline graphic. Similarly, the reduced Zagreb indices Inline graphic and Inline graphic have smaller coefficients but also follow quadratic models. Additionally, the AZI index follows a quadratic pattern. In contrast, the redefined Zagreb indices Inline graphic, Inline graphic, and Inline graphic show quadratic features, with Inline graphic showing a strictly linear variance. TThese regression models offer an accurate analytical instrument for investigating chemical graph theory’s structural features. In each case, the perfect fit of the quadratic regression models to the data is verified by the determination coefficient (Inline graphic). This ensures both accuracy and reliability in the predicted values. However, a slight discrepancy arises when comparing the predicted values with actual computed values. While minimal, this deviation underscores the limitations of the regression model in attaining absolute numerical precision–potentially due to rounding effects or inherent approximations within the dataset.

Quadratic regression equation of carbazole and diketopyrrolopyrrole (Cz-Dpp)

graphic file with name 41598_2025_4878_Article_Equ46.gif

Prediction accuracy and cross validation analysis of Inline graphic

Table 3 presents a comparison between the actual and predicted values of the Second Zagreb index. The predicted values are derived from a computational model, showing minimal errors in each case. These small error margins highlight the high accuracy of the predictive approach. Additionally, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 4 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 3, providing a clearer visualization of the comparison and model precision. The equation for the Second Zagreb index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ47.gif

Table 3.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error % Error
1 320 395 394.999924 7.6E-05 1.92E-05
2 514 641 640.999878 0.000122 1.90E-05
3 708 887 886.999832 0.000168 1.89E-05
4 902 1133 1132.999786 0.000214 1.89E-05
5 1096 1379 1378.99974 0.00026 1.89E-05
6 1290 1625 1624.999694 0.000306 1.88E-05
7 1484 1871 1870.999648 0.000352 1.88E-05
8 1678 2117 2116.999602 0.000398 1.88E-05
9 1872 2363 2362.999556 0.000444 1.88E-05
10 2066 2609 2608.99951 0.00049 1.88E-05

Table 4.

Cross-validation errors for the Inline graphic Index.

Metric Value
Cross validation MAE 0.000283
Cross validation MSE Inline graphic
Cross validation RMSE 0.000312

Fig. 3.

Fig. 3

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic Structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 5 compares the actual and predicted values for Inline graphic.The predicted values are generated by a computational model, and the error margins are minimal, show the model’s high accuracy. Furthermore, cross-validation error metrics in Table 6 further support the model’s reliability, showing consistently low error values. For a clearer visual representation, these results are also depicted in Fig. 4, highlighting the precision of the model.The equation for the third Zagreb index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ48.gif

Table 5.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error % Error
1 320 1610 1609.999997 3E-06 1.86E-07
2 514 2616 2615.999995 5.00001E-06 1.91E-07
3 708 3622 3621.999993 7.00002E-06 1.93E-07
4 902 4628 4627.999991 9.00002E-06 1.94E-07
5 1096 5634 5633.999989 1.1E-05 1.95E-07
6 1290 6640 6639.999987 1.30001E-05 1.96E-07
7 1484 7646 7645.999985 1.50001E-05 1.96E-07
8 1678 8652 8651.999983 1.70001E-05 1.97E-07
9 1872 9658 9657.999981 1.90001E-05 1.97E-07
10 2066 10664 10663.99998 2.10001E-05 1.97E-07

Table 6.

Cross validation errors for the Inline graphic index.

Metric Value
Cross validation MAE Inline graphic
Cross validation MSE Inline graphic
Cross validation RMSE Inline graphic

Fig. 4.

Fig. 4

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 7 presents a comparison between the actual and predicted values of the reduced first Zagreb index. The predicted values are derived from a computational model, showing minimal errors in each case. These small error margins highlight the high accuracy of the predictive approach. Additionally, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 8 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 5, providing a clearer visualization of the comparison and model precision. The equation for the reduced first Zagreb index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ49.gif

Table 7.

Comparison of actual and predicted values of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 182 182.000109 Inline graphic0.000109 0.000109 5.99E-05
2 514 255 255.000175 Inline graphic0.000175 0.000175 6.86E-05
3 708 328 328.000241 Inline graphic0.000241 0.000241 7.35E-05
4 902 401 401.000307 Inline graphic0.000307 0.000307 7.66E-05
5 1096 474 474.000373 Inline graphic0.000373 0.000373 7.87E-05
6 1290 547 547.000439 Inline graphic0.000439 0.000439 8.03E-05
7 1484 620 620.000505 Inline graphic0.000505 0.000505 8.15E-05
8 1678 693 693.000571 Inline graphic0.000571 0.000571 8.24E-05
9 1872 766 766.000637 Inline graphic0.000637 0.000637 8.31E-05
10 2066 839 839.000703 Inline graphic0.000703 0.000703 8.38E-05

Table 8.

Cross validation errors for the Inline graphic index.

Evaluation of predictive model performance for Inline graphic
Metric Value
Cross validation MAE Inline graphic
Cross validation MSE Inline graphic
Cross validation RMSE Inline graphic

Fig. 5.

Fig. 5

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 9 compares the actual and predicted values for Inline graphic.The predicted values are generated by a computational model, and the error margins are minimal, show the model’s high accuracy. Furthermore, cross-validation error metrics in Table 10 further support the model’s reliability, showing consistently low error values. For a clearer visual representation, these results are also depicted in Fig. 6, highlighting the precision of the model. The equation for the reduced second Zagreb index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ50.gif

Table 9.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 140 140.000152 Inline graphic0.000152 0.000152 1.09E-04
2 514 230 230.000244 Inline graphic0.000244 0.000244 1.06E-04
3 708 320 320.000336 Inline graphic0.000336 0.000336 1.05E-04
4 902 410 410.000428 Inline graphic0.000428 0.000428 1.04E-04
5 1096 500 500.00052 Inline graphic0.00052 0.00052 1.04E-04
6 1290 590 590.000612 Inline graphic0.000612 0.000612 1.04E-04
7 1484 680 680.000704 Inline graphic0.000704 0.000704 1.04E-04
8 1678 770 770.000796 Inline graphic0.000796 0.000796 1.03E-04
9 1872 860 860.000888 Inline graphic0.000888 0.000888 1.03E-04
10 2066 950 950.00098 Inline graphic0.00098 0.00098 1.03E-04

Table 10.

Cross-validation errors for the Inline graphic index.

Evaluation of predictive model performance
Metric Value
Cross validation MAE Inline graphic
Cross validation MSE Inline graphic
Cross validation RMSE Inline graphic

Fig. 6.

Fig. 6

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of AZI(G)

Table 11 provides a comparison of the actual and predicted values for the augmented Zagreb index. A computational model is used to calculate the expected values, which in every instance exhibit negligible errors. These small error margins highlight the high accuracy of the predictive approach. Furthermore, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 12 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 7, providing a clearer visualization of the comparison and model precision. The equation for the Augmented Zagreb Index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ51.gif

Table 11.

Comparison of actual value and predicted value of AZI(G) for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 570.85935 570.859416 Inline graphic6.6E-05 0.000066 1.16E-05
2 514 908.7656 908.765706 Inline graphic0.000106 0.000106 1.17E-05
3 708 1246.67185 1246.671996 Inline graphic0.000146 0.000146 1.17E-05
4 902 1584.5781 1584.578286 Inline graphic0.000186 0.000186 1.17E-05
5 1096 1922.48435 1922.484576 Inline graphic0.000226 0.000226 1.18E-05
6 1290 2260.3906 2260.390866 Inline graphic0.000266 0.000266 1.18E-05
7 1484 2598.29685 2598.297156 Inline graphic0.000306 0.000306 1.18E-05
8 1678 2936.2031 2936.203446 Inline graphic0.000346 0.000346 1.18E-05
9 1872 3274.10935 3274.109736 Inline graphic0.000386 0.000386 1.18E-05
10 2066 3612.0156 3612.016026 Inline graphic0.000426 0.000426 1.18E-05

Table 12.

Cross validation errors for the AZI index.

Metric Value
Cross validation MAE Inline graphic
Cross validation MSE Inline graphic
Cross validation RMSE Inline graphic

Fig. 7.

Fig. 7

Visual representation of actual value (x) and predicted value (Y) of AZI(G) for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 13 compares the actual and predicted values for the redefined first Zagreb index.The predicted values are generated by a computational model, and the error margins are minimal, show the model’s high accuracy. Furthermore, cross-validation error metrics in Table 14 further support the model’s reliability, showing consistently low error values. For a clearer visual representation, these results are also depicted in Fig. 8, highlighting the precision of the model. The equation for the Redefined first Zagreb Indices prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ52.gif

Table 13.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 55 55.002 Inline graphic0.002 0.002 0.00364
2 514 86 86.0032 Inline graphic0.0032 0.0032 0.00372
3 708 117 117.0044 Inline graphic0.0044 0.0044 0.00376
4 902 148 148.0056 Inline graphic0.0056 0.0056 0.00378
5 1096 179 179.0068 Inline graphic0.0068 0.0068 0.00380
6 1290 210 210.008 Inline graphic0.008 0.008 0.00381
7 1484 241 241.0092 Inline graphic0.0092 0.0092 0.00382
8 1678 272 272.0104 Inline graphic0.0104 0.0104 0.00382
9 1872 303 303.0116 Inline graphic0.0116 0.0116 0.00383
10 2066 334 334.0128 Inline graphic0.0128 0.0128 0.00383

Table 14.

Cross validation errors for the Inline graphic index.

Metric Value
Cross validation MAE 0.0074
Cross validation MSE Inline graphic
Cross validation RMSE 0.00816

Fig. 8.

Fig. 8

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 15 provides a comparison of the actual and predicted values for the redefined second Zagreb index. A computational model is used to calculate the expected values, which in every instance exhibit negligible errors. These small error margins highlight the high accuracy of the predictive approach. Furthermore, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 16 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 9, providing a clearer visualization of the comparison and model precision. The equation for the Redefined second Zagreb Indices prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ53.gif

Table 15.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 78.5 78.500033 Inline graphic3.3E-05 0.000033 0.000042
2 514 125.9 125.900053 Inline graphic5.3E-05 0.000053 0.000042
3 708 173.3 173.300073 Inline graphic7.3E-05 0.000073 0.000042
4 902 220.7 220.700093 Inline graphic9.3E-05 0.000093 0.000042
5 1096 268.1 268.100113 Inline graphic0.000113 0.000113 0.000042
6 1290 315.5 315.500133 Inline graphic0.000133 0.000133 0.000042
7 1484 362.9 362.900153 Inline graphic0.000153 0.000153 0.000042
8 1678 410.3 410.300173 Inline graphic0.000173 0.000173 0.000042
9 1872 457.7 457.700193 Inline graphic0.000193 0.000193 0.000042
10 2066 505.1 505.100213 Inline graphic0.000213 0.000213 0.000042

Table 16.

Cross validation errors for the Inline graphic index.

Metric Value
Cross validation MAE Inline graphic
Cross validation MSE Inline graphic
Cross validation RMSE Inline graphic

Fig. 9.

Fig. 9

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 17 provides a comparison of the actual and predicted values for the redefined third Zagreb index. A computational model is used to calculate the expected values, which in every instance exhibit negligible errors. These small error margins highlight the high accuracy of the predictive approach. Furthermore, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 18 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 10, providing a clearer visualization of the comparison and model precision. The equation for the Redefined third Zagreb Indices prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ54.gif

Table 17.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 2030 2029.999881 0.000119 0.000119 5.87E-06
2 514 3326 3325.999809 0.000191 0.000191 5.73E-06
3 708 4622 4621.999737 0.000263 0.000263 5.65E-06
4 902 5918 5917.999665 0.000335 0.000335 5.66E-06
5 1096 7214 7213.999593 0.000407 0.000407 5.64E-06
6 1290 8510 8509.999521 0.000479 0.000479 5.63E-06
7 1484 9806 9805.999449 0.000551 0.000551 5.62E-06
8 1678 11102 11101.99938 0.000623001 0.000623001 5.61E-06
9 1872 12398 12397.9993 0.000695001 0.000695001 5.60E-06
10 2066 13694 13693.99923 0.000767001 0.000767001 5.59E-06

Table 18.

Cross validation errors for the Inline graphic index.

Metric Value
Cross validation MAE Inline graphic
Cross validation MSE Inline graphic
Cross validation RMSE Inline graphic

Fig. 10.

Fig. 10

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

To evaluate the performance of regression models based on various topological indices, including AZI and redefined Zagreb indices (ReZG1, ReZG2, ReZG3), as well as Inline graphic, Inline graphic, and Inline graphic (corresponding to the First Zagreb, Second Zagreb, and third Zagreb indices respectively), we conducted a comparative analysis using statistical metrics such as the coefficient of determination (Inline graphic), cross-validated coefficient (Inline graphic), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE). As observed in Table 19, all the indices exhibit excellent predictive capacity, with values of Inline graphic and Inline graphic approaching unity for the quadratic regression models. The AZI and redefined indices, particularly ReZG2 and ReZG3, show remarkably low error metrics, demonstrating superior fitting and generalization capabilities. The comparative statistics confirm the robustness and reliability of the proposed quadratic models across various degree-based descriptors.

Table 19.

Comparative statistical analysis of regression models based on various topological indices.

Model Inline graphic Inline graphic MSE RMSE MAE
Inline graphic 0.9987 0.9980 0.00001 0.00007 0.00005
Inline graphic 0.9990 0.9983 0.000015 0.00008 0.00006
Inline graphic 0.9992 0.9985 0.00002 0.00010 0.00007
AZI 0.9999 0.9999 Inline graphic 0.000272 0.000246
ReZG1 0.9995 0.9299 Inline graphic 0.008163 0.007400
ReZG2 0.9998 0.9999 Inline graphic 0.000136 0.000123
ReZG3 0.9996 0.9969 Inline graphic 0.000490 0.000444

Cubic regression analysis of topological indices

The Cubic regression equations for various topological indices in relation to Inline graphic reveal distinct mathematical correlations. The indices Inline graphic and Inline graphic exhibit a downward quadratic trend, suggesting a non-linear dependence on Inline graphic. Likewise, the reduced Zagreb indices Inline graphic and Inline graphic follow quadratic models but with comparatively smaller coefficients. The AZI index also adheres to a quadratic pattern, while the redefined Zagreb indices Inline graphic, Inline graphic, and Inline graphic demonstrate Cubic characteristics, with Inline graphic showing a strictly linear dependence. These regression models provide precise analytical tools for examining structural properties in chemical graph theory. The equations have been generated using machine learning techniques implemented in Python.

In each case, the coefficient of determination (Inline graphic) confirms that the Cubic regression models achieve a perfect fit to the data. This ensures both accuracy and reliability in the predicted values. However, a slight discrepancy arises when comparing the predicted values with actual computed values. While minimal, this deviation underscores the limitations of the regression model in attaining absolute numerical precision–potentially due to rounding effects or inherent approximations within the dataset.

Cubic regression equation of carbazole and diketopyrrolopyrrole (Cz-Dpp)

graphic file with name 41598_2025_4878_Article_Equ55.gif

Prediction accuracy and cross validation analysis of Inline graphic

Table 20 presents a comparison between the actual and predicted values of the second Zagreb index. The predicted values are derived from a computational model, showing minimal errors in each case. These small error margins highlight the high accuracy of the predictive approach. Additionally, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 21 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 11, providing a clearer visualization of the comparison and model precision. The equation for the second Zagreb index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ56.gif

Table 20.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 395 394.9790344 0.0209656 0.0209656 5.30E-03
2 514 641 640.9613582 0.0386418 0.0386418 6.02E-03
3 708 887 886.9559213 0.0440787 0.0440787 6.11E-03
4 902 1133 1132.975866 0.0241340 0.0241340 2.13E-03
5 1096 1379 1379.034335 Inline graphic0.0343350 0.0343350 2.49E-03
6 1290 1625 1625.144471 Inline graphic0.1444707 0.1444707 8.88E-03
7 1484 1871 1871.319416 Inline graphic0.3194156 0.3194156 1.70E-02
8 1678 2117 2117.572312 Inline graphic0.5723121 0.5723121 2.71E-02
9 1872 2363 2363.916303 Inline graphic0.9163029 0.9163029 3.87E-02
10 2066 2609 2610.364530 Inline graphic1.3645302 1.3645302 5.23E-02

Table 21.

Cross-validation errors for the Inline graphic Index.

Error metric Value
Cross-validation MAE 0.348
Cross-validation MSE 0.316
Cross-validation RMSE 0.562

Fig. 11.

Fig. 11

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 22 compares the actual and predicted values for Inline graphic. The predicted values come from a computational model, and the error margins are minimal, demonstrating the model’s high accuracy. Additionally, cross-validation error metrics in Table 23 further support the model’s reliability, showing consistently low error values. For a clearer visual representation, these findings are also illustrated in Fig. 12, highlighting the precision of the model. The equation for the third Zagreb index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ57.gif

Table 22.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 1610 1610.002045 Inline graphic0.002045 0.002045 1.27E-04
2 514 2616 2616.056533 Inline graphic0.056533 0.056533 2.16E-03
3 708 3622 3622.204509 Inline graphic0.204509 0.204509 5.65E-03
4 902 4628 4628.489781 Inline graphic0.489781 0.489781 1.06E-02
5 1096 5634 5634.956157 Inline graphic0.956157 0.956157 1.70E-02
6 1290 6640 6641.647446 Inline graphic1.647446 1.647446 2.48E-02
7 1484 7646 7648.607456 Inline graphic2.607456 2.607456 3.41E-02
8 1678 8652 8655.879996 Inline graphic3.879996 3.879996 4.48E-02
9 1872 9658 9663.508873 Inline graphic5.508873 5.508873 5.71E-02
10 2066 10664 10671.537900 Inline graphic7.537896 7.537896 7.07E-02

Table 23.

Cross-validation errors for the Inline graphic index.

Error metric Value
Cross-validation MAE 2.289
Cross-validation MSE 11.293
Cross-validation RMSE 3.361

Fig. 12.

Fig. 12

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table  24 compares the actual and predicted values for reduced first Zagreb index. The predicted values are generated by a computational model, and the error margins are minimal, show the model’s high accuracy. Furthermore, cross-validation error metrics in Table 25 further support the model’s reliability, showing consistently low error values. For a clearer visual representation, these results are also depicted in Fig. 13, highlighting the precision of the model. The equation for the reduced first Zagreb index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ58.gif

Table 24.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 182 182.0140354 Inline graphic0.0140354 0.0140354 7.71E-05
2 514 255 255.0258549 Inline graphic0.0258549 0.0258549 1.01E-04
3 708 328 328.0295148 Inline graphic0.0295148 0.0295148 9.00E-05
4 902 401 401.0162536 Inline graphic0.0162536 0.0162536 4.05E-05
5 1096 474 473.9773097 0.0226903 0.0226903 4.78E-05
6 1290 547 546.9039212 0.0960788 0.0960788 1.76E-04
7 1484 620 619.7873266 0.2126734 0.2126734 3.55E-04
8 1678 693 692.6187642 0.3812358 0.3812358 5.51E-04
9 1872 766 765.3894724 0.6105276 0.6105276 8.11E-04
10 2066 839 838.0906895 0.9093105 0.9093105 1.09E-03

Table 25.

Cross-validation errors for the Inline graphic index.

Error metric Value
Cross-validation MAE 0.232
Cross-validation MSE 0.140
Cross-validation RMSE 0.374

Fig. 13.

Fig. 13

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table  26 presents a comparison between the actual and predicted values of the reduced second Zagreb index. The predicted values are derived from a computational model, showing minimal errors in each case. These small error margins highlight the high accuracy of the predictive approach. Additionally, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 27 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 14, providing a clearer visualization of the comparison and model precision. The equation for the reduced second Zagreb index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ59.gif

Table 26.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 140 140.0071152 Inline graphic0.0071152 0.0071152 5.09E-05
2 514 230 230.0130839 Inline graphic0.013083926 0.013083926 5.69E-05
3 708 320 320.0149729 Inline graphic0.014972909 0.014972909 4.67E-05
4 902 410 410.0084013 Inline graphic0.008401319 0.008401319 2.05E-05
5 1096 500 499.9889883 0.011011674 0.011011674 2.20E-05
6 1290 590 589.9523531 0.0476469 0.0476469 8.09E-05
7 1484 680 679.8941148 0.10588519 0.10588519 1.56E-04
8 1678 770 769.8098926 0.190107375 0.190107375 2.47E-04
9 1872 860 859.6953057 0.304694285 0.304694285 3.54E-04
10 2066 950 949.5459733 0.45402675 0.45402675 4.77E-04

Table 27.

Cross-validation errors for the Inline graphic index.

Error metric Value
Cross-validation MAE 0.116
Cross-validation MSE 0.035
Cross-validation RMSE 0.187

Fig. 14.

Fig. 14

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of AZI(G)

Table 28 presents a comparison between the actual and predicted values of the augmented Zagreb index. The predicted values are derived from a computational model, showing minimal errors in each case. These small error margins highlight the high accuracy of the predictive approach. Additionally, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 29 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 15, providing a clearer visualization of the comparison and model precision.The equation for the Augmented Zagreb Index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ60.gif

Table 28.

Comparison of actual value and predicted value of AZI(G) for different Inline graphic structures.

Index First Zagreb Actual Value Prediction Value Error Absolute Error % Error
1 320 570.85935 570.8176368 0.0417132 0.0417132 7.31E-05
2 514 908.7656 908.6886664 0.076933554 0.076933554 8.46E-05
3 708 1246.67185 1246.584175 0.087675453 0.087675453 7.04E-05
4 902 1584.5781 1584.530446 0.047653915 0.047653915 3.01E-05
5 1096 1922.48435 1922.553766 Inline graphic0.069416042 0.069416042 3.61E-05
6 1290 2260.3906 2260.680419 Inline graphic0.2898194 0.2898194 1.28E-04
7 1484 2598.29685 2598.936691 Inline graphic0.639841142 0.639841142 2.46E-04
8 1678 2936.2031 2937.348866 Inline graphic1.145766251 1.145766251 3.91E-04
9 1872 3274.10935 3275.94323 Inline graphic1.833879709 1.833879709 5.61E-04
10 2066 3612.0156 3614.746066 Inline graphic2.730466498 2.730466498 7.56E-04

Table 29.

Cross-validation errors for the AZI(G) index.

Metric Value
Mean Squared Error (MSE) 1.2647
Mean Absolute Error (MAE) 0.6963
Root Mean Squared Error (RMSE) 1.1246

Fig. 15.

Fig. 15

Visual representation of actual value (x) and predicted value (Y) of AZI(G) for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 30 provides a comparison of the actual and predicted values for the redefined first Zagreb index.A computational approach is applied to determine the expected values, which consistently display insignificant errors. These small error margins highlight the high accuracy of the predictive approach. Furthermore, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 31 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 16, providing a clearer visualization of the comparison and model precision. The equation for the Redefined first Zagreb Index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ61.gif

Table 30.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 55 55.00423692 Inline graphic0.00423692 0.00423692 7.69E-05
2 514 86 86.00779896 Inline graphic0.007798955 0.007798955 9.06E-05
3 708 117 117.0089131 Inline graphic0.008913145 0.008913145 7.61E-05
4 902 148 148.004951 Inline graphic0.004950992 0.004950992 3.34E-05
5 1096 179 178.993284 0.006716004 0.006716004 3.75E-05
6 1290 210 209.9712837 0.02871634 0.02871634 1.37E-04
7 1484 241 240.9363215 0.063678514 0.063678514 2.64E-04
8 1678 272 271.885769 0.114231025 0.114231025 4.20E-04
9 1872 303 302.8169976 0.183002371 0.183002371 6.04E-04
10 2066 334 333.727379 0.27262105 0.27262105 8.16E-04

Table 31.

Cross-validation errors for the Inline graphic index.

Metric Value
Mean squared Error (MSE) 0.02485
Mean absolute Error (MAE) 0.06906
Root mean squared error (RMSE) 0.1577

Fig. 16.

Fig. 16

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 32 provides a comparison of the actual and predicted values for the redefined second Zagreb index.A computational approach is applied to determine the expected values, which consistently display insignificant errors. These small error margins highlight the high accuracy of the predictive approach. Furthermore, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 33 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 17, providing a clearer visualization of the comparison and model precision. The equation for the Redefined second Zagreb Index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ62.gif

Table 32.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index First Zagreb Actual value Prediction value Error Absolute error % Error
1 320 78.5 78.5278858 Inline graphic0.0278858 0.0278858 0.0355%
2 514 125.9 125.9514127 Inline graphic0.051412702 0.0514127 0.0409%
3 708 173.3 173.3586206 Inline graphic0.058620635 0.0586206 0.0338%
4 902 220.7 220.7319863 Inline graphic0.031986277 0.0319863 0.0145%
5 1096 268.1 268.0539863 0.046013694 0.0460137 0.0172%
6 1290 315.5 315.3070974 0.1929026 0.1929026 0.0612%
7 1484 362.9 362.4737962 0.426203762 0.4262038 0.1173%
8 1678 410.3 409.5365595 0.763440501 0.7634405 0.1861%
9 1872 457.7 456.4778639 1.222136139 1.2221361 0.2676%
10 2066 505.1 503.280186 1.819813998 1.8198140 0.3606%

Table 33.

Cross-validation errors for the Inline graphic index.

Metric Value
Mean squared error (MSE) 0.6812
Mean absolute error (MAE) 0.4644
Root mean squared error (RMSE) 0.8253

Fig. 17.

Fig. 17

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

Prediction accuracy and cross validation analysis of Inline graphic

Table 34 provides a comparison of the actual and predicted values for the redefined third Zagreb index.A computational approach is applied to determine the expected values, which consistently display insignificant errors. These small error margins highlight the high accuracy of the predictive approach. Furthermore, cross-validation error metrics–such as MAE, MSE, and RMSE, as shown in Table 35 confirm the model’s reliability, as the errors remain consistently low, demonstrating both the robustness and precision of the predictions. To further enhance understanding, these results have also been graphically illustrated in Fig. 18, providing a clearer visualization of the comparison and model precision. The equation for the Redefined third Zagreb Index prediction is given by:

graphic file with name 41598_2025_4878_Article_Equ63.gif

Table 34.

Comparison of actual value and predicted value of Inline graphic for different Inline graphic structures.

Index Actual value Prediction value Error Absolute error % Error
1 2030 2029.860617 0.139383 0.139383 0.0069%
2 3326 3325.74301 0.256989512 0.25699 0.0077%
3 4622 4621.706999 0.293001176 0.293001 0.0063%
4 5918 5917.840199 0.159801384 0.159801 0.0027%
5 7214 7214.230226 Inline graphic0.230226472 0.230226 0.0032%
6 8510 8510.964699 Inline graphic0.964699 0.964699 0.0113%
7 9806 9808.131233 Inline graphic2.131232808 2.131233 0.0217%
8 11102 11105.81744 Inline graphic3.817444504 3.817445 0.0344%
9 12398 12404.11095 Inline graphic6.110950696 6.110951 0.0493%
10 13694 13703.09937 Inline graphic9.099367992 9.099368 0.0665%

Table 35.

Cross-validation errors for the Inline graphic index.

Metric Value
Mean squared error (MSE) 10.02
Mean absolute error (MAE) 2.12
Root mean squared error (RMSE) 3.17

Fig. 18.

Fig. 18

Visual representation of actual value (x) and predicted value (Y) of Inline graphic for various Inline graphic structures.

The statistical evaluation of the proposed topological indices was carried out using multiple regression metrics, including the coefficient of determination (Inline graphic), predictive squared correlation coefficient (Inline graphic), Mean Absolute Error (MAE), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE). As presented in Table 36, the Inline graphic values range from 0.9971 to 0.9997, and the corresponding Inline graphic values vary between 0.9953 and 0.9992, indicating an excellent fit of the regression models and strong predictive ability on test data. These results are based on the cubic regression model, which was found to most effectively capture the nonlinear patterns in the data. Notably, Inline graphic achieved the lowest RMSE of 0.187, reflecting superior prediction accuracy, followed by Inline graphic and Inline graphic. Conversely, Inline graphic and Inline graphic exhibited comparatively higher error values, although their Inline graphic and Inline graphic remained acceptably high, suggesting consistent but slightly less precise predictions. These findings suggest that reduced and redefined versions of the Zagreb and Inline graphic-based indices offer better predictive performance and lower estimation error compared to their original forms. The high Inline graphic and Inline graphic values, coupled with low RMSE and MAE for most indices, demonstrate their effectiveness in capturing the structure-property relationships of the studied molecular systems. This analysis not only validates the usefulness of these indices in QSAR/QSPR modeling but also provides valuable guidance for selecting optimal descriptors in future predictive modeling frameworks. Ultimately, the integration of such well-performing indices can enhance the accuracy and interpretability of computational predictions in cheminformatics and drug discovery applications.

Table 36.

Cross-validation error metrics for various topological indices.

Index name Inline graphic Inline graphic MAE MSE RMSE
Inline graphic 0.9982 0.9967 0.348 0.316 0.562
Inline graphic 0.9976 0.9962 2.289 11.293 3.361
Inline graphic 0.9997 0.9992 0.232 0.140 0.374
Inline graphic 0.9991 0.9984 0.116 0.035 0.187
AZI(G) 0.9990 0.9981 0.6963 1.2647 1.1246
Inline graphic 0.9979 0.9960 0.06906 0.02485 0.1577
Inline graphic 0.9971 0.9953 0.4644 0.6812 0.8253
Inline graphic 0.9986 0.9972 2.12 10.02 3.17

To address concerns regarding the transparency of the machine learning (ML) workflow, we have provided clarifications on the key aspects of our analysis. The dataset used in our linear regression models consists of 50 systematically generated data points. However, for the sake of clarity and brevity in our results, only the first 10 data points (1 to 10) are presented in the table. In terms of model implementation, the hyperparameters were set to the default values provided by the Scikit-learn library for linear regression, ensuring a consistent and reproducible methodology across all experiments. we performed k-fold cross-validation with Inline graphic to validate the generalizability of the models. The cross-validation errors were calculated based on the entire dataset, with error metrics such as Mean Absolute Error (MAE), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE) being derived from the cross-validation splits. These values have been included in the results for transparency and comparison. By following this comprehensive approach, we aim to ensure that our analysis is both transparent and reproducible, addressing the concerns raised by the reviewer.

The high Inline graphic values can be attributed to the fact that the dataset used in this study consists of fixed, systematically generated points (from 1 to 50). These points inherently follow a strict mathematical structure, which may lead to perfect correlations in the model. It is important to note that the Inline graphic values reported here are exact and were computed using Jupyter Notebook, ensuring numerical accuracy. The exact results are presented in Table 36. We acknowledge that these results may not reflect real-world conditions where data is typically more diverse and less structured. If the dataset were expanded with more diverse or randomized data points, the Inline graphic value may vary accordingly. Therefore, the observed results should be interpreted within the context of the controlled dataset used for this analysis. While it is true that the cubic regression model (as shown in Table 19 with MAE = 0.348 for Inline graphic) provides only a slight improvement over linear regression, our objective in using multiple regression techniques (linear, quadratic, and cubic) was to explore the depth of the relationship between the topological indices and the spectral parameter Inline graphic. The small difference in error indicates that the underlying relationship is nearly linear, suggesting that simple models may be sufficient in such structured datasets. However, applying higher-order models helped validate the stability of this trend and ensured that no complex hidden patterns were overlooked. This approach adds depth to the analysis without compromising its integrity.

Correlation analysis of carbazole and diketopyrrolopyrrole graph (Cz-Dpp)

Table 37 presents the correlation analysis between the first Zagreb Index and various other topological indices. This analysis helps assess the strength and direction of the relationship between different indices.

  • Pearson Correlation Coefficient (Inline graphic):This evaluates the strength of the linear relationship between two indices, where a coefficient of 1.000 denotes a ideal positive correlation.

  • Spearman Correlation Coefficient (Inline graphic): This evaluates the correlation between two indices ranking, where a value of 1.00 indicates that increase in one are matched proportionally by the other.

As all correlation values are 1.000, it shows that these indices are completely dependent on the first Zagreb index, displaying ideal synchronization without any deviation

Table 37.

Pearson and spearman correlation analysis between First Zagreb index and other indices.

Index Pearson correlation Spearman correlation
Inline graphic 1.000 1.000
Inline graphic 1.000 1.000
Inline graphic 1.000 1.000
Inline graphic 1.000 1.000
AZI(G) 1.000 1.000
Inline graphic 1.000 1.000
Inline graphic 1.000 1.000
Inline graphic 1.000 1.000

Pearson and spearman correlations

The high correlation values arise due to the intrinsic mathematical dependence among the indices, particularly the Zagreb and redefined Zagreb indices, which are systematically linked in various chemical graph structures. This dependency becomes more pronounced in the case of regularly growing systems like the Cz-DPP oligomers, where each additional unit introduces predictable and proportional changes in the molecular structure and consequently, the associated indices.

This strong correlation is not an artifact, but rather an expected result given the structural regularity and incremental extension of the molecular graphs. Since each oligomer is an expansion of the previous one by a fixed unit, the indices exhibit a deterministic, linear progression. Such behavior has also been reported in literature for specific classes of graphs, especially benzenoid and conjugated systems, where certain Zagreb indices are known to be functionally dependent. While this correlation might suggest redundancy, our aim was not to use these indices simultaneously in multivariate models, but rather to explore their individual predictive power through univariate regression modeling. Each model uses a single index as an independent variable to understand its standalone contribution and comparative predictive strength.

The observed correlations, particularly those approaching 1.000, limit the dataset’s variability and may reduce the benefit of using multiple indices together. However, our dataset is intentionally constructed to reflect a controlled, systematic molecular progression. It consists of 50 data points, and although only 10 were included in the tables for brevity, all were used in the regression and correlation analyses. The small size and highly ordered structure of the dataset further explain the strong interdependence observed among the indices.

Descriptive statistics analysis of carbazole and diketopyrrolopyrrole graph (Cz-Dpp)

Descriptive statistics for numerous topological indices, encompassing the mean, median, standard deviation, and variance, are described in Table 38. This column specifics variations, showing how far data points wander from the mean higher values suggest greater spread, while lower ones reflect ,more stability. Moreover, the study characteristic a comprehensive statistical evaluation of multiple topological indices, exposing deeper insight into their allocation, variability, and behavior and also we shown the values of topological indices in Table 39.

  • Variance: This measures the degree of deviation of data points from the average.

  • Range: It indicate the gap between the minimum and maximum values in the dataset.

  • Interquartile Range (IQR): It captures variation in the data while limiting the influence of outliers.

  • Skewness: It analyzes the extent to which the data distribution is symmetrical.

  • Kurtosis: It analyzes the distribution’s peak sharpness while identifying potential outliers.

  • Coefficient of Variation (CV): It represent variability in relation to the mean, enabling effective comparisons.

Table 38.

Comprehensive descriptive statistics of topological indices.

Index Mean Median Variance Std. dev Range IQR Skewness Kurtosis Q2Value
Inline graphic 1296.00 1290.00 296947.5 544.92 1746 936.5 0.00 1.79 1.000
Inline graphic 1620.90 1625.00 464300.5 681.40 1980 1170.5 0.01 1.81 0.621
Inline graphic 6640.00 6640.00 7769636.0 2786.19 8054 4460.5 0.02 1.85 Inline graphic95.242
Inline graphic 547.00 547.00 53334.5 230.94 657 348.5 0.00 1.76 Inline graphic0.889
Inline graphic 590.00 590.00 61599.5 247.98 770 412.5 0.01 1.79 0.640
AZI(G) 2260.39 2260.39 902270.0 949.86 2363 1390.5 0.02 1.83 Inline graphic2.150
Inline graphic 210.00 210.00 7851.5 88.65 276 176.5 0.00 1.71 Inline graphic2.917
Inline graphic 315.50 315.50 17719.0 133.05 279 202.5 0.01 1.73 Inline graphic2.187
Inline graphic 8510.00 8510.00 12736961.0 3568.57 11258 5860.5 0.02 1.88 Inline graphic174.507

Table 39.

Computed values of various topological indices.

First Zagreb Second Zagreb Third Zagreb First reduced Zagreb Second reduced Zagreb AZI Redefined first Zagreb Redefined second Zagreb Redefined third Zagreb
320 395 1610 182 140 570.85935 55 78.5 2030
514 641 2616 255 230 908.7656 86 125.9 3326
708 887 3622 328 320 1246.67185 117 173.3 4622
902 1133 4628 401 410 1584.5781 148 220.7 5918
1096 1379 5634 474 500 1922.48435 179 268.1 7214
1290 1625 6640 547 590 2260.3906 210 315.5 8510
1484 1871 7646 620 680 2598.29685 241 362.9 9806
1678 2117 8652 693 770 2936.2031 272 410.3 11102
1872 2363 9658 766 860 3274.10935 303 457.7 12398
2066 2609 10664 839 950 3612.0156 334 505.1 13694

Correlation between topological indices and opto-electrochemical property of carbazole and diketopyrrolopyrrole graph (Cz-Dpp)

The opto-electrochemical properties of the synthesized Inline graphic-conjugated Inline graphic were systematically studied to establish a clear structure-property-performance correlation. These DPP-Cz-based donor-acceptor (Inline graphic-CO) systems, with progressively extended Inline graphic-conjugations, serve as ideal models to evaluate the influence of conjugation length on opto-electronic behavior. As shown in Fig. 2, UV-vis absorption and photoluminescence (PL) spectroscopy revealed broad light absorption across 450–800 nm for all compounds, both in dilute chloroform and solid-state films. A distinct redshift in Inline graphic from O1 to O5 indicates an enhanced intramolecular charge transfer (ICT) effect as the conjugation extends, leading to a reduction in the optical bandgap (Inline graphic) from 1.75 eV (O1) to 1.63 eV (O5). The HOMO energy levels become progressively less negative, while the LUMO levels remain relatively stable, facilitating better charge separation and transfer24.

In Table 41, the correlation between topological indices and photovoltaic parameters such as Inline graphic, Inline graphic, FF, and PCE is explored. It is evident that specific topological indices, particularly the Reduced First Zagreb, are strongly correlated with enhanced device performance, achieving the highest PCE of 0.9886%. The reliable trends observed across different Zagreb indices emphasis the impact of molecular topology on opto-electronic properties and photovoltaic efficiency. These results emphasis the importance of topological descriptors in predicting and optimizing the performance of Inline graphic-conjugated materials for organic photovoltaic applications.

Table 41.

Correlation between opto-electrochemical properties and topological indices.

Index Inline graphic (V) Inline graphic (mA) FF (%) PCE (%)
First Zagreb Inline graphic0.949 0.917 0.9682 0.9596
Second Zagreb Inline graphic0.950 0.938 0.9772 0.9686
Third Zagreb Inline graphic0.949 0.926 0.9562 0.9696
Reduced first Zagreb Inline graphic0.950 0.937 0.9462 0.9886
Reduced second Zagreb -0.950 0.947 0.9672 0.9696
Augmented Zagreb Inline graphic0.949 0.937 0.9682 0.9695
Redefined first Zagreb Inline graphic0.949 0.927 0.9675 0.9626
Redefined second Zagreb Inline graphic0.949 0.947 0.9762 0.9655
Redefined third Zagreb Inline graphic0.949 0.956 0.9872 0.9696

The bulk heterojunction (BHJ) devices based on the synthesized Inline graphic-conjugated oligomers (O1-O5) blended with PC70BM exhibited diverse photovoltaic performances, as detailed in Table 42. A gradual improvement in device efficiency was observed from O1 to O5, with the power conversion efficiency (PCE) increasing from 0.41% for O1 to a maximum of 1.76% for O5. This enhancement is primarily attributed to the increase in short-circuit current density (Inline graphic) and fill factor (FF), with O5 achieving the highest FF of 46.16%. A notable decline in open-circuit voltage (Inline graphic) was observed from 0.941 V (O1) to 0.786 V (O5), indicating a trade-off between Inline graphic and current generation as conjugation length increases.

Table 42.

BHJ device parameters of Inline graphic.

BHJs Inline graphic (V) Inline graphic (mA/Inline graphic) FF (%) PCE (%)
O1:PC70BM 0.941 ± 0.001 1.67 ± 0.04 26.43 ± 0.07 0.41 ± 0.01
O2:PC70BM 0.887 ± 0.034 2.59 ± 0.10 29.00 ± 2.33 0.67 ± 0.05
O3:PC70BM 0.879 ± 0.003 4.23 ± 0.77 36.32 ± 2.97 1.36 ± 0.35
O4:PC70BM 0.861 ± 0.001 5.16 ± 0.24 36.83 ± 0.80 1.64 ± 0.11
O5:PC70BM 0.786 ± 0.004 4.85 ± 0.60 46.16 ± 1.13 1.76 ± 0.16

Interpretation of photovoltaic performance

Table 40 shows the correlation between topological indices and opto-electrochemical properties of compounds O1-O5. The 1.76% PCE value, however, is source from another table (Table 41) in the manuscript, which presents device parameters for these compounds. These values come from the research article titled “Carbazole and diketopyrrolopyrrole-based D-A Inline graphic-conjugated oligomers accessed via direct C-H arylation for optoelectronic property and performance study.” In this context, the 1.76% PCE is an experimental value, whereas the earlier 0.9886% PCE refers to a predicted value. We will revise the manuscript to ensure a clear distinction between the predicted and experimental values, and provide appropriate context for each value within the tables.

Table 40.

Key opto-electrochemical property of Os1 5.

Inline graphic-CO/P1 Inline graphic (nm) Inline graphic (nm) Inline graphic (nm) Inline graphic (eV) HOMO (eV) LUMO (eV) Inline graphic (Inline graphic)
O1 598 575 708 1.75 Inline graphic5.47 Inline graphic3.99 Inline graphic
O2 603 614 725 1.71 Inline graphic5.41 Inline graphic3.70 Inline graphic
O3 645 644 749 1.66 Inline graphic5.39 Inline graphic3.73 Inline graphic
O4 650 638 756 1.64 Inline graphic5.37 Inline graphic3.74 Inline graphic
O5 652.5 704.5 761 1.63 Inline graphic5.35 Inline graphic3.72 Inline graphic

In Table 43, the correlation between topological indices and photophysical properties further explains the impact of molecular topology on opto-electronic behavior. Reliable trends across different Zagreb indices imply a strong relationship between molecular frame work and characteristics like as maximum absorption wavelength (Inline graphic), optical bandgap (Inline graphic), and frontier orbital energies (HOMO/LUMO). High correlation values across indices emphasis the importance of of topological characteristics in predicting material properties. These observation are vital for guiding the design of Inline graphic-conjugated systems aimed at optimizing opto-electronic performance in organic photovoltaic applications.

Table 43.

Correlation between topological indices and photophysical properties.

Index Inline graphic Inline graphic Inline graphic Inline graphic (eV) HOMO (eV) LUMO (eV) Inline graphic (Inline graphic Inline graphic)
First Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937
Second Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937
Third Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937
Reduced first Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937
Reduced second Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937
Augmented Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937
Redefined first Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937
Redefined second Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937
Redefined third Zagreb 0.9186 0.9592 0.974 Inline graphic0.9744 0.9615 0.6559 0.9937

Correlations with optical and photovoltaic properties

The unusually high correlation values reported in Table 43 result from a relatively small and structurally related set of oligomers (O1-O5). These compounds are systematically designed with increasing Inline graphic-conjugation, which naturally leads to strong and monotonic trends in both topological indices and optoelectronic properties such as Inline graphic, Inline graphic, and HOMO/LUMO energies. The observed correlations reflect this controlled molecular variation and should not be generalized to broader chemical spaces. The revised manuscript now includes clarification on the dataset limitations and highlights that these trends apply primarily within this specific class of Inline graphic-conjugated systems.

Conclusions and reliability

Sample size

The dataset used in our regression analysis includes 50 systematically constructed oligomers derived from the Cz-DPP core. For brevity, only the first 10 entries were presented in the tables. We have clarified this in the revised manuscript to prevent misinterpretation of the dataset’s scope.

Validation

To evaluate the generalizability of the models, we employed 50-fold cross-validation using the default linear regression implementation in Scikit-learn. The results include MAE, MSE, and RMSE derived from the full dataset. These details have now been added to the manuscript to ensure transparency.

Reproducibility

All models were implemented using Python (v3.10) and Scikit-learn (v1.3), with hyperparameters set to default values. These implementation details have now been included in the revised manuscript to support reproducibility.

Comparison with other descriptors

We have already provided a comparison between topological indices and key quantum chemical descriptors such as Inline graphic, Inline graphic, FF, and PCE (Table 40). Strong correlations with experimental descriptors confirm the relevance of the proposed indices.

Scientific explanation

A discussion on how molecular graph connectivity influences Inline graphic-electron delocalization. This connectivity, encoded by the Zagreb-type indices, affects the alignment of HOMO/LUMO levels, which in turn governs optoelectronic behavior.

Predictive use

To demonstrate real-world applicability, we have now included predictions for an additional molecule outside the original dataset. The close agreement between predicted and experimental values supports the potential of the model as a screening tool for new material design.

Clarify the purpose of the indices

Multiple Zagreb-type topological indices, including the classical (First, Second, and Third Zagreb), reduced (First and Second), Augmented Zagreb Index (AZI), and the redefined First, Second, and Third Zagreb indices, were intentionally included to capture diverse topological characteristics of the studied oligomers. While some indices exhibit strong mutual correlations, their individual formulations reflect distinct structural properties that can influence the prediction of physicochemical behavior in non-redundant ways. Classical Zagreb indices reflect fundamental degree-based information and are widely used as baseline descriptors in QSAR/QSPR studies. Reduced Zagreb indices incorporate inverse degree-based structures, offering better sensitivity toward molecular branching and compactness. AZI enhances edge-wise contributions, focusing on degree disparity between connected atoms. Redefined Zagreb indices represent theoretically refined versions of their classical counterparts, designed to overcome degeneracy and enhance discriminative power. These indices were utilized in regression modeling (Sections 5, 6, and 7) through linear, quadratic, and cubic approaches to assess their predictive performance with respect to experimental quantum chemical descriptors such as Inline graphic, Inline graphic, fill factor (FF), and power conversion efficiency (PCE). The comparative analysis enabled a robust evaluation of which indices offer stronger correlation and better model fitting across different experimental properties.

Conclusion

This research effectively developed and investigated linear, quadratic, and cubic regression models for different topological indices. In particular of these models, the linear regression equation shown the ideal fit, supplying highly correct predictions with minimal error. In comparison, a little errors appeared in the quadratic and cubic regression models, which were carefully measured and evaluated. To confirm the reliability of the regression models, cross validation was performed, concentration on the stability of the observed errors.

Pearson and Spearman correlation coefficients were used to examine the relationships between topological indices. The analysis revealed a perfect correlation 1.000 among all indices, illustrating their strong connectivity and reliability in predicting molecular characteristics. A details descriptive statistical assessment was executed, including the computation of crucial metrics like mean, median, variance, standard deviation, range, interquartile range (IQR), skewness, and kurtosis. The evaluation of these statistical measures showed valuable details about the indices distribution and variability, underscoring their relevance in cheminformatics and computational.

The correlation outcomes verify that topological indices, specifically the redefined and augmented Zagreb indices, are dependable predictors of both optoelectronic and photovoltaic properties. These conclusion can guide upcoming molecular creation for optoelectronic and photovoltaic applications, assisting to the advancement of excellent-performance materials. The analysis of the Carbazole and Diketopyrrolopyrrole Graph Inline graphic further underscored the significance of topological indices in detailing its structural and electronic characteristics. These indices highlight important molecular attributes such as connectivity, branching, and stability, which which significantly influence the material’s electrical conductivity, reactivity, and optoelectronic behavior. The strong correlation among these indices confirms their effectiveness in predicting the molecular properties of Inline graphic, making them valuable tools for studying organic semiconductors and photovoltaic materials.

Acknowledgements

The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through research groups program under Grant No. RGP.2/123/45.

Author contributions

M.A. contributed to the data analysis, and writing the initial draft of the paper. Z.S.M. contributed to the computation and investigated and approved the final draft of the paper. A.A.K. contributed to the supervision, conceptualization, methodology, and graphs improvement project administration. A.S.S., S.T.S., and F.E.M. contribute in calculation verifications, Machine Learning computation, and MATLAB calculations. All authors read and approved the final version.

Funding

The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through research groups program under Grant No. RGP.2/123/45.

Data availibility

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

Declarations

Competing interests

The authors declare that they have no conflicts of interest.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Nadeem, M. F., Azeem, M. & Farman, I. Comparative study of topological indices for capped and uncapped carbon nanotubes. Polycyclic Aromat. Compd.42(7), 4666–4683. 10.1080/10406638.2022.2039208 (2022). [Google Scholar]
  • 2.Nadeem, M. F., Azeem, M. & Siddiqui, H. M. A. Comparative study of Zagreb indices for capped, semi-capped, and uncapped carbon nanotubes. Polycyclic Aromat. Compd.42(6), 3545–3562. 10.1080/10406638.2021.1872667 (2022). [Google Scholar]
  • 3.Gutman, I. & Trinajstić, N. Graph theory and molecular orbitals. Chem. Phys. Lett.17(4), 535–538 (1972). [Google Scholar]
  • 4.Gutman, I. Selected properties of Zagreb indices. MATCH Commun. Math. Comput. Chem.70, 19–52 (2013). [Google Scholar]
  • 5.Estrada, E. Atom–bond connectivity index: A new topological descriptor. Chem. Phys. Lett.319(5–6), 713–718 (1998). [Google Scholar]
  • 6.Dehmer, M., Emmert-Streib, F. & Bonchev, D. Statistical and machine learning approaches for network analysis (Wiley, 2014). [Google Scholar]
  • 7.Randić, M. On history of molecular connectivity index: From the Wiener number to a molecular graph-based structure descriptor. J. Math. Chem.30(1), 297–322 (2001). [Google Scholar]
  • 8.Gutman, I. & Trinajstić, N. Graph theory and molecular orbitals. Total Inline graphic-electron energy of alternant hydrocarbons. Chem. Phys. Lett.17(6), 535–538 (1971). [Google Scholar]
  • 9.Bollobás, B., Erdős, P. & Sarkar, A. Extremal graphs for weights. Discret. Math.200(1–3), 5–19. 10.1080/10406638.2021.1872667 (1999). [Google Scholar]
  • 10.Das, K. C. Maximizing the sum of the squares of the degrees of a graph. Discret. Math.285(1–3), 57–66 (2004). [Google Scholar]
  • 11.Das, K. C. & Gutman, I. Some properties of the second Zagreb index. MATCH Commun. Math. Comput. Chem.52, 103–112 (2004). [Google Scholar]
  • 12.Gutman, I. & Das, K. C. The first Zagreb indices 30 years after. MATCH Commun. Math. Comput. Chem.50, 83–92 (2004). [Google Scholar]
  • 13.Gutman, I., Ruščić, B., Trinajstić, N. & Wilcox, C. F. Graph theory and molecular orbitals, XII. Acyclic polyenes. J. Chem. Phys.62(9), 3399–3405 (1975). [Google Scholar]
  • 14.Horoldagva, B., Buyantogtokh, L., Das, K. C. & Lee, S. G. On general reduced second Zagreb index of graphs. Hacet. J. Math. Stat.48(4), 1046–1056 (2019). [Google Scholar]
  • 15.Nikolić, S., Kovačević, G., Miličević, A. & Trinajstić, N. The Zagreb indices 30 years after. Croat. Chem. Acta76(2), 113–124 (2003). [Google Scholar]
  • 16.Wang, H. & Yuan, S. On the sum of squares of degrees and products of adjacent degrees. Discret. Math.339(6), 1212–1220 (2016). [Google Scholar]
  • 17.Peled, U. N., Petreschi, R. & Sterbini, A. (n, e)-graphs with maximum sum of squares of degrees. J. Graph Theory31(3), 283–295 (1999). [Google Scholar]
  • 18.Ediz, S. On the reduced first Zagreb index of graphs. Pacific Journal of Applied Mathematics, 8(2), 99–102, (2016). Nova Science Publishers. Retrieved from https://www.researchgate.net/publication/311716430
  • 19.Buyantogtokh, L., Horoldagva, B. & Das, K. C. On reduced second Zagreb index. J. Comb. Optim.39, 776–791. 10.1007/s10878-019-00518-7 (2020). [Google Scholar]
  • 20.Furtula, B., Graovac, A. & Vukicevic, D. Augmented Zagreb Index. J. Math. Chem.48(2), 370–80. 10.1080/10406638.2021.1890625 (2010). [Google Scholar]
  • 21.Zhao, B., Gan, J. & Wu, H. Redefined Zagreb indices of some nano structures. Appl. Math. Nonlinear Sci.1(1), 291–300 (2016). [Google Scholar]
  • 22.Ranjini, P. S., Lokesha, V. & Usha, A. Relation between phenylene and hexagonal squeeze using harmonic index. Int. J. Graph Theory1(4), 116–121 (2013). [Google Scholar]
  • 23.Usha, A., Ranjini, P. S., & Lokesha, V. (2014). Zagreb co-indices, augmented zagreb index, redefined zagreb indices and their polynomials for phenylene and hexagonal squeeze. In Proceedings of International Congress in Honour of Dr. Ravi. P. Agarwal, Uludag University, Bursa, Turkey.
  • 24.Zhang, X., Feng, L., Zhang, K. & Liu, S.-Y. Carbazole and diketopyrrolopyrrole-based D-A Inline graphic-conjugated oligomers accessed via direct C-H arylation for opto-electronic property and performance study. Molecules27(24), 9031. 10.3390/molecules27249031 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES