Abstract
This paper proposes an optimized wavelet transform Hausdorff multivariate grey model (OWTHGM(1,N)) that addresses some of the weaknesses of the conventional GM(1,N) model such as inaccurate prediction and poor stability. Three improvements have been made: First, all inputs are filtered using a wavelet transform; second, a new time response function is established using the Hausdorff derivative; and finally, the use of Rao's algorithm to optimise the model's parameters as well as the -order accumulated value of the observation data described by the Hausdorff derivative. In order to demonstrate the effectiveness of OWTHGM(1,N), it is applied to predict CO2 emissions from road fuel combustion. The new model scores 1.27% MAPE and 79.983 RMSE, and is therefore more accurate than competing models. OWTHGM(1,N) could therefore serve a reliable forecasting tool and used to monitor the evolution of CO2 emissions in Cameroon. The forecast results also serve as a sound foundation for the formulation of energy consumption strategies and environmental policies.
• Modification, extension and optimization of grey multivariate model is done.
• The model is very generic can be applied to a wide variety of energy sectors.
• OWTHGM(1,N) is a valid forecasting tool that can be used to track CO2 emissions.
Keywords: CO2 emission forecasting, Grey prediction, Hausdorff derivative, Optimization
Method name: Optimized wavelet transform hausdorff multivariate grey model
Graphical abstract

Specifications table
| Subject area: | Environmental Science |
| more specific subject area: | modelling and forecasting |
| Name of your method: | Optimized wavelet transform hausdorff multivariate grey model |
| Name and reference of original method: | Y. Wang, R. Nie, X. Ma, Z. Liu, P. Chi, W. Wu, B. Guo, X. Yang, L. Zhang, A novel Hausdorff fractional NGMC(p,n) grey prediction model with Grey Wolf Optimizer and its applications in forecasting energy production and conversion of China, Applied Mathematical Modelling. 97 (2021) 381–397. 10.1016/j.apm.2021.03.047. R.V. Rao, R.B. Pawar, Constrained design optimization of selected mechanical system components using Rao algorithms, Applied Soft Computing. 89 (2020) 106,141. 10.1016/j.asoc.2020.106141. Z. Zhang, M.A. Kon, Wavelet matrix operations and quantum transforms, Applied Mathematics and Computation. 428 (2022) 127,179. 10.1016/j.amc.2022.127179. |
| Resource availability: |
World Bank statistics (https://data.worldbank.org/) International Energy Agency (https://www.iea.org/) Ministry of Energy (http://www.minee.cm/) |
Introduction
When predicting the evolution of a system and there is very little information, a major difficulty arises, which is that of accurately extracting the system's characteristic without making too strong assumptions. Deng's grey theory offers a way out [1,2]. However, when conventional multivariate grey models (GM(1,N)) are used, the results are very often inconsistent with reality, as Tien [3] pointed out. One way out is to optimise GM parameters with heuristic optimization procedures [4,5]. In some real-world scenarios, these methods have enhanced the GM(1,N) model's prediction capabilities. Despite these efforts, there is still room to improve its flexibility and forecasting performance, which is summarized as follows:
-
•
As demonstrated by Tien [3], the conventional GM(1,N) model is a factor-based model that reflects the behavioural effects of the driving variables on the system. Unfortunately, the fact that information contained in the set of driving variables is incomplete renders the conventional GM(1,N) model, and even some improved versions, unsuitable for forecasting [6,7].
-
•
The input variables might have irrelevant information (called noise) that corrupts the prediction outcomes [8].
-
•
The least squares method is used to solve grey's difference equation in order to estimate the GM(1,N) model's parameters. However, the first-order differential equation resulting from greys model is used to create the time response function. Although the differential equation and the difference equation approximate each other, they are fundamentally distinct [9]. The GM(1,N) model may become unstable as a result of this discrepancy between parameter estimation and parameter application.
Thus, in order to address the above deficiencies, this paper puts forward a new grey multivariate forecasting model based on the Hausdorff's derivative [10] and optimized by Rao's algorithm (abbreviated OWTHGM(1,N)). In order to demonstrate the efficiency and reliability of OWTHGM(1,N), a practical case is considered, which is the forecasting of the annual electricity consumption.
Methods
The conventional GM(1,N) model
The conventional GM(1,N) model [3,11,6,12] can be implemented as follows:
Step 1: Input raw data
Assume that the observation or raw data is given by the sequence where represents CO2 emissions (the dependant variable), and are the independent variables (fuel demand, prices, GDP, urban population and vehicle fleet). Also, . should be highly correlated to .
Step 2: Generate accumulated data with 1-AGO
The first-order accumulation generating operation (1-AGO) of the observation data can be defined by Eqs. (1) and (2):
| (1) |
where,
| (2) |
The superscripts (0) and (1) respectively indicate the original sequences and 1-AGO sequences.
Step 3: Design mean sequences
The following definitions (Eq. (3)) represent the mean sequences produced by consecutive terms of :
| (3) |
where:
Step 4: Establish grey's differential equation
The basic GM(1,N) model's differential equation can be written as in Eq. (4):
| (4) |
Consequently, if the matrix is invertible, the parameters can be calculated using the least-squares approach (Eq. (5)). Which is:
| (5) |
where the matrices and are given by Eqs. (5.1) and (5.2) respectively,
| (5.1) |
| (5.2) |
Step 5: Solve the system's differential equation
The initial condition can be used to find the solution to Eq. (5):
| (6) |
The fitting value of is obtained by substituting into Eqs. (4) and (6). Eq. (7) can be used to determine the fitting value for the original sequence .
| (7) |
Data filtration using wavelet transform
If there is noise or useless information in the raw data, this will corrupt the GM(1,N) model resulting in poor forecasts [8]. To prevent this, data is cleaned using a Wavelet transform (WT) which is a mathematical function that filters various scale components from a continuous-time signal. Basically, WT is a band-pass filter with its bandwidth scaled to half at each level [13]. The scaling function filters out the lowest point of the transform and allows the entire spectrum to be taken into account. Eq. (8) defines a continuous wavelet transform (CWT) of a signal :
| (8) |
where the scale and translation parameters, are denoted by and respectively.
The discrete wavelet transform (DWT) of a signal is calculated using Eq. (9):
| (9) |
where and are the sampling time and scale factor respectively. is the number of samples. The most crucial element of the signal is its low order component. The signal's identity is clearified in this component. The signal's high order component, on the other hand, is a representation of the signal's specifics.
The first step in filtering data is to determine the wavelet to be applied. This depends on the nature of the raw sample data (i.e. the signal). Although there are several criteria for choosing the appropriate wavelet [14], a simple way to do this is to perform a correlation analysis. The waveform of the signal is examined along with the shape of a wavelet (DWT or CWT). If the two match, that particular wavelet is be used as the mother wavelet for the signal. Thus, the choice will vary from one sample data to another.
Conceptually, it operates as follows: In order to produce lowpass (A1, generally known as the approximation level) and highpass (D1, also known as the detail level) subbands from a signal X, the signal is first filtered with specialized lowpass and high pass filters (see Fig. A1, in Appendix A). After filtering according to the Nyquist criterion1 [15], half of the samples are deleted. The filters often produce strong computational performance and have a limited number of coefficients.
Fig. A1.
Multilevel wavelet decomposition for a fine scale analysis.
These filters can help eliminate any aliasing brought on by down sampling when reconstructing the subbands. The lowpass subband (A1) is iteratively filtered by the same method to produce narrower subbands (A2, D2, and so on) for the following level of decomposition. Each subband's length of coefficients is divided by the total number of coefficients in the stage before it. In this way, the signal of interest can be captured by a few DWT coefficients of great size, and the signal noise is represented by smaller DWT coefficients. In this manner, DWT aids in the analysis of signals at various resolutions and narrower subbands. Additionally, it aids in signal compression and denoise.
Practically, the overall process of data filtration with DWT is done in Matlab (or other programming languages) based on the following three steps:
Step 1: Obtain the approximation and detail coefficients.
To do this, a multilevel wavelet decomposition is used. For a fine scale study, the approximation subband is broken down at several levels or scales (as in Fig. A1).
Step 2: analyse the details and identify a suitable thresholding technique.
This is done with Matlab (or other programming languages). Hard thresholding and soft thresholding are the two thresholding operations. The coefficients with magnitudes below the threshold in either operation are set to zero. The way these two methods handle coefficients with magnitudes greater than the threshold is what distinguishes them from one another. In the case of soft thresholding, the coefficients larger than the threshold are kept unaltered, whereas in the case of hard thresholding, they are decreased towards zero by subtracting the threshold value from the coefficient value. In this paper, we used the sure shrink with soft thresholding technique to denoise the data.2
A single function is used to perform the thresholding of the coefficients as well as the reconstruction of the signal using the new coefficients (see the Matlab code in Appendix B for further information). In this code, sure shrink is the thresholding approach. The first parameter ‘f’ refers to the noisy signal. ‘S’ stands for soft thresholding, and the parameter ‘sln’ represents threshold rescaling with a single noise estimate based on first level coefficients. Level indicates the wavelet decomposition level and the last parameter specifies the wavelet, which is sym6 in this case. The wden function decomposes the input signal into many levels, computes and applies a threshold to the detail coefficients, reconstructs the signal using the updated detail coefficients, and outputs it.
Step 3: Threshold the detail coefficients and reconstruct the signal.
To begin, use the wavedec function to conduct a multilevel wavelet decomposition. The noisy signals are decomposed to five levels. Along with the detail coefficients from the first to the last levels, the function also outputs the fifth level approximation coefficients. The signal's high frequencies are captured by the first level of detail coefficients. The noise in the signal makes up the majority of the high-frequency content, but abrupt signal shifts make up a portion of the high frequency.3
The details subband deserves a careful examination. The detcoef function is used to extract the coefficients, and the coefficients for each level are plotted. With increasing scale/level, the noise is significantly reduced. If we pay attention to level 1 specifics. The objective in this case is to keep abrupt transitions while removing noise. In order to accomplish this, a threshold is applied to the detail coefficients. The universal threshold is the simplest to compute and is computed using Eq. (10):
| (10) |
where is the signal and D is set of first level detail coefficients.
The novel Hausdorff GM(1,N) model
Step 1: Determine the -order accumulation
The following introduces the idea of the fractal derivative of a function with regard to a fractal measure [10]:
| (11.1) |
Eq. (11) is also known as the Hausdorff derivative. is the fractal time with scale index . If for a given function both its derivative and its fractal derivative exists, one can find an analogue to the chain rule:
| (11.2) |
Thus, from Eq. (11.2), it is possible to deduce the -order accumulated value of the observation data which then allows to define a new raw data set represented by in Eqs. (12) and (13):
| (12) |
| (13) |
Step 2: Establish grey's differential equation
The proposed model's differential equation has the following form (Eq. (14)):
| (14) |
Step 3: Solve the system's differential equation
Taking integrals over the interval on both sides of the above equation, we get Eq. (15):
| (15) |
We know that,
and from Eq. (13)
Therefore, Eq. (15) is as follows:
| (16) |
Then, Eq. (15) can be written as in Eq. (17.1):
| (17.1) |
Where the matrices and are defined as in Eqs. (17.2), (17.3) and (17.4) respectively.
| (17.2) |
| (17.3) |
| (17.4) |
If , then matrix can be estimated with Eq. (17.5):
| (17.5) |
The solution to Eq. (14) is given by Eq. (18):
| (18) |
where the function is expressed as in Eq. (19),
| (19) |
Step 4: Generate forecast values
Appendix B provides a demonstration of how Eq. (18) is obtained. The following formula can be used to find the fitted value of the original sequence :
| (20) |
Eq. (20) allows to calculate the forecast values. It is known as the wavelet transform Hausdorff grey multivariate model (abbreviated WTHGM(1,N)).
Optimizing wavelet transform Hausdorff GM(1,N) parameters
Eq. (18) can be expressed using the modified trapezoidal integral formula (given by Eq. (21)):
| (21) |
where is a parameter.
Still with modified trapezoidal integral formula, Eq. (17.3) can be rewritten as in Eq. (22):
| (22) |
where and with represents the coefficient.
An optimization procedure with the mean absolute percentage error (MAPE) set as the objective function can be used to find and . Eq. (23) gives a definition of MAPE.
| (23) |
where , and is the data size. The minimum MAPE value is then determined using a meta-heuristic approach. The ideal values of and are chosen so as to minimize the difference between the predicted CO2 and the emission over a of years. By solving the following optimization problem (Eqs. (24.1) and (24.2)), the optimal values of and are ascertained for this purpose:
| (24.1) |
| (24.2) |
Fig. 1 represents the flowchart of the WTHGM(1,N) model. A meta-heuristic algorithm is employed to solve the suggested optimization problem, as shown on Fig. 1. The condition set on the MAPE depends on the different cases. Thus, the threshold can be set to a certain , the value of which will be chosen according to the precision that we wish to achieve.
Fig. 1.
Flow chart of wavelet transform Hausdorff grey multivariate model.
Selection of meta-heuristic algorithm
Other meta-heuristic algorithms such as genetic algorithms (GA), ABC and particle swarm optimization (PSO) need parameters that are algorithm-specific in addition to usual parameters like iterations and population size. For instance, while PSO needs inertia weight and elements related to social and cognitive learning, GA have special operators such as selection, mutation probability and crossover probability, whereas ABC requires number of employed bees, onlooker bees, scout bees and limits, etc.
Incorrectly setting algorithm-specific parameters can lead to undesired results, such as increasing convergence time or falling into local optimization. However, tuning the specific parameters is a very laborious process. The optimal specific parameters may vary from model to model and may vary from dataset to dataset. The selection of optimal parameters itself is an optimization problem.
Rao's algorithms, which have been developed recently, are straightforward and relatively simple to implement as they do not rely on the use of any algorithm-specific parameters. In particular, the improved Rao algorithm [16] is an algorithm that reinforces both exploitation and exploration, and has a fast convergence speed and a strong ability to deviate from local optimization.
Improved Rao algorithm for the WTHGM(1,N) model
Here, the population consists of individuals . Initially, each individual is randomly generated while satisfying the constraints of Eq. (24.2). Population updates are carried out on local exploitation phase or global exploration phase with a probability of 0.5 to enhance both exploitation and exploration abilities.
In local exploitation phase, is sorted in incremental order of MAPE values. The first half population with the smallest MAPE values is considered as the best individuals while the worst individuals is the second half. Following that, Eqs. (25) and (26) are used to compute the local best-mean () and the local worst-mean () vectors respectively:
| (25) |
| (26) |
Here and are uniformly distributed random vectors in the range . and are the best and worst individuals in respectively. and are the mean vectors of the best and worst population respectively. The new individual is generated with Eq. (27):
| (27) |
Here and are uniformly distributed random vectors in the range . is the randomly selected other individual in . If is better than , means , otherwise .
In local exploration phase, the global population is considered. In the first iteration, is set as and from the second iteration, the new is the old or the updated with a probability of 0.5. Then the individuals in are shuffled within itself. The new individual is generated with Eq. (28):
| (28) |
Here and are the -th individual in and respectively. is the normally distributed random vector in the range .
After generating new individual , it is compared with the old . If is better than , is replaced with in . This process is repeated until the MAPE of the best individual is smaller than the criterion () or until the current number of iterations exceeds the maximum number of iterations (). The algorithmic process of improved Rao algorithm for the WTHGM(1,N) model is shown in the pseudocode below:
Pseudocode: Improved Rao algorithm for the WTHGM(1,N) model
|
Input: dataset for model , parameters for algorithm (, , ) Output: optimal values of and |
|
| 01: Decompose using function of MATLAB 02: Obtain the denoised data using function of MATLAB |
/* Data filtration |
|
03: For = 1 to 04: Randomly generate and to satisfy Eq. (24.2), and construct -th individual of the population 05: Calculate of with using Eq. (23) 06: End for 07: Construct the population 08: Initialize the global population 09: Identify the best individual with the smallest MAPE and the worst individual with the largest MAPE in the population |
/* Population initialization |
| 10: 11: While> 12: Calculate the mean vector of the best population 13: Calculate the mean vector of the worst population 14: If < 0.5 then 15: For = 1 to 16: If < 0.5 then /* Local exploitation phase 17: Calculate the local best-mean vector using Eq. (A1) 18: Calculate the local worst-mean vector using Eq. (A2) 19: Randomly select the other individual in population 20: Generate new individual using Eq. (A3) 21: Else/* Global exploration phase 22: Randomly shuffle the individuals of the global population 23: Generate new individual using Eq. (A4) 24: End if 25: Change to satisfy Eq. (24.2) 26: Calculate with using Eq. (23) 27: If< then 28: End for 29: Identify the best individual with the smallest MAPE and the worst individual with the largest MAPE in the population 30: 31: If then break while loop 32: End while |
/* Population update |
| 33: Output | /* Output |
Applications and numerical simulation
Data selection, filtering and performance measurement
In order to verify the performance of the OWTHGM(1,N) model, we implemented it to forecast CO2 emissions from Cameroon's road fuel combustion. In the OWTHGM(1,N) model implemented in this simulation, the filtering technique applied is DWT (i.e. Eq. (9)). This choice follows from the approach explained at the end of the section entitled «Data filtration using wavelet transform».
The selected drivers are: CO2 emissions, urban population (UP), GDP, road fuel consumption (RFC), fuel prices (PR) and vehicle fleet (VF) [17,18]. Data used in this study cover the period from 1995 to 2020. A prerequisite for good modelling is the selection of appropriate variables, i.e. those that have a significant influence on CO2 emissions. So the first step is to ensure that these variables are highly correlated with CO2 emissions before including them as independent variables in the forecasting model.
Correlation analysis (presented in Table 1), shows that CO2 emissions are significantly correlated with selected variables. Therefore, it can be concluded that they can be used as inputs to model and forecast CO2 emissions. However, it is better to filter the data with WT before the forecasting step. This precaution is crucial because the data may have noise that could introduce data matrix ill-conditioning problems in the estimation of grey model parameters.
Table 1.
Correlation results between the variables used in this study.
| CO2 | GDP | UP | RFC | PR | VF | |
|---|---|---|---|---|---|---|
| CO2 | 1 | 0.99a | 0.82a | 0.94a | 0.90b | 0.92a |
| GDP | 0.99a | 1 | 0.87b | 0.93b | 0.93a | 0.96a |
| UP | 0.82a | 0.87b | 1 | 0.70a | 0.90a | 0.93a |
| RFC | 0.94a | 0.93b | 0.70a | 1 | 0.85a | 0.86a |
| PR | 0.90b | 0.93a | 0.90a | 0.85a | 1 | 0.86b |
| VF | 0.92a | 0.96a | 0.93a | 0.86a | 0.86b | 1 |
respectively denotes significantivity at 5%, 1% and 0.1% thresholds.
Forecast accuracies are assessed based on: Root mean square error (RMSE), fitting degree (FD), Absolute percentage error (APE) and mean APE (MAPE), defined in Eqs. (29)–(31):
| (29) |
| (30) |
| (31) |
where represents the th simulation error between actual CO2 emissions and predicted outcomes . is the total number of predictions and is the mean of .
Scores of MAPE and RMSE closest to zero indicate the best accuracy. Table 2 shows the threshold values for MAPE. Coming to FD, a score greater than 0 indicates a very high precision, and a score between and indicates a good forecasting accuracy. Overall, the more score is close to 1.0, the more accurate the forecasting model is.
Table 2.
| MAPE(%) | Accuracy level | MAPE(%) | Accuracy level |
|---|---|---|---|
| ]0;5] | Class I: Very high precision | ]10;20] | Class III: Average precision |
| ]5;10] | Class II: Good precision | ]20;+∞[ | Class IV: Low precision |
The forecasting model may overfit or underfit in case of data leakage. To ensure that this does not happen, the data set is divided into two (training and test sets). These two sets are concealed from each other so that there is no leakage.4
Modelling CO2 emissions
Prediction of CO2 emissions from Cameroon's road fuel combustion are estimated with WTHGM(1,N) optimized by Rao's algorithm (denoted OWTHGM). For validation purposes, we also compare the estimates with GM(1,N) and WTHGM(1,N) models, as well as multilinear regression (MLR) as applied in the work of Karakurt and Aydin [21]. Training data spans from 1995 to 2017 while test data cover the period or this, the reference and new models are implemented using data collected over the period 1995–2017 to train them (i.e. parameterise the models), while data from 2018 to 2020 are used to test the forecasting performance of each model. In all, 24 simulations were performed on Matlab R2021a using a PC with 8.0 GB RAM. To go systematically, here is how to proceed:
Stage 1: Determining the filtering technique
We start by creating a copy of the raw data. One copy is filtered to remove any potential disturbances while the other copy is used as is. As mentioned above, we examined the shape of the wavelet and it matched with the DWT. Thus we apply Eq. (9) to denoise the input data. The next stage is modelling.
Stage 2: Modelling the WTHGM(1,N)
At this stage the unfiltered copy is used to establish the WTHGM(1,N) model. To do this, Eqs. (11) to (20) are applied.
Stage 3: Optimisation of the WTHGM(1,N)
Here, we use the copy of the raw data that has been filtered and build the OWTHGM(1,N) model given by Eqs. (11) to (28) (see pseudocode). With Rao's algorithm, the population size was fixed at 50 for 105 iterations. In the specific context of our data, we obtained the optimum parameters shown in Table 3:
Table 3.
Optimum parameters obtained with data on CO2 emissions from road transport in Cameroon.
| GM(1,N) | 1 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 |
| WTHGM(1,N) | 0.884 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 |
| OWTHGM(1,N) | 0.148 | 0.376 | 0.425 | 8.9E-4 | 0.159 | 3.7E-4 | 0.026 | 0.001 |
Stage 4: Generating all results
Finally, we generate all the forecast results given by the WTHGM(1,N) and OTHGM(1,N) models. From these results, we calculate the performance of both models to determine their MAPE, FD and RMSE. This allows us to see which model was more accurate.
As shown in Table 4, it can be seen that with the training data (1995–2017), the conventional GM(1,N) model, WTHGM(1,N), its optimised version (OWTHGM), and MLR have similar performance in terms of MAPE. They all score a MAPE below 5%, which allows concluding that they behave like class I models according to Table 2. However, we note that the classical GM(1,N), WTHGM(1,N) and MLR models slightly outperform the new model in terms of RMSE, FD and MAPE. Fig. 2a shows the fit curves of the new OWTHGM(1,N) model and the competing models. It is obvious that the OWTHGM(1,N) model (red curve) needs a short fitting time (represented by the first three predictions in the training phase) before it can efficiently extract the systems’ information. This delay is confirmed by its FD, which is only 95% (see Table 4), the lowest of the four models. This explains why in Fig. 2b the peak of APEs is observed on the column of OWTHGM(1,N) model, indicating a lower forecasting accuracy in the training phase.
Table 4.
Fitted and predicted values of CO2 emissions (in kt) with training data.
| Year | Real data | GM(1,N) | MLR | WTHGM(1,N) | OWTHGM(1,N) |
|---|---|---|---|---|---|
| 1995 | 1668 | 1668 | 1606.71 | 1668 | 1668 |
| 1996 | 1761 | 1721.73 | 1671.70 | 1722.76 | 1291.66 |
| 1997 | 1708 | 1730.46 | 1704.97 | 1754.76 | 1641.11 |
| 1998 | 1730 | 1756.61 | 1791.29 | 1790.21 | 1727.09 |
| 1999 | 1821 | 1781.10 | 1864.94 | 1813.34 | 1756.08 |
| 2000 | 1876 | 1802.23 | 1862.31 | 1824.51 | 1744.61 |
| 2001 | 1637 | 1827.01 | 1862.94 | 1852.04 | 1818.67 |
| 2002 | 1856 | 1887.73 | 1849.80 | 1960.19 | 2027.60 |
| 2003 | 1859 | 1940.24 | 1820.09 | 2012.40 | 1951.53 |
| 2004 | 2065 | 1967.04 | 1919.09 | 2011.08 | 1927.45 |
| 2005 | 2099 | 2003.21 | 2051.08 | 2039.87 | 2052.74 |
| 2006 | 2165 | 2039.29 | 2049.09 | 2070.10 | 2107.53 |
| 2007 | 2107 | 2075.13 | 2085.56 | 2098.51 | 2148.74 |
| 2008 | 1869 | 2095.80 | 2133.20 | 2086.26 | 2070.28 |
| 2009 | 2128 | 2134.90 | 2103.10 | 2101.08 | 2040.63 |
| 2010 | 2184 | 2246.84 | 2265.30 | 2230.97 | 2188.46 |
| 2011 | 2514 | 2404.99 | 2420.01 | 2414.69 | 2357.28 |
| 2012 | 2675 | 2600.24 | 2633.33 | 2649.13 | 2624.88 |
| 2013 | 2786 | 2822.20 | 2828.60 | 2900.19 | 2846.55 |
| 2014 | 2906 | 3027.95 | 2978.35 | 3104.76 | 3019.03 |
| 2015 | 3183 | 3183.27 | 3143.31 | 3222.14 | 3125.21 |
| 2016 | 3287 | 3289.80 | 3291.79 | 3285.38 | 3269.09 |
| 2017 | 3413 | 3363.69 | 3360.44 | 3309.34 | 3317.46 |
| MAPE | 3.26% | 3.43% | 3.65% | 4.97% | |
| FD | 0.97 | 0.97 | 0.97 | 0.95 | |
| RMSE | 90.61 | 96.05 | 102.60 | 142.67 |
Fig. 2.
Illustration of the fitting curves and APE during the training phase.
When data is hidden from each model to check whether they are able to predict CO2 emissions over the period 2018–2020 (test phase), OWTHGM(1,N) model significantly outperforms the competing models. It appears that only MLR can cope with the novel model. It can be seen in Fig. 3a how the predictive curve of OWTHGM(1,N) comes closest to real data. Moreover, FD criteria shows that OWTHGM(1,N) model has the best fit with a score of 99% while its MAPE and RMSE are the closest to zero with scores of 1.27% and 79.99 respectively (see Table 5). Fig. 3b confirms this performance as the novel model actually has the smallest APE distribution.
Fig. 3.
Illustration of the fitting curves and APE during the training phase.
Table 5.
Fitted and predicted values of CO2 (in kt) emissions with test data.
| Year | Real data | GM(1,N) | MLR | WTHGM(1,N) | OWTHGM(1,N) |
|---|---|---|---|---|---|
| 2018 | 3494 | 3432.09 | 3440.75 | 3354.21 | 3475.86 |
| 2019 | 3645 | 3491.73 | 3673.12 | 3409.59 | 3655.95 |
| 2020 | 3779 | 3498.25 | 3970.02 | 3423.04 | 3891.08 |
| MAPE | 4.46% | 2.45% | 6.62% | 1.27% | |
| FD | 0.95 | 0.98 | 0.93 | 0.99 | |
| RMSE | 225.863 | 136.875 | 301.472 | 79.983 |
The model proposed in this paper generates low forecasting errors (less than 5%), leading to the conclusion that OWTHGM(1,N) is a reliable forecasting tool just like its competitors. The novel model can forecast CO2 emissions more precisely as well.
As for MLR model, the regression of (i.e. CO2 emissions) on independent variables (i.e. ) reveals that GDP, UP and VF are the most significant determinants of CO2 with contributions of 37.1%, 39.8% and 21.6% respectively. R-squared value is 0.97, indicating a strong relationship between CO2 emissions and the independent variables. The implication is that the independent variables used in this study manage to explain 97% of Cameroon's road transport CO2 emissions. Only 3% of variation remains unexplained, which is certainly due to insufficient data. However, even if more data were collected and other independent variables included, it would never be possible to explain all variations in CO2 emissions [22]. Given the performance of OWTHGM(1,N), it can be concluded that the proposed model is better at predicting future CO2 emissions without the need to know the underlying functional relationship between the different variables.
In order to strengthen the validation of the new model, we also applied it to other datasets, namely the annual CO2 emissions from the Nigerian transport sector obtained from reference [23]. We also divided the data into two groups (training and test sets). The results from the training set are shown in Table 6.
Table 6.
Adjusted and predicted values of CO2 emissions (in Mt) from Nigeria (training data).
| Year | Real data | GM(1,N) | MLR | WTHGM(1,N) | OWTHGM(1,N) |
|---|---|---|---|---|---|
| 1995 | 15.04 | 17.35 | 16.12 | 14.00 | 14.95 |
| 1996 | 16.56 | 15.07 | 16.48 | 17.99 | 17.00 |
| 1997 | 18.45 | 19.57 | 18.37 | 17.24 | 18.63 |
| 1998 | 16.99 | 16.97 | 18.59 | 17.87 | 17.01 |
| 1999 | 17.82 | 19.83 | 17.80 | 16.99 | 17.80 |
| 2000 | 35.47 | 37.26 | 35.72 | 33.97 | 32.99 |
| 2001 | 39.02 | 40.02 | 38.20 | 41.01 | 37.91 |
| 2002 | 42.44 | 44.43 | 42.90 | 41.10 | 42.76 |
| 2003 | 45.00 | 45.79 | 42.30 | 45.15 | 46.91 |
| 2004 | 44.41 | 45.08 | 44.56 | 43.50 | 43.99 |
| 2005 | 47.59 | 46.32 | 56.67 | 48.03 | 47.26 |
| 2006 | 44.01 | 44.02 | 48.11 | 44.99 | 44.90 |
| 2007 | 42.42 | 42.23 | 43.14 | 40.61 | 41.71 |
| 2008 | 42.65 | 42.00 | 39.21 | 40.53 | 41.32 |
| 2009 | 33.70 | 43.00 | 35.73 | 34.03 | 33.20 |
| 2010 | 50.54 | 51.25 | 48.27 | 50.14 | 49.01 |
| 2011 | 58.31 | 57.88 | 59.33 | 58.87 | 58.80 |
| 2012 | 49.08 | 49.69 | 47.97 | 48.74 | 50.75 |
| 2013 | 52.36 | 52.12 | 50.60 | 51.19 | 51.98 |
| 2014 | 55.35 | 54.00 | 55.00 | 56.26 | 55.30 |
| 2015 | 49.15 | 50.03 | 48.12 | 51.96 | 48.79 |
| 2016 | 52.63 | 52.02 | 54.77 | 50.64 | 52.30 |
| MAPE | 4.70% | 4.23% | 3.59% | 1.86% | |
| FD | 0.95 | 0.96 | 0.96 | 0.98 | |
| RMSE | 2.34 | 2.62 | 1.35 | 0.99 |
The results shown in Table 6 reveal that the OWTHGM(1,N) model outperforms all competing models on all criteria of comparison. For the forecasts made with the test data (see Table 7), the OWTHGM(1,N) model manages to obtain the best MAPE (1.74%) and therefore the best FD (0.953). When we come to the RMSE criterion, the WTHGM(1,N) model wins with a score of 1.58.
Table 7.
Adjusted and predicted values of CO2 emissions (in Mt) from Nigeria using test data.
| Year | Real data | GM(1,N) | MLR | WTHGM(1,N) | OWTHGM(1,N) |
|---|---|---|---|---|---|
| 2017 | 50.75 | 53.72 | 51.10 | 49.14 | 50.02 |
| 2018 | 47.35 | 47.71 | 47.98 | 48.01 | 48.00 |
| 2019 | 57.29 | 59.70 | 54.02 | 56.10 | 59.20 |
| 2020 | 58.58 | 58.56 | 58.36 | 57.01 | 59.00 |
| MAPE | 2.71% | 1.95% | 2.35% | 1.74% | |
| FD | 0.973 | 0.980 | 0.976 | 0.983 | |
| RMSE | 1.92 | 3.13 | 1.58 | 1.89 |
These results and the previous ones allow us to conclude that the new OWTHGM(1,N) model is capable of making very accurate forecasts.
Forecasting Cameroon's road transport CO2 emissions
CO2 emissions from the Cameroonian road transport sector over the period 2021–2030 are predicted using the new model proposed in this study and its competitors. In this respect, we used the projected values of the independent variables (RFC, PR, GDP, UP and VF) by a simple regression. These values are confirmed by forecasts of the National Development Strategy [24]. Forecasts results show with great certainty that substantial increases in total CO2 emissions from the Cameroonian road transport sector are to be expected in the coming years. In other words, CO2 emissions from road transports will increase from 3779 kt in 2020 to 4600 kt in 2030 (see Table 8). Thus, current CO2 emissions will increase by 120% in less than ten years. These results are in contrast to the Cameroon government's projections which aim to reduce total CO2 emissions by 32% before 2030 [18].
Table 8.
Projection results of road fuel CO2 emissions in Cameroun.
| Models |
||||
|---|---|---|---|---|
| Year | GM(1,1) | MLR | WTHGM(1,N) | OWTHGM(1,N) |
| 2021 | 3548 | 3563 | 3490 | 3832 |
| 2022 | 3668 | 3650 | 3606 | 3819 |
| 2023 | 3789 | 3736 | 3717 | 3896 |
| 2024 | 3912 | 3822 | 3824 | 3992 |
| 2025 | 4036 | 3908 | 3929 | 4093 |
| 2026 | 4161 | 3995 | 4032 | 4194 |
| 2027 | 4286 | 4081 | 4133 | 4295 |
| 2028 | 4413 | 4167 | 4234 | 4397 |
| 2029 | 4540 | 4253 | 4333 | 4498 |
| 2030 | 4668 | 4339 | 4432 | 4600 |
Conclusions
This paper proposes an optimized wavelet transform Hausdorff grey multivariate forecasting model (abbreviated OWTHGM(1,N)). In order to validate the model, we implemented it to forecast CO2 emissions from road fuel combustion. For this purpose, we started by showing that GDP, urban population, road transport fuel consumption, fuel prices and size of vehicle fleet could be used as determinants of CO2 emissions. Forecasts results are compared with the classical GM(1,N), WTHGM(1,N) and MLR, allowing to conclude that:
-
•
GDP, size of vehicle fleet and urban population are the most significant determinants of CO2 emissions as confirmed by Refs. [17,18,25]. With these determinants, the proposed OWTHGM(1,N) model manages to completely extract the existing connections they have with CO2 emissions.
-
•
Forecasts of CO2 emissions produced with OWTHGM(1,N) are more precise than competing models. Thus OWTHGM(1,N) is a reliable tool for monitoring CO2 emissions from Cameroon's road transport sector.
These achievements stem from three improvements made in the classical GM(1,N) model. Initially, only significant drivers are considered. Then, all selected drivers are filtered using WT, thereby demonising all series that could hamper modelling. More so, a new time response function has been established using the Hausdorff derivative. Lastly, Rao's algorithm is used to optimise all parameters. The OWTHGM(1,N) model therefore proves to be a reliable forecasting tool and can be used to monitor the evolution of CO2 emissions but can also be applied in other forecasting fields characterised by insufficient information.
CRediT author statement
Flavian Emmanuel Sapnken: Conceptualization, Methodology, Software, Writing - original draft preparation. Hermann Chopkap Noume: Validity tests, Data curation, Visualization, Investigation. Jean Gaston Tamba: Supervision, Validation, Writing-Reviewing and Editing.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
Acknowledgments
We are grateful to Departments of Logistics and Transport Engineering and the Department of Energetics and Thermal Engineering, University Institute of Technology, for providing the necessary facilities and support.
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Footnotes
In order not to lose the information in the signal, the Nyquist criterion specifies that the sampling frequency must be at least twice as high as the highest frequency present in the signal. An aliasing phenomenon occurs if the sampling frequency is less than twice the maximum frequency of the analogue signal.
Soft thresholding is always a good starting point if in doubt about which technique to choose.
There are times when these abrupt changes carry meaning, and it might be desirable to retain this information while denoising the signal.
Performance problems with predictive models are mainly due to over-fitting or under-fitting. Therefore, splitting the data in two, as suggested, allows testing the generalization of the predictive model on data that it has not yet encountered during training stage.
Appendix A. Multilevel decomposition of the approximation subband
Appendix B. DWT Matlab code for filtering noisy data
%% Load a signal
%% The entire process should be repeated for each data until all the input data are filtered
load noisysig.mat
figure;
subplot(2,1,1); plot(f0); grid on; title(‘Original signal’); axis tight;
subplot(2,1,2); plot(f); title (‘Original signal with noise’);
axis tight; grid on;
%% Decompose signal using Discrete Wavelet Transform
dwtmode(‘per’,’noisplay’);
wname =’sym6’;
[C, L]=wavedec(f,level,wname);
plotDetCoefHelper(f,C,L);%helperFunction to plot the coefficients at every level
%% analyse the subbands and determine the threshold
%% Denoise the signal
fd=wden(f,’rigrsure’,’s’,’sln’,level,wname);
figure;
subplot(2,1,1);
plot(f); axis tight; grid on; title (‘Noisy Signal’);
subplot(2,1,2);
plot(fd); axis tight; grid on;
title(sprint(‘Denoised Signal SNR;%0.2f dB’,−20*log10(norm(abs(f0-fd))/norm(f0))));
Appendix B. Proof of the derivation of Eq. (17)
From Eq. (13), if we let , then Eq. (13) can be expressed as:
| (A.1) |
We get the following results by multiplying both sides of Eq. (A.1) by :
| (A.2) |
Expressing the left hand side of Eq. (A.2) as a derivative, we get:
| (A.3) |
Taking integrals over the interval on both sides of Eq. (A.3) we get:
| (A.4) |
Solving integrals on the left hand side of Eq. (A.4) yields:
| (A.5) |
Multiplying each term of Eq. (A.5) by leads to:
| (A.6) |
Recall that , Eq. (A.6) can be rewritten as:
| (A.7) |
The second term on the right hand side of Eq. (A.7) can be expressed as follows using the trapezoidal integral formula:
| (A.8) |
When we substitute Eq. (A.8) in Eq. (A.7), we obtain:
| (A.9) |
Data availability
Data will be made available on request.
References
- 1.Li S.H., Zhu L., Wu Y., Lei X.Q. A novel grey multivariate model for forecasting landslide displacement. Eng. Appl. Artif. Intell. 2021;103 doi: 10.1016/j.engappai.2021.104297. [DOI] [Google Scholar]
- 2.Nie W., Duan H. A novel multivariable grey differential dynamic prediction model with new structures and its application to carbon emissions. Eng. Appl. Artif. Intell. 2023;122 doi: 10.1016/j.engappai.2023.106174. [DOI] [Google Scholar]
- 3.Tien T.L. A research on the grey prediction model GM (1,n) Appl. Math. Comput. 2012;218:4903–4916. [Google Scholar]
- 4.Ding S., Xu N., Ye J., Zhou W., Zhang X. Estimating Chinese energy-related CO2 emissions by employing a novel discrete grey prediction model. J. Clean. Prod. 2020;259 doi: 10.1016/j.jclepro.2020.120793. [DOI] [Google Scholar]
- 5.Sapnken F.E. A new hybrid multivariate grey model based on genetic algorithms optimization and its application in forecasting oil products demand. Grey Syst. Theory Appl. 2023;13:406–420. [Google Scholar]
- 6.Sapnken F.E., Ahmat K.A., Boukar M., Biobiongono Nyobe S.L., Tamba J.G. Forecasting petroleum products consumption in Cameroon's household sector using a sequential GMC(1,n) model optimized by genetic algorithms. Heliyon. 2022;8:e12138. doi: 10.1016/j.heliyon.2022.e12138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Sapnken F.E., Acyl A.K., Boukar M., Nyobe S.L.B., Tamba J.G. A technique for improving petroleum products forecasts using grey convolution models and genetic algorithms. MethodsX. 2023;10 doi: 10.1016/j.mex.2023.102097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bahrami S., Hooshmand R.A., Parastegari M. Short term electric load forecasting by wavelet transform and grey model improved by PSO (particle swarm optimization) algorithm. Energy. 2014;72:434–442. [Google Scholar]
- 9.Shen Q.Q., Cao Y., Yao L.Q., Zhu Z.K. An optimized discrete grey multi-variable convolution model and its applications. Comp. Appl. Math. 2021;40:58. doi: 10.1007/s40314-021-01448-z. [DOI] [Google Scholar]
- 10.Wang Y., Nie R., Ma X., Liu Z., Chi P., Wu W., Guo B., Yang X., Zhang L. A novel Hausdorff fractional NGMC(p,n) grey prediction model with Grey Wolf Optimizer and its applications in forecasting energy production and conversion of China. Appl. Math. Model. 2021;97:381–397. doi: 10.1016/j.apm.2021.03.047. [DOI] [Google Scholar]
- 11.Ding S. A novel discrete grey multivariable model and its application in forecasting the output value of China's high-tech industries. Comput. Ind. Eng. 2019;127:749–760. [Google Scholar]
- 12.Yin C., Mao S. Fractional multivariate grey Bernoulli model combined with improved grey wolf algorithm: application in short-term power load forecasting. Energy. 2023;269 doi: 10.1016/j.energy.2023.126844. [DOI] [Google Scholar]
- 13.Zhang Z., Kon M.A. Wavelet matrix operations and quantum transforms. Appl. Math. Comput. 2022;428 doi: 10.1016/j.amc.2022.127179. [DOI] [Google Scholar]
- 14.Ngui W.K., Leong M.S., Hee L.M., Abdelrhman A.M. Wavelet analysis: mother wavelet selection methods. AMM. 2013;393:953–958. doi: 10.4028/www.scientific.net/AMM.393.953. [DOI] [Google Scholar]
- 15.Navarro-Rodríguez Á., García P., Guerrero J.M. Encyclopedia of Electrical and Electronic Power Engineering. Elsevier; Oxford: 2023. Introduction to digital sampled systems and nyquist sampling theorem; pp. 652–656. J. García. [DOI] [Google Scholar]
- 16.Rao R.V., Pawar R.B. Constrained design optimization of selected mechanical system components using Rao algorithms. Appl. Soft Comput. 2020;89 doi: 10.1016/j.asoc.2020.106141. [DOI] [Google Scholar]
- 17.Tamba J.G., Njomo D., Nsouandele J.L., Bonoma B., Dongue S.B. Vol. 2. Universal Journal of Environmental Research & Technology; 2012. (Assessment of Greenhouse Gas Emissions in Cameroon's Road Transport Sector). [Google Scholar]
- 18.Engo J. Decoupling analysis of CO2 emissions from transport sector in Cameroon. Sustain. Cities Soc. 2019;51 doi: 10.1016/j.scs.2019.101732. [DOI] [Google Scholar]
- 19.Sapnken F.E., Tamba J.G. Petroleum products consumption forecasting based on a new structural auto-adaptive intelligent grey prediction model. Expert Syst. Appl. 2022;203 doi: 10.1016/j.eswa.2022.117579. [DOI] [Google Scholar]
- 20.Sapnken F.E., Ahmat K.A., Boukar M., Nyobe S.L.B., Tamba J.G. Learning latent dynamics with a grey neural ODE prediction model and its application. Grey Syst. Theory Appl. 2023 [Google Scholar]
- 21.Karakurt I., Aydin G. Development of regression models to forecast the CO2 emissions from fossil fuels in the BRICS and MINT countries. Energy. 2023;263 doi: 10.1016/j.energy.2022.125650. [DOI] [Google Scholar]
- 22.Wooldridge J.M. Cengage Learning; 2015. Introductory Econometrics: a Modern Approach, [Google Scholar]
- 23.H. Ritchie, M. Roser, P. Rosado, CO2 and greenhouse gas emissions, Our World in Data. (2020). https://ourworldindata.org/co2-emissions (accessed May 23, 2022).
- 24.SND30 . 1st ed. MINEPAT; Yaoundé: 2020. Stratégie Nationale de Développement 2020-2030 : Pour la Transformation Structurelle Et Le Développement Inclusif. [Google Scholar]
- 25.Hossain M.A., Engo J., Chen S. The main factors behind Cameroon's CO2 emissions before, during and after the economic crisis of the 1980s. Environ. Dev. Sustain. 2021;23:4500–4520. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data will be made available on request.




