Forecasting CO2 emissions from road fuel combustion using grey prediction models: A novel approach

Flavian Emmanuel Sapnken; Hermann Chopkap Noume; Jean Gaston Tamba

doi:10.1016/j.mex.2023.102271

. 2023 Jun 28;11:102271. doi: 10.1016/j.mex.2023.102271

Forecasting CO₂ emissions from road fuel combustion using grey prediction models: A novel approach

Flavian Emmanuel Sapnken ^a,^b,^c,^⁎, Hermann Chopkap Noume ^d, Jean Gaston Tamba ^a,^b,^c

PMCID: PMC10345331 PMID: 37457434

Abstract

This paper proposes an optimized wavelet transform Hausdorff multivariate grey model (OWTHGM(1,N)) that addresses some of the weaknesses of the conventional GM(1,N) model such as inaccurate prediction and poor stability. Three improvements have been made: First, all inputs are filtered using a wavelet transform; second, a new time response function is established using the Hausdorff derivative; and finally, the use of Rao's algorithm to optimise the model's parameters as well as the $ξ$ -order accumulated value of the observation data described by the Hausdorff derivative. In order to demonstrate the effectiveness of OWTHGM(1,N), it is applied to predict CO₂ emissions from road fuel combustion. The new model scores 1.27% MAPE and 79.983 RMSE, and is therefore more accurate than competing models. OWTHGM(1,N) could therefore serve a reliable forecasting tool and used to monitor the evolution of CO₂ emissions in Cameroon. The forecast results also serve as a sound foundation for the formulation of energy consumption strategies and environmental policies.

• Modification, extension and optimization of grey multivariate model is done.

• The model is very generic can be applied to a wide variety of energy sectors.

• OWTHGM(1,N) is a valid forecasting tool that can be used to track CO₂ emissions.

Keywords: CO₂ emission forecasting, Grey prediction, Hausdorff derivative, Optimization

Method name: Optimized wavelet transform hausdorff multivariate grey model

Graphical abstract

Specifications table

Subject area:	Environmental Science
more specific subject area:	modelling and forecasting
Name of your method:	Optimized wavelet transform hausdorff multivariate grey model
Name and reference of original method:	Y. Wang, R. Nie, X. Ma, Z. Liu, P. Chi, W. Wu, B. Guo, X. Yang, L. Zhang, A novel Hausdorff fractional NGMC(p,n) grey prediction model with Grey Wolf Optimizer and its applications in forecasting energy production and conversion of China, Applied Mathematical Modelling. 97 (2021) 381–397. 10.1016/j.apm.2021.03.047. R.V. Rao, R.B. Pawar, Constrained design optimization of selected mechanical system components using Rao algorithms, Applied Soft Computing. 89 (2020) 106,141. 10.1016/j.asoc.2020.106141. Z. Zhang, M.A. Kon, Wavelet matrix operations and quantum transforms, Applied Mathematics and Computation. 428 (2022) 127,179. 10.1016/j.amc.2022.127179.
Resource availability:	World Bank statistics (https://data.worldbank.org/) International Energy Agency (https://www.iea.org/) Ministry of Energy (http://www.minee.cm/)

Open in a new tab

Introduction

When predicting the evolution of a system and there is very little information, a major difficulty arises, which is that of accurately extracting the system's characteristic without making too strong assumptions. Deng's grey theory offers a way out [1,2]. However, when conventional multivariate grey models (GM(1,N)) are used, the results are very often inconsistent with reality, as Tien [3] pointed out. One way out is to optimise GM parameters with heuristic optimization procedures [4,5]. In some real-world scenarios, these methods have enhanced the GM(1,N) model's prediction capabilities. Despite these efforts, there is still room to improve its flexibility and forecasting performance, which is summarized as follows:

•
As demonstrated by Tien [3], the conventional GM(1,N) model is a factor-based model that reflects the behavioural effects of the driving variables on the system. Unfortunately, the fact that information contained in the set of driving variables is incomplete renders the conventional GM(1,N) model, and even some improved versions, unsuitable for forecasting [6,7].
•
The input variables might have irrelevant information (called noise) that corrupts the prediction outcomes [8].
•
The least squares method is used to solve grey's difference equation in order to estimate the GM(1,N) model's parameters. However, the first-order differential equation resulting from greys model is used to create the time response function. Although the differential equation and the difference equation approximate each other, they are fundamentally distinct [9]. The GM(1,N) model may become unstable as a result of this discrepancy between parameter estimation and parameter application.

Thus, in order to address the above deficiencies, this paper puts forward a new grey multivariate forecasting model based on the Hausdorff's derivative [10] and optimized by Rao's algorithm (abbreviated OWTHGM(1,N)). In order to demonstrate the efficiency and reliability of OWTHGM(1,N), a practical case is considered, which is the forecasting of the annual electricity consumption.

Methods

The conventional GM(1,N) model

The conventional GM(1,N) model [3,11,6,12] can be implemented as follows:

Step 1: Input raw data

Assume that the observation or raw data is given by the sequence $X^{(0)} = {X_{1}^{(0)}; X_{2}^{(0)}, \dots, X_{N}^{(0)}}$ where $X_{1}^{(0)}$ represents CO₂ emissions (the dependant variable), and $X_{2}^{(0)}, \dots, X_{N}^{(0)}$ are the independent variables (fuel demand, prices, GDP, urban population and vehicle fleet). Also, $X_{i}^{(0)} = {x_{i}^{(0)} (1), x_{i}^{(0)} (2), \dots, x_{i}^{(0)} (n)}, i = 1, 2, \dots, N$ . $X_{1}^{(0)}$ should be highly correlated to $X_{2}^{(0)}, \dots, X_{N}^{(0)}$ .

Step 2: Generate accumulated data with 1-AGO

The first-order accumulation generating operation (1-AGO) of the observation data can be defined by Eqs. (1) and (2):

X^{(1)} = {X_{1}^{(1)}; X_{2}^{(1)}, \dots, X_{N}^{(1)}}

(1)

where,

X_{i}^{(1)} = {x_{i}^{(1)} (1), x_{i}^{(1)} (2), \dots, x_{i}^{(1)} (n)}, i = 1, 2, \dots, N

x_{i}^{(1)} (k) = \sum_{p = 1}^{k} x_{i}^{(0)} (p), k = 1, 2, \dots, n

(2)

The superscripts (0) and (1) respectively indicate the original sequences and 1-AGO sequences.

Step 3: Design mean sequences

The following definitions (Eq. (3)) represent the mean sequences produced by consecutive terms of $X_{i}^{(1)}$ :

Z_{i}^{(1)} = {z_{i}^{(1)} (2), z_{i}^{(1)} (3), \dots, z_{i}^{(1)} (n)}, i = 1, 2, \dots, N .

(3)

where:

z_{i}^{(1)} (k) = 0.5 (x_{i}^{(1)} (k - 1) + x_{i}^{(1)} (k)), k = 2, 3, \dots, n; i = 1, 2, \dots, N

Step 4: Establish grey's differential equation

The basic GM(1,N) model's differential equation can be written as in Eq. (4):

\frac{d x_{1}^{(1)} (t)}{d t} + α_{1} x_{1}^{(1)} (t) = α_{N + 1} + \sum_{i = 2}^{N} α_{i} x_{i}^{(1)} (t)

(4)

Consequently, if the matrix $B^{T} B$ is invertible, the parameters ${[α_{1}, α_{2}, \dots, α_{i}, \dots, α_{N + 1}]}^{T}$ can be calculated using the least-squares approach (Eq. (5)). Which is:

{[α_{1}, α_{2}, \dots, α_{i}, \dots, α_{N + 1}]}^{T} = {(B^{T} B)}^{- 1} B^{T} Y

(5)

where the matrices $B$ and $Y$ are given by Eqs. (5.1) and (5.2) respectively,

B = (\begin{matrix} - z_{1}^{(1)} (2) & z_{2}^{(1)} (2) & \dots & z_{N}^{(1)} (2) & 1 \\ - z_{1}^{(1)} (3) & z_{2}^{(1)} (3) & \dots & z_{N}^{(1)} (3) & 1 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ - z_{1}^{(1)} (n) & z_{2}^{(1)} (n) & \dots & z_{N}^{(1)} (n) & 1 \end{matrix}) \in R^{(n - 1) \times (N + 1)}

(5.1)

Y = (\begin{matrix} x_{1}^{(0)} (2) \\ x_{1}^{(0)} (3) \\ ⋮ \\ x_{1}^{(0)} (n) \end{matrix}) \in R^{(n - 1) \times 1}

(5.2)

Step 5: Solve the system's differential equation

The initial condition can be used to find the solution to Eq. (5):

{\hat{x}}_{1}^{(1)} (t = 1) = {\hat{x}}_{1}^{(0)} (1) = x_{1}^{(0)} (1)

(6)

The fitting value of $X^{(1)}$ is obtained by substituting $α_{i}$ into Eqs. (4) and (6). Eq. (7) can be used to determine the fitting value for the original sequence $X_{1}^{(0)}$ .

{\hat{x}}_{1}^{(0)} (t) = {\hat{x}}_{1}^{(1)} (t) - {\hat{x}}_{1}^{(1)} (t - 1), t = 2, 3, \dots, N

(7)

Data filtration using wavelet transform

If there is noise or useless information in the raw data, this will corrupt the GM(1,N) model resulting in poor forecasts [8]. To prevent this, data is cleaned using a Wavelet transform (WT) which is a mathematical function that filters various scale components from a continuous-time signal. Basically, WT is a band-pass filter with its bandwidth scaled to half at each level [13]. The scaling function filters out the lowest point of the transform and allows the entire spectrum to be taken into account. Eq. (8) defines a continuous wavelet transform (CWT) of a signal $x (t)$ :

C W T_{φ} (u, v) = \frac{1}{\sqrt{| u |}} \int_{- \infty}^{+ \infty} x (t) φ^{*} (\frac{t - v}{u}) d t

(8)

where the scale and translation parameters, are denoted by $u$ and $v$ $(u, v \in R)$ respectively.

The discrete wavelet transform (DWT) of a signal $x_{j}$ is calculated using Eq. (9):

D W T_{x} (p, q) = \frac{1}{\sqrt{2^{p}}} \sum_{j} x_{j} φ^{*} (\frac{j - q}{2^{p}})

(9)

where $q = 1, 2, \dots N$ and $p$ are the sampling time and scale factor respectively. $N$ is the number of samples. The most crucial element of the signal is its low order component. The signal's identity is clearified in this component. The signal's high order component, on the other hand, is a representation of the signal's specifics.

The first step in filtering data is to determine the wavelet to be applied. This depends on the nature of the raw sample data (i.e. the signal). Although there are several criteria for choosing the appropriate wavelet [14], a simple way to do this is to perform a correlation analysis. The waveform of the signal is examined along with the shape of a wavelet (DWT or CWT). If the two match, that particular wavelet is be used as the mother wavelet for the signal. Thus, the choice will vary from one sample data to another.

Conceptually, it operates as follows: In order to produce lowpass (A1, generally known as the approximation level) and highpass (D1, also known as the detail level) subbands from a signal X, the signal is first filtered with specialized lowpass and high pass filters (see Fig. A1, in Appendix A). After filtering according to the Nyquist criterion1 [15], half of the samples are deleted. The filters often produce strong computational performance and have a limited number of coefficients.

Fig A1 — Multilevel wavelet decomposition for a fine scale analysis.

These filters can help eliminate any aliasing brought on by down sampling when reconstructing the subbands. The lowpass subband (A1) is iteratively filtered by the same method to produce narrower subbands (A2, D2, and so on) for the following level of decomposition. Each subband's length of coefficients is divided by the total number of coefficients in the stage before it. In this way, the signal of interest can be captured by a few DWT coefficients of great size, and the signal noise is represented by smaller DWT coefficients. In this manner, DWT aids in the analysis of signals at various resolutions and narrower subbands. Additionally, it aids in signal compression and denoise.

Practically, the overall process of data filtration with DWT is done in Matlab (or other programming languages) based on the following three steps:

Step 1: Obtain the approximation and detail coefficients.

To do this, a multilevel wavelet decomposition is used. For a fine scale study, the approximation subband is broken down at several levels or scales (as in Fig. A1).

Step 2: analyse the details and identify a suitable thresholding technique.

This is done with Matlab (or other programming languages). Hard thresholding and soft thresholding are the two thresholding operations. The coefficients with magnitudes below the threshold in either operation are set to zero. The way these two methods handle coefficients with magnitudes greater than the threshold is what distinguishes them from one another. In the case of soft thresholding, the coefficients larger than the threshold are kept unaltered, whereas in the case of hard thresholding, they are decreased towards zero by subtracting the threshold value from the coefficient value. In this paper, we used the sure shrink with soft thresholding technique to denoise the data.2

A single function is used to perform the thresholding of the coefficients as well as the reconstruction of the signal using the new coefficients (see the Matlab code in Appendix B for further information). In this code, sure shrink is the thresholding approach. The first parameter ‘f’ refers to the noisy signal. ‘S’ stands for soft thresholding, and the parameter ‘sln’ represents threshold rescaling with a single noise estimate based on first level coefficients. Level indicates the wavelet decomposition level and the last parameter specifies the wavelet, which is sym6 in this case. The wden function decomposes the input signal into many levels, computes and applies a threshold to the detail coefficients, reconstructs the signal using the updated detail coefficients, and outputs it.

Step 3: Threshold the detail coefficients and reconstruct the signal.

To begin, use the wavedec function to conduct a multilevel wavelet decomposition. The noisy signals are decomposed to five levels. Along with the detail coefficients from the first to the last levels, the function also outputs the fifth level approximation coefficients. The signal's high frequencies are captured by the first level of detail coefficients. The noise in the signal makes up the majority of the high-frequency content, but abrupt signal shifts make up a portion of the high frequency.3

The details subband deserves a careful examination. The detcoef function is used to extract the coefficients, and the coefficients for each level are plotted. With increasing scale/level, the noise is significantly reduced. If we pay attention to level 1 specifics. The objective in this case is to keep abrupt transitions while removing noise. In order to accomplish this, a threshold is applied to the detail coefficients. The universal threshold is the simplest to compute and is computed using Eq. (10):

U n i v e r s a l t h r e s h o l d = \frac{2 \cdot \log (l e n g t h (x)) \cdot m e d i a n (a b s (D))}{0.6745}

(10)

where $x$ is the signal and D is set of first level detail coefficients.

The novel Hausdorff GM(1,N) model

Step 1: Determine the $ξ$ -order accumulation

The following introduces the idea of the fractal derivative of a function $Ψ (t)$ with regard to a fractal measure $t$ [10]:

\frac{d Ψ (t)}{d t^{ξ}} = \lim_{t^{'} \to t} \frac{Ψ (t) - Ψ (t^{'})}{t^{ξ} - t^{^{'} ξ}}, ξ > 0

(11.1)

Eq. (11) is also known as the Hausdorff derivative. $t^{ξ}$ is the fractal time with scale index $ξ$ . If for a given function $Ψ (t)$ both its derivative $D Ψ (t)$ and its fractal derivative $D_{ξ} Ψ (t)$ exists, one can find an analogue to the chain rule:

\frac{d Ψ (t)}{d t^{ξ}} = \frac{d Ψ (t)}{d t} \frac{d t}{d t^{ξ}} = \frac{1}{ξ} t^{(1 - ξ)} \frac{d Ψ (t)}{d t}

(11.2)

Thus, from Eq. (11.2), it is possible to deduce the $ξ$ -order accumulated value of the observation data which then allows to define a new raw data set represented by $X_{i}^{(ξ)}$ in Eqs. (12) and (13):

X_{i}^{(ξ)} = {x_{i}^{(ξ)} (1), x_{i}^{(ξ)} (2), \dots, x_{i}^{(ξ)} (n)}, i = 1, 2, \dots, N

(12)

{\begin{matrix} x_{i}^{(ξ)} (1) = x_{i}^{(0)} (1) \\ x_{i}^{(ξ)} (k) = (k^{ξ} - {(k - 1)}^{ξ}) x_{i}^{(0)} (k) + x_{i}^{(ξ)} (k - 1); k = 2, 3, \dots, n; i = 1, 2, \dots, N \end{matrix}

(13)

Step 2: Establish grey's differential equation

The proposed model's differential equation has the following form (Eq. (14)):

\frac{d x_{1}^{(ξ)} (t)}{d t} + α_{1} x_{1}^{(ξ)} (t) = α_{N + 1} + \sum_{i = 2}^{N} α_{i} x_{i}^{(ξ)} (t)

(14)

- α_{1} x_{1}^{(ξ)} (t) + \sum_{i = 2}^{N} α_{i} x_{i}^{(ξ)} (t) + α_{N + 1} = \frac{d x_{1}^{(ξ)} (t)}{d t}

Step 3: Solve the system's differential equation

Taking integrals over the interval $[k - 1, k]$ on both sides of the above equation, we get Eq. (15):

- α_{1} \int_{k - 1}^{k} x_{1}^{(ξ)} (t) d t + \sum_{i = 2}^{N} α_{i} \int_{k - 1}^{k} x_{i}^{(ξ)} (t) d t + α_{N + 1} = \int_{k - 1}^{k} d x_{1}^{(ξ)} (t)

(15)

We know that,

\int_{k - 1}^{k} x_{i}^{(ξ)} (t) d t = z_{i}^{(ξ)} (k); k = 2, 3, . . ., n

and from Eq. (13)

\int_{k - 1}^{k} d x_{1}^{(ξ)} (t) = x_{1}^{(ξ)} (k) - x_{1}^{(ξ)} (k - 1) = (k^{ξ} - {(k - 1)}^{ξ}) x_{1}^{(0)} (k); k = 2, 3, . . ., n

Therefore, Eq. (15) is as follows:

- α_{1} z_{1}^{(ξ)} (k) + \sum_{i = 2}^{N} α_{i} z_{i}^{(ξ)} (k) + α_{N + 1} = (k^{ξ} - {(k - 1)}^{ξ}) x_{1}^{(0)} (k); k = 2, 3, . . ., n

(16)

Then, Eq. (15) can be written as in Eq. (17.1):

B A = Y

(17.1)

Where the matrices $A, B$ and $Y$ are defined as in Eqs. (17.2), (17.3) and (17.4) respectively.

A = (\begin{matrix} α_{1} \\ α_{2} \\ \begin{matrix} ⋮ \\ α_{N + 1} \end{matrix} \end{matrix}) \in R^{(N + 1) \times 1}

(17.2)

B = (\begin{matrix} - z_{1}^{(ξ)} (2) & z_{2}^{(ξ)} (2) & z_{N}^{(ξ)} (2) & \dots & 1 \\ - z_{1}^{(ξ)} (3) & z_{2}^{(ξ)} (3) & \dots & z_{N}^{(ξ)} (3) & 1 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ - z_{1}^{(ξ)} (n) & z_{2}^{(ξ)} (n) & \dots & z_{N}^{(ξ)} (n) & 1 \end{matrix}) \in R^{(n - 1) \times (N + 1)}

(17.3)

Y = (\begin{matrix} (2^{ξ} - 1^{ξ}) x_{1}^{(0)} (2) \\ (3^{ξ} - 2^{ξ}) x_{1}^{(0)} (3) \\ \begin{matrix} ⋮ \\ (n^{ξ} - {(n - 1)}^{ξ}) x_{1}^{(0)} (n) \end{matrix} \end{matrix}) \in R^{(n - 1) \times 1}

(17.4)

If $det (B^{T} B) \neq 0$ , then matrix $A$ can be estimated with Eq. (17.5):

A = {(B^{T} B)}^{- 1} B^{T} Y

(17.5)

The solution to Eq. (14) is given by Eq. (18):

{\hat{x}}_{1}^{(ξ)} (t) = x_{1}^{(0)} (1) e^{α_{1} (1 - t)} + 0.5 \sum_{τ = 2}^{t} (f (τ) e^{α_{1} (τ - t)} + f (τ - 1) e^{α_{1} (τ - t - 1)}); t \geq 2

(18)

where the function $f (τ)$ is expressed as in Eq. (19),

f (τ) = α_{N + 1} + \sum_{i = 2}^{N} α_{i} x_{i}^{(ξ)} (τ)

(19)

Step 4: Generate forecast values

Appendix B provides a demonstration of how Eq. (18) is obtained. The following formula can be used to find the fitted value of the original sequence $X_{1}^{(0)}$ :

{\begin{matrix} {\hat{x}}_{1}^{(0)} (1) = x_{1}^{(0)} (1) \\ {\hat{x}}_{1}^{(0)} (k) = \frac{{\hat{x}}_{1}^{(ξ)} (k) - {\hat{x}}_{1}^{(ξ)} (k - 1)}{(k^{ξ} - {(k - 1)}^{ξ})}; k = 2, 3, \dots, n \end{matrix}

(20)

Eq. (20) allows to calculate the forecast values. It is known as the wavelet transform Hausdorff grey multivariate model (abbreviated WTHGM(1,N)).

Optimizing wavelet transform Hausdorff GM(1,N) parameters

Eq. (18) can be expressed using the modified trapezoidal integral formula (given by Eq. (21)):

{\hat{x}}_{1}^{(ξ)} (t) = x_{1}^{(0)} (1) e^{α_{1} (1 - t)} + \sum_{τ = 2}^{t} (ω_{1} f (τ) e^{α_{1} (τ - t)} + (1 - ω_{1}) f (τ - 1) e^{α_{1} (τ - t - 1)})

(21)

where $ω_{1} (0 \leq ω_{1} \leq 1)$ is a parameter.

Still with modified trapezoidal integral formula, Eq. (17.3) can be rewritten as in Eq. (22):

B = (\begin{matrix} - {\tilde{z}}_{1}^{(ξ)} (2) & {\tilde{z}}_{2}^{(ξ)} (2) & \dots & {\tilde{z}}_{N}^{(ξ)} (2) & 1 \\ - {\tilde{z}}_{1}^{(ξ)} (3) & {\tilde{z}}_{2}^{(ξ)} (3) & \dots & {\tilde{z}}_{N}^{(ξ)} (3) & 1 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ - {\tilde{z}}_{1}^{(ξ)} (n) & {\tilde{z}}_{2}^{(ξ)} (n) & \dots & {\tilde{z}}_{N}^{(ξ)} (n) & 1 \end{matrix}) \in R^{(n - 1) \times (N + 1)}

(22)

where ${\tilde{z}}_{i}^{(ξ)} (k) = (ω_{i + 1} x_{i}^{(ξ)} (k - 1) + (1 - ω_{i + 1}) x_{i}^{(ξ)} (k)), k = 2, 3, \dots, n; i = 1, 2, \dots, N$ and $ω_{i} (i = 2, 3, . . ., N + 1)$ with $(0 \leq ω_{i} \leq 1)$ represents the coefficient.

An optimization procedure with the mean absolute percentage error (MAPE) set as the objective function can be used to find $ξ$ and $ω_{i}$ . Eq. (23) gives a definition of MAPE.

M A P E = \frac{1}{n} \sum_{k = 1}^{n} | \frac{e_{k}}{x_{1}^{(0)} (k)} | \cdot 100 %

(23)

where $e_{k} = x_{1}^{(0)} (k) - {\hat{x}}_{1}^{(0)} (k)$ , and $n$ is the data size. The minimum MAPE value is then determined using a meta-heuristic approach. The ideal values of $ξ$ and $ω_{i}$ are chosen so as to minimize the difference between the predicted CO₂ and the emission over a of $m$ years. By solving the following optimization problem (Eqs. (24.1) and (24.2)), the optimal values of $ξ$ and $ω_{i}$ are ascertained for this purpose:

m i n i m i z e M A P E = \sum_{j = 1}^{m} M A P E_{j}

(24.1)

s . t . {\begin{matrix} 0 \leq ω_{i} \leq 1, i = 1, 2, 3, . . ., N + 1 \\ ξ > 0 \end{matrix}

(24.2)

Fig. 1 represents the flowchart of the WTHGM(1,N) model. A meta-heuristic algorithm is employed to solve the suggested optimization problem, as shown on Fig. 1. The condition set on the MAPE depends on the different cases. Thus, the threshold can be set to a certain $ε = MAP E_{\min}$ , the value of which will be chosen according to the precision that we wish to achieve.

Selection of meta-heuristic algorithm

Other meta-heuristic algorithms such as genetic algorithms (GA), ABC and particle swarm optimization (PSO) need parameters that are algorithm-specific in addition to usual parameters like iterations and population size. For instance, while PSO needs inertia weight and elements related to social and cognitive learning, GA have special operators such as selection, mutation probability and crossover probability, whereas ABC requires number of employed bees, onlooker bees, scout bees and limits, etc.

Incorrectly setting algorithm-specific parameters can lead to undesired results, such as increasing convergence time or falling into local optimization. However, tuning the specific parameters is a very laborious process. The optimal specific parameters may vary from model to model and may vary from dataset to dataset. The selection of optimal parameters itself is an optimization problem.

Rao's algorithms, which have been developed recently, are straightforward and relatively simple to implement as they do not rely on the use of any algorithm-specific parameters. In particular, the improved Rao algorithm [16] is an algorithm that reinforces both exploitation and exploration, and has a fast convergence speed and a strong ability to deviate from local optimization.

Improved Rao algorithm for the WTHGM(1,N) model

Here, the population $P$ consists of $n p$ individuals $p = (ξ, ω_{1}, \dots, ω_{N + 1}) \in R^{N + 2}$ . Initially, each individual is randomly generated while satisfying the constraints of Eq. (24.2). Population updates are carried out on local exploitation phase or global exploration phase with a probability of 0.5 to enhance both exploitation and exploration abilities.

In local exploitation phase, $P$ is sorted in incremental order of MAPE values. The first half population with the smallest MAPE values is considered as the best individuals while the worst individuals is the second half. Following that, Eqs. (25) and (26) are used to compute the local best-mean ( ${\bar{L}}_{B M}$ ) and the local worst-mean ( ${\bar{L}}_{W M}$ ) vectors respectively:

{\bar{L}}_{B M} = {\bar{M}}_{b e s t} + r_{1} \cdot (p_{b e s t} - {\bar{M}}_{b e s t})

(25)

{\bar{L}}_{W M} = {\bar{M}}_{w o r s t} + r_{2} \cdot (p_{w o r s t} - {\bar{M}}_{w o r s t})

(26)

Here $r_{1}$ and $r_{2}$ are uniformly distributed random vectors in the range ${[0, 1]}^{N + 2}$ . $p_{b e s t}$ and $p_{w o r s t}$ are the best and worst individuals in $P$ respectively. ${\bar{M}}_{b e s t}$ and ${\bar{M}}_{w o r s t}$ are the mean vectors of the best and worst population respectively. The new individual $p {^{'}}_{c u r}$ is generated with Eq. (27):

p {^{'}}_{c u r} = p_{c u r} + r_{3} \cdot ({\bar{L}}_{B M} - {\bar{L}}_{W M}) + r_{4} \cdot [(p_{c u r} or p_{s e l}) - (p_{s e l} or p_{c u r})

(27)

Here $r_{3}$ and $r_{4}$ are uniformly distributed random vectors in the range ${[0, 1]}^{N + 2}$ . $p_{s e l}$ is the randomly selected other individual in $P$ . If $p_{c u r}$ is better than $p_{s e l}$ , $(p_{c u r} or p_{s e l})$ means $p_{c u r}$ , otherwise $p_{s e l}$ .

In local exploration phase, the global population $Q$ is considered. In the first iteration, $Q$ is set as $Q = P$ and from the second iteration, the new $Q$ is the old $Q$ or the updated $P$ with a probability of 0.5. Then the individuals in $Q$ are shuffled within itself. The new individual $p {^{'}}_{c u r}$ is generated with Eq. (28):

p {^{'}}_{c u r} = p_{c u r} + r n \cdot (q_{c u r} - p_{c u r})

(28)

Here $p_{c u r}$ and $q_{c u r}$ are the $c u r$ -th individual in $P$ and $Q$ respectively. $r n$ is the normally distributed random vector in the range ${[0, 1]}^{N + 2}$ .

After generating new individual $p {^{'}}_{c u r}$ , it is compared with the old $p_{c u r}$ . If $p {^{'}}_{c u r}$ is better than $p_{c u r}$ , $p_{c u r}$ is replaced with $p {^{'}}_{c u r}$ in $P$ . This process is repeated until the MAPE of the best individual $p_{b e s t}$ is smaller than the criterion ( $M A P E_{m i n}$ ) or until the current number of iterations exceeds the maximum number of iterations ( $I t e r_{m a x}$ ). The algorithmic process of improved Rao algorithm for the WTHGM(1,N) model is shown in the pseudocode below:

Pseudocode: Improved Rao algorithm for the WTHGM(1,N) model

Input: dataset for model $X^{(0)} = {X_{C O_{2}}^{(0)}; X_{GDP}^{(0)}; X_{Urb}^{(0)}; X_{Pri}^{(0)}; X_{Veh}^{(0)}; X_{Fuel}^{(0)}}$ , parameters for algorithm ( $np$ , $MAP E_{\min}$ , $Ite r_{\max}$ ) Output: optimal values of $ξ$ and $ω_{i}$

01: Decompose $X^{(0)}$ using $mdwtdec$ function of MATLAB 02: Obtain the denoised data $X D^{(0)}$ using $mswden$ function of MATLAB	/* Data filtration
03: For $cur$ = 1 to $np$ 04: Randomly generate $ξ$ and $ω_{i}$ to satisfy Eq. (24.2), and construct $cur$ -th individual $p_{cur} = (ξ, ω_{1}, \dots, ω_{N + 1})$ of the population 05: Calculate $MAP E_{cur}$ of $p_{cur}$ with $X D^{(0)}$ using Eq. (23) 06: End for 07: Construct the population $P = {p_{1}, p_{2}, \dots, p_{np}}$ 08: Initialize the global population $Q = P$ 09: Identify the best individual $p_{best}$ with the smallest MAPE and the worst individual $p_{worst}$ with the largest MAPE in the population $P$	/* Population initialization
10: $iter = 0$ 11: While $MAP E_{best}$ > $MAP E_{\min}$ 12: Calculate the mean vector ${\bar{M}}_{best}$ of the best population 13: Calculate the mean vector ${\bar{M}}_{worst}$ of the worst population 14: If $rand$ < 0.5 then $Q = P$ 15: For $cur$ = 1 to $np$ 16: If $rand$ < 0.5 then /* Local exploitation phase 17: Calculate the local best-mean vector ${\bar{L}}_{BM}$ using Eq. (A1) 18: Calculate the local worst-mean vector ${\bar{L}}_{WM}$ using Eq. (A2) 19: Randomly select the other individual $p_{sel}$ in population $P$ 20: Generate new individual ${p^{'}}_{cur}$ using Eq. (A3) 21: Else/* Global exploration phase 22: Randomly shuffle the individuals of the global population $Q$ 23: Generate new individual ${p^{'}}_{cur}$ using Eq. (A4) 24: End if 25: Change ${p^{'}}_{cur}$ to satisfy Eq. (24.2) 26: Calculate $MAP {E^{'}}_{cur}$ with $X D^{(0)}$ using Eq. (23) 27: If $MAP {E^{'}}_{cur}$ < $MAP E_{cur}$ then $p_{cur} = {p^{'}}_{cur}$ 28: End for 29: Identify the best individual $p_{best}$ with the smallest MAPE and the worst individual $p_{worst}$ with the largest MAPE in the population $P$ 30: $iter = iter + 1$ 31: If $iter \geq Ite r_{\max}$ then break while loop 32: End while	/* Population update
33: Output $p_{best}$	/* Output

Open in a new tab

Applications and numerical simulation

Data selection, filtering and performance measurement

In order to verify the performance of the OWTHGM(1,N) model, we implemented it to forecast CO₂ emissions from Cameroon's road fuel combustion. In the OWTHGM(1,N) model implemented in this simulation, the filtering technique applied is DWT (i.e. Eq. (9)). This choice follows from the approach explained at the end of the section entitled «Data filtration using wavelet transform».

The selected drivers are: CO₂ emissions, urban population (UP), GDP, road fuel consumption (RFC), fuel prices (PR) and vehicle fleet (VF) [17,18]. Data used in this study cover the period from 1995 to 2020. A prerequisite for good modelling is the selection of appropriate variables, i.e. those that have a significant influence on CO₂ emissions. So the first step is to ensure that these variables are highly correlated with CO₂ emissions before including them as independent variables in the forecasting model.

Correlation analysis (presented in Table 1), shows that CO₂ emissions are significantly correlated with selected variables. Therefore, it can be concluded that they can be used as inputs to model and forecast CO₂ emissions. However, it is better to filter the data with WT before the forecasting step. This precaution is crucial because the data may have noise that could introduce data matrix ill-conditioning problems in the estimation of grey model parameters.

Table 1.

Correlation results between the variables used in this study.

	CO₂	GDP	UP	RFC	PR	VF
CO₂	1	0.99^a	0.82^a	0.94^a	0.90^b	0.92^a
GDP	0.99^a	1	0.87^b	0.93^b	0.93^a	0.96^a
UP	0.82^a	0.87^b	1	0.70^a	0.90^a	0.93^a
RFC	0.94^a	0.93^b	0.70^a	1	0.85^a	0.86^a
PR	0.90^b	0.93^a	0.90^a	0.85^a	1	0.86^b
VF	0.92^a	0.96^a	0.93^a	0.86^a	0.86^b	1

Open in a new tab

respectively denotes significantivity at 5%, 1% and 0.1% thresholds.

Forecast accuracies are assessed based on: Root mean square error (RMSE), fitting degree (FD), Absolute percentage error (APE) and mean APE (MAPE), defined in Eqs. (29)–(31):

A P E = | \frac{e_{i}}{x_{1}^{(0)} (i)} |

(29)

F D = 1 - M A P E

(30)

R M S E = \sqrt{\frac{1}{h - 1} \sum_{i = 2}^{h} {(e_{i})}^{2}}

(31)

where $e_{i} = x_{1}^{(0)} (i) - {\hat{x}}_{1}^{(0)} (i)$ represents the $i$ ^th simulation error between actual CO₂ emissions $x_{1}^{(0)} (i)$ and predicted outcomes ${\hat{x}}_{1}^{(0)} (i)$ . $h$ is the total number of predictions and ${\bar{x}}_{1}^{(0)} (i)$ is the mean of $x_{1}^{(0)} (i)$ .

Scores of MAPE and RMSE closest to zero indicate the best accuracy. Table 2 shows the threshold values for MAPE. Coming to FD, a score greater than 0 $. 95$ indicates a very high precision, and a score between $0.85$ and $0.95$ indicates a good forecasting accuracy. Overall, the more $F D$ score is close to 1.0, the more accurate the forecasting model is.

Table 2.

Threshold values for MAPE error metric [19,20].

MAPE(%)	Accuracy level	MAPE(%)	Accuracy level
]0;5]	Class I: Very high precision	]10;20]	Class III: Average precision
]5;10]	Class II: Good precision	]20;+∞[	Class IV: Low precision

Open in a new tab

The forecasting model may overfit or underfit in case of data leakage. To ensure that this does not happen, the data set is divided into two (training and test sets). These two sets are concealed from each other so that there is no leakage.4

Modelling CO₂ emissions

Prediction of CO₂ emissions from Cameroon's road fuel combustion are estimated with WTHGM(1,N) optimized by Rao's algorithm (denoted OWTHGM). For validation purposes, we also compare the estimates with GM(1,N) and WTHGM(1,N) models, as well as multilinear regression (MLR) as applied in the work of Karakurt and Aydin [21]. Training data spans from 1995 to 2017 while test data cover the period or this, the reference and new models are implemented using data collected over the period 1995–2017 to train them (i.e. parameterise the models), while data from 2018 to 2020 are used to test the forecasting performance of each model. In all, 24 simulations were performed on Matlab R2021a using a PC with 8.0 GB RAM. To go systematically, here is how to proceed:

Stage 1: Determining the filtering technique

We start by creating a copy of the raw data. One copy is filtered to remove any potential disturbances while the other copy is used as is. As mentioned above, we examined the shape of the wavelet and it matched with the DWT. Thus we apply Eq. (9) to denoise the input data. The next stage is modelling.

Stage 2: Modelling the WTHGM(1,N)

At this stage the unfiltered copy is used to establish the WTHGM(1,N) model. To do this, Eqs. (11) to (20) are applied.

Stage 3: Optimisation of the WTHGM(1,N)

Here, we use the copy of the raw data that has been filtered and build the OWTHGM(1,N) model given by Eqs. (11) to (28) (see pseudocode). With Rao's algorithm, the population size was fixed at 50 for 10⁵ iterations. In the specific context of our data, we obtained the optimum parameters shown in Table 3:

Table 3.

Optimum parameters obtained with data on CO₂ emissions from road transport in Cameroon.

	$ξ$	$ω_{1}$	$ω_{2}$	$ω_{3}$	$ω_{4}$	$ω_{5}$	$ω_{6}$	$ω_{7}$
GM(1,N)	1	0.5	0.5	0.5	0.5	0.5	0.5	0.5
WTHGM(1,N)	0.884	0.5	0.5	0.5	0.5	0.5	0.5	0.5
OWTHGM(1,N)	0.148	0.376	0.425	8.9E-4	0.159	3.7E-4	0.026	0.001

Open in a new tab

Stage 4: Generating all results

Finally, we generate all the forecast results given by the WTHGM(1,N) and OTHGM(1,N) models. From these results, we calculate the performance of both models to determine their MAPE, FD and RMSE. This allows us to see which model was more accurate.

As shown in Table 4, it can be seen that with the training data (1995–2017), the conventional GM(1,N) model, WTHGM(1,N), its optimised version (OWTHGM), and MLR have similar performance in terms of MAPE. They all score a MAPE below 5%, which allows concluding that they behave like class I models according to Table 2. However, we note that the classical GM(1,N), WTHGM(1,N) and MLR models slightly outperform the new model in terms of RMSE, FD and MAPE. Fig. 2a shows the fit curves of the new OWTHGM(1,N) model and the competing models. It is obvious that the OWTHGM(1,N) model (red curve) needs a short fitting time (represented by the first three predictions in the training phase) before it can efficiently extract the systems’ information. This delay is confirmed by its FD, which is only 95% (see Table 4), the lowest of the four models. This explains why in Fig. 2b the peak of APEs is observed on the column of OWTHGM(1,N) model, indicating a lower forecasting accuracy in the training phase.

Table 4.

Fitted and predicted values of CO₂ emissions (in kt) with training data.

Year	Real data	GM(1,N)	MLR	WTHGM(1,N)	OWTHGM(1,N)
1995	1668	1668	1606.71	1668	1668
1996	1761	1721.73	1671.70	1722.76	1291.66
1997	1708	1730.46	1704.97	1754.76	1641.11
1998	1730	1756.61	1791.29	1790.21	1727.09
1999	1821	1781.10	1864.94	1813.34	1756.08
2000	1876	1802.23	1862.31	1824.51	1744.61
2001	1637	1827.01	1862.94	1852.04	1818.67
2002	1856	1887.73	1849.80	1960.19	2027.60
2003	1859	1940.24	1820.09	2012.40	1951.53
2004	2065	1967.04	1919.09	2011.08	1927.45
2005	2099	2003.21	2051.08	2039.87	2052.74
2006	2165	2039.29	2049.09	2070.10	2107.53
2007	2107	2075.13	2085.56	2098.51	2148.74
2008	1869	2095.80	2133.20	2086.26	2070.28
2009	2128	2134.90	2103.10	2101.08	2040.63
2010	2184	2246.84	2265.30	2230.97	2188.46
2011	2514	2404.99	2420.01	2414.69	2357.28
2012	2675	2600.24	2633.33	2649.13	2624.88
2013	2786	2822.20	2828.60	2900.19	2846.55
2014	2906	3027.95	2978.35	3104.76	3019.03
2015	3183	3183.27	3143.31	3222.14	3125.21
2016	3287	3289.80	3291.79	3285.38	3269.09
2017	3413	3363.69	3360.44	3309.34	3317.46
MAPE		3.26%	3.43%	3.65%	4.97%
FD		0.97	0.97	0.97	0.95
RMSE		90.61	96.05	102.60	142.67

Open in a new tab

Fig 2 — Illustration of the fitting curves and APE during the training phase.

When data is hidden from each model to check whether they are able to predict CO₂ emissions over the period 2018–2020 (test phase), OWTHGM(1,N) model significantly outperforms the competing models. It appears that only MLR can cope with the novel model. It can be seen in Fig. 3a how the predictive curve of OWTHGM(1,N) comes closest to real data. Moreover, FD criteria shows that OWTHGM(1,N) model has the best fit with a score of 99% while its MAPE and RMSE are the closest to zero with scores of 1.27% and 79.99 respectively (see Table 5). Fig. 3b confirms this performance as the novel model actually has the smallest APE distribution.

Fig 3 — Illustration of the fitting curves and APE during the training phase.

Table 5.

Fitted and predicted values of CO₂ (in kt) emissions with test data.

Year	Real data	GM(1,N)	MLR	WTHGM(1,N)	OWTHGM(1,N)
2018	3494	3432.09	3440.75	3354.21	3475.86
2019	3645	3491.73	3673.12	3409.59	3655.95
2020	3779	3498.25	3970.02	3423.04	3891.08
MAPE		4.46%	2.45%	6.62%	1.27%
FD		0.95	0.98	0.93	0.99
RMSE		225.863	136.875	301.472	79.983

Open in a new tab

The model proposed in this paper generates low forecasting errors (less than 5%), leading to the conclusion that OWTHGM(1,N) is a reliable forecasting tool just like its competitors. The novel model can forecast CO₂ emissions more precisely as well.

As for MLR model, the regression of $X_{1}^{(0)}$ (i.e. CO₂ emissions) on independent variables (i.e. $X_{2}^{(0)}, \dots, X_{N}^{(0)}$ ) reveals that GDP, UP and VF are the most significant determinants of CO₂ with contributions of 37.1%, 39.8% and 21.6% respectively. R-squared value is 0.97, indicating a strong relationship between CO₂ emissions and the independent variables. The implication is that the independent variables used in this study manage to explain 97% of Cameroon's road transport CO₂ emissions. Only 3% of variation remains unexplained, which is certainly due to insufficient data. However, even if more data were collected and other independent variables included, it would never be possible to explain all variations in CO₂ emissions [22]. Given the performance of OWTHGM(1,N), it can be concluded that the proposed model is better at predicting future CO₂ emissions without the need to know the underlying functional relationship between the different variables.

In order to strengthen the validation of the new model, we also applied it to other datasets, namely the annual CO₂ emissions from the Nigerian transport sector obtained from reference [23]. We also divided the data into two groups (training and test sets). The results from the training set are shown in Table 6.

Table 6.

Adjusted and predicted values of CO₂ emissions (in Mt) from Nigeria (training data).

Year	Real data	GM(1,N)	MLR	WTHGM(1,N)	OWTHGM(1,N)
1995	15.04	17.35	16.12	14.00	14.95
1996	16.56	15.07	16.48	17.99	17.00
1997	18.45	19.57	18.37	17.24	18.63
1998	16.99	16.97	18.59	17.87	17.01
1999	17.82	19.83	17.80	16.99	17.80
2000	35.47	37.26	35.72	33.97	32.99
2001	39.02	40.02	38.20	41.01	37.91
2002	42.44	44.43	42.90	41.10	42.76
2003	45.00	45.79	42.30	45.15	46.91
2004	44.41	45.08	44.56	43.50	43.99
2005	47.59	46.32	56.67	48.03	47.26
2006	44.01	44.02	48.11	44.99	44.90
2007	42.42	42.23	43.14	40.61	41.71
2008	42.65	42.00	39.21	40.53	41.32
2009	33.70	43.00	35.73	34.03	33.20
2010	50.54	51.25	48.27	50.14	49.01
2011	58.31	57.88	59.33	58.87	58.80
2012	49.08	49.69	47.97	48.74	50.75
2013	52.36	52.12	50.60	51.19	51.98
2014	55.35	54.00	55.00	56.26	55.30
2015	49.15	50.03	48.12	51.96	48.79
2016	52.63	52.02	54.77	50.64	52.30
MAPE		4.70%	4.23%	3.59%	1.86%
FD		0.95	0.96	0.96	0.98
RMSE		2.34	2.62	1.35	0.99

Open in a new tab

The results shown in Table 6 reveal that the OWTHGM(1,N) model outperforms all competing models on all criteria of comparison. For the forecasts made with the test data (see Table 7), the OWTHGM(1,N) model manages to obtain the best MAPE (1.74%) and therefore the best FD (0.953). When we come to the RMSE criterion, the WTHGM(1,N) model wins with a score of 1.58.

Table 7.

Adjusted and predicted values of CO₂ emissions (in Mt) from Nigeria using test data.

Year	Real data	GM(1,N)	MLR	WTHGM(1,N)	OWTHGM(1,N)
2017	50.75	53.72	51.10	49.14	50.02
2018	47.35	47.71	47.98	48.01	48.00
2019	57.29	59.70	54.02	56.10	59.20
2020	58.58	58.56	58.36	57.01	59.00
MAPE		2.71%	1.95%	2.35%	1.74%
FD		0.973	0.980	0.976	0.983
RMSE		1.92	3.13	1.58	1.89

Open in a new tab

These results and the previous ones allow us to conclude that the new OWTHGM(1,N) model is capable of making very accurate forecasts.

Forecasting Cameroon's road transport CO₂ emissions

CO₂ emissions from the Cameroonian road transport sector over the period 2021–2030 are predicted using the new model proposed in this study and its competitors. In this respect, we used the projected values of the independent variables (RFC, PR, GDP, UP and VF) by a simple regression. These values are confirmed by forecasts of the National Development Strategy [24]. Forecasts results show with great certainty that substantial increases in total CO₂ emissions from the Cameroonian road transport sector are to be expected in the coming years. In other words, CO₂ emissions from road transports will increase from 3779 kt in 2020 to 4600 kt in 2030 (see Table 8). Thus, current CO₂ emissions will increase by 120% in less than ten years. These results are in contrast to the Cameroon government's projections which aim to reduce total CO₂ emissions by 32% before 2030 [18].

Table 8.

Projection results of road fuel CO₂ emissions in Cameroun.

	Models
Year	GM(1,1)	MLR	WTHGM(1,N)	OWTHGM(1,N)
2021	3548	3563	3490	3832
2022	3668	3650	3606	3819
2023	3789	3736	3717	3896
2024	3912	3822	3824	3992
2025	4036	3908	3929	4093
2026	4161	3995	4032	4194
2027	4286	4081	4133	4295
2028	4413	4167	4234	4397
2029	4540	4253	4333	4498
2030	4668	4339	4432	4600

Open in a new tab

Conclusions

This paper proposes an optimized wavelet transform Hausdorff grey multivariate forecasting model (abbreviated OWTHGM(1,N)). In order to validate the model, we implemented it to forecast CO₂ emissions from road fuel combustion. For this purpose, we started by showing that GDP, urban population, road transport fuel consumption, fuel prices and size of vehicle fleet could be used as determinants of CO₂ emissions. Forecasts results are compared with the classical GM(1,N), WTHGM(1,N) and MLR, allowing to conclude that:

•
GDP, size of vehicle fleet and urban population are the most significant determinants of CO₂ emissions as confirmed by Refs. [17,18,25]. With these determinants, the proposed OWTHGM(1,N) model manages to completely extract the existing connections they have with CO₂ emissions.
•
Forecasts of CO₂ emissions produced with OWTHGM(1,N) are more precise than competing models. Thus OWTHGM(1,N) is a reliable tool for monitoring CO₂ emissions from Cameroon's road transport sector.

These achievements stem from three improvements made in the classical GM(1,N) model. Initially, only significant drivers are considered. Then, all selected drivers are filtered using WT, thereby demonising all series that could hamper modelling. More so, a new time response function has been established using the Hausdorff derivative. Lastly, Rao's algorithm is used to optimise all parameters. The OWTHGM(1,N) model therefore proves to be a reliable forecasting tool and can be used to monitor the evolution of CO₂ emissions but can also be applied in other forecasting fields characterised by insufficient information.

CRediT author statement

Flavian Emmanuel Sapnken: Conceptualization, Methodology, Software, Writing - original draft preparation. Hermann Chopkap Noume: Validity tests, Data curation, Visualization, Investigation. Jean Gaston Tamba: Supervision, Validation, Writing-Reviewing and Editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

We are grateful to Departments of Logistics and Transport Engineering and the Department of Energetics and Thermal Engineering, University Institute of Technology, for providing the necessary facilities and support.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Footnotes

In order not to lose the information in the signal, the Nyquist criterion specifies that the sampling frequency must be at least twice as high as the highest frequency present in the signal. An aliasing phenomenon occurs if the sampling frequency is less than twice the maximum frequency of the analogue signal.

Soft thresholding is always a good starting point if in doubt about which technique to choose.

There are times when these abrupt changes carry meaning, and it might be desirable to retain this information while denoising the signal.

⁴

Performance problems with predictive models are mainly due to over-fitting or under-fitting. Therefore, splitting the data in two, as suggested, allows testing the generalization of the predictive model on data that it has not yet encountered during training stage.

Appendix A. Multilevel decomposition of the approximation subband

Fig. A1

Appendix B. DWT Matlab code for filtering noisy data

%% Load a signal

%% The entire process should be repeated for each data until all the input data are filtered

load noisysig.mat
figure;
subplot(2,1,1); plot(f0); grid on; title(‘Original signal’); axis tight;
subplot(2,1,2); plot(f); title (‘Original signal with noise’);
axis tight; grid on;

%% Decompose signal using Discrete Wavelet Transform

dwtmode(‘per’,’noisplay’);
wname =’sym6’;
[C, L]=wavedec(f,level,wname);
plotDetCoefHelper(f,C,L);%helperFunction to plot the coefficients at every level

%% analyse the subbands and determine the threshold

%% Denoise the signal

fd=wden(f,’rigrsure’,’s’,’sln’,level,wname);
figure;
subplot(2,1,1);
plot(f); axis tight; grid on; title (‘Noisy Signal’);
subplot(2,1,2);
plot(fd); axis tight; grid on;
title(sprint(‘Denoised Signal SNR;%0.2f dB’,−20*log10(norm(abs(f0-fd))/norm(f0))));

Appendix B. Proof of the derivation of Eq. (17)

From Eq. (13), if we let $f (t) = α_{N + 1} + \sum_{i = 2}^{N} α_{i} x_{i}^{(ξ)} (t)$ , then Eq. (13) can be expressed as:

\frac{d x_{1}^{(ξ)} (t)}{d t} + α_{1} x_{1}^{(ξ)} (t) = f (t)

(A.1)

We get the following results by multiplying both sides of Eq. (A.1) by $e^{α_{1} t}$ :

(\frac{d x_{1}^{(ξ)} (t)}{d t} + α_{1} x_{1}^{(ξ)} (t)) e^{α_{1} t} = f (t) e^{α_{1} t}

(A.2)

Expressing the left hand side of Eq. (A.2) as a derivative, we get:

\frac{d}{d t} (x_{1}^{(ξ)} (t) \cdot e^{α_{1} t}) = f (t) e^{α_{1} t}

(A.3)

Taking integrals over the interval $[1, t]$ on both sides of Eq. (A.3) we get:

\int_{τ = 1}^{τ = t} d (x_{1}^{(ξ)} (τ) \cdot e^{α_{1} τ}) = \int_{τ = 1}^{τ = t} f (τ) e^{α_{1} τ} d τ

(A.4)

Solving integrals on the left hand side of Eq. (A.4) yields:

x_{1}^{(ξ)} (t) \cdot e^{α_{1} t} - x_{1}^{(ξ)} (1) \cdot e^{α_{1}} = \int_{τ = 1}^{τ = t} f (τ) e^{α_{1} τ} d τ

(A.5)

Multiplying each term of Eq. (A.5) by $e^{- α_{1} t}$ leads to:

x_{1}^{(ξ)} (t) = x_{1}^{(ξ)} (1) \cdot e^{α_{1} (1 - t)} + \int_{τ = 1}^{τ = t} f (τ) e^{α_{1} (τ - t)} d τ

(A.6)

Recall that $x_{1}^{(ξ)} (t = 1) = x_{1}^{(0)} (1)$ , Eq. (A.6) can be rewritten as:

x_{1}^{(ξ)} (t) = x_{1}^{(0)} (1) \cdot e^{α_{1} (1 - t)} + \int_{τ = 1}^{τ = t} f (τ) e^{α_{1} (τ - t)} d τ

(A.7)

The second term on the right hand side of Eq. (A.7) can be expressed as follows using the trapezoidal integral formula:

{\begin{matrix} \int_{τ = 1}^{τ = t} f (τ) e^{α_{1} (τ - t)} d τ = \sum_{j = 2}^{t} \int_{τ = j - 1}^{τ = j} f (τ) e^{α_{1} (τ - t)} d τ \\ = \sum_{j = 2}^{t} 0.5 (f (j) e^{α_{1} (j - t)} + f (j - 1) e^{α_{1} (j - t - 1)}) \end{matrix}

(A.8)

When we substitute Eq. (A.8) in Eq. (A.7), we obtain:

{\hat{x}}_{1}^{(ξ)} (t) = x_{1}^{(0)} (1) e^{α_{1} (1 - t)} + 0.5 \sum_{τ = 2}^{t} (f (τ) e^{α_{1} (τ - t)} + f (τ - 1) e^{α_{1} (τ - t - 1)})

(A.9)

Data availability

Data will be made available on request.

References

1.Li S.H., Zhu L., Wu Y., Lei X.Q. A novel grey multivariate model for forecasting landslide displacement. Eng. Appl. Artif. Intell. 2021;103 doi: 10.1016/j.engappai.2021.104297. [DOI] [Google Scholar]
2.Nie W., Duan H. A novel multivariable grey differential dynamic prediction model with new structures and its application to carbon emissions. Eng. Appl. Artif. Intell. 2023;122 doi: 10.1016/j.engappai.2023.106174. [DOI] [Google Scholar]
3.Tien T.L. A research on the grey prediction model GM (1,n) Appl. Math. Comput. 2012;218:4903–4916. [Google Scholar]
4.Ding S., Xu N., Ye J., Zhou W., Zhang X. Estimating Chinese energy-related CO2 emissions by employing a novel discrete grey prediction model. J. Clean. Prod. 2020;259 doi: 10.1016/j.jclepro.2020.120793. [DOI] [Google Scholar]
5.Sapnken F.E. A new hybrid multivariate grey model based on genetic algorithms optimization and its application in forecasting oil products demand. Grey Syst. Theory Appl. 2023;13:406–420. [Google Scholar]
6.Sapnken F.E., Ahmat K.A., Boukar M., Biobiongono Nyobe S.L., Tamba J.G. Forecasting petroleum products consumption in Cameroon's household sector using a sequential GMC(1,n) model optimized by genetic algorithms. Heliyon. 2022;8:e12138. doi: 10.1016/j.heliyon.2022.e12138. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Sapnken F.E., Acyl A.K., Boukar M., Nyobe S.L.B., Tamba J.G. A technique for improving petroleum products forecasts using grey convolution models and genetic algorithms. MethodsX. 2023;10 doi: 10.1016/j.mex.2023.102097. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Bahrami S., Hooshmand R.A., Parastegari M. Short term electric load forecasting by wavelet transform and grey model improved by PSO (particle swarm optimization) algorithm. Energy. 2014;72:434–442. [Google Scholar]
9.Shen Q.Q., Cao Y., Yao L.Q., Zhu Z.K. An optimized discrete grey multi-variable convolution model and its applications. Comp. Appl. Math. 2021;40:58. doi: 10.1007/s40314-021-01448-z. [DOI] [Google Scholar]
10.Wang Y., Nie R., Ma X., Liu Z., Chi P., Wu W., Guo B., Yang X., Zhang L. A novel Hausdorff fractional NGMC(p,n) grey prediction model with Grey Wolf Optimizer and its applications in forecasting energy production and conversion of China. Appl. Math. Model. 2021;97:381–397. doi: 10.1016/j.apm.2021.03.047. [DOI] [Google Scholar]
11.Ding S. A novel discrete grey multivariable model and its application in forecasting the output value of China's high-tech industries. Comput. Ind. Eng. 2019;127:749–760. [Google Scholar]
12.Yin C., Mao S. Fractional multivariate grey Bernoulli model combined with improved grey wolf algorithm: application in short-term power load forecasting. Energy. 2023;269 doi: 10.1016/j.energy.2023.126844. [DOI] [Google Scholar]
13.Zhang Z., Kon M.A. Wavelet matrix operations and quantum transforms. Appl. Math. Comput. 2022;428 doi: 10.1016/j.amc.2022.127179. [DOI] [Google Scholar]
14.Ngui W.K., Leong M.S., Hee L.M., Abdelrhman A.M. Wavelet analysis: mother wavelet selection methods. AMM. 2013;393:953–958. doi: 10.4028/www.scientific.net/AMM.393.953. [DOI] [Google Scholar]
15.Navarro-Rodríguez Á., García P., Guerrero J.M. Encyclopedia of Electrical and Electronic Power Engineering. Elsevier; Oxford: 2023. Introduction to digital sampled systems and nyquist sampling theorem; pp. 652–656. J. García. [DOI] [Google Scholar]
16.Rao R.V., Pawar R.B. Constrained design optimization of selected mechanical system components using Rao algorithms. Appl. Soft Comput. 2020;89 doi: 10.1016/j.asoc.2020.106141. [DOI] [Google Scholar]
17.Tamba J.G., Njomo D., Nsouandele J.L., Bonoma B., Dongue S.B. Vol. 2. Universal Journal of Environmental Research & Technology; 2012. (Assessment of Greenhouse Gas Emissions in Cameroon's Road Transport Sector). [Google Scholar]
18.Engo J. Decoupling analysis of CO2 emissions from transport sector in Cameroon. Sustain. Cities Soc. 2019;51 doi: 10.1016/j.scs.2019.101732. [DOI] [Google Scholar]
19.Sapnken F.E., Tamba J.G. Petroleum products consumption forecasting based on a new structural auto-adaptive intelligent grey prediction model. Expert Syst. Appl. 2022;203 doi: 10.1016/j.eswa.2022.117579. [DOI] [Google Scholar]
20.Sapnken F.E., Ahmat K.A., Boukar M., Nyobe S.L.B., Tamba J.G. Learning latent dynamics with a grey neural ODE prediction model and its application. Grey Syst. Theory Appl. 2023 [Google Scholar]
21.Karakurt I., Aydin G. Development of regression models to forecast the CO2 emissions from fossil fuels in the BRICS and MINT countries. Energy. 2023;263 doi: 10.1016/j.energy.2022.125650. [DOI] [Google Scholar]
22.Wooldridge J.M. Cengage Learning; 2015. Introductory Econometrics: a Modern Approach, [Google Scholar]
23.H. Ritchie, M. Roser, P. Rosado, CO2 and greenhouse gas emissions, Our World in Data. (2020). https://ourworldindata.org/co2-emissions (accessed May 23, 2022).
24.SND30 . 1st ed. MINEPAT; Yaoundé: 2020. Stratégie Nationale de Développement 2020-2030 : Pour la Transformation Structurelle Et Le Développement Inclusif. [Google Scholar]
25.Hossain M.A., Engo J., Chen S. The main factors behind Cameroon's CO2 emissions before, during and after the economic crisis of the 1980s. Environ. Dev. Sustain. 2021;23:4500–4520. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data will be made available on request.

[bib0001] 1.Li S.H., Zhu L., Wu Y., Lei X.Q. A novel grey multivariate model for forecasting landslide displacement. Eng. Appl. Artif. Intell. 2021;103 doi: 10.1016/j.engappai.2021.104297. [DOI] [Google Scholar]

[bib0002] 2.Nie W., Duan H. A novel multivariable grey differential dynamic prediction model with new structures and its application to carbon emissions. Eng. Appl. Artif. Intell. 2023;122 doi: 10.1016/j.engappai.2023.106174. [DOI] [Google Scholar]

[bib0003] 3.Tien T.L. A research on the grey prediction model GM (1,n) Appl. Math. Comput. 2012;218:4903–4916. [Google Scholar]

[bib0004] 4.Ding S., Xu N., Ye J., Zhou W., Zhang X. Estimating Chinese energy-related CO2 emissions by employing a novel discrete grey prediction model. J. Clean. Prod. 2020;259 doi: 10.1016/j.jclepro.2020.120793. [DOI] [Google Scholar]

[bib0005] 5.Sapnken F.E. A new hybrid multivariate grey model based on genetic algorithms optimization and its application in forecasting oil products demand. Grey Syst. Theory Appl. 2023;13:406–420. [Google Scholar]

[bib0006] 6.Sapnken F.E., Ahmat K.A., Boukar M., Biobiongono Nyobe S.L., Tamba J.G. Forecasting petroleum products consumption in Cameroon's household sector using a sequential GMC(1,n) model optimized by genetic algorithms. Heliyon. 2022;8:e12138. doi: 10.1016/j.heliyon.2022.e12138. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0007] 7.Sapnken F.E., Acyl A.K., Boukar M., Nyobe S.L.B., Tamba J.G. A technique for improving petroleum products forecasts using grey convolution models and genetic algorithms. MethodsX. 2023;10 doi: 10.1016/j.mex.2023.102097. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0008] 8.Bahrami S., Hooshmand R.A., Parastegari M. Short term electric load forecasting by wavelet transform and grey model improved by PSO (particle swarm optimization) algorithm. Energy. 2014;72:434–442. [Google Scholar]

[bib0009] 9.Shen Q.Q., Cao Y., Yao L.Q., Zhu Z.K. An optimized discrete grey multi-variable convolution model and its applications. Comp. Appl. Math. 2021;40:58. doi: 10.1007/s40314-021-01448-z. [DOI] [Google Scholar]

[bib0010] 10.Wang Y., Nie R., Ma X., Liu Z., Chi P., Wu W., Guo B., Yang X., Zhang L. A novel Hausdorff fractional NGMC(p,n) grey prediction model with Grey Wolf Optimizer and its applications in forecasting energy production and conversion of China. Appl. Math. Model. 2021;97:381–397. doi: 10.1016/j.apm.2021.03.047. [DOI] [Google Scholar]

[bib0011] 11.Ding S. A novel discrete grey multivariable model and its application in forecasting the output value of China's high-tech industries. Comput. Ind. Eng. 2019;127:749–760. [Google Scholar]

[bib0012] 12.Yin C., Mao S. Fractional multivariate grey Bernoulli model combined with improved grey wolf algorithm: application in short-term power load forecasting. Energy. 2023;269 doi: 10.1016/j.energy.2023.126844. [DOI] [Google Scholar]

[bib0013] 13.Zhang Z., Kon M.A. Wavelet matrix operations and quantum transforms. Appl. Math. Comput. 2022;428 doi: 10.1016/j.amc.2022.127179. [DOI] [Google Scholar]

[bib0014] 14.Ngui W.K., Leong M.S., Hee L.M., Abdelrhman A.M. Wavelet analysis: mother wavelet selection methods. AMM. 2013;393:953–958. doi: 10.4028/www.scientific.net/AMM.393.953. [DOI] [Google Scholar]

[bib0015] 15.Navarro-Rodríguez Á., García P., Guerrero J.M. Encyclopedia of Electrical and Electronic Power Engineering. Elsevier; Oxford: 2023. Introduction to digital sampled systems and nyquist sampling theorem; pp. 652–656. J. García. [DOI] [Google Scholar]

[bib0016] 16.Rao R.V., Pawar R.B. Constrained design optimization of selected mechanical system components using Rao algorithms. Appl. Soft Comput. 2020;89 doi: 10.1016/j.asoc.2020.106141. [DOI] [Google Scholar]

[bib0017] 17.Tamba J.G., Njomo D., Nsouandele J.L., Bonoma B., Dongue S.B. Vol. 2. Universal Journal of Environmental Research & Technology; 2012. (Assessment of Greenhouse Gas Emissions in Cameroon's Road Transport Sector). [Google Scholar]

[bib0018] 18.Engo J. Decoupling analysis of CO2 emissions from transport sector in Cameroon. Sustain. Cities Soc. 2019;51 doi: 10.1016/j.scs.2019.101732. [DOI] [Google Scholar]

[bib0019] 19.Sapnken F.E., Tamba J.G. Petroleum products consumption forecasting based on a new structural auto-adaptive intelligent grey prediction model. Expert Syst. Appl. 2022;203 doi: 10.1016/j.eswa.2022.117579. [DOI] [Google Scholar]

[bib0020] 20.Sapnken F.E., Ahmat K.A., Boukar M., Nyobe S.L.B., Tamba J.G. Learning latent dynamics with a grey neural ODE prediction model and its application. Grey Syst. Theory Appl. 2023 [Google Scholar]

[bib0021] 21.Karakurt I., Aydin G. Development of regression models to forecast the CO2 emissions from fossil fuels in the BRICS and MINT countries. Energy. 2023;263 doi: 10.1016/j.energy.2022.125650. [DOI] [Google Scholar]

[bib0022] 22.Wooldridge J.M. Cengage Learning; 2015. Introductory Econometrics: a Modern Approach, [Google Scholar]

[bib0023] 23.H. Ritchie, M. Roser, P. Rosado, CO2 and greenhouse gas emissions, Our World in Data. (2020). https://ourworldindata.org/co2-emissions (accessed May 23, 2022).

[bib0024] 24.SND30 . 1st ed. MINEPAT; Yaoundé: 2020. Stratégie Nationale de Développement 2020-2030 : Pour la Transformation Structurelle Et Le Développement Inclusif. [Google Scholar]

[bib0025] 25.Hossain M.A., Engo J., Chen S. The main factors behind Cameroon's CO2 emissions before, during and after the economic crisis of the 1980s. Environ. Dev. Sustain. 2021;23:4500–4520. [Google Scholar]

PERMALINK

Forecasting CO2 emissions from road fuel combustion using grey prediction models: A novel approach

Flavian Emmanuel Sapnken

Hermann Chopkap Noume

Jean Gaston Tamba

Abstract

Graphical abstract

Introduction

Methods

The conventional GM(1,N) model

Data filtration using wavelet transform

Fig. A1.

The novel Hausdorff GM(1,N) model

Optimizing wavelet transform Hausdorff GM(1,N) parameters

Fig. 1.

Selection of meta-heuristic algorithm

Improved Rao algorithm for the WTHGM(1,N) model

Applications and numerical simulation

Data selection, filtering and performance measurement

Table 1.

Table 2.

Modelling CO2 emissions

Table 3.

Table 4.

Fig. 2.

Fig. 3.

Table 5.

Table 6.

Table 7.

Forecasting Cameroon's road transport CO2 emissions

Table 8.

Conclusions

CRediT author statement

Declaration of Competing Interest

Acknowledgments

Acknowledgments

Funding

Footnotes

Appendix A. Multilevel decomposition of the approximation subband

Appendix B. DWT Matlab code for filtering noisy data

Appendix B. Proof of the derivation of Eq. (17)

Data availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Forecasting CO₂ emissions from road fuel combustion using grey prediction models: A novel approach

Modelling CO₂ emissions

Forecasting Cameroon's road transport CO₂ emissions