Skip to main content
Journal of Environmental and Public Health logoLink to Journal of Environmental and Public Health
. 2023 Jan 31;2023:5903362. doi: 10.1155/2023/5903362

Pakistan CO2 Emission Modelling and Forecasting: A Linear and Nonlinear Time Series Approach

Kassim Tawiah 1,2,, Muhammad Daniyal 3, Moiz Qureshi 4
PMCID: PMC9904910  PMID: 36761238

Abstract

Pakistan is considered among the top five countries with the highest CO2 emissions globally. This calls for pragmatic policy implementation by all stakeholders to bring finality to this alarming situation since it contributes greatly to global warming, thereby leading to climate change. This study is an attempt to make a comparative analysis of the linear time series models with nonlinear time series models to study CO2 emission data in Pakistan. These linear and nonlinear time series models were used to model and forecast future values of CO2 emissions for a short period. To assess and select the best model among these linear and nonlinear time series models, we used the root mean square error (RMSE) and the mean absolute error (MAE) as performance indicators. The outputs showed that the nonlinear machine learning models are the best among all other models, having the lowest RMSE and MAE values. Based on the forecasted value of the nonlinear machine learning neural network autoregressive model, Pakistan's CO2 emissions will be 1.048 metric tons per capita by 2028. The increasing trend in emissions is a frightening and clear warning, suggesting that innovative policies must be initiated to reduce the trend. We encourage the Pakistan government to price CO2 emissions by companies and entities per ton, adapt electricity production from hydro, wind, and different sources with no emissions of CO2, initiate rigorous planting of more trees in the populated areas of Pakistan as forest covers, provide incentives to companies, organisations, institutions, and households to come out with clean technologies or use technologies with no CO2 emissions or those with lower ones, and fund more studies to develop clean and innovative technologies with less or no CO2 emissions.

1. Introduction

The collection of greenhouse gas (GHG) emissions in airspace is creating chaos across the globe and destroying life, economy, health, and food. Carbon (IV) oxide (CO2) is a major component of GHG [1]. The world is nowhere near securing a global temperature rise of less than two degrees Celsius as proposed in the Paris Agreement of the United Nations (UN) Framework Convention on Climate Change (UNFCCC). Using the 1990 criterion, most countries are still releasing more GHG into the atmosphere, with some at a standstill and others releasing less [1]. The UN Environmental Program (UNEP) Climate Action Note indicated that the world is in a state of climate emergency [1].

Pakistan is labelled among the countries with the highest GHG emissions in the world by the UNEP [1]. It contributes to 1.02% of global GHG emissions. Pakistan emitted an estimated 504.59 million tons of GHG in 2018. It is one of the most impacted nations from unfriendly effects of environmental change as well as air contamination. GHG emissions are the leading cause of global warming, thereby becoming the maximum essential research issues within the fields of technological know-how and politics [1].

In Southern Asia, Pakistan is ranked among the developing economies with swift economic positive growth. This is expected to continue for a long time. Agriculture is the principal dominant area in Pakistan's economic growth irrespective of the growing industrial sector. However, this industrial drive, coupled with high population growth, has caused massive deforestation, resulting in less absorption of CO2 by plants, leading to a higher CO2 concentration in the atmosphere [2, 3].

Pakistan is ranked first among countries in Asia facing mass deforestation as a result of bulk utilization of natural energy from the environment. This is causing serious degradation of the environment. This degradation of the environment, together with deforestation, has brought about global warming, leading to climate change. The critical factor of Pakistan's commercial revolution is reworking its economy from organic economies based totally on human and animal energy to inorganic economies, which are completely dependent on fossil fuels. Interestingly, the by-product of fossil fuels is CO2, thereby contributing to the concentration of CO2 in the environment [2, 3].

The quest to be an industrialized giant definitely comes at a price. Pakistan is geared towards a full-industrial revolution which encourages the use of fossil fuels to power the growing industry in the country. The growth in population means expanding cities by destroying green vegetation, which absorbs most of the CO2 in the atmosphere. The growing population also means increasing cars with carbon emissions being used in the country [2, 3].

The Government of Pakistan has put in place a lot of programmes to reduce GHG emissions in the country. Great progress has been chalked up in climate action that ranges from policies, programmes, and nature-based solutions (NBS) to technology-based innovations [4]. Pakistan, recognizing the role of nature in climate modification and diminution, has put forward robust natural capital restoration plans including the Ten Billion Tree Tsunami Program as well as the Protected Drive. These programmes have contributed to enhancing livelihoods and giving opportunities to the vulnerable, including youth and women. In addition, Pakistan has introduced a number of policy actions focused on mitigating GHG emissions from high-emission sectors, such as energy and industry [4].

Notwithstanding, this available information indicates a rise in CO2 emissions in Pakistan. This needs to be scientifically substantiated and verified to assist the government to restrategize in an attempt to achieve the targeted goal in the fight of reducing these emissions in order to achieve the Paris Agreement of the UNFCCC. This study provides a statistical and machine learning framework to delve deep into the discourse of CO2 emissions in Pakistan from 1960 to 2018. It provides CO2 emission forecasts for future and possible policy initiatives and directions to combat this issue based on scientific results to meet the UNFCCC target.

1.1. Literature Review

The world is being confronted with significant difficulties connected with an unnatural weather change and discharge of ozone-depleting substances in large amounts. In 2017, energy businesses represented 46% of all CO2 discharges worldwide [5]. Bokde et al. [5] proposed two-time series disintegration techniques for transient determination of CO2 outflow with a bunch of cutting-edge models for a 48-hour skyline power market. By estimating benchmarks for France, they showed that their new technique has a mean outright rate blunder, that is, 25% lower than that of existing models. Furthermore, they illustrated the use of the conjecture for booking adaptable power utilization in Germany, Norway, Denmark, and Poland. Their results showed that booking an adaptable block of 4 hour of power utilization at a 24-hour stretch can on average diminish subsequent CO2 discharge in all countries [5].

The carbon exchanging market has turned into a powerful weapon in easing fossil fuel by-products in China, and carbon cost is at the centre of its activity. Thus, the carbon exchanging market fills in as an imperative part in gauging carbon cost precisely ahead of time [6]. Sun and Li [6] imaginatively investigated a group of driven long short-term memory network (LSTM) models in light of reciprocal outfit experimental mode decay (ROEMD) for carbon cost gauging, applying it to each of the eight carbon exchanging piloting centres in China. ROEMD was first executed for mode change to disintegrate the first convoluted mode into a bunch of straightforward modes. Accordingly, LSTM was utilized to demonstrate the planning between time-slacking factors and every model's objective quality, thereby constructing numerous LSTM models for comparison. At last, converse ROEMD computation was acquainted with the coordinates' expected consequences of the multimode on eventual outcomes. Its viable application at the same time embraced every one of the eight carbon piloting centres in China, covering their compared carbon cost information over a significantly extended period. The acquired outcomes showed that the proposed model had adequate precision in carbon cost anticipation in China in contrast to the single LSTM model as well as other ordinary machine learning models. Moreover, as indicated by the extent of its application, the creative model displayed solid dependability and comprehensiveness [6].

Wu and Liu [7] proposed a multiresolution combo-predicting model that uses ensemble empirical mode decomposition (EEMD-ADD) to improve the predicting accuracy of the cost of carbon. They achieved it by decomposing carbon price sequencing using EEMD into numerous intrinsic framework functions (IFFs) by dividing these IFFs into several high-frequency compartments, low-frequency compartments, and drift compartments after which the least square vector support machine (LSSVM), particle swarm optimization LSSVM (PSO-LSSVM), and bat algorithm-LSSVM (BA-LSSVM) were utilized to forecast three compartments, respectively. Similarly, Yun et al. [8] built a robust carbon price-predicting model of complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN)-long short-term memory (LSTM) by blending the merits of CEEMDAN in decaying multiresolution time-frequency carbon price waves and the LSTM model by installing monetary waves.

A report by Eckstein et al. [9] positioned Pakistan as the fifth generally impacted country because of environmental changes throughout the course of recent years, while the author in [10] positioned Pakistan's air as the second generally contaminated in 2020. To diminish these effects, the nation needs to take broad variation and alleviation measures while changing the energy area towards dirtying and carbon impartial choices [11].

Existing studies imply that greater than two-thirds of GHG emissions in Pakistan come from fossil power-related CO2 emissions. Consequently, forecasting CO2 emissions is crucial for adjusting regulations to mitigate weather changes. The forecasting of CO2 emissions has been mentioned dramatically in [12].

Pao and Tsai [13] implemented the grey model to obtain CO2 emissions in Brazil between 1980 and 2007. A combination of models based on the optimization algorithms of quantum harmony and discounted mean square has been established and employed in CO2 emission forecasting for the world's top five countries with high CO2 emissions [14]. Auffhammer and Carson [15] came up with China's CO2 emission path using a set of panel data in a provincial-level framework [1619]. The application of the traditional grey model and the upgraded grey model has been implemented to forecast CO2 emissions [20, 21].

From qualitative and quantitative perspectives, numerous discussions have been conducted on the relationship between consumption of energy, economic growth, and CO2 emissions [22]. Al-mulali and Sab [23] investigated the influence of the consumption of basic energy and emissions of CO2 on sixteen developing countries, focusing on the impact on the development of their economies. Wang et al. [24] examined the significant factors contributing to CO2 emissions in China's road freight transport. Zhang and Nian [25] computed the emissions of CO2 and examined the factors contributing to emissions in China's transport sector. In China, Lin et al. [26] developed a new bootstrap ARDL bounding test for CO2 emissions and industrial development. Pao and Tsai [27] conducted an examination of the dynamism of the causal relationship, linking the emission of pollutants, consumption of energy, and their output in Britain, Russia, India, China, and other countries over a definite period [2834].

Little work has been conducted on providing mathematical and statistical modelling techniques for modelling and predicting CO2 emissions in Pakistan and where the country stands in the future concerning this important issue. It is against this backdrop that this paper provides a modelling and forecasting technique based on linear and nonlinear time series models. This study predicts the amount of CO2 emissions in Pakistan for almost a decade to assist the Pakistan government and other stakeholders to plan ahead to devise ways of mitigating the effects of CO2 emissions or possibly curtailing them.

Section 2 of the paper presents the source of data, data characteristics, and the modelling and forecasting techniques applied. Section 3 presents the discussion and the results, while the last section presents the conclusions and recommendations of the study.

2. Materials and Methods

2.1. Data

The data are made up of CO2 emissions in Pakistan from 1960 to 2018. The amount of CO2 was measured in metric tons per capita according to international standards. The data were obtained from the World Bank data indicators of Pakistan (https://databank.worldbank.org/). Figure 1 shows CO2 emissions (metrics tons per capita) from 1960 to 2018 as obtained from the World Bank. It can be observed from the figure that there is an increasing trend, which is an indication of a seasonal trend in the series. Seasonality must therefore be removed from the series before applying our modelling and forecasting techniques. The descriptive statistics in Table 1 show that the average CO2 emissions from 1960 to 2018 were 0.572 metric tons per capita, with a minimum CO2 emission of 0.314 metric tons per capita and a maximum of 0.981 metric tons per capita.

Figure 1.

Figure 1

Time plot of CO2 in Pakistan from 1960 to 2018 (https://databank.worldbank.org/).

Table 1.

Descriptive statistics of CO2 in Pakistan from 1960 to 2018.

Statistic Estimate
Mean 0.572
Median 0.567
Maximum 0.982
Minimum 0.308
Standard deviation 0.191
First quartile 0.390
Third quartile 0.735
Skewness 0.259
Kurtosis 1.845

2.2. Methods

We present a brief outline of all linear and nonlinear time series models implemented in this study. In what follows, we illustrate the linear time series followed by nonlinear machine learning models.

2.2.1. Linear Time Series Models

In this section, we give a brief outline of linear time series models, such as the Box–Jenkins ARIMA, trigonometric seasonality, Box–Cox transformation, ARMA errors, trend and seasonal model, naïve model, and exponential smoothing model.

(1) Box–Jenkins ARIMA Modelling Approach. The ARIMA (p, d, q) [35, 36] can be represented by

X^t=μ+θ1x^t1++θpx^tp++α1et1++αqetq+et. (1)

With lagged values θ1y^t1,,θpy^tp and lagged errors α1et−1,…, αqetq. The p, d, and q of the series x^t are the autoregressive term order, the differencing degree of the series, and the moving average order term, respectively. The white noise et has mean 0 and variance σ2. The differencing of the series x^t can be conducted once or more times.

The seasonal multiplicative ARIMA model has the form ARIMA (p, d, q) × (P, D, Q) written as

θpKδPEf1Ed1EfDX^t=+ϑqEμQEfεt, (2)

with θp(E)=1 − θ1E − …θpEp; δP(Ef)=1 − δ1Ef − …−δPEPf;  ϑq(K)=1 − ϑ1E − …ϑqEq;  an d μQ(Ef)=1 − μ1Kf − …−μQEQf.

In the formula, f and K are the seasonality frequency and the shift balanced operator, respectively. The seasonal difference and ordinary differencing degrees are represented by D and d, respectively. δP(Ef)an d θp(E) are the seasonal autoregressive polynomial of order P and regular autoregressive polynomial of order p, respectively while μQ(Ef)an dϑq(E) are the seasonal moving average of order Q and polynomials of the regular moving average of order q, respectively. Similarly, ∆ = ρ(1 − θ1 − …θp)(1 − δ1 − …−δP), where the mean of the process 1Ed1EfDX^t is ρ. The white noise εt has mean 0 and variance σ2.

Figure 2 shows the diagrammatical display of the steps involved in this methodology. In Figure 2, it can be noticed that there are four steps involved in ARIMA modelling. We discuss the main steps involved in this method briefly.

Figure 2.

Figure 2

Steps involved in the Box–Jenkins methodology (source: Anvari et al. [37]).

Step 1 . —

Identification of the ARIMA Model

This step requires the series to be stationary and parameters independent of time. In time series analysis, it is often found that the series is not white noised in advance. To make the series white noised, differencing is required. To test whether the series is stationary or not, we use the augmented Dickey–Fuller test. Once the series becomes stationary, graphical tools such as the autocorrelation function (ACF) and partial autocorrelation function (PACF) are implemented to identify the order of the candidate model.

Step 2 . —

Model Estimation

With the help of the visual display of the autocorrelation function (ACF) and partial autocorrelation function (PACF) of the series, the appropriate candidate model for the series is estimated. Different combinations of candidate models are applied, and the final model is selected based on the criteria of accuracy of parameters.

Step 3 . —

Diagnostic Checks

Here, the selected candidate model passes through various tests, which include the ACF plot, QQ-Norm plot, and Shapiro–Wilk to verify the normality of residuals. Once the model passes all these tests, the model is considered one of the best models for the next step.

Step 4 . —

Forecasting

In this step, the candidate model fulfilling the three steps discussed earlier is used for predicting the future values of the series. The efficiency of the forecasted model is measured by the root mean square error (RMSE) and the mean absolute error (MAE). The mathematical expressions for RMSE and MAE are given by

RMSE=1tni=n+1txix^i2,MAE=1tni=N+1txix^i, (3)

where x1,…, xn and xn+1,…, xt are the partitions of the data for training and forecasting, respectively. In our case, we used the complete data for both training and forecasting. The RMSE and MAE assess the quality as well as summarize the model for superior forecasting. The model with the smallest RMSE and MAE values is chosen for forecasting [5]. Other indicators of measuring efficiency can also be implemented.

(2) Trigonometric seasonality, Box-Cox transformation, ARMA errors, Trend, and Seasonal (TBATS) Modelling Approach

The trigonometric seasonality, Box–Cox transformation, ARMA errors, trend, and seasonal component (TBATS) model is mainly used to predict the time series with complex seasonal patterns by exponential smoothing. The TBATS model utilizes the representation of trigonometric seasonal components based on the Fourier series. The TBATS model [38, 39] is given by

xtα=τt1+βkt1+i=1Tαtnii+ϑt,τt=τt1+βkt1+γϑt,kt=βkt1+ωϑt,ϑt=i=1pδiϑt1+i=1qθiεt1+εt, (4)

wherext(α) is the Box–Cox-transformed time series at moment t and αtni(i) is the ith seasonal component. τt is the local level, while kt is the trend with damping. ϑt is ARMA (p, q) and εt is the white noise from a Gaussian process. T is the seasonality amount, whereas ni is the length of the ith seasonal period. The Box–Cox transformation is α, while β is trending damping. Also, ω, γ represent the smoothing and δi,θi are the ARMA (p, q) coefficients. εt is the white noise. The seasonal component of the TBATS model is given by

zti=i=1lizj,ti,zj,ti=zj,t1icosξi+zj,t1isinξi+ϕ1iϑt,zj,ti=zj,t1isinξi+zj,t1icosξi+ϕ2iϑt, (5)

whereξi = 2πj/ni and ϕ1(i), ϕ2(i) are the seasonal smoothing.

(3) Naïve Modelling Approach. The naïve modelling approach utilizes the last year's genuine data as a figure for the current year. The following year's estimate t+1 will be equivalent to the current year's information [33]. The supposition that its interest lies later in the period will be equivalent to the interest in the last period. The expression of the naïve method [37, 40, 41] is

X^t+k=X^t, (6)

The seasonal naïve model for the period t+k is of the form

X^t+k=X^t+kng+1, (7)

where n is the period of seasonality and g is the integer part of k − 1/n, which is the number of years prior to the forecast year t+k.

(4) Exponential Smoothing (ETS) Modelling Approach. The exponential smoothing (ETS) technique is identically connected to one or more stochastic models. It likewise follows the property of robustness and treated as a model of critical extrapolation. The thought behind the ETS is that the estimates from this approach are the weighted normal average of past data, and weights exponentially decay with time. That is, the current data have bigger loads in contrast to the previous ones. In this way, ETS delivers savvy forecasting in contrast to typical techniques [37, 42, 43].

The simplest form of ETS [37, 42, 43] is given by

x^t+1t=βxt+β1βxt1+β1β2xt2+, (8)

where β, 0 ≤ β ≤ 1, is the parameter for smoothing and t + 1 is the time for one-step-forward average forecast for the series x1,…, xt. β controls the rate of decrease in weights.

For ETS (A, N, and N), the additive model [37, 42, 43] is represented mathematically by

x^t=Ft1+εt, (9)
Ft=Ft1+βεt, (10)

where (9) is the equation for the data and (10) is for the transition.

The ETS (A, N, and N) multiplicative model [37, 42, 43] is of the form

x^t=Ft11+εt,Ft=Ft11+βεt, (11)

where t is the new level and t−1 is the previews level. εt is white noise with mean 0 and variance, σ2.

We, therefore, present the nonlinear machine learning time series model after thoroughly discussing linear time series models.

2.2.2. Nonlinear Machine Learning Time Series Models

Here, we briefly discuss neural network autoregressive and multilayer perceptron nonlinear machine learning time series models.

(1) Neural network autoregressive (NNAR) modelling approach. A neural network autoregressive (NNAR) model is obtained when the lagged values of the time series are input to a neural network. NNAR (p, r) denotes p delayed inputs and r nodes. NNAR (p, 0) equals ARIMA (p, 0,0) without stationarity parameterization. The mathematical expression for NNAR (p, r) [36, 4446] is of the form

hy=β0+r=1Rβrszk0+j=1pzrjyj, (12)

Model (12) can be constructed in two different stages, with the first being the activation R. Also, the hidden layer B(r) is computed as

Br=br=szr0+j=1pzrjyj, (13)

where r = 1,…, R an dyj = yt−1, …, ytp.s is a previously defined nonlinear activation function. Every B(r) acts as a unique transformation characteristic br(y). Mathematically, the hidden layer,

hy=β0+r=1RβrBr, (14)

receives R instigations.

The NNAR model assumes a sigmoid function,

fω=11+expω, (15)

used in the linear function translation of 0 probability to a probability of 1.

(2) Multilayer Perceptron (MLP) Model. The structure of the multilayer perceptron (MLP) model [4750] is composed of the input layer, hidden layer, and output layer with limited nonlinear functions that are differentiable. In the MLP, information in the form of a nonlinear differentiable function is processed by artificial neurons from the input layer through the hidden layer to the output layer, yielding a response in the form of a disjointed feed-forward network algorithm. The mathematical expression of the MLP is given by

tx=ksp=0Kg1p0kn=0Ngpnicn+dn, (16)

where cn, dn,k, ks, t(x), gpni, and g1p0 are the network inputs, bias of the network, activation function of intermediate layers, output layer activation function, output signal, weights of the intermediate layer, and connections of output neurons, respectively. Figure 3 shows the MLP model with a single hidden layer.

Figure 3.

Figure 3

Structure of multilayer perceptron (MLP) modelling with a single hidden layer.

The RMSE and MAE will be used for all model comparisons.

R packages, forecastHybrid, forecast, Tplyr, caTools, T-series, nnet, TTR, and ForecastTB [5157] were used for all modelling and forecasting.

3. Results and Discussion

From the time series plotted in Figure 3, the series is not stationary. We verified this by applying stationarity at the level (0) and found that the series is not stationary at the level (0). This means that the mean, variance, and covariance are not stationary over some time. This violation of nonstationarity was removed by taking the first difference of the series. We implemented the Dickey–Fuller (DF) test without drift, the Phillips–Perron (PP) test without drift, to verify whether our series is stationary or not, and the structural break (Chow test) [57] test to check whether there is a structural change in the series. The Chow test indicated that there is no structural change in the series as the p value is greater than 0.05. As a result, the series becomes stationary at the level (0). The output of these tests is given in Table 2 for the level (0).

Table 2.

The augmented Dickey–Fuller, Phillips–Perron, and structural break (Chow test) tests for CO2 emission data at the level (0).

Item Estimate
Dickey–Fuller −3.0238
Lag order 3
p value 0.161
Phillips–Perron unit root test −9.6469
Lag order 3
p value 0.5404
Structural break test
(Chow test)
p value 1.00

The Phillips–Perron unit root test: 9.6469.

From Table 2, the series is not stationary at the level (0), so we implemented a single differencing technique to make the series stationary. After making the series stationary, we searched for the best candidate model for the CO2 emission data by producing the correlogram of the differenced series. The visual autocorrelation (ACF) and partial autocorrelation (PACF) plots are presented in Figure 4.

Figure 4.

Figure 4

Correlogram of the first difference of the series of CO2 emissions.

In Figure 4, the ACF and PACF suggest that the estimated ARIMA model for modelling CO2 emissions includes two terms from the autoregressive (AR) part and one term from the moving average (MA) part. In the correlogram, the suitable ARIMA model for the series is ARIMA (4,1,7) since the lags of ACF and PACF are out of the boundaries on 5 and 7.

Using the complete dataset, we presented the visual single horizon forecast of the ARIMA, ETS, TBATS, naïve, NNAR, and MLP models in Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, and Figure 10, respectively, to assess their visual performance. We traced the model performance visually to show the fitted versus the observed values. Figure 5, Figure 6, Figure 7, and Figure 8 show the visuals of linear time series modelling and forecasting, while Figure 9 and 10 show the visuals of the modelling and forecasting of nonlinear machine learning time series models.

Figure 5.

Figure 5

The ARIMA model for modelling and forecasting CO2 emissions.

Figure 6.

Figure 6

The ETS model for modelling and forecasting CO2 emissions.

Figure 7.

Figure 7

The TBATS model for modelling and forecasting CO2 emissions.

Figure 8.

Figure 8

The naïve model for modelling and forecasting CO2 emissions.

Figure 9.

Figure 9

The NNAR model for modelling and forecasting CO2 emissions.

Figure 10.

Figure 10

The MLP model for modelling and forecasting CO2 emissions.

We used the nnfor and neuralnet packages in R to perform the analyses for the nonlinear machine learning time series models. The packages adjust itself for the optimum number of parameters according to the behaviour of the series.

Although all models showed a similar trend as the original series (observed), the nonlinear machine learning time series models had a more closer trend compared to the linear time series models. This is an indication that the nonlinear machine learning models have superior forecasting qualities than the linear time series models.

Except for ETS and naïve models (Figures 6 and 8) which forecasted stable CO2 emissions in subsequent years, all other models forecasted increasing CO2 emissions in Pakistan in subsequent years. This trend is worrying and devastating and similar to that observed by Butt et al. [11].

We estimated the RMSE and MAE for both linear and nonlinear time series models to illustrate the numerical performance of our models. Table 3 shows the RMSE and MAE estimates of all models using the complete dataset. From the table, we observe that RMSE values decrease in the order TBATS, naïve, ETS, ARIMA (4, 1, 7), MLP, and NNAR (1, 1), respectively, with the TBATS having the largest value and NNAR (1, 1) having the least value. For MAE values, the decreasing order is of the form TBATS, naïve, ETS, ARIMA (4, 1, 7), MLP, and NNAR (1, 1), respectively, with the TBATS having the largest value and NNAR (1, 1) having the least value. We noticed that both nonlinear machine learning time series models had smaller RSME and MAE values than all linear time series models, indicating their superiority to linear time series models. The NNAR (1, 1) model's least RSME and MAE values of 0.0007 and 0.0005, respectively, is a clear evidence that the machine learning nonlinear NNAR (1, 1) model possesses a superior forecasting quality and data assessment than the rest of the time series models. As a result, we utilized the NNAR (1, 1) model for forecasting CO2 emissions in Pakistan from 2019 to 2028. The forecast values as illustrated in Figure 9 show that CO2 emissions in Pakistan will be 1.048 metric tons per capita by 2028. This depicts a rapid increase in CO2 emissions from previous years. These results are in line with those of Aftab et al. [2] and Faruque et al. [3].

Table 3.

RMSE and MAE estimates for linear and nonlinear time series models for the complete data.

Candidate models RMSE MAE
Naïve 0.0304 0.0236
ETS 0.0302 0.0232
ARIMA (4, 1, 7) 0.0239 0.0189
TBATS 0.0308 0.0240
NNAR (1, 1) 0.0007 0.0005
MLP 0.0020 0.0017

4. Conclusion

The present study related to forecasting and modelling CO2 emissions in Pakistan clearly warns that the current level of efforts made by the Government of Pakistan is not enough to capture the changes in climate change by reducing CO2 emissions. The increasing trend in emissions is a frightening and clear warning to policymakers. The increasing CO2 emissions is a serious threat to healthy living and fuels global warming vis-à-vis climate change. We call for pragmatic policy implementation by all stakeholders to bring finality to this alarming situation. To get a clearer picture of its increasing trend and to look for an accurate prediction of future CO2 emissions in Pakistan, this study employed a linear and nonlinear time series modelling approach to study CO2 emission data in Pakistan from 1960 to 2018, namely, ARIMA, naïve, TBATS, ETS, NNAR, and MLP. These linear and nonlinear time series models were used to forecast CO2 emissions for almost a decade. We utilized the complete data to present visual single horizon forecast for all models for easy visual comparison. Apart from the ETS and naïve models, which forecasted stable CO2 emissions in subsequent years, all other models forecasted increasing CO2 emissions in Pakistan in subsequent years. Our results reflected the findings of Ullah et al. [58]. Studies in China and India showed similar CO2 emission trends [15, 59, 60]. The RMSE and the MAE were used to check and assess the quality of the forecasting of models using the complete dataset. We observed that both nonlinear machine learning time series models had smaller RMSE and MAE values than all linear time series models, indicating their superiority to them. The NNAR (1, 1) model had the least RMSE and MAE values for the complete dataset. We, therefore, chose it as our best model for forecasting. Based on the forecasted value of the NNAR (1, 1) model, Pakistan's CO2 emissions will be 1.048 metric tons per capita by 2028. These results are in line with those of Aftab et al. [2] and Faruque et al. [3]. To achieve the targeted goal in the fight of reducing CO2 emissions, in order to accomplish the Paris Agreement of the UNFCCC, we propose that the Pakistan government should adapt the following:

  1. We should charge US$ 1000 per ton of CO2 emitted by companies and entities in every quarter of the year [61, 62]. This amount, which is set out by international authorities to curtail CO2 emissions, should be increased with time to discourage the use of high emitting fuels and encourage the use of low emitting ones. When this is initiated, it can reduce CO2 emissions by over fifty percent [61, 62].

  2. We should produce electricity from hydro, wind, and different sources with no emissions of CO2. This can reduce CO2 emissions in excess of seventy percent as well as make electricity cheaper [61].

  3. We should initiate rigorous planting of more trees, at least five million per year, in the populated areas of Pakistan as forest covers to absorb a greater chunk of emitted CO2 in the atmosphere. Studies reveal that five million trees in a particular area can absorb seventy percent of CO2 concentrates in the atmosphere in the area [61]. Therefore, this could reduce the concentration of CO2 in the atmosphere by over seventy percent of the recent value [61].

  4. We should provide, as a matter of urgency, incentives to companies, organisations, institutions, and households to come out with clean technologies or use technologies with no CO2 emissions or those with lower ones [61].

  5. We should fund more studies to develop clean and innovative technologies with less or no emissions [61]. This will go a long way to reduce emissions by over forty percent.

Our study can be used as a suggestion for forecasting CO2 emissions depending on different and dynamic settings. New policies for future purposes can be formulated and devised in light of this study. Our study can be extended to comparative analysis with other machine learning models using different activation functions.

Data Availability

The data consist of CO2 emissions in Pakistan from 1960 to 2018, available at https://databank.worldbank.org/.

Conflicts of Interest

The authors declare that there have no conflicts of interest.

Authors' Contributions

KT, MD, and MQ conceived the idea, planned the study, suggested the statistical methodology, drafted the manuscript, and reviewed the manuscript. All authors read and approved the final manuscript.

References

  • 1.United Nations. Climate actions notes. 2021. https://www.unep.org/explore-topics/climate-action/what-we-do/climate-action-note/state-of-climate.html?gclid=Cj0KCQjwyt-ZBhCNARIsAKH1177SvL-qGAEaJyJtSDcYVKRAaMUh46G9VnwAloRnSPA8Wz5_aVTCKcEaAsoeEALw_wcB .
  • 2.Aftab S., Ahmed A., Chandio A. A., Korankye B. A., Ali A., Fang W. Modeling the nexus between carbon emissions, energy consumption, and economic progress in Pakistan: evidence from cointegration and causality analysis. Energy Reports . 2021;7:4642–4658. doi: 10.1016/j.egyr.2021.07.020. [DOI] [Google Scholar]
  • 3.Faruque M. O., Rabby M. A. J., Hossain M. A., Islam M. R., Rashid M. M. U., Muyeen S. M. A comparative analysis to forecast carbon dioxide emissions. Energy Reports . 2022;8:8046–8060. doi: 10.1016/j.egyr.2022.06.025. [DOI] [Google Scholar]
  • 4.Government of Pakistan. Updated national determined contributions. 2021.
  • 5.Bokde N. D., Tranberg B., Andresen G. B. Short-term CO2 emissions forecasting based on decomposition approaches and its impact on electricity market scheduling. Applied Energy . 2021;281 doi: 10.1016/j.apenergy.2020.116061.116061 [DOI] [Google Scholar]
  • 6.Sun W., Li Z. An ensemble-driven long short-term memory model based on mode decomposition for carbon price forecasting of all eight carbon trading pilots in China. Energy Science & Engineering . 2020;8(11):4094–4115. doi: 10.1002/ese3.799. [DOI] [Google Scholar]
  • 7.Wu Q., Liu Z. Forecasting the carbon price sequence in the Hubei emissions exchange using a hybrid model based on ensemble empirical mode decomposition. Energy Science & Engineering . 2020;8:2708–2721. doi: 10.1002/ese3.703. [DOI] [Google Scholar]
  • 8.Yun P., Huang X., Wu Y., Yang X. Forecasting carbon dioxide emission price using a novel mode decomposition machine learning hybrid model of CEEMDAN-LSTM. Energy Science & Engineering . 2022:1–18. doi: 10.1002/ese3.1304. [DOI] [Google Scholar]
  • 9.Eckstein D., Künzel V., Laura Schäfer L., Winges M. Global climate risk index 2020: who suffers most from extreme weather events? Weather-Related Loss Events in 2018 and 1999 to 2018. 2020.
  • 10.Iqair. World’s most polluted countries and regions (historical data from 2018 to 2021) 2021. https://www.iqair.com/world-most-polluted-countries .
  • 11.Butt D., Myllyvirta L., Dahiya S. CO2 Emissions from Pakistan’s Energy Sector. 2021. https://energyandcleanair.org/wp/wp-content/uploads/2021/07/CO2-Emissions-from-Pakistans-Energy-sector_30_07_2021.pdf .
  • 12.Wu L., Liu S., Liu D., Fang Z., Xu H. Modelling and forecasting CO2 emissions in the BRICS (Brazil, Russia, India, China, and South Africa) countries using a novel multi-variable grey model. Energy . 2015;79:489–495. doi: 10.1016/j.energy.2014.11.052. [DOI] [Google Scholar]
  • 13.Pao H. T., Tsai C. M. Modeling and forecasting the CO2 emissions, energy consumption, and economic growth in Brazil. Energy . 2011;36(5):2450–2458. doi: 10.1016/j.energy.2011.01.032. [DOI] [Google Scholar]
  • 14.Chang H., Sun W., Gu X. Forecasting energy CO2 emissions using a quantum harmony search algorithm-based DMSFE combination model. Energies . 2013;6(3):1456–1477. doi: 10.3390/en6031456. [DOI] [Google Scholar]
  • 15.Auffhammer M., Carson R. T. Forecasting the path of China’s CO2 emissions using province-level information. Journal of Environmental Economics and Management . 2008;55(3):229–247. doi: 10.1016/j.jeem.2007.10.002. [DOI] [Google Scholar]
  • 16.Meng M., Niu D. X., Shang W. A small-sample hybrid model for forecasting energy-related CO2 emissions. Energy . 2014;64:673–677. doi: 10.1016/j.energy.2013.10.017. [DOI] [Google Scholar]
  • 17.Lin C. S., Liou F. M., Huang C. P. Grey forecasting model for CO2 emissions: a Taiwan study. Applied Energy . 2011;88(11):3816–3820. doi: 10.1016/j.apenergy.2011.05.013. [DOI] [Google Scholar]
  • 18.Pao H. T., Fu H. C., Tseng C. L. Forecasting of CO2 emissions, energy consumption and economic growth in China using an improved grey model. Energy . 2012;40(1):400–409. doi: 10.1016/j.energy.2012.01.037. [DOI] [Google Scholar]
  • 19.Amarpuri L., Yadav N., Kumar G., Agrawal S. Prediction of CO 2 emissions using deep learning hybrid approach: a Case Study in Indian Context. Proceedings of the 2019 Twelfth International Conference on Contemporary Computing (IC3); August 2019; Noida, India. IEEE; pp. 1–6. [Google Scholar]
  • 20.Lu I. J., Lewis C., Lin S. J. The forecast of motor vehicle, energy demand and CO2 emission from Taiwan’s road transportation sector. Energy Policy . 2009;37(8):2952–2961. doi: 10.1016/j.enpol.2009.03.039. [DOI] [Google Scholar]
  • 21.Paravantis J. A., Georgakellos D. A. Trends in energy consumption and carbon dioxide emissions of passenger cars and buses. Technological Forecasting and Social Change . 2007;74(5):682–707. doi: 10.1016/j.techfore.2006.05.005. [DOI] [Google Scholar]
  • 22.Rentziou A., Gkritza K., Souleyrette R. R. VMT, energy consumption, and GHG emissions forecasting for passenger transportation. Transportation Research Part A: Policy and Practice . 2012;46(3):487–500. doi: 10.1016/j.tra.2011.11.009. [DOI] [Google Scholar]
  • 23.Al-mulali U., Normee Che Sab C. Energy consumption, pollution and economic development in 16 emerging countries. Journal of Economics Studies . 2013;40(5):686–698. doi: 10.1108/jes-05-2012-0055. [DOI] [Google Scholar]
  • 24.Wang T., Li H., Zhang J., Lu Y. Influencing factors of carbon emission in China’s road freight transport. Procedia-Social and Behavioral Sciences . 2012;43:54–64. doi: 10.1016/j.sbspro.2012.04.077. [DOI] [Google Scholar]
  • 25.Zhang C., Nian J. Panel estimation for transport sector CO2 emissions and its affecting factors: a regional analysis in China. Energy Policy . 2013;63:918–926. doi: 10.1016/j.enpol.2013.07.142. [DOI] [Google Scholar]
  • 26.Lin F.-L., Inglesi-Lotz R., Chang T. Revisit coal consumption, CO2 emissions and economic growth nexus in China and India using a newly developed bootstrap ARDL bound test. Energy Exploration & Exploitation . 2018;36(3):450–463. doi: 10.1177/0144598717741031. [DOI] [Google Scholar]
  • 27.Pao H. T., Tsai C. M. CO2 emissions, energy consumption and economic growth in BRIC countries. Energy Policy . 2010;38(12):7850–7860. doi: 10.1016/j.enpol.2010.08.045. [DOI] [Google Scholar]
  • 28.Alam M. J., Begum I. A., Buysse J., Rahman S., Van Huylenbroeck G. Dynamic modeling of causal relationship between energy consumption, CO2 emissions and economic growth in India. Renewable and Sustainable Energy Reviews . 2011;15(6):3243–3251. doi: 10.1016/j.rser.2011.04.029. [DOI] [Google Scholar]
  • 29.Pao H.-T., Yu H. C., Yang Y. H. Modeling the CO2 emissions, energy use, and economic growth in Russia. Energy . 2011;36(8):5094–5100. doi: 10.1016/j.energy.2011.06.004. [DOI] [Google Scholar]
  • 30.Pao H.-T., Tsai C. M., Yang Y.-H. Multivariate Granger causality between CO2 emissions, energy consumption, FDI (foreign direct investment) and GDP (gross domestic product): evidence from a panel of BRIC (Brazil, Russian Federation, India, and China) countries. Energy . 2011;36(1):685–693. doi: 10.1016/j.energy.2010.09.041. [DOI] [Google Scholar]
  • 31.Ozcan B. The nexus between carbon emissions, energy consumption and economic growth in Middle East countries: a panel data analysis. Energy Policy . 2013;62:1138–1147. doi: 10.1016/j.enpol.2013.07.016. [DOI] [Google Scholar]
  • 32.Zhang X. P., Cheng X. M. Energy consumption, carbon emissions, and economic growth in China. Ecological Economics . 2009;68(10):2706–2712. doi: 10.1016/j.ecolecon.2009.05.011. [DOI] [Google Scholar]
  • 33.Cowan W. N., Chang T. Y., Inglesi-Lotz R., Gupta R. The nexus of electricity consumption, economic growth and CO2 emissions in the BRICS countries. Energy Policy . 2014;66:359–368. doi: 10.1016/j.enpol.2013.10.081. [DOI] [Google Scholar]
  • 34.Bashir M. F., Ma B., Shahbaz M., Jiao Z. The nexus between environmental tax and carbon emissions with the roles of environmental technology and financial development. PLoS One . 2020;15(11) doi: 10.1371/journal.pone.0242412.e0242412 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Box G., Jenkins G., Reinsel G. 4. Time Series Analysis . Hoboken, NJ, USA: Wiley; 2008. [Google Scholar]
  • 36.Daniyal M., Tawiah K., Muhammadullah S., Opoku-Ameyaw K. Comparison of conventional modeling techniques with the neural network autoregressive model (NNAR): application to COVID-19 data. Journal of Healthcare Engineering . 2022;2022:9. doi: 10.1155/2022/4802743.4802743 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Anvari S., Tuna S., Canci M., Turkay M. Automated Box–Jenkins forecasting tool with an application for passenger demand in urban rail systems. Journal of Advanced Transportation . 2016;50(1):25–49. doi: 10.1002/atr.1332. [DOI] [Google Scholar]
  • 38.Hyndman R. J., Anthanapoulous G. 2. Forecasting: Principles and Practice . Australia: Monash University; 2018. [Google Scholar]
  • 39.RPubs by RStudio. Why TBATS? 2022. https://rpubs.com/chenx/tbats_notes .
  • 40.Basri K I., Sumitra I. D. Comparison of forecasting the number of outpatients visitors based on naïve method and exponential smoothing. IOP Conference Series: Materials Science and Engineering . 2019;662 doi: 10.1088/1757-899X/662/4/042002.42002 [DOI] [Google Scholar]
  • 41.Ahuja S., Kumar A. Expectation-based probabilistic naive approach for forecasting involving optimized parameter estimation. Arabian Journal for Science and Engineering . 2022:1–8. doi: 10.1007/s13369-022-06819-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Arumugam R., Raji R. A Markov Model for Prediction of Corona Virus COVID-19 in India-A Statistical Study. Xi’an Dianzi Keji Daxue Xuebao/Journal of Xidian University . 2020;14:1422–1426. [Google Scholar]
  • 43.Jain G., Mallick B. A Study of Time Series Models ARIMA and ETS . Rochester, NY, USA: SSRN; 2017. [DOI] [Google Scholar]
  • 44.Tealab A., Hefny H., Badr A. Forecasting of nonlinear time series using ANN. Future Computing and Informatics Journal . 2017;2(1):39–47. doi: 10.1016/j.fcij.2017.05.001. [DOI] [Google Scholar]
  • 45.Benrhmach G., Namir K., Namir A., Bouyaghroumni J. Nonlinear autoregressive neural network and extended kalman filters for prediction of financial time series. Journal of Applied Mathematics . 2020;2020:6. doi: 10.1155/2020/5057801.5057801 [DOI] [Google Scholar]
  • 46.Hopfield J. J. Artificial neural networks. IEEE Circuits and Devices Magazine . 1988;4(5):3–10. doi: 10.1109/101.8118. [DOI] [Google Scholar]
  • 47.Srinivasa K., Thilagam P. S. Multi-layer perceptron based fake news classification using knowledge base triples. Applied Intelligence . 2022:1–12. doi: 10.1007/s13369-022-06819-0. [DOI] [Google Scholar]
  • 48.Deyasi A., Bhattacharjee A. K., Mukherjee S., Sarkar A. Multi-layer perceptron based comparative analysis between CNTFET and quantum wire FET for optimum design performance. Solid State Electronics Letters . 2021;3:42–52. doi: 10.1016/j.ssel.2021.12.003. [DOI] [Google Scholar]
  • 49.Chai S. S., Cheah W. L., Goh K. L., Chang Y. H. R., Sim K. Y., Chin K. O. A multilayer perceptron neural network model to classify hypertension in adolescents using anthropometric measurements: a cross-sectional study in sarawak, Malaysia. Computational and Mathematical Methods in Medicine . 2021;2021:11. doi: 10.1155/2021/2794888.2794888 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Qureshi M., Daniyal M., Tawiah K. Comparative evaluation of multi-layer perceptron approach with conventional ARIMA in modeling and prediction of COVID-19 daily death cases. Journal of Healthcare Engineering . 2022;2022:p. 7. doi: 10.1155/2022/4864920.4864920 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.forecastHybrid. Convenient Functions for Ensemble Time Series Forecasts. 2020. https://cran.r-project.org/web/packages/forecastHybrid/forecastHybrid.pdf .
  • 52.Bokde N. D., Yaseen Z. M., Andersen G. B. ForecastTB—an R package as a test-bench for time series forecasting—application of wind speed and solar radiation modeling. Energies . 2020;13(10):p. 2578. doi: 10.3390/en13102578. [DOI] [Google Scholar]
  • 53.Tuszynski J., Dietze M. caTools: moving window statistics, GIF, Base64, ROC AUC, etc. 2021. https://search.r-project.org/CRAN/refmans/caTools/html/00Index.html .
  • 54.Trapletti A., Hornik K. Tseries: time series analysis and computational finance. 2022. https://cran.r-project.org/web/packages/tseries/tseries.pdf .
  • 55.Ripley B., Venables W. Feed-forward neural network and multinomial log-linear models. 2022. https://cran.r-project.org/web/packages/nnet/nnet.pdf .
  • 56.Ulrich J. TTR: Technical Trading Rules. 2021. https://cran.r-project.org/web/packages/TTR/TTR.pdf .
  • 57.Zeileis A., Leisch F., Hornik K., Kleiber C. Strucchange: an R package for testing for structural change in linear regression models. Journal of Statistical Software . 2002;7(2):1–38. doi: 10.18637/jss.v007.i02. [DOI] [Google Scholar]
  • 58.Ullah S., Ozturk I., Usman A., Majeed M. T., Akhtar P. On the asymmetric effects of premature deindustrialization on CO2 emissions: evidence from Pakistan. Environmental Science and Pollution Research . 2020;27(12) doi: 10.1007/s11356-020-07931-0.13692 [DOI] [PubMed] [Google Scholar]
  • 59.Kirikkaleli D., Adebayo T. S. Do public-private partnerships in energy and renewable energy consumption matter for consumption-based carbon dioxide emissions in India? Environmental Science and Pollution Research . 2021;28(23) doi: 10.1007/s11356-021-12692-5.30139 [DOI] [PubMed] [Google Scholar]
  • 60.Köne A. Ç., Büke T. Forecasting of CO2 emissions from fuel combustion using trend analysis. Renewable and Sustainable Energy Reviews . 2010;14(9):2906–2915. doi: 10.1016/j.rser.2010.06.006. [DOI] [Google Scholar]
  • 61.Newell R. G. Federal Climate Policy 101: Reducing Emissions. 2021.
  • 62.I M F B. More countries are pricing carbon, but emission are too cheap. 2022. https://www.imf.org/en/Blogs/Articles/2022/07/21/blog-more-countries-are-pricing-carbon-but-emissions-are-still-too-cheap .

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data consist of CO2 emissions in Pakistan from 1960 to 2018, available at https://databank.worldbank.org/.


Articles from Journal of Environmental and Public Health are provided here courtesy of Wiley

RESOURCES