Abstract
The spread of COVID-19 implied a large and fast increase of demand for intensive care services. To face this increase in demand, health care systems need to adapt their response by increasing hospital beds, intensive care unit (ICU) capacity and by (re-)deploying doctors and other personnel. This paper proposes a forecast approach based on the Vector Error Correction model for the daily counts of hospitalized patients with symptoms and of patients in ICU, using publicly available data on the current COVID-19 outbreak in Italy, Switzerland and Spain. The level of analysis is the local government managing the health care system response, which corresponds to regions for Italy. The one-week-ahead forecasts are validated with out-of-sample data over successive weeks; they are found to provide timely and robust prediction of ICU capacity needs in Lombardy, the most-affected Italian region, starting from the sample of the first 2 weeks of data. The same methodology is successfully validated on other Italian regions, Switzerland and Spain. This approach may be used in other countries/regions/provinces to help adapt the health care system response to COVID-19 (or other similar disease); for this purpose, the open-source software code to produce the forecasts is provided with the paper.
1. Introduction
The human-to-human transmission of SARS-CoV-2 has spread around the world, prompting the World Health Organization to characterise it as a pandemic on March 11, 2020, see [1]. The European Centre for Disease Prevention and Control (ECDC) reported 2, 520, 522 infected between 31 December 2019 and April 22, 2020, including 176, 786 deaths, affecting more than 210 countries from all five continents, see the ECDC Epidemiological Update.
Several regions and countries have introduced strong measures to mitigate contagion through rigid social-distance measures and partial or total lock-downs. Italy has been the country with an early onset of the pandemic, with the region of Lombardy particularly affected by COVID-19.
Each local area experiences specificities in the spread of this epidemic: the spread of the virus is likely to differ between regions with different population density, age structure, and culture or habits influencing social proximity; similarly, different characteristics of the health care system and different approaches in testing for SARS-CoV-2 and related confinement approaches are expected to affect the diffusion of the disease. These specificities need to be accounted for in forecasting models.
Different indicators on the spread of the epidemic may have different levels of reliability. Indicators frequently reported are number of tested, infected, hospitalised, deceased and patients in intensive care units. Many of these indicators may reflect regional differences in reporting and suffer from selection bias. For instance the number of infected may be biased because of undetected or untested cases; the number of deaths due to Covid-19 may be biased by different practice in testing policies of deceased patients; see [2].
From the perspective of the health-system response to the local epidemic, the most important indicators are the number of patients requiring ICU admission, and their population at risk, associated to the hospitalized patients with COVID-19 symptoms. Both these indicators appear to be less prone to misreporting or selection bias than the number of infected, because hospitalisations are officially recorded following standardised protocols; this makes the corresponding time series more reliable, see [3]. These time series provide key information to adapt hospital facilities and staff in response to the evolution of the outbreak; see [4] for a classification of various patients groups and flows among them.
Moreover, the ECDC [5] indicated bed occupancy in ICU as the reference indicator of seriousness and of impact of the COVID-19 disease, both at community (region) and at hospital level. All these element indicate that ICU bed occupancy and its forecasts should be a central piece of information to craft the local/regional health care response, shifting or enhancing ICU capacity accordingly, [6]. This indicates a clear need for mathematical, econometric and statistical models to obtain timely short-term forecasts of ICU capacity, for local governments to plan—and quickly adjust—their health care resources.
Within the econometrics and statistics literature on COVID-19, forecasting has so far mainly focused on the evolution of infected people, deaths and hospitalized patients, including the ones in ICU. Most forecasting approaches in this area have been so far univariate, i.e. they have used the dynamics of the single time series of the outcome of interest to produce multi-step-ahead predictions.
Using only information contained in a single time series as in univariate approaches imposes limitations to the associated forecasts. One of them is caused by ignoring possible co-movements with other available time-series. Adding other time series as predictors, similarly to ARIMAX models, increases the explanatory power of the model. However, in order to use these models for multi-step-ahead predictions, one needs to forecast also the predictors; this is equivalent to a multivariate time-series approach, which is the one proposed in this paper.
For example, Remuzzi and Remuzzi [7] forecast the number of infected patients and hospitalized ICU admissions using an exponential trend model at the onset of the epidemic. Other examples use phenomenological models, [see 8, 9], exponential smoothing, trend and seasonal models [10], ARIMA models [11, 12] or state space models for disease transmission over time within Susceptible-Infectious-Recovered (SIR)-like models [13].
Unlike previous approaches, the present paper proposes a bivariate structural model to forecast the daily need of ICUs beds (IC) jointly with the number of patients hospitalized with COVID-19 symptoms (HwS), which approximates the population at risk for IC. In this approach, the joint dynamics of IC and HwS is modeled and used to produce multi-step-ahead predictions.
The model exploits the fact that these two time series are linked, theoretically as well as empirically; in technical language, they are cointegrated, as detailed below. The proposed forecast tool is obtained as the prediction of an estimated multivariate Vector Error Correction Model (VECM) to describe the joint dynamics of IC and HwS, see Engle and Granger [14], Johansen [15, 16].
There are three main advantages of using a VECM approach to forecast IC. First, it embodies the structural relation existing between the level of IC and the level of HwS: hospitalised patients may worsen their health status within a few days in hospital, and require ICU acute treatment. The HwS series de facto proxies the population at risk of treatment in ICU, and it is a natural predictor of IC flows.
Second, the use of HwS as a predictor summarises many external factors affecting ICU. In fact IC depends on hospital admission policies (ICU turnover, co-presence in ICU of COVID-19 and non-COVID-19 patients) and/or public health measures put in place during the epidemic (social distancing, lockdown). These factors all affect HwS prior to affecting IC, and using the information in HwS can help summarise the influence of all these factors on the forecast of IC.
Third, the VECM model presents well interpretable parameters, and it allows flexibility in the specification of the deterministic trend component. This permits to define various specifications, which can be fitted and compared. In particular, the strength of the cointegration relation, the speed of adjustment between series towards the equilibrium, the number of significant lags of the IC equations are all useful indications on the goodness of fit of the model. Out-of-sample forecasts can also be used to compare alternative models.
The aim of the present paper is to propose the VECM class of models as a predictive tool for the forecast of IC demand. Given the large class of alternative models, the present paper does not attempt to run an exhaustive forecast comparison with all alternative predictive models; this comparison is bound not to be exhaustive, both in terms of alternative models and of datasets. Rather, in line with the forecasting literature [17], it uses the Random Walk model as a benchmark reference forecast, on a selection of datasets from Italy, Spain and Switzerland.
In principle, the Random Walk benchmark allows to compare any given model with all present and future competing models and forecast techniques. This benchmark is found to be effective in the applications in the paper, as it is fund to outperform the present approach close to the flat peak of ICU demand. Note that prediction of the flat period is not troublesome, both from a health care system response viewpoint and from the methodological perspective. In fact, on the one hand, the health care system does not require immediate re-adjustments of resources. On the other hand, the proposed methodology is not expected to outperform the Random walk model when ICU demand is flat.
The bivariate nature of the proposed approach, while an advancement with respect to existing univariate time series techniques, falls short of incorporating (and predicting) the time series of other relevant population groups, such as the susceptible, infected, diagnosed, recognised, threatened, healed, ailing and extinct, see Giordano et al. [18]. This is done in micro-simulation models widely adopted by epidemiologists.
The purpose of such models is to simulate and explore the impact of non-pharmaceutical interventions to reduce (COVID-19) mortality and healthcare demand. These approaches are broadly used to predict aggregate trends in disease incidence and mortality under alternative scenarios to guide critical policy decisions. However, they typically require detailed parameterizations of high-resolution information on population density, progression of the disease and travel patterns. Examples are (area’s) average age, household distribution size, class sizes, staff-student ratios (to model transmission events) or age-stratified proportion of infections, delay from onset of symptoms hospitalisation, duration of hospitalisation, infection fatality ratio (to model the disease progression and healthcare demand), see Ferguson et al. [19]
While these detailed data are not available to the authors in the present case hence preventing the application of this approach, the multivariate nature of the proposed VECM class of models may allow extensions of the present approach to other time series representing these groups. These extensions appear a promising area for future research.
The VECM class of models is applied in this paper to the daily time series of IC and HwS, with the aim to predict IC one week ahead. These model could be extended to include information from lower-frequency time series, as in [20] and [21], if this type of information is available. This is however, not the case in the applications presented in this paper. Another possible alternative to the VECM class of models could be to combine forecast obtained from several econometric time series models [22, 23], also in the context of mixed frequency data [24, 25]. This option, however, would prevent interpretability of model coefficients, which appears an essential issue, especially in a health policy context. Moreover, adapting the response of the health-system requires ease-of-use and simplicity of the forecasting tool for the health authorities; both these aspect are better addressed by the present recourse to a single class of models.
This paper applies the forecast tools to the daily counts of hospitalized patients with symptoms and of patients in ICU, using open data from the current COVID-19 outbreak in Italian regions, as well data from Switzerland and Spain. The level of analysis is taken to be regions in Italy, because they are responsible for the management the health care system. Results are presented for the most affected regions in Northern Italy, such as Lombardy, Veneto, Emilia-Romagna. The proposed approach is validated also on data for Switzerland and Spain.
The out-of-sample daily forecast accuracy is presented and compared across different models within the VECM class, using standard forecasting accuracy indices, [17]. The benchmark model is taken to be the Random Walk, as often done in the literature. Results are presented both for the overall sample of data, and for a set of rolling sequence of weeks. The results show a good forecast performance for the proposed models.
2. Data
The outbreak in northern Italian regions had similar evolutions, which are well represented by the one of Lombardy (10 million inhabitants), which is the first and the most affected Italian region. Many hospitals in Lombardy have undergone dramatic changes after the onset of COVID-19: many wards have been reallocated to host COVID-19 patients, outpatients activities suspended, elective surgeries postponed, new hirings of doctors and nurses.
The first Italian patient tested positive on February 20, 2020 in Lombardy [see 4]. The COVID-19 ICU Lombardy Network was established on February 21 to coordinate the critical care response to the outbreak, with the identified main priority to increase ICU capacity. The Italian Civil Protection started publishing daily data at national, regional and provincial level on the evolution of the COVID-19 contagion in the country from February 24, 2020, see the dedicated Github repository, including numbers of infected, patients hospitalized with COVID-19 symptoms and patients in ICU. The initial ICU capacity of 842 beds in Lombardy was increased to approximately 1755 by the middle of April.
The Italian national government and the one of Lombardy adopted successive restrictive measures to contain the spread of the virus, with increasing intensity of social distancing. The major decrees are dated March 1 (“Decreto Zone Rosse”—red-zones Decree), March 9 (“Decreto #IoRestoaCasa”—#IStayHome Decree) and March 22 (“Decreto Chiusa Italia”—Locked-Italy Decree).
Fig 1 shows the time evolution of the main indicators for Lombardy. The number of tested and of infected patients grows exponentially, the number of patients needing hospitalization grows rapidly, causing distress to the health care system. The right panel of Fig 1 shows the increase of patients hospitalized with symptoms (HwS) and of inpatients in ICU (IC).
The right panel in Fig 1 plots the time series of ic and hws (the log of IC and HwS). Note the parallel trend between the two, which suggests that the two may have a common trend. This is the same as saying that they are contegrated in the sense of Engle and Granger [14], i.e. that there exists a linear combination of the two that has no trend. The prediction of IC for Lombardy is the main empirical application of the present paper.
The time series of other Italian regions corresponding to Fig 1 show similar patterns, albeit with a different starting date; they are not graphed here for brevity. Similar patterns are also observed for data from Spain and Switzerland. The Italian regions of Veneto and Emilia-Romagna, as Spain and Switzerland, are used to validate the proposed approach.
The available data for Lombardy covers the period February 24–April 5 2020, split into a training sample (February 28–March 29) and an out-of-sample period (March 30–April 5). Data from February 24 to 27 was used to initialize differences and lags, i.e. as pre-estimation period. This sample and its divisions are called the ‘full sample’ in the following. A lag length of k = 4 was used, resulting in 3 lagged changes appearing in Eq (1). The pre-estimation period was chosen so as to align the estimation sample in the first and second stage.
3. Method
The forecasts are produced using the VECM. The VECM is a special case of the Vector AutoRegressive (VAR) models, and allows for the presence of stationary and nonstationary variables, see Hendry [26]. VECMs incorporate one (or more) underlying long-run equilibrium relation(s) among the time series, called the cointegration relation(s), as well as adjustment dynamics that corrects towards equilibrium. Finally, they allow the growth rates of lagged variables to influence each other.
3.1 Vector error correction models
The nature of the trends in the system can be both deterministic, i.e. a function of time, and stochastic, as represented by random walks. A p × 1 vector xt of variables observed at times t = 1, 2, … that is stationary in its first differences Δxt = xt − xt−1 is called integrated of order 1, or I(1). The vector xt is generated by a VECM if it satisfies Eq (1) below
(1) |
where εt is a p × 1 vector of random variables (not necessary Gaussian), i.i.d. over time, with mean 0 and positive variance-covariance matrix Ω. Here Δxt = (Δx1,t, …, Δxp,t)′ is the p × 1 vector of the first differences, Dt is an nD×1 vector of deterministic variables; Dt is taken here to include a constant 1 and a linear trend t. The Γj are p × p parameter matrices with i, ℓ element Γj,iℓ describing the influence of Δxℓ,t−j on Δxi,t.
The term β′xt−1 cointains r (in the present case r = 1) cointegrating relations among xt−1, and α measures adjustment towards equilibrium. Eq (1) is referred to as the VECM, in reference to the adjustment mechanism. The long-run equilibrium (cointegrating) relation β′xt−1, properly corrected for Dt, is stationary, so that (1) provides a balanced equation: the l.h.s. is stationary, similarly to the terms εt, β′xt−1 and Δxt−j on the r.h.s., possibly corrected for Dt.
When r = 1, and the long-run equilibrium relation β′ in (1) is normalised setting the coefficient to x1t−1 equal to 1, β′ = (1, −β2)′, the cointegrating relation can also be written as . Hence, correcting for Dt,
(2) |
The β2 coefficients (only one coefficient in the present case) describe the proportionality factor between x1t and x2t in the long-run, where in (2) x1t and x2t may be trending (nonstationary) while their deviation wt−1 is stationary. In (2), is a design matrix that can select a subset of Dt in .
3.2 Estimation
Estimation of (1) can be done in two stages following Engle and Granger [14]. The first stage is performed by estimating (2) by regression of x1t−1 on x2,t−1 and ; this generates or, equivalently . These quantities are substituted in (1) in the second stage, which estimates the remaining parameters by regression of Δxt on β′xt−1, Δxt−j for j = 1, …, k − 1 and , where is a design matrix that selects a subset of Dt in .
The specification of the deterministic terms in , m = 1, 2, in the two stages is labeled d(i, j) in the following, where i refers to the first stage and j to the second one. Consider first i; when i = 2, both the constant and the trend t are included in , while when i = 1 only the constant is included. Finally when i = 0, no deterministic component is included in (2). The same convention is used for j in relation to and (1). These models are the ones proposed in Johansen [16] (eq. (5.13) to (5.17), page 81); more details on the properties of the model are provided in Berta et al. [27].
The time series of ic and hws are clearly non-stationary and apparently show the same common trend (Fig 1). This motivates the use of the VECM class of models d(i, j) introduced above. Two of these models appeared of special relevance, namely the d(2, 1) and d(1, 1) models, because they both are characterized by growth rates with constant and possibly non-zero expectations, see Berta et al. [27].
When x1,t = ict (the log of ICt) and x2,t = hwst (the log of HwSt), the interpretation of the coefficients is as follows. The cointegrating relation (2) reads ict−1 = β2 hwst−1 + a0 + a1 t + wt−1 for j = 2, where for j = 1, a1 = 0 and for j = 0, a1 = a2 = 0. Taking exponentials, , where γt = exp(a0 + a1t) is the coefficient of proportionality and ηt = exp(wt) is an error term.
This relation is a relation of semi-proportionality when 0 < β2 < 1 i.e. IC moves less than proportionally with respect to HwS. On the contrary, when β2 = 1, IC are directly proportional to HwS. The constant of proportionality γt equals 1 when a0 = a1 = 0. When a1 ≠ 0, γt changes over time; this reflects shifts in the transition to ICU admission from symptomatic status, as well as possibly the severity of the disease.
The adjustment Eq (1) describes how changes of x1,t = ict and x2,t = hwst adjust to deviations wt (equilibrium-correction), as well as to past changes. The analysis of the i, ℓ coefficients in Γj,iℓ describing the influence of Δxℓ,t−j on Δxi,t allows to verify if the dynamics of x2,t is autonomous, and how it affects Δx1,t via its lagged values Δx2,t−j.
3.3 Forecast performance
The forecast performance of the model is evaluated by means of standard forecast accuracy indices, namely the Mean Absolute Error (MAE), , and Mean Absolute Percentage Error (MAPE), , where predictions are made at time t, yt + j indicates the time series to be predicted at time t + j, and its i-th predictor based on information up to time t; see e.g. [17].
Because the forecast are made on the log of the ICt series, its MAE also corresponds to ‘ln Q’ metric indicator of Tofallis [28] and provides an approximation to the MAPE for the corresponding ICt forecast. In fact, one has |log(at/bt)| ≈ |(at − bt)/bt| where the approximation hold for |(at − bt)/bt| (possibly much) lower than 1, where at is the predictor of ICt and bt is ICt. This implies that the average of |log(at/bt)| (i.e. the MAE for forecasts of ict = log ICt) is approximately equal to the average of |(at − bt)/bt|, (which is the MAPE for forecasts of ICt), when the predictor of ICt is constructed from the one of ict = log ICt.
Moreover, to allow comparability with other predictions, the forecast accuracy indices of the proposed models are compared with the ones for the Random Walk model, which is often used as the benchmark in the forecast literature, see e.g. [17]. Specifically, the relative version of MAE and MAPE are labelled MAER and MAPER in the following; they equal MAERi = MAEi/MAE0 and MAPERi = MAPEi/MAPE0, where the index 0 corresponds to the Random Walk. Values of MAERi and MAPERi below one are indication of better performance of model i as compared with the Random Walk.
A practical procedure to select a forecast model is to estimate all models on the available (rolling or full) estimation sample(s), retaining one week of data for out-of-sample forecast comparison. If a preferred model is needed, one can choose the model that gives minimal MAE and MAPE in the out-of-sample forecasts for the last week.
To better validate estimates and to assess the flexibility of the proposed models, a set of rolling training samples and out-of-sample periods were considered. The first training sample covered the first two weeks of data, i.e. from February 24 to March 8 for Lombardy; out-of-sample forecast were made for the following seven days. Additional training samples of data were considered, each one obtained by adding one extra week of data. The last out-of-sample forecast exercise coincides with the full training sample. In case more than three weeks of data were available, the first week of data was discarded to allow for a ‘burn-in’ period. The same setup of full sample and rolling samples analyses was used for the other Italian regions, Switzerland and Spain.
4. Results
The models d(i, j) described in the previous section were applied to data on x1,t = ict and x2,t = hwst from Italian regions, Switzerland and Spain. Results are presented in this section, starting with Lombardy.
4.1 Model estimation for Lombardy
Table 1 reports estimates of the model parameters for the d(1, 1), which is selected here as the d(i, j) model with lowest MAE and MAPE over this sample, as detailed in the following susection.
Table 1. Estimates for the d(1, 1) model for Lombardy, estimation period March 2 to March 29 2020.
Engle Granger first stage, Eq (2) | ||||||||||
Cointegrating relation: | ||||||||||
Engle Granger second stage, Eq (1) | ||||||||||
eq Δict | eq Δhwst | |||||||||
predictors | coeff | se | t-ratio | p-value | coeff | se | t-ratio | p-value | ||
-0.28 | 0.11 | -2.5 | 0.01 | ** | 0.69 | 0.24 | 2.81 | 0.00 | *** | |
Δict−1 | -0.02 | 0.13 | -0.12 | 0.90 | -0.66 | 0.29 | -2.33 | 0.02 | * | |
Δhwst−1 | -0.04 | 0.08 | -0.53 | 0.59 | 0.18 | 0.18 | 1.02 | 0.31 | ||
Δict−2 | 0.16 | 0.06 | 2.59 | 0.01 | ** | 0.26 | 0.13 | 1.97 | 0.05 | * |
Δhwst−2 | 0.07 | 0.06 | 1.18 | 0.24 | 0.30 | 0.12 | 2.42 | 0.02 | * | |
Δict−3 | 0.38 | 0.06 | 6.27 | 0.00 | *** | 0.44 | 0.13 | 3.32 | 0.00 | *** |
Δhwst−3 | 0.18 | 0.05 | 3.78 | 0.00 | *** | 0.24 | 0.11 | 2.22 | 0.03 | * |
1 | 0.00 | 0.01 | -0.44 | 0.66 | 0.00 | 0.02 | 0.01 | 0.99 |
*, **, ***: significant at 5%, 1%, 0.1% level.
Several remarks are in order. The β2 coefficient is estimated equal to 0.74, showing a less-than-proportional relations between HwS and IC. The constant of proportionality γt is approximately 1.27, higher than 1. The adjustment of Δict to the disequilibrium error is strong and significant, (−0.28). The one of Δhwst is larger in magnitude and also significant. This indicates that both Δict and Δhwst adjust towards equilibrium.
The analysis of the Γj,iℓ coefficients for the equation of IC patients shows that the growth rates of IC and HwS lagged 2 or 3 days significantly help to predict the growth rate of IC. This shows the value added of the multivariate approach over univariate ones, where HwS helps to predict IC both because it enters the equilibrium-correction term and via the growth rates lagged 2 or 3 days. Note that the dynamic equation for HwS is also useful in computing multi-step-ahead predictions for IC.
The fit of the corresponding first stage regression is plotted in Fig 2. The graph also includes the pre-estimation sample between February 24 to March 1 and the out-of-sample forecast period from March 30 to April 5, 2020. Note that Fig 2 represents a cross-section relation and not a predictive dynamic equation; in fact the contemporaneous value of hwst appears as an explanatory variable. This equation represents, instead, the equilibrium relation between ict and hwst towards which Δict adjusts via the equilibrium correction Eq (1).
The d(1, 1) model was then used to forecast ict+j setting t at March 29, 2020, and considering j = 1, …, 7, which correspond to March 30–April 5, 2020. The forecasts of ict + j and Δict + j are graphed in Fig 3, respectively in the left and right graphs; the corresponding out-of-sample forecast intervals, obtained as ± 3 times the model forecast standard error, are also reported. The model graphically shows a good forecast performance.
4.2 Forecasting accuracy for Lombardy
The MAE and MAPE indices—as well as their relative versions with respect to the Random Walk MAER and MAPER— were calculated in the out-of-sample period both for models d(1, 0), d(1, 1), d(2, 0), d(2, 1); results are reported in Table 2. In the out-of-sample period of March 30 to April 5, MAE and MAPE are smallest for the d(1, 1) model, which is hence the preferred model. The absolute forecast percentage error for this model of ICt is approximately 4%.
Table 2. Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE) along with their relative versions with respect to the Random Walk model (MAER and MAPER, respectively) for models d(i, j) over different training samples and forecast periods.
forecast week: | week 3 | week 4 | ||||||
model: | d(1, 0) | d(1, 1) | d(2, 0) | d(2, 1) | d(1, 0) | d(1, 1) | d(2, 0) | d(2, 1) |
MAE | 0.04 | 0.64 | 0.34 | 0.56 | 0.04 | 0.09 | 0.01 | 0.06 |
MAER | 0.10 | 1.62 | 0.87 | 1.42 | 0.15 | 0.35 | 0.06 | 0.25 |
MAPE | 0.01 | 0.10 | 0.05 | 0.09 | 0.01 | 0.01 | 0.00 | 0.01 |
MAPER | 0.09 | 1.49 | 0.80 | 1.30 | 0.15 | 0.33 | 0.06 | 0.24 |
forecast week: | week 5 | week 6 | ||||||
model: | d(1, 0) | d(1, 1) | d(2, 0) | d(2, 1) | d(1, 0) | d(1, 1) | d(2, 0) | d(2, 1) |
MAE | 0.05 | 0.06 | 0.08 | 0.08 | 0.05 | 0.04 | 0.06 | 0.05 |
MAER | 0.55 | 0.62 | 0.79 | 0.86 | 4.16 | 3.26 | 5.32 | 4.38 |
MAPE | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 |
MAPER | 0.54 | 0.61 | 0.78 | 0.85 | 4.16 | 3.26 | 5.32 | 4.38 |
Bold entries correspond to the model with smallest MAE or MAPE for the given forecast week.
The rolling out-of-sample performance of the models is also reported in Table 2, which shows that the best performance was obtained by the d(1, 0) model for weeks 3 and 5, model d(2, 0) for week 4 and model d(1, 1) for week 6. The level of absolute forecast percentage error of ICt for the best models is between 1% and 5% across all weeks.
It can be noted that models with fewer parameters work best in the early weeks, where there is less information available. The best specification of the deterministic component appears to change over weeks; this suggest that more flexible specifications of this component may be useful over (future) longer spans of data.
Overall, both the absolute forecast error (MAE) and the absolute relative forecast error (MAPE) of the best models are small, indicating a satisfactory forecast accuracy. The out-of-sample performance of the best model for weeks 3, 4 and 5 for Lombardy are reported in Figs 4, 5 and 6.
Table 2 also reports MAER and MAPER. It can be observed that the best d(i, j) model outperforms the Random Walk model expect for week 6, where the level evolution of the ICt time series is flat, and the flat prediction of the Random Walk has lower absolute average forecast errors. Note that the level of MAE and MAPE of the best d(i, j) model in weeks 6 is 4%, within the range of values observed in previous weeks (1% to 5%). This shows that week 6 is the period where the Random Walk has an improved forecast performance rather than a period of lower performance of the best d(i, j) model.
4.3 External validation
Validation of the models was performed considering the data available at the time of writing (April 2020) for the other Italian regions of Emilia Romagna and Veneto, as well as for Spain and Switzerland. For Emilia Romagna and Veneto the forecast exercise is performed in week 6. The best d(i, j) models are d(1, 1) for Emilia Romagna and d(2, 1) for Veneto in term of MAE and MAPE. Fig 7 plots the level forecasts for Emilia Romagna and Veneto of these model. Both show a good forecast performance.
A further validation of the model was performed considering the data for Switzerland and Spain until April 25. The best d(i, j) models are d(2, 1) for Spain and d(1, 1) for Switzerland in term of MAE and MAPE. Fig 8 shows forecasts of these models, again showing a good forecast performance.
5. Conclusions
Unlike other methodologies and approaches used for analyzing COVID-19 data, the VECM approach proposed here relies on real-time daily official data as inputs, exploits the multivariate nature of the structural relationship between IC and HwS, and can be estimated over relatively short samples. Moreover, the model adapts over time not only to the evolution of contagion, but also to a number of local regional characteristics, such as hospital access policies and containment measures, as both are typically reflected in the time series of hospitalized patients with symptoms.
In the estimated VECM for Italian regions, parameters are significant and consistent with a priori expectations: the IC series evolves over time adjusting to its past values, to deviations from the log run equilibrium with HwS and to the lagged values of HwS. The level cointegrating relation often show an overall less-than-proportional relationship between HwS and IC.
The present study has some limitations. It focuses on the available data, provided at regional level for ICU and hence ignores local (cities, provinces) variation. This prevents to link the evolutions of the analyzed time series with local conditions (local distribution of age, co-morbidities) and/or to the distribution of hospitals’ characteristics. Moreover, the models and related forecasts do not consider issues related to reaching or approaching the threshold of maximum ICU capacity; this was due both to lack of available data on the thresholds and to the dynamic adaptation of the capacity to the increased needs induced by the pandemic in the case of Italy.
Despite these limitations, the VECM forecasting tool is found to offer reliable and timely forecasts for various Italian regions including Lombardy, Veneto and Emilia-Romagna. These results are confirmed by the validation on Spanish and Swiss aggregate data.
The reliability of the forecasts was confirmed also using different small incremental samples, where forecasts seven days ahead are found to be reliable already using only two weeks of training data since the onset of contagion. Forecast accuracy largely improves with increased availability of data. It was also found that flexible specifications of the deterministic component become increasingly useful using longer spans of data.
These forecasts appear important tools for health care system managers, who need to predict the needs of ICU beds in a fast way in the fast-rising upwards phase, allowing them to calibrate and adapt the response of the health care system to the epidemic (modifying staff resources, hospital facilities and resources), in a timely fashion.
The approach proposed in this paper may be used in other countries/regions to provide local health authorities with real-time forecast numbers of ICU beds, helping to shape the health care response and to prevent shortage of hospital resources (beds, ventilators, doctors and nurses), [19].
A free software program (in the form of a public R package called ‘Presize’, available here, is provided with this article. This can facilitate replication the proposed analysis in other countries, regions or provinces, even at higher granularity than the regional level, if appropriate in terms of health-system decision-making and if data are available.
Acknowledgments
Opinions and views expressed in this paper are those of the authors and do not necessarily reflect the ones of the institutions of affiliation.
The authors thank Camillo Rossi, MD (Healthcare Director at Spedali Civili di Brescia, Italy), for stimulating this research by requesting reliable short-run forecasts of ICU needs at the very beginning of the COVID-19 outbreak. The authors gratefully acknowledge useful comments from the Editor, two anonymous referees, Ian Vollbracht, Brian Doherty, and Helen Johnson from the European Commission and the ECDC respectively.
Data Availability
Data are publicly available at: https://github.com/pcm-dpc/COVID-19.
Funding Statement
Paolo Berta, Pietro Giorgio Lovaglio acknowledge funding from Regione Lombardia (2014IT16RFOP012). Paolo Paruolo and Stefano Verzillo received no specific funding for this work. The funder had no roles in our study design, data collection and analysis, decision to publish, and preparation of the manuscript.
References
- 1.WHO (2020). Director-General’s opening remarks at the media briefing on COVID-19—11 March 2020. Available on WHO Coronavirus disease 2019, Media resources, News website https://www.who.int/emergencies/diseases/novel-coronavirus-2019/events-as-they-happen Last accessed April 20, 2020.
- 2.Richardson, S. and D. Spiegelhalter (2020). Coronavirus statistics: what can we trust and what should we ignore? The Guardian (April 12).
- 3.Bartoszek, K., E. Guidotti, S. M. Iacus, and M. Okrój (2020). Are official confirmed cases and fatalities counts good enough to study the COVID-19 pandemic dynamics? a critical assessment through the case of Italy. Technical report. [DOI] [PMC free article] [PubMed]
- 4. Grasselli G., Pesenti A., and Cecconi M. (2020). Critical care utilization for the covid-19 outbreak in lombardy, italy: early experience and forecast during an emergency response. JAMA 323(16), 1545–1546. 10.1001/jama.2020.4031 [DOI] [PubMed] [Google Scholar]
- 5.ECDC (2020). Coronavirus disease 2019 (COVID-19) pandemic: increased transmission in the EU/EEA and the UK? Seventh update. Technical report, European Centre for Disease Prevention and Control, https://www.ecdc.europa.eu/sites/default/files/documents/RRA-seventh-update-Outbreak-of-coronavirus-disease-COVID-19.pdf.
- 6.Rodriguez-Llanes, J., R. Delgado, M. Pedersen, M. Meneghini, and P. González (2020). COVID-19: Surging critical care capacity across Europe. Technical report, https://www.researchgate.net/publication/340528505. [DOI] [PMC free article] [PubMed]
- 7. Remuzzi A. and Remuzzi G. (2020). COVID-19 and Italy: what next? The Lancet, 10.1016/S0140-6736(20)30627-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Roosa K., Lee Y., Luo R., Kirpich A., Rothenberg R., Hyman J., (2020). Real-time forecasts of the Covid-19 epidemic in China from February 5th to February 24th, 2020. Infectious Disease Modelling 5, 256–263. 10.1016/j.idm.2020.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Chowell G., Tariq A., and Hyman J. M. (2019). A novel sub-epidemic modeling framework for short-term forecasting epidemic waves. BMC Medicine 17(1), 164 10.1186/s12916-019-1406-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Petropoulos F. and Makridakis S. (2020). Forecasting the novel coronavirus covid-19. PloS One 15(3). 10.1371/journal.pone.0231236 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Benvenuto D., Giovanetti M., Vassallo L., Angeletti S., and Ciccozzi M. (2020). Application of the ARIMA model on the COVID-2019 epidemic dataset. Data in Brief, 10.1016/j.dib.2020.105340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Peracchi, F. (2020). The Covid-19 pandemic in Italy. Working paper, EIEF.
- 13.Branas, C. C., A. Rundle, S. Pei, W. Yang, B. G. Carr, S. Sims, et al. (2020). Flattening the curve before it flattens us: hospital critical care capacity limits and mortality from novel coronavirus (SARS-CoV2) cases in US counties. Technical Report medRxiv, 10.1101/2020.04.01.20049759. [DOI]
- 14. Engle R. F. and Granger C. W. J. (1987). Co-integration and error correction: representation, estimation, and testing. Econometrica 55(2), 251–276. 10.2307/1913236 [DOI] [Google Scholar]
- 15. Johansen S. (1991). Estimation and hypothesis testing of cointegration vectors in Gaussian vector autoregressive models. Econometrica 59(6), 1551–1580. 10.2307/2938278 [DOI] [Google Scholar]
- 16. Johansen S. (1996). Likelihood-based Inference in Cointegrated Vector Auto-Regressive Models. Oxford University Press. [Google Scholar]
- 17. Makridakis S., Spiliotis E., and Assimakopoulos V. (2020). The M4 competition: 100,000 time series and 61 forecasting methods. International Journal of Forecasting 36(1), 54–74. 10.1016/j.ijforecast.2019.04.014 [DOI] [Google Scholar]
- 18. Giordano G., Blanchini F., Bruno R., Colaneri P., Di Filippo A., Di Matteo A. (2020). Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nature Medicine, 1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ferguson, N. M. et al. (2020). Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand. Technical report, Imperial College COVID-19 Response Team, 10.25561/77482. [DOI]
- 20. Kuzin V., Marcellino M., and Schumacher C. (2011). MIDAS vs. mixed-frequency VAR: Nowcasting GDP in the euro area. International Journal of Forecasting 27(2), 529–542. 10.1016/j.ijforecast.2010.02.006 [DOI] [Google Scholar]
- 21. Ghysels E. (2016). Macroeconomics and the reality of mixed frequency data. Journal of Econometrics 193(2), 294–314. 10.1016/j.jeconom.2016.04.007 [DOI] [Google Scholar]
- 22. McAlinn K. and West M. (2019). Dynamic Bayesian predictive synthesis in time series forecasting. Journal of Econometrics 210(1), 155–169. 10.1016/j.jeconom.2018.11.010 [DOI] [Google Scholar]
- 23. McAlinn K., Aastveit K. A., Nakajima J., and West M. (2019). Multivariate Bayesian predictive synthesis in macroeconomic forecasting. Journal of the American Statistical Association, 1–19. [Google Scholar]
- 24. Stübinger J. (2019). Statistical arbitrage with optimal causal paths on high-frequency data of the s&p 500. Quantitative Finance 19(6), 921–935. 10.1080/14697688.2018.1537503 [DOI] [Google Scholar]
- 25. Stübinger J. and Adler K. (2020). How to identify varying lead–lag effects in time series data: Implementation, validation, and application of the generalized causality algorithm. Algorithms 13(4), 95 10.3390/a13040095 [DOI] [Google Scholar]
- 26. Hendry D. F. (1995). Dynamic Econometrics. Oxford University Press, USA. [Google Scholar]
- 27.Berta, P., P. G. Lovaglio, P. Paruolo, and S. Verzillo (2020). Real time forecasting of Covid-19 intensive care units demand. Working paper.
- 28. Tofallis C. (2015). A better measure of relative prediction accuracy for model selection and model estimation. Journal of the Operational Research Society 66, 1352–1362. 10.1057/jors.2014.124 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data are publicly available at: https://github.com/pcm-dpc/COVID-19.