Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2007 Jun 21;29(1):79–88. doi: 10.1016/j.tourman.2007.04.003

A fractionally integrated autoregressive moving average approach to forecasting tourism demand

Fong-Lin Chu 1,
PMCID: PMC7115486  PMID: 32287722

Abstract

The primary aim of this paper is to incorporate fractionally integrated ARMA (p, d, q) (ARFIMA) models into tourism forecasting, and to compare the accuracy of forecasts with those obtained by previous studies. The models are estimated using the volume of monthly international tourist arrivals in Singapore. Empirical findings demonstrate the evidence that the approach we propose generates relatively lower sample mean absolute percentage errors (MAPEs). This study also deals with the volatile data faced by a forecaster. We use the Asian financial crisis and the September 11 event as examples of economic and political shocks. With respect to the objective of shaving the coefficient MAPE, forecasts based on the selected ARFIMA models dominate convincingly.

Keywords: Fractionally integrated ARMA (p, d, q); Forecast; Singapore; Tourist arrivals

1. Introduction

International tourism has experienced an overwhelming boom over the last two decades and has now been called the largest industry in the world. Despite this positive outcome the rate of growth has varied unevenly from year to year. While most countries have enjoyed a steady increase in their tourist arrivals, some have experienced a decline in numbers at times. Planning under these circumstances is exceptionally difficult and important. To be successful, accurate forecasts of tourism demand are required. This can avoid the financial costs of excess capacity or the opportunity costs of unfulfilled demand.

The end of World War II, as well as the development of relatively fast, safe and comfortable air travel, marked the genesis of modern-day international tourism. Slightly more than two-quarters of a century ago, the serious study of international tourism had begun. By virtue of the accumulating record of international travel statistics at this time, researchers began to study the international tourism demand, one area of which has been forecasts. Over the last three to four decades, a substantial number of such studies have been undertaken. A large portion of these have yielded empirical information and a considerable majority have reported the accuracy of forecasts. Forecasting methods are generally classified into quantitative and qualitative (Archer, 1980; Uysal & Crompton, 1985). The quantitative approach, which gives more accurate forecasts than judgement forecasts are further divided into causal methods (e.g., regression and structural models) and time-series methods. The latter identify stochastic components (i.e., autoregressive and moving average components) in each time series and its distinguishing feature is that no attempt is made to relate a variable, say yt to other variables. The movements in yt are ‘explained’ solely in terms of its own past, or by its position in relation to time (Harvey, 1981).

The literature on modelling and forecasting tourism is huge. It includes approaches based on regression models, which to a large extent had been examined by Leob (1982), Uysal and Crompton (1985), Witt and Martin (1987), Crouch, Schultz, and Valerio (1992), Admowicz (1994), Morris, Wilson, and Bakalis (1995), Malenberg and Van Soest (1996), Kulendran and King (1997), and Song and Witt (2006). Other authors such as Geurts and Ibrahim (1975), Choy (1984), Van Doorn (1984), Martin and Witt (1989), Chan (1993), Chan, Hui, and Yuen (1999), Witt, Witt, and Wilson (1994), Turner, Kulendran, and Pergat (1995), Turner, Kulendran, and Fernando (1997), Garcia-Ferrer and Queralt (1997), Chu (1998a), Chu (1998b), Kim (1999), Lim & McAleer (2001), Lim & McAleer (2002), Goh and Law (2002), Gustavsson and Nordström (2001), Brännäs, Hellström, and Nordström (2002), Chu (2004), and Gil-Alana (2005) use pure time-series analytical models. In testing the accuracy of different forecasting models for tourist arrivals, researchers had found that time-series models often generate acceptable forecasts at low cost with reasonable benefits. Therefore, a spate of literature had investigated the forecasting ability along this line. Since the purpose of this paper is mainly concerned with the worldwide visitors in Singapore, a brief review of the articles dealing with the prediction of this island state's tourism demand will be conducted. Chan (1993) examined various forecasting models on monthly data between July 1977 and December 1988 and found that the sine wave nonlinear regression model outperformed Naïve I, Naïve II, simple linear regression, and ARIMA (2, 1, 2). The mean absolute percentage error (MAPE) of the 19-months-ahead forecast is 2.567. Applying a seasonal ARIMA model and using the same dataset and forecasting horizon, Chu (1998a) found that the MAPE reduced to 1.857. Chu (1998b) further improved the MAPE to 1.813 by using a combined approach to the same dataset. This paper tries to improve the accuracy of forecasts by employing a fractionally integrated autoregressive moving average (ARFIMA) model. Sutcliffe (1994) argued that the use of fractional differencing opens up a much wider and realistic behavior for the trend and seasonal components than traditional integer differencing. He showed several advantages of using fractional differencing for forecasting monthly data. Franses and Ooms (1997) extended the standard ARFIMA (0, d, 0) model for quarterly UK inflation, where they allowed the fractional integration parameter d to vary with the season s. For quarterly out-of-sample forecasting, this so-called periodic ARFIMA (0, d, 0) model does not appear to be the overall winner when compared with rival forecasting schemes. Porter-Hudak (1990) characterized a set of monetary aggregate series using a seasonal ARFIMA model. She estimated the seasonal fractional differencing parameter using an extension of a method proposed by Geweke and Porter-Hudak (1983) for the non-seasonal fractional differencing case. She also looked at forecasts generated by the seasonal fractional model and compared the one-step-ahead forecasts with those of a standard seasonal ARIMA model. Ray (1993) used a series of monthly IBM product revenues to illustrate the usefulness of seasonal fractionally differenced ARMA models for business forecasting. She found that the fractionally differenced seasonal model gives a more accurate next-quarter, next-half-year, next-year, and next-two-year forecasts than the non-fractional model based on the criteria that are specifically constructed to reflect the accuracy of long-range periodic forecasts. Applying a seasonal fractional integration to the number of monthly visitors to the US for the time period January 1996–February 2003, Gil-Alana (2005) shows that the series can be well described in terms of a seasonal I(d) process, with d higher than 0 but smaller than 1. To examine the forecasting properties in the same article, Gil-Alina used a longer dataset (the short-term arrivals at the Auckland International Airport, April 1978–December 1999) and found that the seasonal fractional models outperform the nonseasonal ones in practically all cases.

However, existing literature regarding forecasting tourism demand for Singapore so far had not adopted ARFIMA processes and this paper will fill this gap. Monthly data from July 1977 to December 1988 are used to identify a model for the series. Immediately after that, a 19-months-ahead forecast is implemented. This is a time period that covers relatively stable data. The reason why a period of 19 months was selected as the forecasting horizon is to compare our results with that of previous studies. Subsequently, the dataset will be extended to account for economic and political shocks. Shocks normally generate relatively unstable data. Therefore, the model will be tested by relatively stable and unstable data. Using the Gulf War as an example of sudden environment change, Chan et al. (1999) had explored the relative performance of various forecasting models and found Naïve II to be the best in terms of forecasting accuracy. Chan, however, did not consider the ARFIMA model and restricted their prediction interval to 12 months. This paper considers ARFIMA and extends the sample data to cover the Asian financial crisis and the September 11 event. The prediction interval is extended to 19 months as well. The organization of this paper is as follows. The next section briefly discusses Singapore's tourist industry and its time-series data. The third section presents the ARFIMA model. The fourth section analyzes model estimation and reports the application to tourist arrivals forecasting. The final section presents a summary and conclusions.

2. Singapore tourism industry and data

The performance of the Singapore tourism industry has been remarkable in recent years. The island state's tourism sector, one of its pillar industries, employs some 150,000 people and accounts for about 10% of its economy of $87 billion a year. It also attracts some 8 million visitors—twice its population—each year from all over the world. The data of monthly number of international tourists visiting Singapore between July 1977 and November 2004 are obtained from Singapore Tourist Promotion Board and are plotted in Fig. 1 . The marked seasonal pattern dominates the movement in the series. The plot exhibited a permanent deterministic pattern of a long-term upward trend with short-term fluctuations that were independent from one time period to the next. Note that Singapore's popularity has been growing; the series appears to be nonstationary in that the mean is increasing over time. The tourism industry survived the heavy blows of the first Iraq War, the Asian financial crisis, the September 11 terrorist attacks on the United States in 2001, the subsequent Iraq War, and the SARS (severe acute respiratory syndrome) outbreak in early 2003. These events, however, had influenced the decision of many of the potential travelers, and tourist arrivals therefore had been reduced significantly below the linear trend line in Fig. 1. Among them, the Asian financial crisis and the September 11 event are two different demand shocks in nature. The former is economic while the latter is political. The impacts of both disturbances on arrivals are demonstrated by a fenced rectangular in Fig. 1. For the forecasting experiment, the sample was split into the last 19 observations, henceforth called the ‘prediction interval’, and the remaining data, henceforth called the ‘pre-prediction interval’. Parameter estimates were obtained from fitting the models to the pre-prediction interval. Three different types of pre-prediction and prediction interval are considered. One of them has stable tourist arrivals within the entire sample, henceforth called the ‘tranquil period’. The other two, characterized by economic and political shocks, are further classified into cases, depending upon whether shock has taken place at the time the policy maker or the planning manager implemented the forecasts. Suppose the policy maker had already implemented the forecasts based on his best forecasting model but found later that shock had occurred. In this situation, the policy maker may be reluctant to change the forecasts because cost may surpass benefit. Even if he does change the forecasts based on alternative models at hand, there is no guarantee that the results will be better than those obtained from the previous models. Alternatively, if the shock had already taken place at the time the forecast is made, but the industry still shows no sign of partial or full recovery, the policy maker in this situation has the option to try alternative forecasting models if he deems appropriate. The Asian financial crisis took place in the second half of 1997. Accordingly, prediction intervals consist of January 1997–July 1998 and January 1998–July 1999. In the former, demand shock was not present at the time the forecast was made. In the latter, shock had already taken place. Parameter estimates were obtained from fitting the models to the pre-prediction interval, which corresponds to July 1977–December 1996 and July 1977–December 1997, respectively. The September 11 event took place in the latter half of 2001. Likewise, the prediction intervals are January 2001–July 2002 and January 2002–July 2003. Pre-prediction and prediction intervals for various cases are summarized in Table 1 .

Fig. 1.

Fig. 1

Monthly tourist arrivals.

Table 1.

Pre-prediction and prediction interval for various stages

Pre-prediction interval Prediction interval
Tranquil period
July 1977–December 1988 January 1989–July 1990


Asian financial crisis
July 1977–December 1996 January 1997–July 1998
July 1977–December 1997 January 1998–July 1999


September 11 event
July 1977–December 2000 January 2001–July 2002
July 1977–December 2001 January 2002–July 2003

3. A fractionally integrated ARMA model

A long memory model, which has become popular in econometrics, is the (discrete time) fractionally differenced model (Brockwell & Davis, 2002). It was suggested as an alternative to ARIMA models by Hosking (1981), and Granger and Joyeux (1980). A fractionally integrated ARMA processes, or more precisely, the ARIMA (p, d, q) processes with 0<∣d∣<0.5, satisfy difference equations of the form

(1-B)dφ(B)Xt=θ(B)Zt, (1)

where φ(z) and θ(z) are polynomials of degrees p and q, respectively, satisfying φ(z)≠0 and θ(z)≠0 for all z such that ∣z∣⩽1, B is the backward shift operator (i.e., BXt=Xt −1) and {Zt} is a Gaussian white noise sequence with mean 0 and variance σ 2. The difference operator (1−B) is raised to a fractional power, d, denoting the fractional order of integration. The operator (1−B)d is defined by the binomial expansion

(1-B)d=j=0πjBj,

where

πj=0<kjk-1-dk,j=1,2,.

The autocorrelations ρ(h) at lag h of an ARIMA(p, d, q) process with 0<∣d∣<0.5 has the property

ρ(h)h1-2dcash. (2)

This hyperbolic decay at high lags distinguishes series with long memory from series with short memory. It is the main characteristic for empirical identification.

The spectral density of {Xt} is given by

f(λ)=σ22π|θ(e-iλ)|2|φ(e-iλ)|2|1-e-iλ|-2d. (3)

Calculation of exact Gaussian likelihood of observations [x 1, …,  xn] of a fractionally integrated ARMA process is very slow and demanding in terms of computer memory. Instead of estimating the parameters d, φ 1, …  φp, θ 1, …,  θq, and σ 2 by maximizing the exact Gaussian likelihood, it is simpler to maximize the Whittle approximation Lw, defined by

-2ln(Lw)=nln(2π)+2nlnσ+σ-2jIn(ωj)g(ωj)+jlng(ωj), (4)

where In is the periodogram σ 2 g/(2π)(=f) is the model spectral density, and j denotes the sum over all nonzero Fourier frequencies ωj=2π/n∈(−π, π]. The parameters of the fractionally integrated ARIMA (p, d, q) process can be estimated and the results can be used for out-of-sample forecasting evaluation.

4. Model estimation and forecasting

Because the series illustrated in Fig. 1 demonstrates a combination of strong seasonality and nonstationarity, they are 12th differenced in an effort to obtain the seasonally differenced data. In order to impart stationarity into the series, we next take the first difference of the already seasonally differenced data. The results are displayed in Fig. 2 . Alternatively, one may just take the first difference of the raw data and ignore the 12th difference. This is illustrated in Fig. 3 . Sometimes forecasts based on the model fit to first differenced data render better results.

Fig. 2.

Fig. 2

Seasonally adjusted and first differenced data.

Fig. 3.

Fig. 3

First differenced data.

4.1. Empirical results for the tranquil period

The following models, each with different parameters, are found to provide a good fit to the data for the tranquil period. The first model considered was the parsimonious (0, d, 1) structure as suggested by Akaike information criterion (AIC):

(1-B)0.098(Δ1Δ121Xt)=θ(B)Zt,θ(B)=1-0.592B. (5)

The dth difference of a series is denoted by Δd. A seasonal difference is denoted by Δs, where s is the period of the data. The Dth such seasonal difference is ΔsD. In Eq. (5), we use monthly data and therefore 12th difference the data first and then first difference the resulting series, which is denoted by Δ1Δ121Xt. The second best model suggested by AIC was the (4, d, 6) structure:

φ(B)(1-B)0.269(Δ1Xt)=θ(B)Zt,φ(B)=1+0.716B+0.144B2+0.297B3+0.618B4,θ(B)=1-0.041B-0.643B2+0.103B3+0.851B4-0.214B5-0.550B6. (6)

Note that the resulting structure in Eq. (5) took into account the seasonal difference, while Eq. (6) ignored it. The forecasting performance of both models will be compared later. A frequently used descriptive device for measuring forecasting performance is the sample MAPE (Makridakis & Hibon, 1979; Makridakis et al., 1984). The coefficient MAPE measures the mean of absolute percent error of prediction and is given by

MAPE=1Kt=N+1N+K|1-X^tXt|·100,

where X^t and Xt represent the forecast and actual values. The MAPEs for Eqs. (5), (6) were 1.738% and 7.578%, respectively. Guided by AIC and out-of-sample analysis of forecasting accuracy, we choose the structure of Eq. (5) for the tranquil period. The results are illustrated in Table 2 along with the outcomes of previous empirical studies in an attempt to make a comparison.

Table 2.

Sample mean absolute percentage errors of various models

Model MAPE
Simple linear regression model 17.567%a
ARIMA (2, 1, 2) 8.577a
Naïve I model 5.526a
Naïve II model 4.459a
Cubic polynomial model 4.010b
Sine wave nonlinear model 2.567a
ARIMA (3, 1, 0)(0, 1, 0)12 1.857b
Combined forecast model 1.813b
Fractionally integrated ARMA (0, 0.098, 1) 1.738

Table 2 illustrates the forecasting ability of nine time-series models, in which the fractionally integrated ARMA approach turned out to be the best. It is notoriously difficult to produce a value of MAPE less than 2%, as far as the ex post forecasting of monthly tourist arrivals is concerned. This is especially so when the prediction horizon lasts for 19 months. Among the models which rendered a MAPE of less than 2%, the fractionally integrated ARMA approach has a 4.15% and 6.41% improvement over the combined forecast and ARIMA (3, 1, 0) (0, 1, 0)12, respectively. The improvement over the rest of the models is even more remarkable. All pose double-digit percentage MAPE improvement. The sample forecasts and actual arrivals are graphically illustrated in Fig. 4 . The forecasts follow the overall trend in tourist arrivals, and do capture the cyclical fluctuations that occurred during the prediction interval. The forecasts reproduce most of the turning points as well.

Fig. 4.

Fig. 4

Actual and forecasted number of tourist arrivals.

Table 3 shows both the forecasts and actual numbers. For comparison among the models which manufactured MAPEs below 2%, the results of ARIMA (3, 1, 0) (0, 1, 0)12 and the combined forecast models are also presented. Forecasts with minimal errors are in bold face. The mean and standard deviation of the absolute percentage errors are illustrated in Table 4 . Regarding the overall performances, the selected ARFIMA dominated convincingly.

Table 3.

Actual arrivals and forecasts of various models

Year/month Actual Forecasts
ARFIMA Combineda Seasonal ARIMAb
1989
Jan 380,024 378,930 378,862 378,662
Feb 353,852 363,100 363,683 364,047
Mar 404,794 398,300 398,368 399,818
Apr 375,673 382,810 384,103 384,128
May 363,122 369,840 373,638 371,059
Jun 367,298 370,950 372,092 372,326
Jul 435,974 429,770 431,177 431,302
Aug 443,399 440,510 443,750 441,671
Sep 406,419 393,410 394,894 395,124
Oct 423,403 417,900 419,070 419,748
Nov 425,947 414,220 415,952 416,202
Dec 450,045 446,410 448,577 448,532


1990
Jan 433,549 431,720 433,915 433,916
Feb 425,093 416,030 418,886 419,394
Mar 445,492 451,380 453,865 455,461
Apr 430,812 436,020 440,031 440,064
May 410,356 423,180 425,568 427,290
Jun 409,917 424,420 424,669 428,851
Jul 474,880 483,360 488,014 488,121

Table 4.

Mean and standard deviation of absolute percentage errors of various models

Model Mean Std. deviation
Seasonal ARIMA 1.857% 1.260
Combined forecast 1.813 1.124
Fractionally integrated ARMA 1.738 0.951

4.2. The period of the Asian financial crisis and the September 11 event

Next we proceed to analyze the choice of the models pertaining to the period of the Asian financial crisis. At first we look at situation where economic shock only hit the prediction interval. In other words, when the policy maker implements the prediction task, shock has not yet taken place. Subsequently, shock did indeed hit the prediction interval so that the policy maker was dumbfounded and could do nothing but wait for the actual arrivals. The first model considered was the parsimonious (2, d, 2) structure as suggested by AIC:

φ(B)(1-B)-0.411(Δ1Δ121Xt)=θ(B)Zt,φ(B)=1+1.026B+0.832B2,θ(B)=1+1.042B+0.982B2. (7)

The second best model suggested by AIC was the (4, d, 6) structure:

φ(B)(1-B)0.126(Δ1Xt)=θ(B)Zt,φ(B)=1+1.744B+1.988B2+1.654B3+0.936B4,θ(B)=1+1.361B+0.907B2-0.087B3-0.732B4-0.865B5-0.227B6. (8)

The calculated sample MAPEs for Eqs. (7), (8) are 14.40% and 11.86%, respectively. Guided by the out-of-sample analysis of forecasting accuracy, we choose the structure of Eq. (8). For the situation where the policy makers face a pre-prediction interval in which economic shock has already taken place, the first model considered was the parsimonious (1, d, 1) structure as suggested by AIC:

φ(B)(1-B)-0.213(Δ1Δ121Xt)=θ(B)Zt,φ(B)=1+0.665B,θ(B)=1+0.432B. (9)

The second best model suggested by AIC was the (11, d, 12) structure:

φ(B)(1-B)-0.061(Δ1Xt)=θ(B)Zt,φ(B)=1+0.619B+0.715B2+0.454B3+0.184B11,θ(B)=1+0.303B+0.438B2+0.447B12. (10)

The calculated sample MAPEs for Eqs. (9), (10) are 8.79% and 6.36%, respectively. Guided by the out-of-sample analysis of forecasting accuracy, we choose the structure of Eq. (10).

Preferred structures pertaining to the period of the September 11 event are the following alternative models. This is the situation where political shock only hit the prediction interval. The first model considered was the parsimonious (1, d, 0) structure as suggested by (AIC):

φ(B)(1-B)-0.179(Δ1Δ121Xt)=Zt,φ(B)=1+0.252B. (11)

The second best model suggested by AIC was the (2, d, 5) structure:

φ(B)(1-B)-0.128(Δ1Xt)=θ(B)Zt,φ(B)=1+0.008B+0.999B2,θ(B)=1-0.319B+1.057B2-0.700B3+0.174B4-0.323B5. (12)

The MAPEs for Eqs. (11), (12) are 14.69% and 4.71%, respectively. Guided by the out-of-sample analysis of forecasting accuracy, we choose the structure of Eq. (12). In the situation where political shock hit the pre-prediction interval, the first model considered was the parsimonious (1, d, 0) structure as suggested by AIC:

φ(B)(1-B)-0.202(Δ1Δ121Xt)=Zt,φ(B)=1+0.229B. (13)

The second best model suggested by AIC was the (3, d, 6) structure:

φ(B)(1-B)-0.277(Δ1Xt)=θ(B)Zt,φ(B)=1+0.622B+1.002B2+0.617B3,θ(B)=1+0.562B+0.955B2+0.301B3-0.360B4-0.133B5-0.373B6. (14)

The MAPEs for Eqs. (13), (14) are 30.82% and 31.15%, respectively. Guided by AIC and the out-of-sample analysis of forecasting accuracy, we choose the structure of Eq. (13).

The ex post forecasts for the period of the Asian financial crisis and the September 11 event are generated. MAPEs are calculated and illustrated in Table 5, Table 6 , respectively. For comparison, parameter estimates are obtained from fitting the various models to the pre-prediction interval. These include univariate extrapolative models such as simple linear regression, Naïve I, Naïve II, cubic polynomial, and sine wave nonlinear regression. Modern stochastic time-series model such as nonseasonal ARIMA (p, d, q) and seasonal ARIMA models are also considered. Model-based out-of-sample forecasts are obtained and the MAPEs are calculated and displayed in Table 5, Table 6. The minimal MAPE manufactured by a specific model is illustrated by bold-faced numerical values.

Table 5.

Sample MAPE of various models for the period of the Asian financial crisis

Model Prediction interval
1/97–7/98 1/98–7/99
Naïve I 20.64% 8.89%
Naïve II 36.20 16.34
Linear regression 12.95 19.62
Cubic polynomial regression 20.00 21.20
Sine wave nonlinear regression 13.05 20.97
ARIMA (p, d, q) 13.61a 7.31b
ARIMA (p, d, q)(P, D, Q)12 14.03c 6.55d
Fractionally integrated ARIMA (p, d, q) 11.86 6.36
a

ARIMA (2, 1, 2).

b

ARIMA (2, 1, 2).

c

ARIMA (1, 1, 1)(0, 1, 1)12.

d

ARIMA (1, 1, 2)(0, 1, 1)12.

Table 6.

Sample MAPE of various models for the period of the September 11 event

Model Prediction interval
1/01–7/02 1/02–7/03
Naïve I 11.28% 39.16%
Naïve II 23.03 74.09
Linear regression 8.81 43.74
Cubic polynomial regression 7.78 29.86
Sine wave nonlinear regression 8.90 43.73
ARIMA (p, d, q) 6.84a 33.73b
ARIMA (p, d, q)(P, D, Q)12 8.46c 31.54d
Fractionally integrated ARMA (p, d, q) 4.71 30.82
a

ARIMA (2, 1, 3).

b

ARIMA (4, 1, 3).

c

ARIMA (3, 1, 3)(1, 0, 1)12.

d

ARIMA (0, 0, 1)(1, 1, 1)12.

Again the results in Table 5, Table 6 seem to indicate that the fractionally integrated ARMA approach relatively outperformed the rest of the models. It generated the best predictor for both prediction intervals during the Asian financial crisis. It also generated the best and the second best predictor for both prediction intervals during the September 11 event. The result for the prediction interval of January 2001–July 2002 is worth mentioning. Here, the coefficient MAPE is merely 4.71%, despite the tourist flows being influenced by the September 11 event. The deterioration of MAPEs seen in the prediction interval of January 2002–July 2003 mostly reflected the plunge of international visitors when the SARS crisis was at its peak. For that interval, the cubic polynomial regression seems to produce the best outcome but its outperformance over fractionally integrated ARMA is only marginal. If this period of prediction interval had not been severely disturbed by the SARS epidemic, the approach we proposed might have done better than the cubic polynomial regression method. In all cases, fractionally integrated ARMA (p, d, q) models beat the rival nonseasonal ARIMA (p, d, q) and seasonal ARIMA (p, d, q).

We mentioned earlier that the reason why a period of 19 months was selected as the forecasting horizon is to compare our results with that of past authors. Since 12 or 24 months seem more logical given the series is a monthly series, we fit all models and make 12- and 24-month forecasts. Table 7, Table 8, Table 9 illustrate the MAPEs of various models for the period of tranquility, the Asian financial crisis, and the September 11 event, respectively. The minimal MAPE manufactured by a specific model is illustrated by bold-faced numerical values. Indeed, the fractional integrated ARMA outperformed the rest of the models in most of the cases.

Table 7.

Sample MAPE of various models for 12- and 24-months-ahead forecasts

Model Prediction interval
1/89–12/89 1/89–12/90
Naïve I 7.42% 9.17%
Naïve II 8.81 6.76
Linear regression 16.16 18.38
Cubic polynomial regression 2.11 5.77
Sine wave nonlinear regression 2.58 2.82
ARIMA (p, d, q) 7.83 11.12
ARIMA (p, d, q)(P, D, Q)12 1.74 2.03
Fractionally integrated ARIMA (p, d, q) 1.63 2.28

Table 8.

Sample MAPE of various models for 12- and 24-months-ahead forecasts

Model Prediction interval
1/97–12/97 1/97–12/98 1/98–12/98 1/98–12/99
Naïve I 14.12% 22.47% 11.17% 8.12%
Naïve II 28.77 38.32 19.69 13.79
Linear regression 7.81 15.52 22.65 18.52
Cubic polynomial regression 12.61 24.03 24.73 19.76
Sine wave nonlinear regression 8.15 15.72 2..96 19.85
ARIMA (p, d, q) 9.14 14.76 8.63 7.29
ARIMA (p, d, q)(P, D, Q)12 9.13 20.13 5.76 9.13
Fractionally integrated ARIMA (p, d, q) 7.19 13.18 5.29 7.48

Table 9.

Sample MAPE of various models for 12- and 24-months-ahead forecasts

Model Prediction interval
1/01–12/01 1/01–12/02 1/02–12/02 1/02–12/03
Naïve I 11.79% 11.19% 7.65% 33.89%
Naïve II 22.96 22.38 32.89 66.20
Linear regression 8.25 9.59 9.63 38.75
Cubic Polynomial regression 6.99 8.12 5.85 25.15
Sine wave nonlinear regression 8.24 9.57 9.62 38.75
ARIMA (p, d, q) 7.49 6.73 5.83 28.03
ARIMA (p, d, q)(P, D, Q)12 7.23 9.03 3.73 26.25
Fractionally integrated ARIMA (p, d, q) 5.67 4.98 6.23 26.65

To this end, forecasting comparison was done statically, i.e., the models were fitted with the data up to a certain time, and then the estimated models were used to forecast the tourist arrivals over the next 12, 19 and 24 months. An alternative way for the forecasting assessment is to calculate the MAPEs recursively and then compare the forecasting performance with different forecasting horizons. Here, 1- 2- and 6-period out-of-sample forecasts are implemented. The parameters of each model are re-estimated at the beginning of each forecast period using all of the observations up to the forecast origin. Table 10, Table 11, Table 12 give the MAPE statistics for the periods of tranquility, the Asian financial crisis, and the September 11 event, respectively. The forecast generated by the fractionally integrated ARMA model appears to predict the monthly arrivals quite well except those generated during the months when the SARS crisis took place.

Table 10.

Sample MAPE for forecasts of monthly tourism arrivals

Forecast origin Forecast period MAPE
12/88 1/89 1.01
1/89 2/89 4.07
2/89 3/89 2.71
3/89 4/89 2.39
4/89 5/89 4.21
5/89 6/89 4.32
6/89 7 89 1.98
7 89 8/89 5.84
8/89 9/89 0.44
9/89 10/89 0.21
10/89 11/89 2.21
11/89 12/89 2.28
12/89 1/90 1.37
1/90 2/90 1.19
2/90 3/90 2.73
3/90 4/90 4.26
4/90 5/90 8.76
5/90 6/90 8.65
6/90 7/89 7.15


12/89 1/89–2/89 2.47
2/89 3/89–4/89 1.79
4/89 5/89–6/89 4.04
6/89 7/89–8/89 4.49
8/89 9/89–10/89 1.01
10/89 11/89–12/89 2.54
12/89 1/90–2/90 0.95
2/90 3/90–4/90 2.16
4/90 5/90–6/90 8.34


12/89 1/89–6/89 1.54
6/90 7/89–12/89 2.76
12/90 1/90–6/90 2.80

Table 11.

Sample MAPE for forecasts of monthly tourism arrivals

Forecast origin Forecast period MAPE Forecast origin Forecast period MAPE
12/96 1/97 2.27 12/97 1/98 5.12
1/97 2/97 5.83 1/98 2/98 0.83
2/97 3/97 2.51 2/98 3/98 4.71
3/97 4/97 1.36 3/98 4/98 4.70
4/97 5/97 5.23 4/98 5/98 8.27
5/97 6/97 5.85 5/98 6/98 7.29
6/97 7/97 8.15 6/98 7/98 1.51
7/97 8/97 1.89 7/98 8/89 8.21
8/97 9/97 8.79 8/89 9/98 0.83
9/97 10/97 13.59 9/98 10/98 2.60
10/97 11/97 3.89 10/98 11/98 2.56
11/97 12/97 4.79 11/98 12/98 2.29
12/97 1/98 10.66 12/98 1/99 9.56
1/98 2/98 10.26 1/99 2/99 1.59
2/98 3/98 3.51 2/99 3/99 4.73
3/98 4/98 8.91 3/99 4/99 2.04
4/98 5/98 10.95 4/99 5/99 4.84
5/98 6/98 10.31 5/99 6/99 4.39
6/98 7/98 4.67 6/99 7/99 1.18


12/96 1/97–2/97 4.29 12/97 1/98–2/98 2.98
2/97 3/97–4/97 2.67 2/98 3/98–4/98 6.22
4/97 5/97–6/97 5.12 4/98 5/98–6/98 6.06
6/97 7/97–8/97 1.90 6/98 7/98–8/98 5.29
8/97 9/97–10/97 16.1 8/98 9/98–10/98 3.28
10/97 11/97–12/97 2.93 10/98 11/98–12/98 1.70
12/97 1/98–2/98 12.63 12/98 1/99–2/99 5.13
2/98 3/98–4/98 6.02 2/99 3/99–4/99 2.96
4/98 5/98–6/98 5.78 4/99 5/99–6/99 5.74


12/96 1/97–6/97 3.31 12/97 1/98–6/98 6.14
6/97 7/97–12/97 12.15 6/98 7/98–12/98 6.99
12/97 1/98–6/98 12.07 12/98 1/99–6/99 6.60

Table 12.

Sample MAPE for forecasts of monthly tourism arrivals

Forecast origin Forecast period MAPE Forecast origin Forecast period MAPE
12/00 1/01 4.01 12/01 1/02 0.47
1/01 2/01 4.90 1/02 2/02 3.40
2/01 3/01 0.65 2/02 3/02 2.28
3/01 4/01 7.89 3/02 4/02 1.76
4/01 5/01 0.45 4/02 5/02 0.85
5/01 6/01 4.64 5/02 6/02 1.41
6/01 7/01 3.97 6/02 7/02 1.39
7/01 8/01 0.09 7/02 8/02 0.22
8/01 9/01 8.59 8/02 9/02 2.88
9/01 10/01 4.98 9/02 10/02 2.62
10/01 11/01 10.02 10/02 11/02 1.75
11/01 12/01 6.51 11/02 12/02 1.41
12/01 1/02 6.01 12/02 1/02 5.72
1/02 2/02 5.35 1/02 2/02 5.66
2/02 3/02 2.05 2/02 3/02 29.41
3/02 4/02 1.78 3/02 4/02 177.65
4/02 5/02 0.95 4/02 5/02 37.09
5/02 6/02 1.76 5/02 6/02 2.83
6/02 7/02 1.41 6/02 7/02 6.67


12/00 1/01–2/01 4.52 12/02 1/02–2/02 0.56
2/01 3/01–4/01 4.04 2/02 3/02–4/02 1.16
4/01 5/01–6/01 2.69 4/02 5/02–6/02 2.03
6/01 7/01–8/01 2.87 6/02 7/02–8/02 1.04
8/01 9/01–10/01 8.81 8/02 9/02–10/02 2.63
10/01 11/01–12/01 7.27 10/02 11/02–12/02 1.18
12/01 1/02–2/02 3.82 12/02 1/03–2/03 4.10
2/02 3/02–4/02 2.58 2/03 3/03–4/03 134.15
4/02 5/02–6/02 1.65 4/03 5/03–6/03 24.54


12/00 1/01–6/01 3.53 12/01 1/02–6/02 1.51
6/01 7/01–12/01 6.35 6/02 7/02–12/02 2.45
12/01 1/02–6/02 2.38 12/02 1/03–6/03 99.21

Forecasting performance of the tourism demand models tend to vary with the nature of the data used, i.e., monthly series, viz., quarterly series. To ensure the reliability of the forecasting evaluation, our estimated models were further checked by quarterly time series in the forecasting exercise for all datasets. Guided by AIC and the out-of-sample analysis of forecasting accuracy, we choose the models represented by the following equations:

φ(B)(1-B)0.369(Δ1Xt)=θ(B)Zt,φ(B)=1+0.867B+0.136B2+0.091B3-0.848B4,θ(B)=1-0.315B+0.298B2-0.039B3-0.416B4, (15)
φ(B)(1-B)-0.241(Δ1Xt)=θ(B)Zt,φ(B)=1-0.014B+0.057B2-1.166B4+0.239B8-0.015B9,θ(B)=1-0.249B+0.41B2-0.712B4-0.097B5+0.337B8, (16)
φ(B)(1-B)-0.097(Δ1Xt)=θ(B)Zt,φ(B)=1+0.032B+0.9428B4,θ(B)=1-0.053B-0.040B3-0.546B4, (17)
φ(B)(1-B)0.112(Δ1Xt)=θ(B)Zt,φ(B)=1+0.992B+0.995B2+0.994B3,θ(B)=1+0.657B+0.767B2+0.686B3, (18)
φ(B)(1-B)0.109(Δ1Xt)=θ(B)Zt,φ(B)=1+0.001B-0.008B2-0.962B4,θ(B)=1-0.241B+0.020B2-0.719B4. (19)

The results of the calculated sample MAPEs are illustrated in Table 13 . We produced forecasts for 4, 6, and 8 quarters and the results demonstrated that the forecasts generated by quarterly series perform as well as those of monthly series. For example, the forecasts generated for the tranquil period is so magnificent that the coefficient MAPE is approximately 2%.

Table 13.

Sample MAPE for forecasts of quarterly tourism demand

Model Prediction interval MAPE
Eq. (15) 1Q/89–4Q/89 2.07%
1Q/89–2Q/90 2.06
1Q/89–4Q/90 1.73


Eq. (16) 1Q/97–4Q/97 6.43
1Q/97–2Q/98 10.35
1Q/97–4Q/98 12.85


Eq. (17) 1Q/98–4Q/98 6.38
1Q/98–2Q/99 5.02
1Q/98–4Q/99 5.72


Eq. (18) 1Q/01–4Q/01 5.68
1Q/01–2Q/02 6.37
1Q/01–4Q/02 5.83


Eq. (19) 1Q/02–4Q/02 4.62
1Q/02–2Q/03 28.56
1Q/02–4Q/03 23.68

5. Conclusions

A fractionally integrated ARMA (p, d, q) model has been constructed to explain the demand for Singapore tourism. The data for the inflow of international visitors over the periods July 1977–December 1988, July 1977–December 1996, July 1977–December 1997, July 1977–December 2000, and July 1977–December 2001 are fitted to the mathematical model that we constructed. For each pre-prediction interval mentioned above, we employ the resulting parameter estimates to generate the ex post out-of-sample forecasts of the number of arrivals for the next 19 months. MAPE was the criterion used to evaluate our forecasting performance. During the tranquil period, the fractionally integrated ARMA (p, d, q) model is the clear winner amongst the nine rival models. In fact, it improves the accuracy of forecast by several percentage points depending upon which rival models we are comparing it with. For instance, the improvement over the combined forecasting approach and seasonal ARMA (3, 1, 0)(0, 1, 0)12 model are 4.15% and 6.41%, respectively. Improvement over the rest of the models is even more magnificent.

The model we proposed also performed quite well in environments disturbed by economic and political shocks. Although we differentiate two cases that the forecaster or the policy maker may face in the vicinity of the shock, the proposed models seem to render the best performance to the policy maker among the methods we consider in most cases. This study is motivated by the potential of gains in predictive accuracy by using a fractionally integrated ARMA (p, d, q) model, in which the estimates of differencing parameter d are treated as fractional. Apart from treating d as a noninteger, appropriate integer differences are also implemented simultaneously in an effort to impart the data stationary. This may explain why the fractionally integrated ARMA (p, d, q) model performed relatively better than that of its rival traditional and seasonal ARIMA (p, d, q).

We also calculated the MAPEs recursively and then compared the forecasting performance with different forecasting horizons as an alternative approach for the forecasting assessment. One, two, and six periods ahead, forecasts are executed. The study was further expanded by including quarterly series in the forecasting exercise to ensure the reliability of the forecasting evaluation. Apart from the SARS period, the fractionally integrated ARMA (p, d, q) model outperforms the rival forecasting models most of the time.

The contribution of this paper is to show that a fractionally integrated ARMA model can be a credible alternative when modeling monthly tourism arrivals in Singapore. When model-based out-of-sample forecasts are entertained, more accurate results are generated. Hence, the results presented in this paper lead us to suggest that they can be used by recreation managers and decision makers in regional and national tourism offices in implementation of policies in an attempt to enhance tourism demand and growth.

An interesting topic for future research is to employ this estimating approach to international tourism demand for other countries. Model-based out-of-sample forecasts for the arrivals can also be implemented and used to find out whether the accuracy of forecast has improved. Tourism market in the West might possess characteristics different from those of the Asian Pacific countries. But when demand shocks take place, the global tourism industry can hardly escape being impacted. The need to thoroughly test the model inclusive of a static and disturbed environment is inevitable for a sound model.

References

  1. Admowicz W.L. Habit formation and variety seeding in a discrete choice model of recreation demand. Journal of Agricultural and Resource Economics. 1994;19:19–31. [Google Scholar]
  2. Archer B.H. Forecasting demand: Quantitative and intuitive technique. International Journal of Tourism Management. 1980;1:5–12. [Google Scholar]
  3. Brännäs K., Hellström J., Nordström J. A new approach to modeling and forecasting monthly guest nights in hotels. International Journal of Forecasting. 2002;18:19–30. [Google Scholar]
  4. Brockwell P.J., Davis R.A. 2nd ed. Springer; New York: 2002. Introduction to time series and forecasting. [Google Scholar]
  5. Chan Y.M. Forecasting tourism: A sine wave time series regression approach. Journal of Travel Research. 1993;32:58–60. [Google Scholar]
  6. Chan Y.M., Hui T.K., Yuen E. Modeling the impact of sudden environmental changes on visitor arrival forecasts: The case of the Gulf War. Journal of Travel Research. 1999;37:391–394. [Google Scholar]
  7. Choy D.L. Forecasting tourism revisited. Tourism Management. 1984;5:171–176. [Google Scholar]
  8. Chu F.L. Forecasting tourism arrivals: Nonlinear sine wave or ARIMA? Journal of Travel Research. 1998;36:79–84. [Google Scholar]
  9. Chu F.L. Forecasting tourism arrivals: A combined approach. Tourism Management. 1998;19:515–520. [Google Scholar]
  10. Chu F.L. Forecasting tourism demand: A cubic polynomial approach. Tourism Management. 2004;25:209–218. doi: 10.1016/j.tourman.2007.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Crouch G.I., Schultz L., Valerio P. Marketing international tourism to Australia: A regression analysis. Tourism Management. 1992;13:196–208. [Google Scholar]
  12. Franses P.H., Ooms M. A periodic long-memory model for quarterly UK inflation. International Journal of Forecasting. 1997;13:117–126. [Google Scholar]
  13. Garcia-Ferrer A., Queralt R.A. A note on forecasting international tourism demand in Spain. International Journal of Forecasting. 1997;13:539–549. [Google Scholar]
  14. Geurts M., Ibrahim I.B. Comparing the Box–Jenkins approach with the exponentially smoothed forecasting model: Application to Hawaii tourists. Journal of Marketing Research. 1975;12:182–188. [Google Scholar]
  15. Geweke J., Porter-Hudak S. The estimation and application of long memory time series models. Journal of Time Series Analysis. 1983;4:221–238. [Google Scholar]
  16. Goh C., Law R. Modeling and forecasting tourism demand for arrivals with stochastic nonstationary seasonality and intervention. Tourism Management. 2002;23:499–510. [Google Scholar]
  17. Gil-Alana L.A. Modeling international monthly arrivals using seasonal univariate long-memory processes. Tourism Management. 2005;26:867–878. [Google Scholar]
  18. Granger C.W.J., Joyeux R. An introduction to long-memory time series models and fractional differencing. Journal of Time Series Analysis. 1980;1:15–39. [Google Scholar]
  19. Gustavsson P., Nordström J. The impact of seasonal unit roots and vector ARMA modeling on forecasting monthly tourism flows. Tourism Economics. 2001;7:117–133. [Google Scholar]
  20. Harvey A.C. Philip Allan; Deddington: 1981. Time series models. [Google Scholar]
  21. Hosking J.R.M. Fractional differencing. Biometrika. 1981;68:165–176. [Google Scholar]
  22. Kim J.H. Forecasting monthly tourist departure from Australia. Tourism Economics. 1999;5:277–291. [Google Scholar]
  23. Kulendran N., King M.L. Forecasting international quarterly tourist flows using error-correction and time-series models. International Journal of Forecasting. 1997;13:319–327. [Google Scholar]
  24. Leob P.D. International travel to the United States: An econometric evaluation. Annals of Tourism Research. 1982;9:7–20. [Google Scholar]
  25. Lim C., McAleer M. Cointegration analysis of quarterly tourism demand by Hong Kong and Singapore for Australia. Applied Economics. 2001;33:1599–1619. [Google Scholar]
  26. Lim C., McAleer M. Time series forecasts of international travel demand for Australia. Tourism Management. 2002;23:389–396. [Google Scholar]
  27. Makridakis S., Anderson A., Carbone R., Fildes R., Hibon M., Lewandowske R., editors. The accuracy of major forecasting procedure. Wiley; New York: 1984. [Google Scholar]
  28. Makridakis S., Hibon M. Accuracy of forecasting: An empirical investigation. Journal of the Royal Statistical Society. 1979;142:97–125. [Google Scholar]
  29. Martin C.A., Witt S.F. Forecasting tourism demand: A comparison of the accuracy of several quantitative methods. International Journal of Forecasting. 1989;5:7–19. [Google Scholar]
  30. Malenberg B., Van Soest A. Parametric and semiparametric modeling of vacation expenditures. Journal of Applied Econometrics. 1996;11:59–76. [Google Scholar]
  31. Morris A., Wilson K., Bakalis S. Modeling tourism flows from Europe to Australia. Tourism Economics. 1995;2:147–167. [Google Scholar]
  32. Porter-Hudak S. An application of seasonal fractionally differenced model to the monetary aggregates. Journal of the American Statistical Association. 1990;85:338–344. [Google Scholar]
  33. Ray B.K. Long-range forecasting of IBM product revenues using a fractionally differenced ARMA model. International Journal of Forecasting. 1993;9:255–269. [Google Scholar]
  34. Singapore Tourist Promotion Board. Annual statistical report on visitor arrivals to Singapore—various issues 1975–2004.
  35. Song H., Witt S.F. Forecasting international tourist flows to Macau. Tourism Management. 2006;27:214–224. [Google Scholar]
  36. Sutcliffe A. Time series forecasting using fractional differencing. Journal of Forecasting. 1994;13:183–393. [Google Scholar]
  37. Turner L.W., Kulendran N., Fernando H. Univariate modeling using periodic and non-periodic analysis: Inbound tourism to Japan, Australia and New Zealand compared. Tourism Economics. 1997;3:39–56. [Google Scholar]
  38. Turner L.W., Kulendran N., Pergat V. Forecasting New Zealand tourism demand with disaggregated data. Tourism Economics. 1995;1:51–69. [Google Scholar]
  39. Uysal M., Crompton J. An overview of approaches used to forecast tourism demand. Journal of Travel Research. 1985;23:7–15. [Google Scholar]
  40. Van Doorn J.W.M. Tourism forecasting and policy maker-criteria of usefulness. Tourism Management. 1984;5:24–39. [Google Scholar]
  41. Witt S.F., Martin C. Econometric models for forecasting international tourism demand. Journal of Travel Research. 1987;25:23–30. [Google Scholar]
  42. Witt C.A., Witt S.F., Wilson N. Forecasting international tourist flows. Annals of Tourism Research. 1994;21:612–628. [Google Scholar]

Articles from Tourism Management are provided here courtesy of Elsevier

RESOURCES