Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2009 Jan 10;25(3):531–549. doi: 10.1016/j.ijforecast.2008.11.014

Modeling tourism: A fully identified VECM approach

Carl Bonham a,, Byron Gangnes a,1, Ting Zhou b,2
PMCID: PMC7114862  PMID: 32287620

Abstract

System-based cointegration methods have become popular tools for economic analysis and forecasting. However, the identification of structural relationships is often problematic. Using a theory-directed sequential reduction method suggested by Hall, Henry and Greenslade [Hall, S. G., Henry, S., & Greenslade, J. (2002). On the identification of cointegrated systems in small samples: A modelling strategy with an application to UK wages and prices. Journal of Economic Dynamics and Control, 26, 1517–1537], we estimate a vector error correction model of Hawaii tourism, where both demand and supply-side influences are important. We identify reasonable long-run equilibrium relationships, and Diebold–Mariano tests for forecast accuracy demonstrate satisfactory forecasting performance.

Keywords: Cointegration, Vector error correction model, Identification, Tourism demand and supply analysis, Tourism forecasting, Hawaii

1. Introduction

System-based cointegration methods, and their dynamic counterpart vector error correction models (VECMs), have become popular tools for economic analysis and forecasting. Cointegration analysis addresses the problem of spurious regressions among non-stationary time series. Estimation in a system context may shed light on important interrelationships among series, while reducing the risk of endogeneity bias. 3

However, system methods introduce additional challenges, chief among them being the problem of identifying individual structural relationships. In a system with cointegrating rank r, Pesaran and Shin (2001) show that exact identification requires r restrictions in each of the r cointegrating vectors. The popular Johansen, 1988, Johansen, 1991, Johansen, 1995 method uses a statistical approach to achieve the needed restrictions. Pesaran and Shin (2001) and Pesaran and Smith (1998) criticize this approach as a pure mathematical convenience, and instead advocate a theory-based approach. Hall, Henry, and Greenslade (2002) argue that the different identification methods proposed in the literature are almost impossible to implement in practice, due to the limited sample sizes typically available for empirical research. As an alternative, they suggest testing and imposing theory-based weak exogeneity assumptions at the earliest stage of the model reduction process. In this paper, we apply the Hall et al. (2002) strategy to the problem of estimating a structural econometric model of Hawaii tourism, where both demand and supply-side influences may be important.

There exists a large body of empirical literature on modeling and forecasting tourism flows. Nearly all existing studies focus solely on the demand side of the market, attempting to model either the demand arising from alternative source-country markets for a particular tourism destination, or the allocation of outbound travel demand to alternative destinations. 4 The application of cointegration analysis and the associated error-correction dynamic specifications to tourism demand modeling began in the mid-1990s, and is now relatively common. In many cases, single-equation methods have been used with little or no consideration of potential endogeneity problems (see for example Gonzalez and Moral, 1995, Kim and Song, 1998, Song et al., 2000, Vogt and Wittayakorn, 1998). When system approaches are used (for example Dritsakis, 2004, Gangnes and Bonham, 1998, Kulendran and King, 1997, Kulendran and Witt, 2001, Kulendran and Witt, 2003, Lim and McAleer, 2001, Song and Witt, 2003), identification is obtained exclusively using Johansen’s reduced rank regression technique, and identified cointegrating relationships are often simply assumed to represent demand functions. 5

The identification problem in tourism systems has been noted by some researchers and addressed in various ways, none of which are fully satisfactory. For example, Song and Witt (2003) note the difficulty of interpreting multiple cointegrating relationships, and omit from consideration cases where more than one relationship is found. Citing Kulendran and Witt (2001), Muscatelli and Hurn (1992) select vectors for which the estimated parameters conform to demand theory. Without testing, De Mello and Nell (2005) impose identifying restrictions implied by the AIDS model.

Our VECM approach explicitly allows for endogeneity and permits the identification of relationships governing both demand and supply (pricing) behavior. Hawaii is a particularly apt case for such analysis, because tourists from two markets–the United States and Japan–represent a dominant 85% of the total market. Clearly, in this case demand parameters cannot be estimated reliably without regard to supply constraints and potential price responses. And of course, a knowledge of supply-side behavior is of interest in its own right. Our identified model describes relatively long-run equilibrium relationships governing tourism demand and visitor accommodations pricing in Hawaii, and the forecasts compare favorably with those of three competing models according to the Diebold and Mariano (1995) tests of forecast accuracy.

The organization of the paper is as follows. Section 2 derives the visitor demand and room price equations, and identifies the variables to be used in the modeling exercise. Section 3 outlines our estimation methodology. Section 4 presents the empirical results of the Hawaii tourism model. Section 5 evaluates the forecast performance of the model. Section 6 concludes.

2. Tourism model specification

There is a relatively small body of theoretical literature on tourism economics, and no single unifying conceptual framework. Some early perspectives are reflected by Gray (1970) and Quandt (1970)Bull (1995)Sinclair and Stabler (1997), and Vanhove (2005) provide textbook overviews of tourism theory. In recent years, we have begun to see the development of optimization-based models of aspects of the tourism industry, often directed toward the question of optimal taxation of tourist spending (see for example Copeland, 1989, Copeland, 1990, Gooroochurn, 2004, Lu et al., 2008, Morely, 1992, Piga, 2006, Taylor, 1995). While theoretical work remains relatively sparse, we have noted above that there exists a large and well-defined body of empirical literature that can be used to guide the specification of visitor demand equations. Literature on the supply-side of the industry and pricing behavior is much more limited.

2.1. Tourism demand

Empirical models of tourism demand borrow heavily from consumer theory (Varian, 1992), which predicts that the optimal consumption level depends on the consumer’s income, the price of the good in question, the prices of related goods (substitutes and complements), and other demand shifters. Formally, the Marshallian demand for the tourism product can be expressed as,

Dij=F(Yi,Pi,Pj,PjS,Z), (1)

where Dij is the tourism product demanded in destination j by consumers from origin country i; Yi is the income of origin country i; Pi is the price of other goods and services in the origin country i; Pj is the price of the tourism product in destination country j; PjS is the price of tourism product in competing destinations; and Z is the vector of other factors affecting tourism demand. Assuming homogeneity, demand can be written as a function of real income and relative destination and substitute prices,

Dij=F(YiPi,PjPi,PjSPi,Z). (2)

In the literature, there are at least two classes of tourism models, those explaining the distribution of outward flows from a single source market (outbound modeling) and those explaining aggregate tourism flows into a single destination (inbound modeling). For outbound modeling, market shares of visitors or expenditures are the typical dependent variables. For inbound modeling, the most appropriate measure is real expenditures on tourism-related goods and services. However, the unavailability and often poor quality of expenditure data confine the typical study to total visitor arrivals (Anastasopoulos, 1984, O’Hagan and Harrison, 1984). Of the 81 tourism demand studies reviewed by Li et al. (2005), nearly 70% choose the number of visitor arrivals as the measure of demand, while 35% use expenditures or expenditure shares. (These total to more than 100% because some studies include more than one measure.) Visitor arrivals continue to be the most popular measure in studies completed since 2000 (Song & Li, 2008).

Proxies for demand determinants vary considerably, but often include measures of income, relative prices, substitute prices, travel costs, exchange rates, deterministic trends, and dummy variables for one-time events (Li et al., 2005). Typical income measures include gross domestic product, gross national product, national disposable income, personal income, and consumption expenditures, measured in either real, nominal, aggregate, or per capita form, depending on data availability and the nature of the tourism demand being modeled. 6

Several types of prices appear in the demand specification. The first is the own price of tourism products, usually approximated by the consumer price index in the destination market. 7 Second are measures of substitute prices. Because domestic travel may substitute for foreign travel, aggregate prices in the country of origin are often included. At the same time, competition among different overseas destinations may call for the inclusion of variables that represent the costs of substitute destinations. Exchange rate adjusted relative prices (real exchange rates) are commonly used as proxies for both effects. 8 Finally, transportation costs are sometimes included as a separate factor in determining travel. 9 Many studies augment income and price variables with deterministic effects: time trends to capture evolving consumer tastes; a constant term to account for “utility image” that does not vary greatly with time; dummies to account for various one-off events such as the Olympic Games, large-scale fairs, foreign currency/travel restrictions and oil crises; seasonality; or changes in data collection methods. These types of events, if otherwise neglected, might lead to bias in the estimated parameters (Anastasopoulos, 1984, Crouch et al., 1992, Kliman, 1981, Mak et al., 1977).

For our Hawaii tourism model (HTM), we use the number of visitor arrivals as the dependent variable, because high frequency expenditure data is not available for a sufficiently long continuous time span. We seek to identify the demand relationship for each of the two primary Hawaii tourism markets, US and Japanese visitors. Tourists from these two markets consistently account for over 85% of all visitors. To keep the model size manageable, we do not include visitors from other destinations. For the same reason, we include only the principle determinants of tourism demand, leaving out influences that are deemed less central to our analysis. In addition, some conceptually relevant factors are excluded because of the difficulty of finding appropriate proxies. The model includes five demand determinants: US real personal income (yr_us), US consumer price index (cpi_us), Japanese real personal income (yr_jp), Japanese exchange rate adjusted CPI (cpi_E_jp) and Hawaii average daily hotel room price (prm). The variables used throughout the text are described in Table 1 . All series are seasonally adjusted at a quarterly frequency and are expressed as natural logarithms, with the exception of the occupancy rate, which is expressed as a percentage.

Table 1.

Summary of variables in the Hawaii tourism model.

Mnemonic Description Units Source
Hawaii variables
vus US visitors to Hawaii thou DBEDT
vjp Japanese visitors to Hawaii thou DBEDT
prm Hawaii average daily hotel room rate dollars DBEDT
ocup Hawaii average daily hotel occupancy rate % DBEDT

US variables
yr_us US real personal income bil 82–84$ BEA
cpi_us US CPI (1982–1984 =  100) index BLS

Japan variables
yr_jp Japan real personal income bil 95Yen ESRI
cpi_jp Japan CPI (1995 =  100) index SBSC
xr_jp Yen/dollar exchange rate Yen/dollar FED

Calculated variable
cpi_E_jp cpi_jp/xr_jp index Authors’ calc.

Note: Except for the hotel occupancy rate, natural logarithms of each series are used in the analysis.

Sources: DBEDT: Department of Business, Economic Development and Tourism, State of Hawaii; BEA: Bureau of Economic Analysis, US; BLS: Bureau of Labor Statistics, US; FED: Federal Reserve Bank at St. Louis; ESRI: Economic and Social Research Institute, Japan; SBSC: Statistics Bureau and Statistics Center, Japan.

2.2. Tourism supply and pricing behavior

Both theoretical and empirical research on the supply side of tourism markets is scant (Crouch, 1994b, Dwyer and Forsyth, 2006). In much of the empirical tourism literature, supply is assumed to be perfectly elastic, and the parameters of demand relationships are estimated by Ordinary Least Squares (OLS). However, the infinite elasticity assumption is a convenient simplification rather than a tested hypothesis. Fujii et al. (1985) estimate the supply elasticity of Hawaii lodging services to be close to two, and it is not uncommon to observe sizable fluctuations in hotel room prices. Satisfactorily accounting for supply responses is therefore indispensable in deriving unbiased demand elasticities, 10 and supply behavior is of interest in its own right.

It is rather difficult to give a precise definition of tourism supply, considering the variety of products and services that tourists consume. 11 In this paper, we focus on the supply of accommodations, in part because lodging services represent the single largest component of visitor expenditures in Hawaii, and also because it is possible to obtain reliable data on hotel room prices. 12 Visitor accommodations are structure intensive; that is, they have high fixed costs relative to operating costs. As a result, in the short run there is a considerable incentive for the hotelier to fill an empty room by discounting the price to almost as low as the negligible marginal cost of filling a room, leading to price incentives during off-peak periods. 13 Over longer horizons, capacity is adjusted through expansion and contraction of room inventory.

One approach to modeling the room supply is to estimate an inverted supply curve. Examples of this appear in the hotel room tax literature (Bonham and Gangnes, 1996, Fujii et al., 1985). The supply price of hotel rooms is assumed to be a mark-up over the marginal cost,

PR=markupMC=MR(QR,PL,PK,PZ), (3)

where QR is the total quantity of rented rooms; PL,PK and PZ are the input prices of labor, capital and other inputs; and M is the markup factor.

Because the average length of stay and the number of visitors per room are relatively stationary, the number of hotel rooms rented will follow the same trend path as the number of visitors, and therefore visitor arrivals can be taken as a measure of the quantity of rented rooms. Here, for consistency with demand-side modeling, we use the sum of US and Japanese visitors (vus+vjp), ignoring the limited number of Hawaii visitors from other markets. The parameter on arrivals is a measure of the (inverse) long-run supply elasticity of lodging services. In the short term, the markup over the marginal cost should be sensitive to excess demand/supply conditions, as suggested above. We include the occupancy rate (ocup) as a proxy for these influences. It would be desirable to include Hawaii-specific input cost measures, but other than local wage rates, such measures do not exist. Considering the limited time span of available data (86 observations) and the number of variables already present in the demand model, we have elected to treat the US consumer price index (cpi_us) as a rough proxy for cost influences.

3. Empirical methodology

We model Hawaii tourism activity using a vector error correction framework. In this section we present the econometric framework, and describe the procedures used to identify the system and select a parsimonious model.

Consider the Vector Error Correction Model (VECM) for an m×1 vector of I(1) variables, zt,

Δzt=Πzt1+i=1k1ΓiΔzti+d+ϵt  ,t=1,2,,T, (4)

where k is the number of lags in the unrestricted VAR representation of zt, and d is an m vector of deterministic terms. 14 The equilibrium properties of (4) are characterized by the rank of Π. 15 If the elements of zt are I(1) and cointegrated with rank(Π)=r<m, Π can be decomposed into two m×r full column rank matrices α and β, where Π=αβ. This implies that there exist r<m stationary linear combinations of zt, such that ξt=βztI(0). 16 The matrix of adjustment coefficients, α, measures how strongly deviations from the long-run equilibrium, ξt, feed back into the system.

To select a parsimonious version of the VECM in Eq. (4), at least four types of restrictions are relevant: restrictions on the rank of the long-run matrix, Π; restrictions on the long-run cointegrating vectors, β; restrictions on the short-run dynamic coefficients, Γi; and restrictions on the loading parameters, α. Researchers have proposed various different ways to impose these restrictions (Johansen, 1988, Johansen, 1991, Johansen, 1995, Pesaran and Shin, 2001, Phillips, 1991, Phillips, 1995, Saikkonen, 1993a, Saikkonen, 1993b). However, Hall et al. (2002) argue that these approaches are almost impossible to implement in practice when a fairly rich specification encounters a limited sample size. The interaction of dynamic and long-run parameters has an enormous effect on the size and power of the statistical tests conventionally adopted. The Monte Carlo results of Hall et al. (2002) reveal that imposing valid weak exogeneity restrictions before testing the cointegrating rank generally improves the power of Johansen rank tests. At the same time, restricting the cointegrating rank has little impact on weak exogeneity tests, at least as long as the rank is not restricted to be less than the true rank.

We follow Hall et al. (2002) and apply the following pragmatic strategy in reducing our general VECM to a more parsimonious representation:

  • 1.

    Test and impose weak exogeneity restrictions;

  • 2.

    Test the rank of the long-run matrix, Π (cointegrating rank);

  • 3.

    Use Johansen’s reduced rank procedure to estimate the cointegrating vectors;

  • 4.

    Test and impose theory-based just- and over-identifying restrictions on the cointegrating vector, β;

  • 5.

    Estimate the complete dynamic model and simplify the dynamics. At this stage, the causality structure of the model is established by eliminating cointegrating vectors with insignificant loading parameters.

3.1. Testing weak exogeneity

A well known problem with VARs, and one which is particularly important in the identification of a VECM, is the prohibitively large number of parameters. Each equation involves estimating m×k lag coefficients, plus one or more parameters for the deterministic components. Even moderate values of m and k quickly exhaust typical samples for macroeconometric research.

One way to address the over-parameterization problem is to test and impose weak exogeneity assumptions. For each series that is treated as weakly exogenous, the number of equations in the system is reduced by one, and the number of parameters by (mk+d), where d is the number of deterministic components. For the HTM, if the external drivers (yr_us, yr_jp, cpi_us, cpi_E_jp) are treated as weakly exogenous, the number of equations is reduced from eight to four, and the number of parameters to estimate is reduced from 272 to 136.

To see the effect of weak exogeneity on the system, partition the m-vector of I(1) random variables zt into the n-vector yt and the q-vector xt, such that zt=(yt,xt) and q=mn. Our primary interest is in the structural modeling of yt, conditional on its own past values, yt1, yt2, , and the current and past values of xt. The parameters, matrices, and errors in the VECM equation (4) can be partitioned conformably as d=(dy,dx), α=(αy,αx), Γi=(Γyi,Γxi), i=1,2,,k1, ϵt =(ϵyt,ϵxt), and the variance–covariance matrix as

Ω=(ΩyyΩyxΩxyΩxx). (5)

The model is transformed into a conditional model for yt and a marginal model for xt,

Δyt=(dyωdx)+ωΔxt+(αyωαx)βzt1+i=1k1(ΓyiωΓxi)Δzti+(ϵytωϵxt), (6)
Δxt=dx+αxβzt1+i=1k1ΓxiΔzti+ϵxt, (7)

where ω=ΩyxΩxx1.

The parameters of interest, β, enter both the conditional model (6) and the marginal model (7), and the adjustment coefficients (αyωαx) depend on the covariance matrix, Ω, and all the adjustment coefficients (αy, αx). Therefore, the parameters of interest cannot be variation free, and a full system analysis is required. When the parameters of interest are the cointegrating vector βxt is weakly exogenous if and only if αx=0 (Johansen, 1991). The condition αx=0 causes β to drop out of the marginal distribution for xt in Eq. (7), and ensures that αx does not appear in the conditional model in Eq. (6). 17 Therefore, the conditional model (6) contains as much information about the cointegrating relationships, βzt1, as the full system, and analysis of the conditional model alone is sufficient.

Following Hall et al. (2002), once weak exogeneity restrictions are tested and imposed, we conduct Johansen rank tests. 18 The resulting tests have greater power than tests conducted without the theory-based exogeneity restrictions.

3.2. Restricting cointegrating vectors

Even with a known rank for the long run matrix, Π, an identification problem arises because the matrices α and β are not uniquely identified without additional information. Pesaran and Shin (2001) show that r2 restrictions (r restrictions per vector) are needed for exact identification. The most common approach to imposing the r2 identifying restrictions is Johansen’s statistical approach. Specifically, Johansen’s just identified estimator of β is obtained by selecting the r largest eigenvectors of the system, subject to “ortho-normalization” and “orthogonalization” restrictions. Pesaran and Shin (2001) criticize this approach as “pure mathematical convenience” rather than an economically justified approach. 19 They emphasize the use of economic theory to guide the choice of long-run exact- and over-identifying restrictions. The theory-guided approach takes Johansen’s just identified vector βJ as given, and replaces the “statistical” restrictions with ones that are economically meaningful.

In the following section we adopt the pragmatic reduction strategy of Hall et al. (2002), testing for weak exogeneity, testing for cointegrating rank, and applying theory-based exact- and over-identifying restrictions to the cointegrating vectors.

4. The Hawaii tourism model

Historical data for the HTM variables on a quarterly basis are available for the period 1980 to 2005. To preserve data for out-of-sample forecast evaluation, we identify the model using a truncated sample from 1980Q1 through 2001Q2. This choice maximizes our sample period for initial estimation and identification, avoids the difficult task of modeling the September 11, 2001 (9/11) shock to Hawaii tourism, and preserves a sufficiently large post-estimation period for out-of-sample forecast evaluation.

4.1. Weak exogeneity

We begin with the vector of eight variables, zt=(vus,vjp,prm,ocup,yr_us,cpi_us,yr_jp,cpi_E_jp) discussed in Section 2. We hypothesize that the four tourism variables, yt=(vus,vjp,prm,ocup), are endogenous, and the remaining external factors, xt=(yr_us,cpi_us,yr_jp,cpi_E_jp), are exogenous. For a system with eight variables, there can exist at most seven cointegrating vectors. 20 Following the strategy outlined in Section 3, we leave the cointegrating rank unrestricted (r=7) and test the null hypothesis, H0:αx=0, for each candidate exogenous variable. (That is, we exclude all cointegrating vectors from equations explaining the “theoretically” exogenous variables.) We cannot reject weak exogeneity of US real income, yr_us, or the exchange-rate-adjusted Japanese price level, cpi_E_jp; tests for both variables have marginal significance levels (p-values) in excess of 10% (see Table 2 , Panel 1). In contrast, weak exogeneity of both the US price level, cpi_us, and Japanese real income, yr_jp, is strongly rejected at the 1% level. 21

Table 2.

Weak exogeneity tests.

Δyt=(cyωcx)+ωΔxt+(αyωαx)βzt1+ik1(ΓyiωΓxi)Δzti+(ϵytωϵxt)(6)
H0:αx=0
Panel 1: rank(Π)=7
Variable χ2(7) p-value

yr_us 10.42 0.17
cpi_us 37.67 0.00
yr_jp 31.90 0.00
cpi_E_jp 11.02 0.14

Panel 2: Harbo weak exogeneity tests
Variable F-test p-value

yr_us 0.34 0.79
cpi_us 1.21 0.32
yr_jp 2.64 0.06
cpi_E_jp 0.03 0.99

Note: Column 1 lists the variables tested for weak exogeneity. Column 2 presents the χ2 statistic (F statistic in the case of Panel 2) for the null hypothesis of weak exogeneity. Column 3 presents the marginal significance level of the statistic in Column 2.

While αx=0 is a necessary and sufficient condition for weak exogeneity of xt with respect to β, this condition often proves to be too strong in practice, because exogenous variables may form cointegrating relationships among themselves (Pesaran, Shin, & Smith, 2000). In our case, because of macroeconomic relationships within and between the US and Japan, it is likely that our vector, xt=(yr_us,cpi_us,yr_jp,cpi_E_jp), of hypothesized weakly exogenous variables is cointegrated. 22 The rejection of αx=0 may occur due to cointegration among the exogenous variables, rather than because of their endogeneity for the parameters of interest in the HTM.

Nevertheless, weak exogeneity can still be tested, following the approach suggested by Harbo, Johansen, Nielsen, and Rahbek (1998). Instead of estimating the whole system and testing whether a subset of α is zero, they suggest estimating the conditional model alone and checking for weak exogeneity by adding the empirically derived cointegrating relationships to the marginal model. The null hypothesis of weak exogeneity implies that in the marginal model, the loading parameters on the estimated equilibrium relationships are insignificantly different from zero.

The results from the Harbo test for weak exogeneity are reported in Panel 2 of Table 2 (note that the cointegrating vectors used in these tests are identified in Section 4.3 below). The first differences of the “exogenous” variables (Δyr_us, Δcpi_us, Δyr_jp and Δcpi_E_jp) are each regressed on the lagged first differences of all variables, the three identified cointegrating vectors and a constant. We test the joint null hypothesis that the loading parameters on all three cointegrating vectors are insignificantly different from zero in each equation in the marginal system. F-tests for this null hypothesis are presented in Panel 2 of Table 2. We do not reject the null hypothesis of weak exogeneity for any of the variables in the xt=(yr_us,cpi_us,yr_jp,cpi_E_jp) vector at the 5% marginal significance level. Below, we test for the rank of the cointegrating space, subject to these weak exogeneity restrictions.

4.2. Cointegrating rank

Imposing the weak exogeneity restrictions tested above, we proceed to test the rank of the long run matrix, Π, using Johansen’s reduced rank methodology. Table 3 reports the test statistics and the corresponding asymptotic critical values at the 5% and 10% significance levels, as tabulated in Table T.4 of Pesaran et al. (2000) for a system with four weakly exogenous variables.

Table 3.

Cointegration rank tests.

Trace test Max eigenvalue test
H(r) Statistic 0.05 0.10 Statistic 0.05 0.10

r=0 139.30 99.11 93.98 52.82 43.75 41.01
r1 86.48 69.84 65.90 42.80 37.44 34.66
r2 43.68 45.10 41.57 23.78 30.55 27.86
r3 19.90 23.17 20.73 19.90 23.17 20.73

Note: Column 1 lists the null hypothesis of zero, or at most one, two, three, or four cointegrating vectors; Column 2 lists the trace statistic; Column 3 and 4 are the critical values for the trace statistic at the 5% and 10% significance levels from Table T.4 of Pesaran et al. (2000); Column 5 lists the maximum eigenvalue statistic; Column 6 and 7 are the critical values for the maximum eigenvalue statistic at the 5% and 10% significance levels from Table T.4 of Pesaran et al. (2000); bolded numbers indicate significance at the 10% level.

Based on both the trace and maximum eigenvalue statistics, we reject the null of both zero and one or fewer cointegrating vectors at the 5% significance level. The null of two or fewer cointegrating vectors is rejected at the 10% level using the trace test, but not the maximum eigenvalue test. Because of the potential for three cointegrating relationships, and our objective of modeling one supply and two demand relationships, we proceed under the assumption that the system has three cointegrating vectors.

4.3. Long-run cointegrating vectors

Our goal here is to identify three long run equilibrium relationships, βzt1, where β is the 3×9 matrix of unrestricted cointegrating parameters. 23 We test theory-driven restrictions under the assumption that the three cointegrating vectors (β1, β2, and β3) represent the demand for tourism services by US visitors, demand by Japanese visitors, and a pricing rule for Hawaii visitor accommodations. To obtain the just-identified system, we impose r=3 restrictions per equation reported in Table 4 . In the cointegrating vector representing US visitor demand, Eq. (8), we normalize on US visitor arrivals, vus, and exclude Japanese visitor arrivals (β1,2=0) and Japanese real income, (β1,7=0). In the vector representing Japanese visitor demand (9), we normalize on Japanese visitor arrivals, and exclude US visitor arrivals (β2,1=0) and US real income (β2,5=0). Finally, for the supply vector (10) we normalize on the hotel room price, and exclude both US and Japanese income (β3,5,β3,7=0). Parameter estimates are reported in Table 4. This system of equations serves as the starting point for the tests of over-identifying restrictions presented below.

Table 4.

Just-identified system.

US visitor demand
vus=β1,3prm+β1,4ocup+β1,5yr_us+β1,6cpi_us+β1,8cpi_E_jp+β1,9t+ξDus(8)

β1,3 β1,4 β1,5 β1,6 β1,8 β1,9
−9.58 −4.19 25.89 16.44 0.46 −0.19
(2.70) (4.52) (7.26) (8.66) (0.73) (0.09)

Japanese visitor demand
vjp=β2,3prm+β2,4ocup+β2,6cpi_us+β2,7yr_jp+β2,8cpi_E_jp+β2,9t+ξDjp(9)

β2,3 β2,4 β2,6 β2,7 β2,8 β2,9
−1.83 1.70 −3.04 4.82 0.02 0.03
(0.44) (0.60) (1.15) (0.89) (0.12) (0.01)

Accommodations pricing
prm=β3,1vus+β3,2vjp+β3,4ocup+β3,6cpi_us+β3,8cpi_E_jp+β3,9t+ξS(10)

β3,1 β3,2 β3,4 β3,6 β3,8 β3,9
0.44 0.43 1.89 −0.24 −0.26 0.01
(0.11) (0.11) (0.46) (0.59) (0.09) (0.00)

log likelihood = 1288.23

Note: Each column presents parameter estimates, with standard errors in parentheses. Computations are carried out using PcGive 10.

4.3.1. US tourism demand

To identify a US tourism demand relationship, we test four over-identifying restrictions. We test exclusion restrictions on the occupancy rate, ocup, and Japanese consumer prices, cpi_E_jp; a homogeneity restriction on the hotel room price, prm, and the US consumer price index, cpi_us, (β1,3=β1,6); and a restriction on the magnitude of the US income elasticity. Note that the income elasticity in the just-identified US demand relationship is implausibly large and estimated quite imprecisely. While the tourism literature often reports income elasticities in excess of two, we restrict the elasticity to the smallest statistically acceptable value, because we expect that a larger income elasticity will adversely impact the forecasting performance of the HTM. We cannot reject the restriction that the US real income elasticity (β1,5) is 3.5, but smaller values are rejected. The estimated relative price elasticity of −0.55 is well within the range of estimates reported in the literature. 24

The comparison of price elasticity estimates is more difficult because of the many alternative price measures used. Witt and Witt (1995) report a median own price elasticity of −0.7 for studies using destination cost. Sheldon’s (1993) results imply a median destination price elasticity of −1.2, and an exchange rate elasticity of −1.6. Again, the range of price elasticity estimates is very large, ranging from −7.3 to 1.6 for destination prices, and from −7.6 to 4.1 for exchange rates. The resulting US demand relationship is presented in Table 5 .

Table 5.

Over-identified system.

US visitor demand
vus=β1,3prm+β1,4ocup+β1,5yr_us+β1,6cpi_us+β1,8cpi_E_jp+β1,9t+ξDus(8)

β1,3 β1,4 β1,5 β1,6 β1,8 β1,9
−0.55 0 3.5 0.55 0 −0.02
(−0.31) (0.0) (0.31) (0.00)

Japanese visitor demand
vjp=β2,3prm+β2,4ocup+β2,6cpi_us+β2,7yr_jp+β2,8cpi_E_jp+β2,9t+ξDjp(9)

β2,3 β2,4 β2,6 β2,7 β2,8 β2,9
−0.37 0 0 2.23 0.37 0
(−0.09) (0.13) (0.09)

Accommodations pricing
prm=β3,1vus+β3,2vjp+β3,4ocup+β3,6cpi_us+β3,8cpi_E_jp+β3,9t+ξS(10)

β3,1 β3,2 β3,4 β3,6 β3,8 β3,9
0.54 0.13 1.83 0 0 0.01
(0.10) (0.07) (0.42) (0.00)

log likelihood = 1279.23
LR-test, χ2(10) =  18.01 [0.055]

Note: Each column presents parameter estimates, with standard errors in parentheses. The last panel of the table presents the likelihood ratio test for the joint null that all over-identifying restrictions are valid. The marginal significance levels for this test are in brackets. Computations are carried out using Pc-Fiml 9.10.

4.3.2. Japanese tourism demand

To identify a Japanese tourism demand equation, we test four over-identifying restrictions similar to those used for US tourism demand. We test exclusion restrictions on the hotel occupancy rate, ocup, US consumer prices, cpi_us, and the time trend. In addition, we test one homogeneity restriction on the hotel room price and Japanese prices, prm and cpi_E_jp, (β2,3=β2,8). The Japanese income elasticity is left unrestricted, as there is no clear indication in the literature of a consensus estimate, and the estimated elasticity is an economically reasonable 2.23. The relative price elasticity estimate of −0.37 falls well within the range of other studies. The resulting Japanese demand relationship is presented in Table 5.

4.3.3. Accommodations pricing (supply)

To identify a room pricing relationship for visitor accommodations, we test two over-identifying restrictions; we exclude both the US and Japanese price levels (cpi_us and cpi_E_jp). While we have eliminated any possible proxy for production costs, both the occupancy rate and the number of visitors have the expected sign, and it is possible that the deterministic trend and/or occupancy rate also proxy for production costs. We tested and rejected the restriction that US and Japanese visitors enter the supply equation with the same coefficient. The implied weighted-average supply price elasticity is 2.4, similar to the estimate of approximately 2 found by Fujii et al. (1985). The resulting pricing equation is presented in Table 5.

Taken as a group, we cannot reject these over-identifying restrictions at the 5% level. The likelihood ratio statistic for the joint test of all over-identifying restrictions has a value of 18, and a marginal significance level of 5.5%. (The relatively low significance level is primarily a result of the large magnitude of the restriction on the US income elasticity.) The overidentified cointegrating relationships presented in Table 5 represent the long-run equilibria of the system. Below we further restrict the system by testing and imposing zero restrictions on system dynamics.

4.4. The dynamic model

At this stage, the dynamics are simplified by dropping the statistically insignificant terms from Eqs. (8)–(10). This involves excluding first differenced terms with t-values less than 2, starting from the smallest. The error correction terms are eliminated by the same criterion. A total of 58 zero restrictions are applied. The joint test of all zero restrictions produces a χ2 statistic of 30.75, with a p-value of 0.99. We do not reject these exclusion restrictions at the 1% level. The estimated loading parameters and corresponding diagnostic test statistics are shown in Table 6 .

Table 6.

Dynamic model: Loading parameters and diagnostics.

Equation α1 α2 α3 R¯2 AR1-5 Normality Arch
Δvus −0.11 0.33 0.51 2.25 2.05 0.32
(−5.18) (5.04) [0.07] [0.36] [0.86]

Δvjp −0.34 −.13 0.62 2.32 1.68 0.19
(−4.57) (−1.57) [0.06] [0.43] [0.94]

Δprm −0.09 −0.16 0.46 2.15 0.83 1.06
(−5.06) (−4.43) [0.08] [0.66] [0.38]

Δocup −0.02 −0.10 0.22 0.65 2.21 1.25 0.35
(−1.25) (−3.59) (5.86) [0.07] [0.53] [0.84]

Log likelihood = 1263.85
LR-test, χ2(58) =  30.75 [0.99]

Note: Column 1 lists the dependent variable for each equation in the system; Columns 2–4 give the loading parameters, α1α3, and the corresponding Student t-statistics for the three identified cointegrating vectors; Column 5 presents the coefficient of determination R2; Column 6 gives the F-test results (and corresponding p-values) for the null hypothesis that the equation residuals are independent up to lag 5. Column 7 contains the results of a χ2 test (with p-values) for the null hypothesis that the regression residuals are normally distributed. Column 8 is a test for the null that the residuals do not exhibit autoregressive conditional heteroscedasticity (ARCH) (Engle, 1982). Figures in parentheses (.) are the Student t-statistics corresponding to the loading parameters, whereas those in square brackets [.] are p-values for individual tests. All computations are carried out using Pc-Fiml 9.10, with the exception of the R2 values, which are calculated using RATS v 5.0.

The estimated system appears to be an adequate model for Hawaii tourism activity. All equations perform reasonably well, explaining 51%, 62%, 46%, and 65% of the variation in Δvus, Δvjp, Δprm, and Δocup, respectively. All equations pass all diagnostic tests at the 5% significance level. The existence of long-run equilibrium error terms in the model equations allows for a temporary disequilibrium between causal variables and the demand and supply variables. The adjustment factors (αs) capture the speed of adjustment toward the equilibrium relationship. For example, if US arrivals are less than predicted by US real income growth and the relative cost of a Hawaii vacation, arrivals would increase over time to eliminate the disequilibrium error. The three long-run equilibrium errors enter the four equations differently. The equation for US visitor growth, Δvus, contains both ξDus and ξS. The loading parameter on the US demand equilibrium error, ξDus, is −0.11, so 11% of the equilibrium error is corrected each period. In the equation for Japanese visitor growth, Δvjp, the equilibrium error associated with Japanese visitor demand, ξDjp, enters with a coefficient of −0.34, implying a complete adjustment towards equilibrium in slightly less than three quarters. The equilibrium errors for US demand, ξDus, and the pricing rule, ξS, enter the hotel room price equation, Δprm, while all three errors enter the equation for the change in hotel occupancy, Δocup. In the case of the Japanese visitor demand equation, the equilibrium error for the pricing rule, ξS, is retained, despite the fact that its t-value (−1.57) is below the 5% critical value. The same is true for the US demand equilibrium error in the occupancy rate equation. In both cases excluding these equilibrium errors led to a rejection of the null hypothesis of no serial correlation (up to lag 5) in the equation residuals. To more fully evaluate the performance of the HTM, we perform an out of sample forecast evaluation below.

5. Forecast evaluation

This section evaluates the forecasting performance of the newly identified HTM. To preserve data for an out-of-sample forecast evaluation, we identified the HTM and its rivals using a truncated sample from 1980Q1 through 2001Q2. This also allowed us to avoid the difficulty of modeling the significant shock to Hawaii tourism from the September 11, 2001 terrorist attacks. In the years since 9/11, Hawaii tourism has also been adversely affected by terrorism worries, anthrax scares, the invasion of Afghanistan followed by the Iraq War, and the outbreak of Severe Acute Respiratory Syndrome and Avian flu. As a result, the period since 9/11 represents a particularly challenging one for forecasting. 25

We compare forecasts from the HTM with those from three rival models. Two are VARX systems (vector autoregressions with exogenous variables), one in log levels (LVARX) and the other in log first differences (DLVARX). The LVARX in levels admits the possibility of cointegration, but does not impose cointegrating restrictions as in the HTM. The DLVARX in differences precludes the possibility of cointegration, but has the advantage of converting some forms of structural change into one period shocks. In both cases, the rival models are identified (one equation at a time) using the model selection algorithm in PcGETS 10.3 (Hendry & Krolzig, 2001). 26 Specifically, we construct a Generalized Unrestricted Model (GUM) in log levels (or differences) for each endogenous variable in the HTM just-identified system. That is, each endogenous variable is explained by up to four lags (three in the differenced model) of each of the four endogenous and four weakly exogenous variables, and we make use of the theory-motivated exclusion restrictions used in the just-identified model presented in Table 4. 27 PcGETS is used in its default “liberal-testimation” mode. In the liberal mode, significance levels are adjusted to minimize the non-selection probability, i.e., keep as many of the GUM variables as possible, at the risk of retaining irrelevant variables more often. Trivedi (1984) characterized such an algorithm as “testimation.”

The third competing model is a system of Autoregressive Integrated Moving Average (ARIMA) models selected automatically, based on an algorithm suggested by Hannan and Rissanen (1982). 28 The HR approach selects the ARIMA(p,1,q) model that minimizes the residual variance, with a penalty for higher autoregressive and moving average orders, p and q.

While it is common in the forecasting literature, both generic and tourism specific, to compare model forecasts with a naive no change or random walk forecast, it is important to note that both the GETS and ARIMA model selection algorithms can chose the random walk specification. In fact, minimizing the HR criterion results in an ARIMA(0, 1, 0)–random walk–model being selected for the hotel room price.

As described above, we initially identified each model specification over the sample period 1980Q1–2001Q2. Each model is then used to generate dynamic forecasts from four to twelve steps ahead. The sample is then rolled forward one quarter, and another set of four- through twelve-step-ahead dynamic forecasts are generated. 29 We obtain 12 four-step-ahead, 8 eight-step-ahead and 4 twelve-step-ahead dynamic forecasts.

Rather than simply rank the rival models based on loss functions such as the mean absolute percent error (MAPE) or mean squared error (MSE), we test the accuracy of the out-of-sample forecasts from the HTM relative to the accuracy of the LVARX and DLVARX competitors using the Harvey, Leybourne, and Newbold (1998) modification of the Diebold and Mariano (1995) test.

We construct pairwise rankings between the HTM and its three competitor models for each endogenous variable: US visitor arrivals, Japanese visitor arrivals, the hotel room price, and the occupancy rate. Table 7, Table 8, Table 9 present MAPE and MSE rankings, along with Diebold and Mariano (DM) test results, for the four-, eight-, and twelve-step-ahead forecasts respectively. The second column of Table 7, Table 8, Table 9, labeled HTM, shows the p-values for the DM test of the null hypothesis that the HTM model forecast is equal in accuracy to its competitor (listed in Column 1), versus the alternative hypothesis that the HTM forecast has a lower MSE. Tests with the opposite alternative hypothesis are presented in the row labeled HTM. That is, the null hypothesis is that the HTM model forecast produces the same MSE as its competitor, and the alternative hypothesis is that the competitor MSE is lower. For all tests, p-values below 5% are underlined.

Table 7.

4-step-ahead forecast comparisons 2001:3–2005:1.

Model j H0:MSEi=MSEj vs Ha:MSEi<MSEj
MSE MAPE
Model i
HTM LVARX DLVARX ARIMA
US visitors
HTM 0.999 0.212 0.373 0.0102 0.0115
LVARX 0.001 0.0194 0.0183
DLVARX 0.788 0.0058 0.0100
ARIMA(3, 1, 2) 0.627 0.0098 0.0112

JP visitors
HTM 0.951 0.970 1.000 0.0325 0.0276
LVARX 0.049 0.0455 0.0305
DLVARX 0.030 0.0778 0.0334
ARIMA(0, 1, 1) 0.000 0.1280 0.0583

Room price
HTM 0.999 0.001 1.000 0.0008 0.0044
LVARX 0.000 0.0021 0.0083
DLVARX 0.999 0.0007 0.0032
ARIMA(0, 1, 0) 0.000 0.0164 0.0255

Occupancy
HTM 0.368 0.557 0.0422 0.0023 0.0492
LVARX 0.632 0.0018 0.0455
DLVARX 0.443 0.0024 0.0576
ARIMA(2, 1, 3) 0.958 0.0014 0.0112

Note: Each panel presents results for a different target variable. In each case, column 1 lists the competitor model j, and columns 6–7 present the Mean Squared Error (MSE) and Mean Absolute Percentage Error (MAPE) for the model j forecasts. Columns 2–5 list competitor models i and present the p-value for a test of the null hypothesis that H0:MSEi=MSEj versus the alternative hypothesis, Ha:MSEi<MSEj. Thus, p-values below the conventional 5% significance level in column 2 indicate a rejection of the hypothesis that the MSE of the HTM forecast is equal to its competitor forecast from model j in favor of the alternative that the HTM forecast produces a smaller MSE. For each forecast target, the minimum MSE and MAPE are underlined, as are p-values below the conventional 5% significance level.

Table 8.

8-step-ahead forecast comparisons 2001:3–2005:1.

Model j H0:MSEi=MSEj vs Ha:MSEi<MSEj
MSE MAPE
Model i
HTM LVARX DLVARX ARIMA
US visitors
HTM 1.000 0.830 0.190 0.0156 0.01583
LVARX 0.000 0.0432 0.02794
DLVARX 0.170 0.0186 0.01813
ARIMA(3, 1, 2) 0.810 0.0146 0.01604

JP visitors
HTM 0.997 0.947 1.000 0.0551 0.03644
LVARX 0.003 0.0854 0.0447
DLVARX 0.053 0.1064 0.0387
ARIMA(0, 1, 1) 0.000 0.1587 0.0659

Room price
HTM 0.968 0.461 1.000 0.0011 0.0049
LVARX 0.032 0.0025 0.0084
DLVARX 0.539 0.0010 0.0043
ARIMA(0, 1, 0) 0.000 0.0185 0.0271

Occupancy
HTM 0.999 1.000 0.006 0.0028 0.0607
LVARX 0.000 0.0059 0.08904
DLVARX 0.000 0.0071 0.1049
ARIMA(2, 1, 3) 0.994 0.0009 0.0307

Note: Each panel presents results for a different target variable. In each case, column 1 lists competitor model j, and columns 6–7 present the Mean Squared Error (MSE) and Mean Absolute Percentage Error (MAPE) for the model j forecasts. Columns 2–5 list competitor models i and present the p-values for a test of the null hypothesis that H0:MSEi=MSEj versus the alternative hypothesis, Ha:MSEi<MSEj. Thus, p-values below the conventional 5% significance level in column 2 indicate a rejection of the hypothesis that the MSE of the HTM forecast is equal to its competitor forecast from model j in favor of the alternative that the HTM forecast produces a smaller MSE. For each forecast target, the minimum MSE and MAPE are underlined, as are p-values below the conventional 5% significance level.

Table 9.

12-step-ahead forecast comparisons 2001:3–2005:1.

Model j H0:MSEi=MSEj vs Ha:MSEi<MSEj
MSE MAPE
Model i
HTM LVARX DLVARX ARIMA
US visitors
HTM 1.000 0.966 0.033 0.0356 0.0212
LVARX 0.000 0.0566 0.0331
DLVARX 0.034 0.0433 0.0278
ARIMA(3, 1, 2) 0.967 0.0231 0.0212

JP visitors
HTM 1.000 1.000 1.000 0.0266 0.0272
LVARX 0.000 0.0551 0.0395
DLVARX 0.000 0.1356 0.0454
ARIMA(0, 1, 1) 0.000 0.1140 0.0568

Room price
HTM 0.455 0.038 1.000 0.0032 0.0089
LVARX 0.545 0.0031 0.0108
DLVARX 0.962 0.00122 0.0050
ARIMA(0, 1, 0) 0.000 0.0188 0.0272

Occupancy
HTM 1.000 1.000 0.0000 0.0022 0.0570
LVARX 0.000 0.0085 0.1160
DLVARX 0.000 0.0111 0.1218
ARIMA(2, 1, 3) 1.000 0.0009 0.0357

Note: Each panel presents results for a different target variable. In each case, column 1 lists competitor model j, and columns 6–7 present the Mean Squared Error (MSE) and Mean Absolute Percentage Error (MAPE) for the model j forecasts. Columns 2–5 list competitor models i and present the p-values for a test of the null hypothesis that H0:MSEi=MSEj versus the alternative hypothesis, Ha:MSEi<MSEj. Thus, p-values below the conventional 5% significance level in column 2 indicate a rejection of the hypothesis that the MSE of the HTM forecast is equal to its competitor forecast from model j in favor of the alternative that the HTM forecast produces a smaller MSE. For each forecast target, the minimum MSE and MAPE are underlined, as are p-values below the conventional 5% significance level.

For the relatively short-term four-step-ahead forecasts, the HTM produces forecasts with a statistically lower MSE for three out of four comparisons with the LVARX model, one out of four comparisons with the DLVARX, and two of four comparisons with the ARIMA model forecasts. In contrast, the DLVARX and ARIMA forecasts produce MSEs that are significantly smaller than the HTM only once each, while the LVARX model never statistically dominates the HTM.

The results are much the same when forecasting over longer horizons. For the sample of eight-step-ahead forecasts, the HTM forecasts statistically dominate the LVARX forecasts in three out of four comparisons, the DLVARX forecasts once, and the ARIMA forecasts twice. In only one case, the ARIMA model forecast of the occupancy rate, does a competitor forecast result in a MSE statistically smaller than the HTM forecasts. The same basic conclusion holds when evaluating the relatively small sample (four forecasts) of twelve-step-ahead forecasts. The HTM produces forecasts with MSEs that are statistically smaller than those of the LVARX and DLVARX forecasts in three of four cases, and statistically dominate the ARIMA forecasts in two of three cases. The DLVARX model again produces the best forecast for the hotel room price, and the ARIMA forecasts are statistically more accurate than the HTM in two cases.

In summary, the HTM model produced statistically more accurate forecasts than the LVARX, DLVARX, and ARIMA models in 83%, 42%, and 50% of the comparisons. The same three models produced statistically more accurate forecasts than the HTM model in 0%, 17%, and 30% of the comparisons. Much of the literature that has examined the accuracy of tourism forecasting models has found that sophisticated methods, including error correction models, often produce forecasts which are no better than or inferior to those from relatively simple ARIMA models. 30 In the case of the HTM, based on formal tests of accuracy, we find that the HTM routinely outperforms other VARX competitors and has a slight advantage over its ARIMA competitor. In forecasting both Japanese visitor arrivals and room prices, the HTM dominates the ARIMA model at each forecast horizon, while the ARIMA model dominates the HTM for all forecasts of the occupancy rate. The relatively good performance of the VECM model suggests that the methodology warrants further study.

6. Concluding remarks

Cointegration analysis and error-correction modeling have become standard components of the economic modeling and forecasting toolkit. However, the application of these tools in a system setting introduces challenges, including identifying economically meaningful structural relationships and choosing an appropriate strategy for model reduction. These problems are particularly challenging given the limited data samples often available in practice.

In this paper, we apply Hall et al.’s (2002) theory-directed sequential reduction method to select a vector error correction model (VECM) for forecasting Hawaii tourism. We test and impose theory-based weak exogeneity assumptions at the earliest stage in the model reduction process. By doing so, the number of parameters to be estimated is greatly reduced, saving degrees of freedom and improving the efficiency of the estimated coefficients.

To our knowledge, ours is the first paper in the empirical tourism literature to tackle the important problem of identifying both demand- and supply-side relationships in a cointegrated system. The theory-guided approach has intuitive appeal, and we identify economically meaningful cointegrating vectors. For tourism activities in Hawaii, the paper identifies one demand relationship each for US and Japanese visitors, and an inverse supply curve for average hotel room prices. By formally incorporating the supply side, the Hawaii tourism model is less vulnerable to endogeneity biases caused by neglecting demand and supply interactions.

We perform out of sample forecast comparisons against two competing VARs which were identified automatically (one equation at a time) using the model selection algorithm in PcGETS 10.3 (Hendry & Krolzig, 2001), as well as an ARIMA alternative. Based on the Diebold and Mariano (1995) tests for forecast accuracy, the HTM routinely outperforms other VARX competitors, but has an advantage over its ARIMA competitor in fewer cases. The failure to consistently outperform simple ARIMA models is consistent with the findings of many previous studies, but the common observation bears repeating, that structural models like the HTM provide additional benefits over pure time series models by providing structural insight and by permitting scenario analysis.

Hall et al.’s (2002) pragmatic approach to identifying structural relationships in a VECM appears to hold promise for tourism modeling and forecasting, as well as for other settings where there are important sources of endogeneity and where available data samples are limited. The approach can be applied wherever the model’s scope can be restricted to a limited number of cointegrating relationships, for example in analyzing demand (and supply) behavior for an overall visitor market, or distinguishing the behavior of two or perhaps three separate markets, for example domestic and international, within a region and outside of the region, etc. And certainly there are markets like Hawaii where one or two source countries dominate arrivals; an example is Las Vegas, where in 2007 88% of visitors were domestic and only 12% were from foreign countries.

The high data demands of this system approach impose some restrictions on its application. As we have noted above, in order to keep the model size manageable we were forced to limit the scope of our analysis to a few key determinants of tourism flows. In particular, we were not able to include some of the demand shifters that often appear in the empirical tourism literature, such as prices of tourism services in competing destinations and policy measures such as tourism marketing expenditures. Our necessarily simpler modeling of the demand side is a limitation of this analysis.

For similar reasons, the VECM approach may become impractical in cases where the practitioner wants to estimate relationships for a relatively large number of separate markets. Even here, one could use a VECM approach to model overall visitor activity, to explicitly allow for endogeneity, and then in a second stage estimate a separate set of equations to allocate activity across the destination or source markets. Regardless of the empirical strategy adopted, our results suggest that it is important to address price endogeneity in some way whenever modeling activity in the tourism industry.

Biographies

Carl Bonham studied economics at Tulane University and received his Ph.D. in economics from the University of Texas at Austin in 1989, where he was a University Fellow. He has taught economics at the University of Hawaii since 1989. He serves on the State of Hawaii Council on Revenues and is the Director of the University of Hawaii Economic Research Organization.

Byron Gangnes received his B.A. in Economics from the University of Puget Sound. He received his Ph.D. in Economics from the University of Pennsylvania in 1990 where he was a University Fellow and a Research Fellow with Project LINK. Dr. Gangnes has taught at the University of Hawaii and the Singapore Management University, and was a visiting fellow at the Project Link Research Center at the University of Toronto Institute for Policy Analysis.

Ting Zhou did her undergraduate work in International Economics at Fudan University in Shanghai and received a Ph.D. in Economics from the University of Hawaii at Manoa in 2003, where she was also a research assistant at the University of Hawaii Economic Research Organization (UHERO). Her fields of specialty included applied dynamic econometric analysis, regional econometric modeling, and macroeconomics. Following her graduation she worked with Evolution Capital Management LLC, an Asian focused multi-strategy hedge fund, where she was the Head of Operations and Risk. Since 2007 Ms. Zhou has concentrated on China macroeconomics and policy and helps managing a greater China portfolio (Mainland, Hong Kong and Taiwan) within the fund.

Footnotes

3

See Banerjee, Dolado, Galbraith, and Hendry (1993) for a discussion of finite sample endogeneity bias in error correction models.

4

There are a number of comprehensive reviews of tourism demand modeling and forecasting, including Crouch, 1994a, Crouch, 1994b, Li, Song, and Witt (2005), Lim, 1997, Lim, 1999, Song and Li (2008) and Witt and Witt, 1992, Witt and Witt, 1995. Methodology overviews include Archer (1994) and Frechtling (2001).

5

In recent years, hybrid versions of error-correction models have appeared in the tourism literature, including a time-varying parameter ECM (Li, Wong, Song, & Witt, 2006), a linear AIDS–ECM approach (De Mello and Fortuna, 2005, Durbarry and Sinclair, 2003, Li et al., 2004), and a combination of the two (Li, Song, & Witt, 2006).

6

Generally speaking, personal income or consumption expenditures are used to model leisure and holiday travel, while gross domestic or national product and national disposable income are used to model business travel. As for the choice between nominal and real incomes, Eqs. (1), (2) make it clear that both are acceptable, provided that prices are specified accordingly. A per capita income specification is justified by Witt and Witt (1995) as a solution to the multicollinearity problem when both income and population are used to measure market size.

7

This practice is sometimes criticized on the grounds that “the cost of living for local residents does not always reflect the cost of living for foreign visitors to that destination, especially in poor countries” (Song & Witt, 2000). Occasionally tourism-specific prices are employed. For example, Gangnes and Bonham (1998) use the hotel room price. Others (Edwards, 1995, Martin and Witt, 1987, Witt and Witt, 1992) argue against the use of tourism-specific indices because their coverage may be suspect, and there is little evidence of superior performance.

8

Martin and Witt (1987) report that the CPI-based real exchange rate is a good proxy for tourism cost, while the nominal exchange rate itself is not. Some studies (Kim and Song, 1998, Song et al., 2000) include real exchange rates from a number of competing countries, while others (Vogt & Wittayakorn, 1998) use a single weighted real exchange rate. Some authors (Lathiras and Siriopoulos, 1998, Vogt and Wittayakorn, 1998) argue that nominal exchange rates should be included separately from source and destination price levels, because tourists may respond very differently to them.

9

Song and Witt (2000) suggest using “representative air fares between origin and destination for air travel,” as in Crouch (1991) and Fujii, Khaled, and Mak (1985)Gangnes and Bonham (1998) reject such a practice on the ground that “frequent discounting and package trips” imply a significantly lower actual price than published fares. Edwards (1995) uses International Air Transport Association (IATA) data on revenues per passenger ton/km. Perhaps because of data limitations, Li et al. (2005) report that only about 30% of recent tourism demand models included a measure of travel cost.

10

This point is also made by Li et al. (2005) and Sinclair and Stabler (1997).

11

The difficulty defining tourism supply is discussed by Sinclair and Stabler (1997) and Smith (1998). This issue has been a key challenge in the development of an international system of tourism satellite accounts (United Nations Statistics Division, 2008).

12

Visitors to Hawaii have spent an average of 33% of total expenditures on hotel lodging services over the past three decades.

13

A similar argument is made by Blair, Mak, and Bonham (2007). There is ample evidence that Hawaii hoteliers price according to market conditions. For example, there was aggressive discounting following the 9/11 terror attacks. (Pacific Business News, 2002).

14

In addition to an intercept, pulse-type dummy variables are included for the 1985 United Airlines strike and the 1991 Persian Gulf War. The estimation period ends in 2001Q2, so we do not need to include a dummy for the 9/11 terror attacks.

15

In Eq. (4), Π=Ini=1kΦi, Γi=j=i+1kΦj, i=1,,k1, where Φi, i=1,2,,k, are m×m matrices of unknown parameters in the unrestricted VAR representation of zt. We assume that the roots of InΦ1λΦ2λ2Φkλk=0 lie either on or outside the unit circle, but rule out the possibility that one or more elements of zt are I(2). A review of the econometric analysis of I(2) variables is provided by Haldrup (1998).

16

If all elements of zt are stationary, Π is a full rank m×m matrix. If the elements of zt are I(1) but not cointegrated, Π is rank zero and a VAR model in first differences is appropriate.

17

Two conditions must be satisfied for xt to be weakly exogenous (Hall et al., 2002). (1) The parameters of interest must be functions of the parameters in the conditional model alone. (2) The parameters in the conditional model and the parameters in the marginal model must be variation-free; that is, they do not have any joint restrictions.

18

The methodology for testing the rank of Π is well known, and is addressed in standard graduate level econometrics texts (see, e.g., Davidson & MacKinnon, 2004); for this reason it will not be covered here.

19

Another non-theoretical method of identification is the triangularization approach of Phillips, 1991, Phillips, 1995.

20

We treat all variables in the HTM as I(1). Zhou, Bonham, and Gangnes (2004) report augmented Dickey Fuller (Dickey and Fuller, 1979, Dickey and Fuller, 1981), Perron (1990) and Schwert (1989) tests for unit roots in each variable studied here. We select the lag length for our initial VAR by estimating a VAR in levels with a maximum of five lags, and sequentially reducing the lag length by one lag until we maximize the Schwarz information criterion, subject to non-rejection of the null hypothesis of no serial correlation up to lag 6. We select a lag length of 4. (The results of these tests are available from the authors on request.)

21

Because weak exogeneity depends on the model specification, Hall et al. (2002) suggest exogenizing any non-rejecting weakly exogenous variables and re-testing the remaining variables. Treating yr_us and cpi_E_jp as weakly exogenous, we re-estimate the system with six endogenous variables, two exogenous variables, and five unrestricted cointegrating vectors, r=5. The test results (not shown) continue to strongly reject the null hypothesis of weak exogeneity of both cpi_us and yr_jp at less than the 1% significance level.

22

Using a restricted trend, unrestricted intercept VAR specification, we cannot reject the hypothesis that there is at least one cointegrating relationship among the four variables.

23

Following Pesaran et al. (2000), we allow for an unrestricted intercept in the VECM (4) and restrict the time trends to lie in the cointegrating space. We can then test the hypothesis that the time trend can be excluded from the cointegrating vectors.

24

Witt and Witt (1995) find that income elasticities tend to exceed unity, consistent with the notion that international travel is a luxury good. For a sample of fourteen models from four studies, they report a median income elasticity of 2.4. Edwards (1995) obtains an income elasticity of 5 for US travelers to the Asia-Pacific region. Sheldon (1993) surveys ten econometric studies of tourism expenditures from 1966 to 1987 for a wide range of source–destination pairs, including US travel to Canada, Europe and Mexico; Canadian tourism to the US and other countries; and US destination tourism by major foreign countries. She finds a large range of income elasticities (from −0.15 to 6.6), with a median of 2.2.

25

See Bonham, Edmonds, and Mak (2006) for an analysis of the impact of 9/11 and other shocks on US and Hawaii tourism.

26

See Krolzig (2003) for an evaluation of the use of the PcGETS algorithms to identify structural VARs.

27

As for the HTM, we also include dummy variables for the 1985 United Airlines strike and the 1991 Persian Gulf War.

28

For a detailed discussion and applications of the identification procedure employed here, see Granger and Newbold (1986) and Hannan and McDougall (1988).

29

All models are re-estimated, but not re-selected, once every four quarters. Estimation results are not reported here, but are available upon request.

30

For example, Gonzalez and Moral (1995) and Kulendran and King (1997) find that ECM forecasts are dominated by ARIMA forecasts. However, Kulendran and Witt (2001) find that it is possible to identify an ARIMA model that is not outperformed by an ECM.

References

  1. Anastasopoulos, P. (1984). Interdependencies in international travel: The role of relative prices—A case study of the mediterranean region. Ph.D. thesis. New School for Social Research
  2. Archer B. Demand forecasting and estimation. In: Ritchie J., Goeldner C.R., editors. Travel, tourism, and hospitality research: A handbook for managers and researchers. 2nd ed. John Wiley and Sons Ltd.; New York: 1994. pp. 105–114. [Google Scholar]
  3. Banerjee A., Dolado J.J., Galbraith J.W., Hendry D. Oxford University Press; Oxford: 1993. Co-integration, error correction, and the econometric analysis of non-stationary data. [Google Scholar]
  4. Blair R.D., Mak J., Bonham C. Collusive duopoly: The economic effects of the Aloha and Hawaiian Airlines’ agreement to reduce capacity. Antitrust Law Journal. 2007;74(2):409–438. [Google Scholar]
  5. Bonham C., Edmonds C., Mak J. The impact of 9/11 and other terrible global events on tourism in the United States and Hawaii. Journal of Travel Research. 2006;45(1):99–110. [Google Scholar]
  6. Bonham C., Gangnes B. Intervention analysis with cointegrated time series: The case of the Hawaii hotel room tax. Applied Economics. 1996;28:1281–1293. [Google Scholar]
  7. Bull A. 2nd Rev. Ed. Longman Publishing Group; 1995. The economics of travel and tourism. [Google Scholar]
  8. Copeland, B. (1989). Taxing tourists: Optimal commodity taxation and public goods provision in the presence of international tourism. University of British Columbia mimeo
  9. Copeland, B. (1990). Tourism, welfare and de-industrialization in a small open economy. University of Alberta research paper No. 90–11
  10. Crouch, G. (1991). Determinants of international tourist flows: A multi-variable regression analysis. Australian tourist commission report
  11. Crouch G. The study of international tourism demand: A review of findings. Journal of Travel Research. 1994;33(1):12–23. [Google Scholar]
  12. Crouch G. The study of international tourism demand: A survey of practice. Journal of Travel Research. 1994;32(4):41–55. [Google Scholar]
  13. Crouch G., Schultz L., Valerio P. Marketing international tourism to Australia: A regression analysis. Tourism Management. 1992;13:196–208. [Google Scholar]
  14. Davidson R., MacKinnon J. Oxford University Press; New York: 2004. Econometric theory and methods. [Google Scholar]
  15. De Mello M., Fortuna N. Testing alternative dynamic systems for modelling tourism demand. Tourism Economics. 2005;11:517–537. [Google Scholar]
  16. De Mello M.M., Nell K.S. The forecasting ability of a cointegrated VAR system of the UK tourism demand for France, Spain and Portugal. Empirical Economics. 2005;30:277–308. [Google Scholar]
  17. Dickey D., Fuller W. Distribution of the estimators for autoregressive time series with a unit root. Journal of the American Statistical Association. 1979;74:427–431. [Google Scholar]
  18. Dickey D., Fuller W. Likelihood ratio statistics for autoregressive time series with a unit root. Econometrica. 1981;49:1057–1072. [Google Scholar]
  19. Diebold F.X., Mariano R.S. Comparing predictive accuracy. Journal of Business and Economic Statistics. 1995;13(3):253–263. [Google Scholar]
  20. Dritsakis N. Cointegration analysis of German and British tourism demand for Greece. Tourism Management. 2004;25:111–119. [Google Scholar]
  21. Durbarry R., Sinclair M. Market share analysis: The case of French tourism demand. Annals of Tourism Research. 2003;30:927–941. [Google Scholar]
  22. Dwyer L., Forsyth P. Edward Elgar; 2006. International handbook on the economics of tourism. [Google Scholar]
  23. Edwards, A. (1995). Asia-Pacific travel forecasts to 2005. Economist intelligence unit research report
  24. Engle R. Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica. 1982;50:987–1007. [Google Scholar]
  25. Frechtling D.C. Butterworth–Heinemann; 2001. Forecasting tourism demand: Methods and strategies. [Google Scholar]
  26. Fujii E., Khaled M., Mak J. The exportability of hotel occupancy and other tourist taxes. National Tax Journal. 1985;38:169–177. [Google Scholar]
  27. Gangnes, B., & Bonham, C. (1998). A tourism forecasting model for the State of Hawaii: Final report. Tech. rep. University of Hawaii Economic Research Organzation. Consultant’s report to the Research and Economic Analysis Division, State of Hawai’i Department of Business, Economic Development and Tourism
  28. Gonzalez P., Moral P. An analysis of the international tourism demand in Spain. International Journal of Forecasting. 1995;11:233–251. [Google Scholar]
  29. Gooroochurn, N. (2004). Tourism taxation: A theoretical and empirical investigation. Mimeo, presented at the ECOMOD international confernce on input–output and general equilibrium: Data, modeling and policy analysis
  30. Granger C., Newbold P. 2nd ed. Academic Press; 1986. Forecasting economic time series. [Google Scholar]
  31. Gray H. Heath Lexington Books; Lexington, MA: 1970. International travel-international trade. [Google Scholar]
  32. Haldrup N. An econometric analysis of I(2) variables. Journal of Economic Surveys. 1998;12:595–650. [Google Scholar]
  33. Hall S.G., Henry S., Greenslade J. On the identification of cointegrated systems in small samples: A modelling strategy with an application to UK wages and prices. Journal of Economic Dynamics and Control. 2002;26:1517–1537. [Google Scholar]
  34. Hannan E., Rissanen J. Recursive estimation of mixed autoregressive-moving average order. Biometrika. 1982;69:81–94. [Google Scholar]
  35. Hannan E.J., McDougall A.J. Regression procedures for ARMA estimation. Journal of the American Statistical Association. 1988;83(402):490–498. [Google Scholar]
  36. Harbo I., Johansen S., Nielsen B., Rahbek A. Asymptotic inference on cointegrating rank in partial systems. Journal of Business and Economic Statistics. 1998;16:388–399. [Google Scholar]
  37. Harvey D.I., Leybourne S.J., Newbold P. Tests for forecast encompassing. Journal of Business and Economic Statistics. 1998;16(2):254–259. [Google Scholar]
  38. Hendry D.F., Krolzig H. -M. London: Timberlake Consultants Press; 2001. Automatic econometric model selection using PcGETS. [Google Scholar]
  39. Johansen S. Statistical analysis of cointegration vectors. Journal of Economic Dynamics and Control. 1988;12:231–254. [Google Scholar]
  40. Johansen S. Estimation and hypothesis testing of cointegrating vectors in Gaussian vector autoregressive models. Econometrica. 1991;59:1551–1580. [Google Scholar]
  41. Johansen S. Oxford University Press; London: 1995. Likelihood-based inference in cointegrated vector autoregressive models. [Google Scholar]
  42. Kim S., Song H. An empirical analysis of demand for Korean tourism: A cointegration and error correction approach. Tourism Economics. 1998;3:25–41. [Google Scholar]
  43. Kliman M. A quantitative analysis of Canadian overseas tourism. Transportation Research. 1981;15:487–497. [Google Scholar]
  44. Krolzig, H. -M. (2003). General-to-specific model selection procedures for structural vector autoregressions. Economics Papers 2003-W15. Economics Group, Nuffield College, University of Oxford. http://ideas.repec.org/p/nuf/econwp/0315.html. Accessed 01.07.07
  45. Kulendran N., King M. Forecasting international quarterly tourism flows using error correction and time series models. International Journal of Forecasting. 1997;13:319–327. [Google Scholar]
  46. Kulendran N., Witt S.F. Cointegration versus least squares regression. Annals of Tourism Research. 2001;28(2):291–311. [Google Scholar]
  47. Kulendran N., Witt S. Forecasting the demand for international business tourism. Journal of Travel Research. 2003;41:265–271. [Google Scholar]
  48. Lathiras P., Siriopoulos C. The demand for tourism to Greece: A cointegration approach. Tourism Economics. 1998;4:171–185. [Google Scholar]
  49. Li G., Song H., Witt S. Modeling tourism demand: A dynamic linear AIDS approach. Journal of Travel Research. 2004;43:141–150. [Google Scholar]
  50. Li G., Song H., Witt S. Time varying parameter and fixed parameter linear AIDS: An applicaton to tourism demand forecasting. International Journal of Forecasting. 2006;22:57–71. [Google Scholar]
  51. Li G., Wong K., Song H., Witt S. Tourism demand forecasting: A time varing parameter error correction model. Journal of Travel Research. 2006;45:175–185. [Google Scholar]
  52. Li G., Song H., Witt S. Recent developments in econometric modeling and forecasting. Journal of Travel Research. 2005;44(1):82–99. [Google Scholar]
  53. Lim C. An econometric classification and review of international tourism demand models. Tourism Economics. 1997;3:69–81. [Google Scholar]
  54. Lim C. A meta-analytic review of international tourism demand. Journal of Travel Research. 1999;37:273–284. [Google Scholar]
  55. Lim C., McAleer M. Cointegration analysis of quarterly tourism demand by Hong Kong and Singapore for Australia. Applied Economics. 2001;33:1599–1619. [Google Scholar]
  56. Lu, L. -J., Chang, J. -J., & Hu, S. -W. (2008). Congestion externalities of tourism, Dutch disease and optimal taxation: Macroeconomic implications. Mimeo, Taipei. Institute of Economics, Academia Sinica
  57. Mak J., Moncur J., Yonamine D. Determinants of visitor expenditures and visitor lengths of stay: A cross-section analysis of US visitors to Hawaii. Journal of Travel Research. 1977;15:5–8. [Google Scholar]
  58. Martin C., Witt S. Tourism demand forecasting models: Choice of appropriate variable to represent tourists’ cost of living. Tourism Management. 1987;8:223–246. [Google Scholar]
  59. Morely C. A microeconomic theory of international tourism demand. Annals of Tourism Research. 1992;19:250–267. [Google Scholar]
  60. Muscatelli V.A., Hurn S. Cointegration and dynamic time series models. Journal of Economic Surveys. 1992;6:1–43. [Google Scholar]
  61. O’Hagan J., Harrison M. Market shares of US tourist expenditures in Europe: an econometric analysis. Applied Economics. 1984;16:919–931. [Google Scholar]
  62. Pacific Business News (2002). Outrigger rolls out deep discounts. URL: http://www.bizjournals.com/pacific/stories/2002/04/22/daily15.html
  63. Perron P. Testing for a unit root in a time series with a changing mean. Journal of Business and Economic Statistics. 1990;8:153–162. [Google Scholar]
  64. Pesaran, M., & Shin, Y. (2001). Long-run structural modeling. University of Cambridge DAE Working Paper No. 9419
  65. Pesaran M., Shin Y., Smith R. Structural analysis of vector error correction models with exogenous I(1) variables. Journal of Econometrics. 2000;97:293–343. [Google Scholar]
  66. Pesaran M., Smith R. Structural analysis of cointegrating VARs. Journal of Economic Surveys. 1998;12:471–505. [Google Scholar]
  67. Phillips P. Optimal inference in cointegrated systems. Econometrica. 1991;59:283–306. [Google Scholar]
  68. Phillips P. Fully modified least squares and vector autoregressions. Econometrica. 1995;63:1023–1078. [Google Scholar]
  69. Piga, C. A. (2006). Pigouvian taxation in tourism. Discussion Paper Series 2006. Department of Economics, Loughborough University
  70. Quandt R. Heath Lexington Books; Lexington, MA: 1970. The demand for travel: Theory and measurement. [Google Scholar]
  71. Saikkonen P. Estimation of cointegration vectors with linear restrictions. Econometric Theory. 1993;9:19–35. [Google Scholar]
  72. Saikkonen P. Continuous weak convergence and stochastic equicontinuity results for integrated processes with an application to the estimation of a regression model. Econometric Theory. 1993;9:155–188. [Google Scholar]
  73. Schwert G.W. Tests for unit roots: A Monte Carlo investigation. Journal of Business and Economic Statistics. 1989;7(2):147–159. [Google Scholar]
  74. Sheldon P. Forecasting tourism: Expenditure versus arrivals. Journal of Travel Research. 1993;32:13–20. [Google Scholar]
  75. Sinclair M., Stabler M. Routledge; London: 1997. The economics of tourism. [Google Scholar]
  76. Smith S. Tourism as an industry: Debates and concepts. In: Ioannides D., Debbage K.G., editors. The economic geography of the tourist industry: A supply-side analysis. Routledge; 1998. pp. 31–52. [Google Scholar]
  77. Song H., Romilly P., Liu X. An empirical study of outbound tourism demand in the UK. Applied Economics. 2000;32:611–624. [Google Scholar]
  78. Song H., Witt S. Elsevier Science Ltd; Kidlington, Oxford, UK: 2000. Tourism demand modelling and forecasting. [Google Scholar]
  79. Song H., Witt S. Tourism forecasting: The general-to-specific approach. Journal of Travel Research. 2003;42:65–74. [Google Scholar]
  80. Song H., Li G. Tourism demand modelling and research—A review of recent research. Tourism Management. 2008;29:203–220. [Google Scholar]
  81. Taylor, L. (1995). Taxing sales to tourists over time. Tulane University mimeo
  82. Trivedi P.K. Uncertain prior information and distributed lag analysis. In: Henddry D.F., Wallis K.F., editors. Econometrics and quantitative economics. Basil Blackwell (Oxford); 1984. pp. 173–210. [Google Scholar]
  83. United Nations Statistics Division, Statistical Office of the European Communities, Organization for Economic Co-operation and Development, & World Tourism Organization (2008). 2008 tourism satellite account: Recommended methodological framework. Mimeo
  84. Vanhove N. Butterworth-Heinemann; 2005. The economics of tourism destinations. [Google Scholar]
  85. Varian H. 3rd ed. W. W. Norton & Company; 1992. Microeconomic analysis. [Google Scholar]
  86. Vogt M., Wittayakorn C. Determinants of the demand for Thailand’s exports of tourism. Applied Economics. 1998;30:711–715. [Google Scholar]
  87. Witt S., Witt C. Academic Press; London: 1992. Modelling and forecasting demand in tourism. [Google Scholar]
  88. Witt S., Witt C. Forecasting tourism demand: A review of empirical research. International Journal of Forecasting. 1995;11:447–475. [Google Scholar]
  89. Zhou, A., Bonham, C., & Gangnes, B. (2004). Identifying long-run cointegrating relations: An application to the Hawaii tourism model. Working Paper 2004-2. University of Hawaii Economic Research Organization. http://www.uhero.hawaii.edu/workingpaper/vartourmod.pdf. Accessed on 01.07.07

Articles from International Journal of Forecasting are provided here courtesy of Elsevier

RESOURCES