Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2022 Mar 19;21(2):31–43. doi: 10.1007/s44199-022-00041-5

Classical Forecasting of International Tourist Arrivals to Thailand

Sukanya Intarapak 1, Thidaporn Supapakorn 2,, Witchanee Vuthipongse 3
PMCID: PMC8934021  PMID: 35340991

Abstract

The objectives of this work are to find the suitable forecasting model and forecasting period of the number of foreign tourists traveling to Thailand. The monthly data is gathered during January 2008 to December 2019 and is divided into two sets. The first set is the data from January 2008 to December 2018 for the modelling by the method of decomposition, Holt–Winter’s exponential smoothing method and the Box–Jenkins. The second is the monthly data in 2019 for comparing the performance of the forecasting models via the criteria of the lowest mean absolute percentage error (MAPE) and the root mean square error (RMSE). The results show that, in term of forecasting, the multiplicative decomposition is the most accurate technique for the short-term (3 months) forecasting period with the lowest MAPE and RMSE of 1.04% and 42,054.29 international tourists, respectively.

Keywords: Foreign tourist, Forecasting, Box–Jenkins, Decomposition, Exponential smoothing, Mean absolute percentage error, Root mean square error

Introduction

The kingdom of Thailand is located in the center of mainland Southeast Asia. Thailand shares the border with Myanmar and Laos to the north, with Laos and Cambodia to the east, with the Gulf of Thailand and Malaysia to the south and with the Andaman Sea and Myanmar to the west. There are 76 provinces and Bangkok is the capital. The popular provinces are Phuket, Krabi, Chiang Mai, Nan, Buriram, Maehongson, Ayutthaya also Bangkok. Thailand encompasses diverse ecosystems, cultures, also, religious backgrounds. This diversity enchants tourists both Thais and foreigners. The famous tourist attractions are, for example, beaches, islands, floating markets, temples, mountains, waterfalls, national park, full moon parties and civilization of the cities. According to the data from [1], Fig. 1 clearly displays the increasing trend of both the number of foreign tourists (in million people) entering to Thailand and their expenditures (in billion baht). The percentage increase of foreign tourist arrivals of 2019 to 2008 is 172.88%. This may be due to the political factors in Thailand that are likely to improve which affect investors’ confidence, the recovery of the economy of Thailand and other countries in the world, organizing an illegal tour, the low-cost airlines, the more direct flight, or even an increase in inflation rate according to oil prices.

Fig. 1.

Fig. 1

The international tourist arrivals (in million people) and their expenditures (in billion baht) in 2008–2019

Tourism becomes one of the significant sectors that drives the economic growth of Thailand. It can be seen that the income from the tourism industry varies with the number of foreign tourists entering Thailand. The Thai government realizes the importance of the tourism industry as evidenced by the tourism action plan of the Tourism Authority of Thailand (TAT) which has four goals [2]. One of those is aiming to be the popular tourist destination for sustainability and still generating the highest tourism income in the world top 7. Hence, there are several strategies to build the international market, i.e., adjusting the image of the Thai tourism brand, boosting consumer spending, and expanding the market base of special interests such as medial and wellness tourism, sport Tourism, marriage celebration, wedding anniversary including traveling to visit relatives.

Due to the COVID-19 outbreak situation, countries all over the world have implemented travel restrictions to control the outbreak within the country. For the impact of the COVID-19 outbreak on international travel, the World Tourism Organization (UNWTO) has reported that in the first quarter of 2020, the world's international tourism has declined 22% and is expected for the full year of 2020 to drop by 60% from the previous year. In Thailand, the pursuant to the Declaration of Emergency Situations in all areas of the Kingdom of Thailand starting on 26 March 2020 including the announcement of the Civil Aviation Authority of Thailand (CAAT) to temporarily ban the inbound passenger flights result in no foreign tourists traveling to Thailand since April throughout September 2020 [1, 3].

Tourism is one of the significant contributors to Thailand’s economy. Moreover, the tourist numbers help in understanding the tourism story. The purpose of this paper is to forecast the number of foreign tourists entering Thailand which is very important for the policy planning of the country's economy. The pre-COVID-19 data of the number of tourist arrivals to Thailand is considered to analyze under normal circumstance. In order to achieve the research goal, three classical forecasting methods: decomposition, Holt–Winter’s exponential smoothing and the Box–Jenkins, are applied to get the estimates and the details will be stated in the next section.

Literature Review

Time series forecasting is an important topic in practical application including business and industry, government, economics, environmental sciences, medicine, social science, politics, and finance [4]. Several forecasting approaches of modeling are employed to get accurate forecasts based on previous data. In 2012, Tularam, Wong and Nejad [5] analyzed tourist arrivals to Australia using time series analysis by an autoregressive integrated moving average (ARIMA) and Vector Auto-Regression models using Australian tourist arrival data (1956–2010). The results showed that ARIMA(2,2,2) performed better in term of the prediction in 2010.

In 2013, Keerativibool [6] applied the methods of Box–Jenkins, Winters’ multiplicative exponential smoothing, decomposition, and combined forecasting to forecast the number of international tourist arrivals to Thailand. The combined forecasting method was the most powerful in terms of forecasting the next 6 monthly values based on the criterion of maximum correlation coefficient between the actual data and the forecasts followed by the decomposition, Winters’ multiplicative exponential smoothing, and Box–Jenkins, respectively. Later, in 2014, Saothayanun et al. [7] compared the methods of Box–Jenkins and Holt–Winters’ multiplicative for forecasting the number of international tourists to Thailand. Base on the criteria of the root mean square error and mean absolute percent error of the forecasting values, the Winters’ method performed better than the Box–Jenkins.

In 2017, Subedi [8] suggested the alternative approach of modeling combining the autoregressive model with polynomial (biquadratic) function on time series data with monthly/seasonal fluctuation to forecast the number of tourist arrivals in Nepal. The trend was represented by the autoregressive part and the monthly fluctuation/seasons was done by biquadratic part. Recently, Roshan and Jahufer [9] implemented the Holt–Winters’ Method and Seasonal Autoregressive Integrated Moving Average (SARIMA) method to forecast the tourist arrivals in Sri Lanka. The results revealed that SARIMA provided the least root mean square error and mean absolute deviation.

Recently, Xie et al. [10] proposed the decomposition-ensemble approach to enhance the predictive accuracy of tourism demand forecasting of Hong Kong from nine sources: mainland China, Korea, Japan, the USA, Philippines Singapore, Australia, the UK, and Thailand. The competing models are the naïve, ARIMA and the artificial neural network (ANN) models. Hwandee and Phumchusri [11] aimed to forecast the international tourist arrivals to Thailand from 5 major countries: mainland China, Malaysia, Korea, and Russia during Jan 2013 to September 2018. In terms of the mean absolute percentage error, the seasonal autoregressive integrated moving average outperformed the multiple regression model with several important economic factors: income, price, exchange rate, and seasonal effect. Additionally, Rahman and Lee [12] presented the artificial neural network forecasting with the missing values imputation. The accuracy performance was validated by the mean absolute error and root mean square error of two forecasting methods which are SARIMA and ANN. Based on the imputation method of the decomposition, SARIMA performed better. In the other way, based on the imputation method of the spatial weighting method, ANN outperformed. Lately, in 2021, Janjua et al. [13] adopted the univariate time series forecasting method of tourist arrival based on the data from January 1991 to March 2020. The ARIMA(12, 1, 12) was suggested to forecast the international arrival from April 2020 to December 2020. The forecasts showed that Thailand will face significant negative zone arrival of international tourists due to the COVID-19 pandemic crises, which adversely affect Thailand’s economic due to the shortfall of international tourist arrivals.

From the forecasting methods mentioned above, different models have been applied and developed for tourism demand forecasting. They corporate the classical approach [513], the machine learning approach [11, 13], and the combination of classical approaches and/or the machine learning approaches [6, 8, 9, 11, 13]. The objective of this paper is to apply three classical forecasting methods: decomposition, Holt–Winter’s exponential smoothing and the Box–Jenkins to get the estimate of the tourism demand base on the monthly historical time series from 2008 to 2019. The remainder of this paper is organized as follows. Section 3 describes the data sets, the classical forecasting methods; decomposition, Holt–Winters’ exponential smoothing and Box–Jenkins, and the performance evaluation based on the root mean square error and mean absolute percent error. The results of the time series forecasting analysis are in Sect. 4. Section 5 is finally the conclusions and discussion.

Methodology

Data Sets

The monthly data of the foreign tourist arrivals to Thailand from 2008 to 2019 [1] is divided into two segments—the fitting and the forecasting. The fitting data segment from January 2008 to December 2018 is for constructing the time series modeling and displayed in Fig. 2. The forecasting segment contains the monthly data in 2019 is for forecasting and competing forecasting models.

Fig. 2.

Fig. 2

Time series plot of international tourist arrivals (in million people) from January 2008 to December 2018

Forecasting Methods

Decomposition

The classical decomposition approach [4] is to break the time series (Yt) down into the component parts of trend (Tt) and season (St). Typically, the model of decomposition is categorized as the additive model

Yt=Tt+St+εt

and multiplicative model

Yt=TtStεt

where εt is the random error.

The trend component is estimated using the simple linear regression model with the least squares estimates. The seasonal factor is calculated for each period in the season by detrended time series. Herein, the multiplicative model of the decomposition method is considered because the amplitude of the seasonal fluctuations varies with the level of the series as displayed in Fig. 2.

Holt–Winters’ Exponential Smoothing

The forecasting equation of (Holt) Winters’ multiplicative exponential smoothing [14], an extension of Holt’s method, is

Y^t+p=(Lt+pTt)St-s+p,

where, Lt=αYtSt-s+(1-α)(Lt-1+Tt-1) is the exponentially smooth series or level estimate, Tt=β(Lt-Lt-1)+(1-β)Tt-1 is the trend estimate, St=γYtLt+(1-γ)St-s is the seasonality estimate, α is smooth constant for the level, β is smooth constant for the trend estimate, γ is smooth constant for seasonality estimate, p is the period to be forecast into the future and 0α,β,γ1.

Box–Jenkins

The Box–Jenkins approach [14] uses an iterative model-building strategy that consists of selecting an initial model (model identification), estimating the model coefficients (parameter estimation), and analyzing the residuals (model checking). The model fits well if the residuals ae generally small, randomly distributed and contain no useful information.

The general class of models, representing time series containing seasonal fluctuations, is a seasonal ARIMA process. It is formed by including additional seasonal terms in the ARIMA, that is, SARIMA(p,d,q) (P,D,Q)s defined as follows

ϕp(B)ΦP(B)(1-B)d(1-Bs)DYt=ϕ0+ωq(B)ΩQ(Bs)εt

where (1-Bs)Yt=Yt-Yt-s, d is the order of differencing, s is the number of seasons per year, and D is the order of seasonal differencing. The operator polynomials are

ϕp(B)=(1-ϕ1B--ϕpBp)
ωq(B)=(1-ω1B--ωqBq)
ΦP(Bs)=(1-Φ1Bs--ΦPBsP)
ΩQ(Bs)=(1-Ω1Bs--ΩQBsQ).

In addition, the residuals should be independent and normally distributed.

Measures of Accuracy

Mean Absolute Percentage Error (MAPE)

The mean absolute percentage error [14] is computed by finding the absolute error in each period, dividing by the actual observed value for that period, and then averaging these absolute percentage errors, that is, MAPE=1nt=1n|Yt-Y^t|Yt×100.

Root Mean Squared Error (RMSE)

The RMSE [14] is the square root of the mean squared error which is the sum of the squared residuals and divided by the number of the observations. The unit of RMSE is the same as the original series and RMSE is given by RMSE=1nt=1n(Yt-Y^t)2.

Results

Forecasting Methods

Decomposition

Based on the multiplicative decomposition model, Yt=TtStεt, the forecasting equation is Y^t=T^tS^t where T^=910,099+17,509t; t = 1, 2, … where 1 represents January of 2008 and the seasonal index is shown in Table 1. The number of foreign tourists arriving to Thailand is higher than usual during December–March, especially the New Year celebration, that is, there will be the increase in the number of tourists, representing 16%, 13%, 10% and 8%, respectively.

Table 1.

Seasonal index of international tourist arrivals to Thailand

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
1.13 1.10 1.08 0.97 0.87 0.90 1.00 1.02 0.85 0.92 1.00 1.16

Holt–Winters’ Exponential Smoothing

Examination of the data for the international tourist arrivals from January 2008 to December 2018 in Fig. 2 indicates that trend and seasonal pattern appear to exist. Winters’ three-parameter linear and seasonal exponential smoothing method decomposes the time series into three components: level, trend, and season. The multiplicative model is demonstrated with the smoothing coefficients of level, trend, and seasonality as α^ = 0.762, β^ = 0.001, and γ^ = 0.710, respectively.

Box–Jenkins

The monthly international tourist arrivals from January 2008 to December 2018 shows non-stationary pattern because of the upward trend and seasonal components as displayed in Fig. 2. After transforming the time series data with the first season differences, the time series plot in Fig. 3 shows the fluctuation around 200,000. This suggests stationary pattern and there is not necessary to make the first differences. For the model identification, the autocorrelation coefficients trail off to zero gradually as displayed in Fig. 4a, whereas the autocorrelation coefficients drop to zero after the first time lag as shown in Fig. 4b. This suggests AR (p = 1). In addition, the sample partial autocorrelations have significant spike at lag 1 and 13. This behavior suggests an MA (Q = 1) term at the seasonal lag 12.

Fig. 3.

Fig. 3

Time series plot for the seasonal differences of international tourist arrivals to Thailand

Fig. 4.

Fig. 4

Plots of a ACF and b PACF for the seasonal differences of international tourist arrivals to Thailand

The suitable model SARIMA(1,0,0)(0,1,1)12 in the form of backward operator is

(1-B12)(1-ϕ1B)Yt=θ0+(1-θ12B12)εt

or it can be rewritten as

Yt=θ0+ϕ1(Yt-1-Yt-13)+Yt-12-θ12et-12+εt

with parameter estimates in Table 2. Hence, the forecasting equation is

Y^t=206,864.98+0.774(Yt-1-Yt-13)+Yt-12-0.636et-12.
Table 2.

Estimated parameters for SARIMA(1,0,0)(0,1,1)12

Parameter Estimate s.e. t test statistic p-value
θ0 206,864.98 23,340.63 8.863 0.000
ϕ1 0.774 0.061 12.616 0.000
θ12 0.636 0.096 6.643 0.000

The model adequacy is done accordingly to the plots of ACF and PACF for the residuals of SARIMA(1,0,0)(0,1,1)12 as shown in Fig. 5. These plots satisfy the model assumption and reveal that the residuals are independent because there is no lag of ACF and PACF falling outside the 95% confidence interval. Also, Fig. 6 reveals equal scatter of error terms.

Fig. 5.

Fig. 5

Plots of ACF and PACF for the residuals of SARIMA(1,0,0)(0,1,1)12

Fig. 6.

Fig. 6

Plots of residuals vs forecast values of international tourist arrivals to Thailand

Model Comparison

According to the criteria of the performance comparison on the mean absolute percentage error and the root mean square error of the three forecasting approaches: decomposition, Holt–Winters and Box–Jenkins, Table 3 shows that the multiplicative Holt–Winters model provides the least values of MAPE and RMSE.

Table 3.

Comparison of model performance

Method MAPE (%) RMSE (people)
Decomposition 6.31 146,491.24
Holt–Winters 5.61 134,694.82
Box–Jenkins 5.88 138,124.91

Suitable Model for Forecasting and Forecasting Period

The forecasting values of the monthly international tourist arrivals to Thailand in 2019 are computed based on the forecasting models in Sect. 4.1. Figure 7 displays the comparison of the actual values and the forecasting values of the multiplicative model of decomposition and Holt–Winters and the Box–Jenkins model of SARIMA(1,0,0)(0,1,1)12. For the performance of forecasting, Table 4 shows that the multiplicative decomposition method performs the best for short-term forecast.

Fig. 7.

Fig. 7

Actual and forecast values of international tourist arrivals to Thailand in 2019

Table 4.

Comparison of forecasting performance

Method MAPE (%) RMSE (people)
3 months 6 months 12 months 3 months 6 months 12 months
Decomposition 1.04 1.81 1.62 42,054.29 71,285.35 62,522.32
Holt–Winter 2.01 2.81 3.05 75,291.41 115,287.00 116,954.59
Box–Jenkins 2.54 4.51 2.86 97,749.53 164,327.05 123,202.81

Conclusions and Discussion

The demonstration of the classical forecasting techniques and algorithms which are decomposition, Holt–Winters and Box–Jenkins are successfully done using SPSS. The time series of the international tourist arrivals of the year before the Covid-19 pandemic, 2008–2019, are applied. The performances of the model are separated into model and forecasting based on the criterial of the lowest of MAPE and RMSE. For the model comparison, the multiplicative Holt–Winters’ exponential smoothing performs the best followed by the Box–Jenkins, SARIMA(1,0,0)(0,1,1)12, and the multiplicative decomposition, respectively. In the view of forecasting performance, the short-, medium-, and long-term forecasts are focused. The multiplicative decomposition is superior for all terms of forecasting followed by Holt–Winters and Box–Jenkins. This point shows that the best model does not always support the superior forecasting.

The principle of time series forecasting is that the forecast model is created by the pattern or the behavior of the past data. This is the main reason of selecting the time series before the emergence of corona virus pandemic. The limitation of this study is that the forecasting models are valid only for Thailand during the specific period of time. As a result, the number of foreign tourists has changed greatly since February 2020 [1] (see Fig. 8) and more importantly this affects the component of time series data. This severely affects tourism revenue which is the major contributor to the GDP and economics of Thailand and exactly all over the world. The government stakeholders including the private sectors should find policies or measures whether the campaign of getting the Covid-19 vaccine shots to build up the immune response, or the epidemic prevention measures, or the policy of maintaining the tourist attractions to encourage foreign tourists to return to Thailand under normal conditions.

Fig. 8.

Fig. 8

Percentage change of international tourist arrivals to Thailand of 2021 to 2020

Acknowledgements

We would like to credit the Ministry of Tourism and Sports for providing the data. In addition, the authors acknowledge the two anonymous reviewers and the editor-in-chief Professor M. Ishaq Bhatti for their valuable comments to enrich this article. However, authors take full responsibilities of any errors there may be.

Abbreviations

MAPE

Mean absolute percentage error

RMSE

Root mean square error

TAT

Tourism Authority of Thailand

UNWTO

World Tourism Organization

CAAT

Civil Aviation Authority of Thailand

ARIMA

Autoregressive integrated moving average

SARIMA

Seasonal autoregressive integrated moving average

ANN

Artificial neural network

ACF

Autocorrelation function

PACF

Partial autocorrelation function

Author contributions

TS carried out the study conception and design, analysis, interpret the results and draft manuscript preparation: WV provided us the data. TS and SI reviewed the results. All authors approved the final version of the manuscript.

Funding

This research was funding supported by the Program Management Unit for Human Resources and Institutional Development (B05F630034).

Declarations

Conflict of ınterest

The authors declare they have no conflicts of interest.

References

  • 1.Ministry of Tourism and Sports. Tourism statistics (2021). https://www.mots.go.th/more_news_new.php?cid=411
  • 2.Tourism Authority of Thailand. Policy and marketing plan (2021). https://www.tat.or.th/en/about-tat/market-plan
  • 3.Ministry of Foreign Affairs. Declaration of an emergency situation in all areas of the Kingdom of Thailand (2020). https://www.mfa.go.th/en/content/115754-declaration-of-an-emergency?cate=5f2916be298eeb02ac2eb389
  • 4.Montgomery DC, Jennings CL, Kulahci M. Introduction to Time Series Analysis and Forecasting. New Jersey: Wiley; 2007. [Google Scholar]
  • 5.Tularam GA, Wong VSH, Nejad SAS. Modeling tourist arrivals using time series analysis. J. Math. Stat. 2012;8(3):348–360. doi: 10.3844/jmssp.2012.348.360. [DOI] [Google Scholar]
  • 6.Keerativibool W. Forecasting model for the number of international tourist arrivals to Thailand. Srinakharinwirot Sci. J. 2013;29(2):9–26. [Google Scholar]
  • 7.Saothayanun L, Taweesakulvatchara S, Kanjanasakda Y, Somrang B. Forecasting methods for the number of international tourists in Thailand: Box–Jenkins method and winter’s method. Thai Sci. Technol. J. 2014;22(1):89–98. [Google Scholar]
  • 8.Subedi A. Time series modeling on monthly data of tourist arrivals in Nepal: an alternative approach. Nepal. J. Stat. 2017;1:41–54. doi: 10.3126/njs.v1i0.18816. [DOI] [Google Scholar]
  • 9.Roshan, A.M.F., Jahufer, A.: Forecasting Sri Lankan tourist arrivals: a comparative study of Holt–Winter's method versus ARIMA model. Proceedings of 8th International Symposium-2018, SEUSL, pp. 11–20 (2018)
  • 10.Xie G, Qian Y, Wang S. A decomposition-ensemble approach for tourism forecasting. Ann. Tour. Res. 2020;81(102891):1–16. doi: 10.1016/j.annals.2020.102891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hwandee O, Phumchusri N. Advances in Electronics Engineering. Singapore: Springer; 2020. pp. 115–125. [Google Scholar]
  • 12.Rahman NHA, Lee MH. Artificial neural network forecasting performance with missing value imputations. IAES Int. J. Artif. Intell. (IJ-AI) 2020;9(1):33–39. [Google Scholar]
  • 13.Janjua LR, Muhammad F, Sukjai P, Rehman A, Yu Z. Impact of COVID-19 pandemic on logistics performance, economic growth and tourism industry of Thailand: an empirical forecasting using ARIMA. Braz J Oper Prod Manag Res. 2021;18(2):1–13. doi: 10.14488/BJOPM.2021.001. [DOI] [Google Scholar]
  • 14.Hanke JE, Wichern D. Business Forecasting. New Jersey: Pearson Prentice Hall; 2014. [Google Scholar]

Articles from Journal of Statistical Theory and Applications are provided here courtesy of Nature Publishing Group

RESOURCES