Skip to main content
Heliyon logoLink to Heliyon
. 2021 Apr 16;7(4):e06655. doi: 10.1016/j.heliyon.2021.e06655

Methodological proposal for the identification of tourist routes in a particular region through clustering techniques

Juan B Duarte-Duarte 1, Leonardo H Talero-Sarmiento 1, Diana C Rodríguez-Padilla 1,
PMCID: PMC8060598  PMID: 33898811

Abstract

The tourism sector has an essential role in the sustainable development of a country. Therefore, in this research we propose a methodology to identify tourist routes that integrate the most important Points Of Interest in a region taking up as criteria profile characteristics in common between the sites evaluated using clustering techniques. To attain this goal, firstly, a literature review focused on compiled information used for location selection and evaluation in attraction potential sites. Then, clustering techniques are applied to identify similarities between sites, and finally, a layout of tourist routes is presented. We applied this methodology using data from a Region in Colombia. As a result, eight factors are proposed: Natural, Cultural, Tourist Plant, Infrastructure, Superstructure, Accessibility, Human and Tourist Capital and Security. From the second phase, three tourist groups were identified with three tourism factors for each of them; and then, two examples of tourist routes are proposed.

Keywords: Clustering techniques, Sustainable development, Tourism factors, Tourist clusters, Tourist routes, Tourism


Clustering techniques, Sustainable development, Tourism factors, Tourist clusters, Tourist routes, Tourism.

1. Introduction

Tourism site integration using curated routes is a strategy that could improve sustainable tourism development and, at the same time, increase the quality of life of local people. Different countries often regard tourism as the preferred alternative to development (Briedenhann and Wickens, 2004) because of its economic and social value at the global scale. Tourism plays a key role in sustainable development and impacts the climate (UNWTO, 2008); tourism has also been identified as a tool to raise the economic growth of underdeveloped regions and to improve the standard of living of local communities (Rátz and Puczkó, 1998) (Kombol, 2000). Additionally, tourism presents sociocultural benefits, improves education, and reinforces the maintenance of culture and heritage (Jafari, 2005).

Taking advantage of the benefits of tourism, over the years, its economic impact has been studied using different methodologies. For example, Deng et al. (2002) have used multicriteria analysis techniques to categorize Victoria Park in Australia as a tourism resource, and the same technique have been used by Triantaphyllou (2000) in Bosnia, Laguna Marín-Yaseli and Nogués Bravo (2003) in Iberian Mountain Ranges, Infante Sánchez (2014) in Colombia, Egül & Ik (2014) in Turkey, Rubio et al. (2016) in Argentina, Camarena (2016) in México, and Morteza et al. (2016) in Qeshm Island, Iran, among others. On the other hand, Zimmer and Grassman (1996), Padín and Pardellas (2004), Blanco (2008), Andrés Cabello and Pascual Bellido (2015) have used the LEADER method, a guide to evaluate the tourism potential of a territory, in different regions of Spain and Mexico. Furthermore, Eagles et al. (2001) and Mishra (2009) in different areas of Asia; Mart et al. (2013) in Mexico; and Alam et al. (2015) in Malaysia, have used statistical methods to evaluate tourism sites. Many other authors have employed other methodologies.

Most methodologies usually use qualitative techniques that require expert analysis (Laitamaki et al., 2016). However, there are different quantitative techniques based on the structural information of tourist sites, such as clustering techniques (CTs). There is evidence that the application of CTs can create cooperative relations in high-tech sectors, where the most valuable capital is knowledge and cooperation occurs between companies and academia (Porter, 2000) (Rocha, 2004). However, it is possible to apply these techniques in more traditional sectors, such as tourism: for instance, Pitchayadejanant and Nakpathom (2018) clustered the agrotourism activities in an orchard using machine learning, Gosal et al. (2019) processed and clustered the attractiveness of different tourist locations, Nilashi et al. (2019) used clusters to predict touristic choice preferences in ecofriendly tourism, and Guo et al. (2020) clustered tourism attractions to predict the competitiveness of a city, among others. These clustering techniques can create a promising new development direction for the tourism industry (Estevão et al., 2009).

We propose the use of CTs to design tourist routes (TRs) in a region to maximize the potential benefits of tourism. These routes are important because TRs are tools that highlight the cultural heritage of sites, promoting and benefiting those who are part of it (Jeambey, 2016) (Briedenhann and Wickens, 2004). Additionally, TRs integrate several attractions that have not been studied independently, allowing an equitable distribution of their benefits (Meyer, 2004). Consequently, tourist route implementation is a strategy to optimize job creation and tourism and conserve the historical, cultural, and folkloric heritage of any site (Open Africa, 2002).

Therefore, the research question is, “How can we identify a methodology to propose tourist routes in a particular region?” In this paper, we propose a methodology to identify theoretical tourist routes in such a way that different tourist attractions can be integrated into a region. In Phase 1, a literature review is performed to collect characteristics or attributes to describe the tourism cluster of a site. In Phase 2, we apply a clustering technique among sites to find patterns and similarities among them, identifying specific clusters. Finally, Phase 3 designs tourist routes by integrating these clusters.

2. Literature review

To allow tourism on a site, different organizations and researchers have proposed diverse methodologies to identify and categorize touristic factors (UNWTO, 2013). These factors are an elemental guide in the tourism process (Knezevic, 2008). Consequently, this investigation uses predefined characteristics and proposes to group them into a hierarchical structure, with eight factors, covering 27 subfactors (Figure 1) described in Table 1.

Figure 1.

Figure 1

Tourism Factors and sub-factors.

Table 1.

Tourism Factors proposed.

Factor Description Author
Natural Any geomorphologic or biophysical element that allows the visit, appreciation, and satisfaction of tourists. Zimmer states that a natural resource is one that people can use to satisfy their needs to be alive (Zimmer and Grassman, 1996). Vegetal associations, hydrography, flora, fauna, and climate, among others. (SECTUR, 1975) (OEA, 1987)
Cultural Keeping historical traditions, which, according to the International Council of ICOMOS, are defined as cultural heritage, historical sites, past and present traditions, and monuments (ICOMOS, 1999). Folklore, art, programmed events, and gastronomy, among others. (OEA, 1987) (Clawson and Knetsch, 1968)
Tourist Plant Related to establishments managed by the public or private sector. These establishments provide tourist services (Camara, 2014), attending the needs and desires of the tourist (Quesada Castro, 2006). Hotels, restaurants, and touristic agencies, among others. (SECTUR, 1975) (Zimmer and Grassman, 1996) (Boullón, 1997)
Infrastructure Concerning the set of elements, equipment or services necessary for the proper functioning of a country, of a city or any organization (RAE, 2015). Transport systems, sanitation, public services, signage, and medical facilities, among others. (Zimmer and Grassman, 1996) (Boullón, 1997)
Superstructure All the specialized agencies, public and private, of the local community responsible for optimizing and changing the functioning of each part of the tourist system (Labrada, 2014) (Quesada Castro, 2006). (Zimmer and Grassman, 1996) (Boullón, 1997) (Almeida, 2009)
Human and Tourist Capital Defined as the number of people with technical knowledge, skills, singularity, and qualifications (Lillo, 2006). Attention in another language, native community, education, health conditions, and social characteristics of the population, among others. (Maass et al., 2010)
Accessibility The characteristic that allows environments, products, and services to be used seamlessly by all people, achieving the objectives they are designed for, as well as the relationship with the three primary forms of human activity: mobility, and understanding. These three forms of human activity are limited by barriers (López, 2002) (SECTUR, 1975) (Padín Fabeiro and Pardellas de Blas, 2004)
Security Interpreted as entities that allow us to perceive that we move in a space without real or potential risks (MINCIT, 2003). Fire prevention, food sanitation, and police services, among others. (Solís et al., 2013) (Maass et al., 2010)

Next, we describe the methodologies used in different studies due to their essential contributions to the classification of tourist factors.

2.1. Pierre defert

Defert (1972) proposes the categorization of tourist resources as follows: Hidrom (water); Litom (elements built by man); Antropóm (socioeconomic structure); Phitom (Natural terrestrial elements); and Mnénome (elements that generate memory). This categorization of resources has been used by authors such as Camara and Morcate Labrada (2014) to evaluate the city of Fort de France. Additionally, the consulting company Auren Consultores (2014), directed by the School of Industrial organization of Spain, compared Defert's classification to those of other authors to choose the most suitable classification for Andalusia, Spain. This comparison was also performed by researchers at the University of San Luis Potosí in Mexico (2015) and by Rubio et al. (2016), a Ph.D. in Geography of the “Universidad Nacional del Sur” in Argentina.

2.2. SECTUR

The Secretary of Tourism of Mexico designs and implements policies aimed at strengthening tourism activity (SECTUR, 1975). These policies have been applied by various authors, such as Reyez and Sánchez (2005). Reyes has used three variables to calculate the potentiality index of natural tourism: 1) natural variables (geomorphologic, vegetal associations, and natural elements); 2) accessibility variables (terrestrial, maritime and air transport, gas station, and road density); and 3) equipment variables (hotels, food establishments, tourist promotion units, banks, and commercial establishments). Other authors who have used the SECTUR information are Martínez (2010)), Mikery Gutiérrez and Pérez-Vázquez (2014), Camara and Morcate Labrada (2014) and Cortés (2015); these authors used SECTUR information to analyze the tourism potential of their respective case studies.

2.3. OAS

The Organization of American States (1987) proposed a resource categorization divided into the following categories: 1) natural sites (landscape-based resources); 2) museums and cultural traditions (art, history, and monuments); 3) folklore (related to the traditions of the resident population); 4) contemporary technical, scientific and artistic realizations (elements that by their singularity have tourist interest and are more current than historical characters); and 5) programmed events (organized events, current or traditional). This categorization is easy to replicate; several authors such as Blanco (2008), Conti et al. (2012), Solís et al. (2013), Mikery Gutiérrez and Pérez-Vázquez (2014), Castellanos Menjura and Ariza Cortes (2015), and Navarro (2015) have applied this methodology to their tourist resources inventory.

2.4. LEADER guide

Zimmer and Grassman (1996) created the LEADER guide to evaluate the tourist potential of a territory. He took into account the offer (natural factors, socioeconomic factors, infrastructures and services available, cultural factors, sports and leisure offerings, health offerings, accommodation, restaurants, and sites with the possibility of organizing conferences and seminars) and the demand (defined from customer surveys). Then, Peter carried out an analysis of the competition and, finally, performed a study of the market trends to anticipate the opportunities and risks linked to the new expectations of clients. Authors such as Blanco (2008), Vázquez (2014), Andrés Cabello and Pascual Bellido (2015), and Vanegas (2017) have used the guide to apply tourism research methodologies in different parts of Europe and Latin America.

2.5. Gunn

Gunn (2002) has proposed factors such as market preferences, advocacy, information systems, restrictions and benefits (social environment, governmental disposition, among others), and physical factors, composed of potential destination areas, by geographic and competitive contexts, to identify new and improved attractions and threats to the environment. The authors who have used this categorization are Cha and Uysal (1995), Smith (1987) and Smith (2013).

3. Methodology

3.1. Data

First, a region is selected (taking into account a region could be divided into micro regions or sites to apply the following methodology where the identified characteristics in the literature review are consulted from primary or secondary sources such as: interviews from the inhabitants of the region, going to each site of interest, looking for information in the regional official website where you can find its general tourist characteristics; official website of general and specific statistics of the sites of the region, newsletters, articles, housing census, citizenship, development plans, tourism development plans, inventories of the region's assets, among others.

The data must be considered in terms of different types of variables (e.g., the scale of measurement, reason variables, quantity variables, and dichotomous variables). These variables have to be stored and classified; furthermore, each evaluated site should have a numerical value for each of these variables; that is, no missing data are allowed.

3.2. Touristic clusters

Clustering techniques can recognize patterns and similarities between the characteristics of entities (Instituto Valenciano de Tecnologías Turísticas, 2015). CTs organize studied entities from the information provided by the decision maker; in this case, the information represents the characteristics collected in the literature review section. Moreover, the hierarchical clustering technique has excellent performance (Everitt et al., 2011), allowing classifications of greater accuracy than permutation methods such as K-means (Karypis et al., 2000) (Li et al., 2002). Nevertheless, to apply this technique, there are two requirements: 1) the selection of a link method and 2) the implementation of a distance function (Wilks, 2011).

However, considering that, the proposed factors and their corresponding characteristics have different types of variables, such as the scale of measurement, according to reason, quantity and dichotomous, it is necessary to transform them to the same scale. To do this, we use the Gower coefficient of similarity because it allows the integration of quantitative and qualitative variables (Lopez and Villa, 2011) (Podani, 2012). Furthermore, we can estimate the Euclidean distance Dij between each pair of townships ij using the Gower similarity (Sij) according to the following relation Dij2=2(1Sij). The coefficient of similarity between the townships ij is explained by Eq. (1):

Sij=i=1nWijkGijki=1nWijk (1)

where Wijk=0 if entities j and k cannot be compared for variable i because xij or xik is unknown.

Additionally, Gower defines:

For binary variables: Wijk=1 y Gijk=0 if xijxik; Wijk=Gijk=1 if xij=xik=1 or if xij=xik=0 and double zeros (mutual absences) are included; Wijk=Gijk=0 if xij=xik=0 and double zeros are excluded from the comparison.

For nominal variables: Wijk=1 if xij and xik are known; then let, Gijk=0 if xijxik; Gijk=1 if xij=xik.

Furthermore, for variables measured on the Interval and ratio scale: Wijk=1 if xij y xik are both known and Gijk=1{|xijxik|/(rangeofvariablei)} (Gower and Legendre, 1986).

Finally, we study the link method in each case; however, considering the distance functions estimated in this investigation and its characteristics, we suggest applying methods that use squared distances.

3.3. Tourist routes: related sites integration

For the design of the routes, we consider the recommendations of Aravena Paillalef et al. (2013), Cadena et al. (2013) and Garrido et al. (2015). The design process begins by identifying tourist points of interest to integrate using routes. These points are subsequently located on a map to determine their geographical distribution and to analyze the possible integrations to make between them.

The design of these routes must consider the location of the sites and their geographical proximity to one another, without taking into account the road infrastructure between them. That is, routes can have two kinds of sections: real and hypothetical (a.k.a. theoretical) roads. The real routes link the sites without taking into account the road type (i.e., whether the road is paved or not). The hypothetical routes, on the other hand, are unbuilt connections that will be required. We use this methodology to identify possible infrastructure improvements that could be helpful in connecting tourist sites.

4. Results

The methodology proposed was applied in Santander, which is one of the 32 departments in Colombia. Santander is in the northwest part of the country (Figure 2). Santander is composed politically of 87 townships and is considered the fourth most important economy in Colombia, contributing 7.8% of the total national gross domestic product (GDP). We select Santander in this study due to its diversified economy, which causes heterogeneity in the distribution of wealth among its inhabitants (de Santander, 2016), as represented in the indices of monetary poverty, the Gini coefficient (indicator of the distribution of income of individuals within an economy) and the index of unmet basic needs (Departamento Administrativo Nacional de estadística, 2018). By applying this methodology and taking into account the benefits of tourism, the economic disparity in Santander may be able to be reduced.

Figure 2.

Figure 2

Representation of Santander-Colombia.

4.1. Data

According to the information available from secondary sources (the official website of Santander governorate, the official website of the National Administrative Department of Statistic (DANE), among others), we collect 79 characteristics for every township (Annex 1. 79 characteristics for each Townships) 12 belong to the Accessibility factor, 10 belong to the Human and Tourist Capital factor, 11 belong to the Cultural factor, 11 belong to the Natural factor, 11 belong to the Infrastructure factor, 16 belong to Tourist Plant, 5 belong to Security and 3 belong to Superstructure. Applying the hierarchical clustering technique, we identified that 6 of the 79 characteristics are dichotomous variables, 50 are variables of quantity, 21 are expressed as a percentage, and one is a reason variable (Annex 1. 79 characteristics for each Townships).

4.2. Touristic clusters

According to the characteristics identified, to apply the clustering techniques, we selected the Ward link method and the squared Euclidean distance due to the ease of identifying atypical values between entities in the same group; as a result, three tourist groups are proposed considering the potential touristic identification. The Natural, Cultural, and Human and Tourist Capital factor data are used to form the clusters; additional data (Accessibility, Infrastructure, Tourist Plant, Security, and Superstructure) are used to analyze the general infrastructure of each township. The Similarity Matrix and the Distance Matrix are in Annex 2. Similarity Matrix and Annex 3. Distance Matrix, respectively.

We choose three groups because fewer groups would lead to negative similarity (atypical relationships) and more groups would not significantly increase the similarity. We highlight that the groups are heterogeneous in size, as indicated in the dendrogram, a diagram showing the similarity of attributes between each pair of entities grouped sequentially (Minitab 18, 2018). We use yellow, blue, and red (Figure 3) to differentiate groups; the geographical distribution of each group is given on Santander's political map (Figure 4).

Figure 3.

Figure 3

Dendrogram showing the three clusters selected.

Figure 4.

Figure 4

Geographic clusters on Santander map.

Touristic Cluster 1 (Blue): This cluster has the most significant “raizal” population (he native population of San Andrés, Providencia and Santa Catalina due to crossbreeding between indigenous, Spanish, French, English, Dutch and Africans (Ministerio de Cultura de Colombia, 2004)) located in the El Peñón, Coromoro, Guadalupe and Oiba townships. This cluster also includes some of the most representative cultural events in the region, such as "Las Ferias Bonitas" (the beautiful fairs), the Tiple national week celebrating the traditional musical instrument, the Luis A. Calvo festival in Bucaramanga, the Guabina and Tiple festival in Vélez, the music Festival in San Vicente de Chucurí, and the Guascarrilera festival in Matanza. Additionally, most of the townships around the Chicamocha canyon (Aratoca, Cepitá, Los Santos, and Villanueva) are in this group.

Touristic Cluster 2 (Red): This group of townships stand out because they have large indigenous populations, mulattos and Afro-Colombians (referring to Colombian citizens of African descent), mostly distributed in the towns of Cimitarra (with 3326 inhabitants of this type of population), Puerto Parra (with 1454), Palmas de Socorro (with 1369) and Puerto Wilches (with 1337) according to DANE's census and projections for the population of each township in 2015. This group also has the fewest cultural components compared to the other groups, with only 5 of the 40 townships having representative aspects in this area. These components are found in Socorro, widely known due to the insurrection of the “comuneros” community, where the basilica "Nuestra Señora del Socorro” is located; in Suratá, where the Bambuco National Festival (traditional dance) is celebrated; in Pinchote, where the Trios National Festival is celebrated; in Güepsa, where the “Panela” festival is held, featuring traditional brown sugar food; and in Chipatá, the location of the song festival. Finally, this group includes the Páramo of Santurbán, which includes the townships of California, Santa Barbara, Suratá, and Vetas.

Touristic Cluster 3 (Yellow): Represented by the most significant number of the “ROM” population, or Gypsy ethnic communities, established in Colombia (Ministerio de Cultura, 2010). Located in Girón, Floridablanca and Lebrija, these townships have several museums, such as the Guane Archaeological Museum located in Floridablanca, "Mansión del Fraile” museum in Girón, the “Quijote de la Mancha” house museum in Zapatoca, and Petroleum National Museum in Barrancabermeja. This group is also recognized for its fairs and festivities and its handicrafts as well as because there are some renowned typical food delicacies in these townships, such as "Obleas" in Floridablanca, “Culonas” ants in Barichara, artisan wine in Zapatoca and "Panuchas" (traditional candy) in Malaga. These townships have several well-known caves, including "La Pintada" cave in Lebrija, Caverne in Malaga, Nitro cave in Zapatoca, and Guane cave in Girón, among others.

4.3. Tourist routes: related sites integration

The routes are shown on a political map of Santander and have been added to Google Earth, a web-based mapping service platform. The routes can be designed considering each cluster; for example, from Cluster 1, a route was designed considering the proximity of each site of interest to the townships without taking into account the existing routes (Figure 5 and Figure 6) (this route is in Google Earth in Annex 4. Cluster 1 Route 1)

Route: Chicamocha canyon and more.”

Figure 5.

Figure 5

Touristic Cluster 1 Route 1 sites.

Figure 6.

Figure 6

Touristic Cluster 1 Route 1.

This route focuses on a scenic tourism tour, and it begins at Los Santos Township, Cepitá, Aratoca; continues to Curití, San Gil; and ends at Villanueva. These locations contain the extensive Chicamocha canyon, and it is also possible to appreciate other touristic sites, such as “el Salto del Duende” in Los Santos, “Salto del Mono Aullador” in Aratoca, the “Yeso” cave and the “La Vaca” cave in Curití. In addition, tourists can enjoy the famous Fonce River in San Gil, where foreigners can practice sports such as rafting or canoeing and, finally, the “Indio” cave in Villanueva.

Analyzing the general infrastructure of these townships, considering their geographical information and other data factors (Accessibility, Infrastructure, Tourist Plant, Security, Superstructure), the canyon of Chicamocha provides opportunities for landscape and scenery sightseeing, extreme sports, and historic tours. Accordingly, it will be necessary to build access roads between Los Santos and Aratoca due to the lack of existing connections between them. Furthermore, most of the townships on this route must improve and expand their internet coverage. Additionally, the pavement of the roads in this region should increase the number of tourist agencies as well as the options to practice different kinds of sports.

Another tourist route we can find in Cluster 2 is route 1 (Figure 7 and Figure 8) named:

“Route: Biodiversity of the Santurbán Páramo”

Figure 7.

Figure 7

Touristic Cluster 2. Route 1.

Figure 8.

Figure 8

Touristic Cluster 2 Route 1 sites.

This route focuses on landscape sightseeing tourism that begins with Suratá, continues to California, and ends in Vetas Township to develop tourism around the Santurbán Páramo. Here, tourists can see the Cattleya Mendelii and Befaria flowers, the Frailejon, 15 species of mosses, and several species of animals, including the Chirrador, the “Pato Zambullidor,” and the Curí and the Condor, among others. Additionally, tourists can visit “La Laguna” and buffaloes in Suratá; the Páez, Pico, Quelpa and Toro lagoons; hot springs; the San Antonio de Padua sanctuary, which is located in the foothills of the eastern Colombian Cordillera in the Andean Mountain system in California; and the La negra Lagoon in Vetas.

Analyzing the infrastructures of these three townships, we note that although California is geographically next to Vetas, these townships are not connected in terms of roadway infrastructure; therefore, it is proposed that these townships be connected to take advantage of the tourist activities provided by the Páramo.

5. Discussion

This article proposed to answer the following research question: How can a methodology be identified to proposed touristic routes in a particular region? To accomplish this aim, artificial intelligence techniques that allow us to systematically analyze all the characteristics that can define a possible tourist site are proposed. However, considering the literature review, we found that there are multiple variables to determine potential tourists of a site that can vary depending on the site analyzed and the methodology used to evaluate the site. For example, there are discrepancies in SECTUR (1975) methods; Zimmer and Grassman (1996), among others. Additionally, it should be noted that not all sites have the same characteristics, and therefore, as a preliminary step, the fields must be filtered. In this study, since we evaluated the department of Santander, we eliminated some characteristics considering its geographical location (e.g., number of beaches, and seasons).

This process is essential, as the characteristics found can also change according to the data available at the time of the study. Variable selection is a critical process because if there are different variables, it could be possible to develop different tourist profiles than those found in the present study. However, if in the future, different characteristics are considered, these will not invalidate the proposed routes; instead, complementary routes could be designed.

On the other hand, in the data collection, we used secondary sources; more specifically, the official national, departmental, and township websites, on which it is possible to find reliable information. However, according to the publication and updating policies, some information required the prior permission of the owner, and some had outdated publications, which means that although the characteristics selected are appropriate for the site, their figures may vary.

In the last methodological section, the purpose of designing tourist routes was to integrate potential tourist sites considering the position and geographical proximity between townships. In Santander, some of this integration was made taking into account the roadway infrastructure available; however, we found nearby townships that do not have a direct connection; for these cases, we propose theoretical roads so that, in the future, stakeholders could propose infrastructure projects.

The results of this study are aligned with those of Pitchayadejanant & Nakpathom (2018), who use data mining to identify associations and patterns of activities in tourism, as we take advantage of the association of patterns to create touristic clusters that are linked by routes. On the other hand, Gosal et al. (2019) perform clustering that includes tourist factors, including such as nature, ornithology, and religious pilgrimage, among others, that complement our proposed factors; also, Nilashi et al. (2019) conclude that big data and machine learning are essential factors with enormous growth potential in the tourism industry. Finally, Guo et al. (2020) compared different methods to identify economic clusters that are essential to improve the tourism competitiveness of a city.

Taking into account the proposed data hierarchy, as well as the proposed method, for future work, it could be necessary to update the information with data in situ, using deep techniques to analyze the construction of building roads, bridges, and other infrastructures. This research could be proposed to government entities as an alternative method to growth and to improve the social, cultural, and economic situation of each township of Santander. The proposed clusters could be part of the departmental development plans around Colombia, taking into account that this methodology could be replicable anywhere. In addition, the tourism agencies could offer different attractions in Santander using the proposed routes that include the preference of tourists in terms of nature, culture, history or adventure; finally, these routes can be promoted on platforms such as social networks and official websites of the department's cities.

6. Conclusions

In this study, we introduce a novel approach to speed up the decision-making process related to designing and creating routes by identifying tourism clusters that consider the similarity between their characteristics based on a hierarchical structure. These clusters are geographical sites that offer a particular type of tourism; however, the cluster groups depend on the attributes that describe each location. That is, if we use different characteristics, it is possible to find a diverse cluster group. For this, it is essential to justify the selection of attributes, the data collection process and the sources. We also highlight that this methodology is based on quantitative methods. At the route creation phase, we need to use an expert criterion to speed up the interest site connection, meaning that when we choose among feasible sites, we decide which ones to integrate with the main route based on the expert criterion. In conclusion, clustering techniques reduce the use of resources such as time, money and expertise. Additionally, the application of this technique can be used in future tourism studies in other places or sites where a tourism manager can identify touristic activities, as well as natural, cultural and other resources. Thus, this technique can be used to determine and design adventurous, religious, gastronomic, and other exciting and attractive tourism routes. Finally, this methodology allows government staff and entrepreneurs to highlight and prioritize future investments, such as transportation structure (e.g., new roads, bridges, cableways, among others), which can improve tourism development.

Declarations

Author contribution statement

Diana Carolina Rodriguez-Padilla: Performed the experiments; Analyzed and interpreted the data; Wrote the paper.

Juan Benjamín Duarte-Duarte: Conceived and designed the experiments; Contributed reagents, materials, analysis tools or data.

Leonardo Hernán Talero-Sarmiento: Conceived and designed the experiments; Performed the experiments; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data availability statement

Data will be made available on request.

Declaration of interests statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.

Appendix A. Supplementary data

The following is the supplementary data related to this article:

Aditional Information.pdf
mmc1.pdf (52.7KB, pdf)
Annex 1. 79 characteristics for each Township.csv
mmc2.csv (456.4KB, csv)
Annex 2. Similarity Matrix.csv
mmc3.csv (20.8KB, csv)
Annex 3. Distance Matrix.csv
mmc4.csv (49.4KB, csv)
Annex 4. doc.kml
mmc5.zip (1.9MB, zip)

References

  1. Alam A.S.A.F., Er A.C., Begum H., Alam M.M. The factors of selecting Malaysia as tourist destination. Mediterr. J. Soc. Sci. 2015;6(3):491–498. [Google Scholar]
  2. Almeida M.V. Matriz de Avaliação do potential turístico de localidades receptoras. Revista Turismo Em Análise. 2009;20(3):541–563. [Google Scholar]
  3. Andrés Cabello S., Pascual Bellido N. La construcción del turismo en nuevos destinos: luces y sombras. El caso de La Rioja (España) Revista de Ciencias Sociales y Humanidades. 2015;24:30–49. [Google Scholar]
  4. Aravena Paillalef G., Espinosa Sepúlveda A., Flores Chávez J. 2013. Manual para el desarrollo de circuitos de turismo de intereses especiales 1. [Google Scholar]
  5. Auren Consultores Recursos Turísticos. 2014;30 [Google Scholar]
  6. Blanco M. Guía para la elaboración del plan de desarrollo turístico de un territorio. 2008;45 [Google Scholar]
  7. Boullón R.C. 1997. Planificacion del espácio turístico. 3°ed. México, Trillas.http://www.aptae.pe/archivos_up/0107-planificacion-del-espacio-turistico-roberto-c-ballon.pdf Retrieved from. [Google Scholar]
  8. Briedenhann J., Wickens E. Tourism routes as a tool for the economic development of rural areas-vibrant hope or impossible dream? Tourism Manag. 2004;25(1):71–79. [Google Scholar]
  9. Cadena A., Christian V., Ing F. 2013. Nuevos atractivos turísticos tangibles en los cantones: Tulcán, Espejo y Mira para la creación de circuitos turísticos. [Google Scholar]
  10. Camara Charles Jean, de los Ángeles MorcateLabrada Flora. Arquitectura y urbanismo. Arquit. Urban. 2014;35(1):48–67. http://scielo.sld.cu/scielo.php?script=sci_arttext&pid=S1815-58982014000100005 Retrieved from. [Google Scholar]
  11. Camarena D.A. 2016. Evaluación de los recursos turísticos naturales del municipio de San Pedro Lagunillas, Nayarit, México, a partir de la metodología multicriterio; pp. 43–60. [Google Scholar]
  12. Castellanos Menjura Claudia Patricia. Universidad Distrital; 2015. Evaluacion de los recursos turisticos con vocacion ecoturistica y caracterizacion de la demanda turistica en las zonas de uso publico de la Reserva Forestal Protectora del Cerro Quinini (Tibacuy-Cundinamarca)http://repository.udistrital.edu.co/bitstream/11349/2383/1/CastellanosMenjuraClaudiaPatricia2015.pdf Retrieved from. [Google Scholar]
  13. Cha S., Uysal M. Regional analysis of tourism resources for marketing purposes. J. Hospit. Leisure Market. 1995;2:61–74. [Google Scholar]
  14. Conti A., Charne U., Moscoso F.V., Comparato G.J., Cassani M.J., Avalís V.S., Rucci A.C. Vol. 18. 2012. Evaluación de atractivos para la identificación de nuevos productos turísticos. (Caso de estudio: región Capital de la provincia de Buenos Aires). [Google Scholar]
  15. Gobernación de Santander . Gobernación de Santander; 2016. Plan Desarrollo Departamental. [Google Scholar]
  16. Defert P. Méthodes de Recheches Touristiques et Leur Application Aux Pays et Régions En Voie de Développement. 1972. Essai de formulation d’une typologie intégreé des resources et activités touristiques; pp. 64–75. [Google Scholar]
  17. Deng J., King B., Bauer T. Evaluating natural attractions for tourism. Ann. Tourism Res. 2002;29(2):422–438. [Google Scholar]
  18. Departamento Administrativo Nacional de estadística . Departamento Administrativo Nacional de estadística; 2018. Boletín Técnico Pobreza Monetaria Santander Boletín Técnico; pp. 1–13. [Google Scholar]
  19. Eagles P.F.J., Bowman Margaret E., Chang-Hung Tao Teresa. 2001. Guidelines for Tourism in Parks and Protected Areas of East Asia. [Google Scholar]
  20. Egül A.Y.Ş., Ik T.U.Ş.I.Ş. Tourist hotel location selection with analytic hierarchy process. Market. Tourism Manag. Conf. 2014:47–58. [Google Scholar]
  21. Estevao C., Ferreira J. The tourism clusters role in regional development: presenting a competitiveness conceptual model. Tourism Destinat. Develop. Brand. 2009;2003:127–139. https://repositorio.ipcb.pt/bitstream/10400.11/1479/1/tourism.pdf Retrieved from. [Google Scholar]
  22. Everitt B.S., Landau S., Leese M., Stahl D. Vol. 14. 2011. Cluster Analysis. (Quality and Quantity). [Google Scholar]
  23. Garrido M.A., Sánchez J.A.L., Enriquez A.F. Rutas turísticos-culturales e itinerarios culturales como productos turísticos: reflexiones sobre una metodología para su diseño y evaluación. Análisis Espacial y Representación Geográfica: Innovación y Aplicación. 2015;(October):463–471. [Google Scholar]
  24. Gosal A.S., Geijzendorffer I.R., Václavík T., Poulin B., Ziv G. Using social media , machine learning and natural language processing to map multiple recreational beneficiaries. Ecosyst. Serv. 2019;38(June):100958. [Google Scholar]
  25. Gower J.C., Legendre P. Metric and Euclidean properties of dissimilarity coefficients. J. Classif. 1986;3(1):5–48. [Google Scholar]
  26. Gunn C.A. 2002. Tourism Planning: Basics, Concepts, Cases/Clare A. Gunn with Turgut Var.https://trove.nla.gov.au/work/10503424?q&sort=holdings+desc&_=1520179818638&versionId=45468001 Retrieved from. [Google Scholar]
  27. Guo S., Jiang Y., Long W. Urban tourism competitiveness evaluation system and its application: comparison and analysis of regression and classification methods. Proc. Comput. Sci. 2020;162(Itqm 2019):429–437. [Google Scholar]
  28. ICOMOS Carta internacional sobre el turismo cultural. La Gestión del Turismo en los sitios con Patrimonio. 1999;6 [Google Scholar]
  29. Infante Sánchez E. del P. Elementos determinantes en Cundinamarca para el desarrollo del turismo como actividad estratégica regional. Suma de Negocios. 2014;5(10):40–48. [Google Scholar]
  30. Instituto Valenciano de Tecnologías Turísticas . Instituto Valenciano de Tecnologías Turísticas; 2015. BIG DATA: retos y oportunidades para el turismo; p. 84. [Google Scholar]
  31. Jafari J. El turismo como disciplina científica. Politic. Soc. 2005;42(1):39–56. [Google Scholar]
  32. Jeambey Z. Rutas Gastronómicas y Desarrollo local: un ensayo de conceptualización en Cataluña. Pasos: Revista de Turismo y Patrimonio Cultural, ISSN-e 1695-7121, Vol. 14, No. 5, 2016, Págs. 1187-1198. 2016;14(5):1187–1198. [Google Scholar]
  33. Karypis M., Kumar V., Steinbach M. A comparison of document clustering techniques. TextMining Workshop At. 2000 [Google Scholar]
  34. Knezevic R. Contents and assessment of basic tourism resources. 2008;14(1):79–94. [Google Scholar]
  35. Kombol T.P. Rural tourism on the Croatian islands - Sustainable development and regenerative strategies. 2000;102:425–431. [Google Scholar]
  36. Laguna Marín- Yaseli M., Nogués Bravo D. La potencialidad turística del medio natural en el lic de las Sierras Ibericas Riojanas mediante evaluación multicriterio. Zubia - Monogr. 2003;13(2001):227–240. https://dialnet.unirioja.es/descarga/articulo/1032626.pdf Retrieved from. [Google Scholar]
  37. Laitamaki J., Tada M., Liu S., Setyady N., Vatcharasoontorn N., Zheng F. Tourism Industry. 2016;14(1):7–29. [Google Scholar]
  38. Landaluce Calvo M. Isabel. 1st. Vol. 21. QUESTII ¨ O´; 2011. Análisis Factorial Tratamiento De Tablas Mixtas: Nuevas equivalencias entre el ACP normado y el ACM; pp. 99–108. [Google Scholar]
  39. Li J., Chu C.-H., Wang Y., Yan W. 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE’02. Proceedings (Cat. No.02CH37291), 2. 2002. An improved fuzzy c-means algorithm for manufacturing cell formation; pp. 1505–1510. [Google Scholar]
  40. Lillo Adelaida, Ramón Rodríguez Ana, Sevilla-Jiménez Martín. Un marco de análisis del capital humano en turismo. Papers de Turisme. 2006;39:45–59. [Google Scholar]
  41. Lopez Fernando. Libro Verde de la Accesibilidad en España. Diagnóstico y bases para un plan integral de supresión de barreras. Madrid: Instituto de Migraciones y Servicios Sociales. 2002 [Google Scholar]
  42. Lucas Robert C. Clawson and Knetsch, economics of outdoor recreation. Nat. Resour. J. 1968;8:738–743. http://digitalrepository.unm.edu/nrj/vol8/iss4/13 Available at: [Google Scholar]
  43. Maass S.F., Luz I., De R. 2010. Evaluación multicriterio de los recursos turísticos del Parque Estatal Multicriteria evaluation: of tourism resources of the park sierra de. [Google Scholar]
  44. Mart M., Nicol S., Melissa H., Galv O. Políticas públicas Y factores que determinan La competitividad turística de Morelia. México Y De Alcalá De Henares. 2013;8(2):1598–1603. [Google Scholar]
  45. Martínez Mario Alberto, Osorio García Maribel, Franco Maass Sergio, Ramírez de la O Irma Luz, Nava Bernal Gabino. Evaluación Multicriterio de los recursos turísticos del Parque Estatal. El periplo Sustentable. 2010;18:1–33. [Google Scholar]
  46. Meyer D. 2004. Tourism Routes and Gateways: Key Issues for the Development of Tourism Routes and Gateways and Their Potential for Pro-poor Tourism; pp. 1–31. (April) [Google Scholar]
  47. Mikery Gutiérrez Mildred Joselyn, Pérez-Vázquez Arturo. Métodos para el análisis del potential turístico del territorio rural. Revista Mexicana de Ciencias Agrícolas. 2014;(9):1729–1740. doi: 10.29312/remexca.v0i9.1060. [DOI] [Google Scholar]
  48. Ministerio de Cultura . Ministerio de Cultura; 2010. Cartilla Política Cultura para el Pueblo Gitano; p. 24. [Google Scholar]
  49. Ministerio de Cultura de Colombia . Awá Kuaiker, Gente de La Montaña, (018000) Ministerio de Cultura de Colombia; 2004. Raizales, isleños descendientes de europeos y africanos; pp. 1–11.http://www.mincultura.gov.co/areas/poblaciones/noticias/Documents/Caracterizacin%20del%20pueblo%20Awa.pdf Retrieved from. [Google Scholar]
  50. Minitab 18 Dendrograma. 2018. https://support.minitab.com/es-mx/minitab/18/help-and-how-to/modeling-statistics/multivariate/how-to/cluster-observations/interpret-the-results/all-statistics-and-graphs/dendrogram/ Retrieved from.
  51. MINCIT . Ministerio de comercio de Colombia; 2003. Seguridad Turística: Reto competitivo de Colombia. [Google Scholar]
  52. Mishra P.G.M.S. 2009. Asia Pacific Journal of Marketing and Logistics.http://www.emeraldinsight.com/doi/pdfplus/10.1108/13555850910926281 Retrieved from. [Google Scholar]
  53. Morteza Z., Reza F.M., Seddiq M.M., Sharareh P., Jamal G. Selection of the optimal tourism site using the ANP and fuzzy TOPSIS in the framework of Integrated Coastal Zone Management: a case of Qeshm Island. Ocean Coast Manag. 2016;130:179–187. [Google Scholar]
  54. Navarro D. Recursos turísticos y atractivos turísticos: conceptualización, clasificación y valoración. Cuad. Tur. 2015;(35):335. [Google Scholar]
  55. Nilashi M., Ahani A., Dalvi M., Yadegaridehkordi E., Samad S., Ibrahim O.…Akbari E. Preference learning for eco-friendly hotels recommendation: a multi-criteria collaborative filtering approach. J. Clean. Prod. 2019;215:767–783. [Google Scholar]
  56. OEA . CICATUR-OEA; 1987. Touristics Resources. [Google Scholar]
  57. Open Africa . Open Africa; Cape Town: 2002. Information Relating to the African Dream Project. [Google Scholar]
  58. Padín Fabeiro C., Pardellas de Blas X.X. Una propuesta de turismo sostenible para el municipio de Caldas de Reis (Pontevedra) Cuad. Tur. 2004;13(13):107–126. http://dialnet.unirioja.es/servlet/articulo?codigo=1033354&info=resumen&idioma=SPA Retrieved from. [Google Scholar]
  59. Pitchayadejanant K., Nakpathom P. Data mining approach for arranging and clustering the agro-tourism activities in orchard. Kasetsart J. Soc. Sci. 2018;39(3):407–413. [Google Scholar]
  60. Podani J. Extending Gower’s General Coefficient of Similarity to Ordinal Characters. 2012;48(2):331–340. [Google Scholar]
  61. Porter M.E. Location, competition, and economic development: local clusters in A global economy. Econ. Dev. Q. 2000;14(1):15–21. [Google Scholar]
  62. Quesada Castro Renato. EUNED; 2006. Elementos Del Turismo; pp. 1–316. [Google Scholar]
  63. Rátz T., Puczkó L. SAC; Auchincruive, Ayr: 1998. Rural Tpurism and Sustainable Development in Hungary. “Rural Tourism Management: Sustainable Options” International Conference, Conference Proceedings; pp. 450–464. [Google Scholar]
  64. Real Academia Española (RAE). (Spanish Royal Academy) Spanish dictionary. Twenty-second edition. 2005. https://dle.rae.es/infraestructura?m=form Available at: (consulted in octubre, 2018) [Google Scholar]
  65. Reyez O., Sánchez C. Metodología para determinar el potential de los recursos turísticos naturales en el Estado de Oaxaca, México. Cuad. Tur. 2005;1(16):153–173. [Google Scholar]
  66. Rocha H. Entrepreneurship and development: the role of clusters. Small Bus. Econ. 2004;23:363–400. [Google Scholar]
  67. Rubio M.L., Duval V.S., Pezzola A. Análisis de Localización de Emprendimientos Turísticos en El Sector Norte del Partido de Villarino (Argentina) InterEspaço: Revista de Geografia e Interdisciplinaridade. 2016;2(5):9–35. [Google Scholar]
  68. SECTUR . 1975. ¿Que Hacemos?https://www.gob.mx/sectur/es/#192 Retrieved from. [Google Scholar]
  69. Smith S.L.J. Regional tourism analysis of tourism resources. Tourism. 1987;14:254–273. [Google Scholar]
  70. Smith S. Identification of funtional tourism in North America. Am. Pharmaceut. Rev. 2013;16(5):13–20. [Google Scholar]
  71. Solís M.J., Errazuriz M.J., Caradeuc C., Seabra G. Evaluación multicriterio de la potencialidad turística de un territorio. caso de estudio parque nacional pan de azúcar, región de Atacama. Chile. 2013:1–12. [Google Scholar]
  72. Triantaphyllou E. 2000. Multi-Criteria Decision Making Methods. (March) pp. 5–21. [Google Scholar]
  73. UNWTO . Responding to Global Challenges Climate Change and Tourism Responding to Global Challenges (Copyright; World Tourism Organization and United Nations Environment. World Tourism Organization and the United Nations Environment Programme; Madrid, Spain: 2008. [Google Scholar]
  74. UNWTO Sustainable Tourism for Development Guidebook. Enhancing Capacities for Sustainable Tourism for Development in Developing Countries. 2013;228 [Google Scholar]
  75. Vanegas Juan Gabriel. Evaluación Multicriterio e inventario de atractivos turísticos. Espacios. 2017;38:1–6. [Google Scholar]
  76. Wilks D.S. Cluster Analysis. Int. Geophys. 2011;100:603–616. [Google Scholar]
  77. Zimmer P., Grassman S. Seminario Leader. 1996. Evaluar el potential turístico de un territorio. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Aditional Information.pdf
mmc1.pdf (52.7KB, pdf)
Annex 1. 79 characteristics for each Township.csv
mmc2.csv (456.4KB, csv)
Annex 2. Similarity Matrix.csv
mmc3.csv (20.8KB, csv)
Annex 3. Distance Matrix.csv
mmc4.csv (49.4KB, csv)
Annex 4. doc.kml
mmc5.zip (1.9MB, zip)

Data Availability Statement

Data will be made available on request.


Articles from Heliyon are provided here courtesy of Elsevier

RESOURCES