Skip to main content
Data in Brief logoLink to Data in Brief
. 2016 Aug 27;9:123–127. doi: 10.1016/j.dib.2016.08.040

Infodemiological data of Ironman Triathlon in the study period 2004–2013

Sofiane Mnadla a,1, Nicola Luigi Bragazzi b,c,d,1,, Mehdi Rouissi e, Anis Chaalali e, Anna Siri d, Johnny Padulo f,g, Luca Paolo Ardigò h, Francesco Brigo i,j,2, Karim Chamari k,2, Beat Knechtle l,m,2
PMCID: PMC5018091  PMID: 27642618

Abstract

This article reports data concerning the Internet-related activities and interest for Ironman Triathlon competition. Google Trends (GT) was used and mined from 2004 onwards. The interest for Ironman Triathlon was found to be cyclic over time. The Triathlon-related Internet activities negatively correlated with the number of finishers per year (Pearson׳s correlation r=−0.690, p-value<0.05), while an increasing participation of female athletes who were less likely to surf the Internet could be noticed (r=−0.811, p-value<0.05). Further, younger athletes, who were more likely to access the web, were underrepresented in the Ironman Triathlon event. Moreover, there was a correlation between the biking time and the Internet query volumes (r=0.590, p-value<0.05), and, in particular, for the male athletes (r=0.664, p-value<0.05). Finally, the countries which most contributed to the Internet query volumes were those with the highest number of medals.

Keywords: Digital era, Google Trends, Infodemiology, Ironman Triathlon, Web 2.0


Specifications Table

Subject area Sports sciences
More specific subject area Sports data mining
Type of data Graphs, heat-maps
How data was acquired Outsourcing of Google Trends site and the Ironman site
Data format Raw and Analyzed
Experimental factors Google Trends search volumes were obtained through graphs and heat-maps
Experimental features Validation of Google Trends-based data with “real-world” data taken from the Ironman site was performed by means of correlational analysis
Data source location Worldwide
Data accessibility Data are within this article

Value of the data

  • Google Trends (GT)-based data (infodemiological data) could be useful for scientific community and researchers in that they show good correlation with “real world” data obtained from the Ironman site, thus proving to be reliable.

  • These data could be further statistically processed, analyzed, refined and validated.

  • These data could be used to understand sports-related web activities.

1. Data

This article contains infodemiological data on Ironman Triathlon searched worldwide in the study period 2004–2013, obtained from Google Trends (GT) (Fig. 1, Fig. 2). These data showed a cyclic pattern (Fig. 3) and well correlated with “real-world” data obtained from the Ironman Triathlon site for the same study period (Fig. 4, Fig. 5, Fig. 6, Fig. 7).

Fig. 1.

Fig. 1

Heat-map of interest for Ironman Triathlon for each country.

Fig. 2.

Fig. 2

Interest for Ironman Triathlon over time in the period 2004–2013, worldwide.

Fig. 3.

Fig. 3

Wavelet Spectral Analysis of Ironman Triathlon-related web searches.

Fig. 4.

Fig. 4

Correlation between Ironman Triathlon-related web activities and number of finishers per year.

Fig. 5.

Fig. 5

Correlation between Ironman Triathlon-related web activities and number/percentage of female finishers per year.

Fig. 6.

Fig. 6

Correlation between Ironman Triathlon-related web activities and number/percentage of male finishers per year.

Fig. 7.

Fig. 7

Correlation between Ironman Triathlon-related web activities and average biking time per year/percentage of biking time per year (overall and for male athletes).

2. Experimental design, materials and methods

GT (freely available at https://www.google.com/trends) was used to explore Internet activities and interest related to Ironman Triathlon competition [1]. GT was searched worldwide, looking for “Ironman triathlon” as keyword, and using “search topic” as search strategy option, from its inception until 2013. “Real-world” statistical data were collected from the Ironman Triathlon site (available at http://ironmanworldchampionship.com) for the same study period 2004–2013.

In order to capture regular time patterns, spectral analysis was carried out using algorithms written in Matlab, freely accessible at http://paos.colorado.edu/research/wavelets/ [2].

Correlational analysis was carried out between the GT-based search volumes and the “real-world” statistical data about Ironman Triathlon. All statistical analyses were performed using commercial software, namely the Statistical Package for Social Science version 23.0 (SPSS, IBM, IL, USA) and STATISTICA version 12 (StatSoft Inc., Tulsa, OK, USA). Figures with a p-value<0.05 were considered statistically significant.

Conflicts of interest

The authors declare no conflicts of interest.

Footnotes

Transparency document

Transparency data associated with this article can be found in the online version at http://dx.doi.org/10.1016/j.dib.2016.08.040.

Transparency document. Supplementary material

Supplementary material

mmc1.pdf (79.3KB, pdf)

.

References

  • 1.Knechtle B., Nikolaidis P.T., Rosemann T., Rüst C.A. Ironman Triathlon. Prax. (Bern. 1994) 2016;105:761–773. doi: 10.1024/1661-8157/a002369. [DOI] [PubMed] [Google Scholar]
  • 2.Torrence C., Compo G.P. A practical guide to wavelet analysis. Bull. Am. Meteorol. Soc. 1998;79:61–78. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.pdf (79.3KB, pdf)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES