Abstract
Statistical modelling of sports data has become more and more popular in the recent years and different types of models have been proposed to achieve a variety of objectives: from identifying the key characteristics which lead a team to win or lose to predicting the outcome of a game or the team rankings in national leagues. Although not as popular as football or basketball, volleyball is a team sport with both national and international level competitions in almost every country. However, there is almost no study investigating the prediction of volleyball game outcomes and team rankings in national leagues. We propose a Bayesian hierarchical model for the prediction of the rankings of volleyball national teams, which also allows to estimate the results of each match in the league. We consider two alternative model specifications of different complexity which are validated using data from the women's volleyball Italian Serie A1 2017–2018 season.
Keywords: Bayesian statistics, volleyball, poisson distribution, hierarchical models
1. Introduction
A considerable amount of research has focused around the analysis of sport data, mostly for the prediction of sport scores in each game and team rankings in national leagues. Examples of different types of applications of statistical models to sport data include football [2,4,11–13,20], but also basketball [14] and hockey [16]. However, there is a shortage of published works on the analysis of volleyball, and in particular women's volleyball data. Notable exceptions include: [10], who explored the differences in playing characteristics between winning and losing teams in the international federation of men's beach volleyball world tour tournament; [6] and [24], who examined the overall performance of volleyball teams based on many game factors using data from the men's Italian serie A and the women's National Collegiate Athletic Association (NCAA) Division I volleyball league, respectively; [1] and [21], who used different approaches to either analyse the results from the games or to predict future team rankings in the Turkish men's and women's volleyball leagues.
A potential factor which limited the analysis of volleyball data in the literature is the fact that the playing rules and the scoring system vary across countries and have also changed over the years, e.g. from the ‘side-out’ to the ‘rally’ scoring system. According to the current regulations of the Federation International de Volleyball [5], volleyball games have six players on each side. Six rotational spots on the court are changed every time a particular side serves the ball. The aim is to deliver the ball over the net and ground it, or the ball touches on the ground of the opposing side, while preventing the ball from touching the ground on their own side. The volleyball game begins with a ‘serve’ of the ball. Then players take turns rotating around the court so each player has a chance to serve the ball. The members of the opposing team will attempt to ‘save’ it from hitting the ground and knock it to one of their own players or hit it back over the net. Play will continue until one team fails by allowing the ball to touch the ground in its own court or they correctly return it to the opposing court. A point is awarded to the serving team if the opposing team makes a mistake. If the serving team fails, however, then the receiving team has control of the ball and becomes the serving team.
According to the current scoring system, matches are played until a team wins a set of three games, with each game typically going to 25 points. However, if the two teams won two games each, the third game then goes only up to 15 points. A team must win each game by 2 points. If the score is tied with even numbers, both teams have to continue playing the game until a 2-point lead is obtained. Otherwise, points keep accumulating until one team wins with a margin of victory of 2 points, even if the score is greater than 25 or 15 points. When playing in professional volleyball leagues, the teams get points according to set numbers at the end of all matches in the league. More specifically, the team points are awarded as follows: if the match is won 3–0 or 3–1, 3 points are assigned to the winner and 0 points to the loser; if the match is won 3–2, 2 points are assigned to the winner and 1 point to the loser.
In this paper, building on the previous literature on modelling sport data, we propose a novel Bayesian hierarchical modelling framework for the analysis of volleyball data which allows to jointly predict match results and team rankings in national leagues. We use data from the women's volleyball Italian Serie A1 2017–2018 season as a motivating example to implement and validate the proposed framework. The rest of the article is structured as follows: Section 2 introduces the notation and the data used for the analysis. Sections 3 and 4 describe the modelling framework and two alternative model specifications implemented for the analysis of the volleyball data. Section 5 summarises the results in terms of parameter estimation as well as outcome prediction and assesses the relative performance of the models. Finally, Section 5 presents some issues and possible extensions that can be the material for future work, while Appendix 1 includes the JAGS code for our analysis.
2. Data from the italian serie A1 2017–2018
Data were obtained from the Italian women's volleyball federation web site (http://www.legavolleyfemminile.it/) and include different types of in-game statistics related to each of the 132 matches of the regular season 2017–2018 for all 12 teams in the league. Generally, there are six categories of volleyball statistics which have been found to be associated with match results [10], which can be broadly distinguished in: attack, setting, serving, passing, defence, and blocking. Data on variables related only to four of these categories are included in the analysis since for some of the categories data are not publicly available on the federation website. In particular, for each team and match of the women's Italian Serie A1 regular season 2017–2018, the following set of statistics are included in our analysis (in brackets the names assigned to each variable):
Match. The number of the match.
Teams. The names of the home team () and away team () for each match, together with a unique numeric score assigned to each team regardless of whether it is the hosting (h) or visiting (a) team.
Points. Total number of points scored by the home team () and by the away team ().
Sets. Total number of sets won by the home team () and by the away team () for each match, and a binary indicator telling whether the 5-th set was played () or not (). Additionally, we define a second binary indicator , which takes value 1 or 0 according to whether the home team won or lost the match, respectively.
Serve. Total number of serves (), total number of perfect serves or ‘aces’, i.e. ending in a point for the serving team () and total number of serving errors (), i.e. ending in a point for the opponent team, for . Serve efficiency is then defined as .
Defence. Total number of digs (), total number of perfect ‘digs’, i.e. defence action causing the ball to follow a trajectory defined according to specific criteria () and total number of digging errors (). Defence efficiency is then defined as .
Attack. Total number of attacks (), total number of perfect attacks, i.e. attacks ending in a point for the attacking team () and total number of attacking errors (). Attack efficiency is then defined as .
Block. Total number of blocks (), total number of perfect blocks, i.e. blocks ending in a point for the blocking team () and total number of blocking errors or ‘invasions’ (). Block efficiency is then defined as .
Table 1 shows the structure of the dataset, which includes different types of in-game statistics associated with each game (we use the term … to indicate that the data related to some of the matches and variables have been omitted from the table for clarity). For example, the first row contains information about the first match of the season between the home team Casalmaggiore and the away team Conegliano, which are assigned with the unique numeric scores 3 and 4, respectively. The match ended with a total point score of 60 to 75 in favour of the away team, who won a total of 3 sets without losing any set.
Table 1. Summary of the type of data used in the analysis of the Italian Serie A1 women's volleyball league 2017–2018.
i | home team | away team | h | a | … | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Casalmaggiore | Conegliano | 3 | 4 | 60 | 75 | 0 | 3 | 0 | 0 | −0.17 | −0.06 | … |
2 | Novara | Busto Arsizio | 8 | 2 | 113 | 108 | 3 | 2 | 1 | 1 | −0.09 | −0.10 | … |
3 | Piacenza | Scandicci | 10 | 12 | 40 | 75 | 0 | 3 | 0 | 0 | −0.07 | 0.03 | … |
4 | San Casciano | Monza | 11 | 7 | 96 | 94 | 3 | 1 | 0 | 1 | −0.07 | −0.07 | … |
… | … | … | … | … | … | … | … | … | … | … | … | … | … |
129 | Casalmaggiore | Piacenza | 3 | 10 | 105 | 103 | 2 | 3 | 1 | 0 | −0.04 | −0.03 | … |
130 | Scandicci | Busto Arsizio | 12 | 2 | 110 | 98 | 3 | 2 | 1 | 1 | −0.06 | −0.06 | … |
131 | Filottrano | San Casciano | 5 | 11 | 113 | 108 | 3 | 2 | 1 | 1 | −0.08 | −0.03 | … |
132 | Novara | Legnano | 8 | 6 | 75 | 57 | 3 | 0 | 0 | 1 | −0.08 | −0.07 | … |
In our analysis, we also consider the efficiency measures of the different statistics available for each team when playing at home and visiting: serve and attack efficiency ( and ) – typically related to the offensive strength of each team – and block and defence efficiency ( and ) – typically associated with the defensive strength of each team. These variables are included in the model to improve the estimation of the attack and defence performances of the teams and the prediction of the results for future matches.
Before introducing the model, it is important to highlight some key characteristics of volleyball which needs to be taken into account when modelling the data. First, unlike other sports, the total number of points scored in a volleyball match by each team is not typically enough to determine which of the two teams effectively won the match. For example, consider the situation where a match ends with the home team winning a total of 3 sets with a relatively small point difference with respect to the opponent in each set, while the away team wins 2 sets with a substantially larger point gap in each set. Then, it is possible that the winning team is associated with a lower number of points across all sets compared with those scored by the losing team. Second, the total number of points assigned to the two opposing teams at the end of a match may vary depending on the number of sets played. If the total number of sets played is 3 or 4, the winning and losing team earns 3 and 0 points, respectively; if the total number of sets played is 5, the winning and losing team are given 2 and 1 points, respectively.
Finally, we note that, although the total number of points scored in a match could be potentially infinite, the total number of points scored in a single set may or may not have an upper bound. For example, when a team scores 25 points in the first set and the opponent has scored 23 or less, the match ends; however, if the opponent has scored 24, the match continues until the point difference between the two teams is equal to 2. Ideally, set-specific statistics for each match would provide more detailed information about the performance of each team compared with using the aggregated points across all the sets played. However, this type of information is not typically available to the public and their analysis would also introduce additional modelling issues which are discussed in Section 6.
3. Modelling framework
From a statistical perspective, one of the most interesting aspects in modelling team sport data is related to the distributional form associated with the number of scores in a single game by the two opponent teams. In the analysis of football data, the Poisson distribution has been widely used to model the game scores; for example, [12] proposed the use of a bivariate Poisson distribution which includes a parameter that explicitly accounts for the covariance between the goals scored by the two competing teams. More recently, [2] adopted a Bayesian hierarchical model which simultaneously accounts for the correlation between the two variables via the specification of two conditionally independent Poisson variables for the number of goals scored. Being defined within a Bayesian framework, this flexible modelling specification also allows to generate predictions of future games in a relatively easy way by means of the posterior predictive distribution.
We extend and adapt the modelling frameworks typically used for the analysis of football data and propose a novel Bayesian hierarchical modelling framework for the analysis and prediction of volleyball results in regular seasons. Three different sub-models or ‘modules’ form our framework: (1) the module of the observed number of points scored by the two opposing teams in a match ( and ); (2) the module of the binary indicator for the number of sets played (); (3) the module of the binary indicator for the winner of the match (). These three modules are jointly modelled using a flexible Bayesian parametric approach, which allows to fully propagate the uncertainty for each unobserved quantity and to assess the predictive performance of the model in a relatively easy way. In the following, we describe the notation and the model used in each of the three modules.
3.1. Module 1: modelling the scoring intensity
In the first module of the framework, we follow [2,12] and model the number of points scored by the home and away team in the i-th match of the season using two independent Poisson distributions
(1) |
conditionally on the set of parameters , representing the scoring intensity in the i-th match for the home and away team, respectively. The parameters in Equation (1) are then modelled using the log-linear regressions
(2) |
which corresponds to a Poisson log-linear model, whose parameters can be given specific interpretations [12]. Within the formulae of Equation (2), μ is a constant, while λ can be identified as the home effect and represents the advantage for the team hosting the game which is typically assumed to be constant for all the teams and throughout the season. The overall offensive and defensive performances of the k-th team is captured by the parameters att and def, whose nested indexes identify the home and away team in the i-th game of the season, where K denotes the total number of the teams.
We then expand the modelling framework to incorporate match-specific statistics related to the offensive and defensive performances of the home and away teams. More specifically, the effects associated with the attack intensity of the home teams and the defence effect of the away teams are:
(3) |
We omit the index i from the terms to the left-hand side of the above formulae to ease notation, i.e. and . The overall offensive effect of the home teams in Equation 3 is a function of a baseline team-specific parameter , and the attack and serve efficiencies of the home team, whose impact is captured by the parameters and . The overall defensive effect of the away team is a function of a baseline team-specific parameter , and the defence and block efficiencies of the away team, whose impact is captured by the parameters and , respectively.
Similarly, the effects associated with the attack intensity of the away teams and the defence effect of the home teams are:
(4) |
where the overall offensive effect in Equation 4 depends on the parameters , and , while the overall defensive effect depends on the parameters , and .
To achieve identifiability of the model, a set of parametric constraints needs to be imposed. Following [2,12], we impose sum-to-zero constraints on the team-specific parameters
(5) |
for and . Under this set of constraints, the overall offensive and defensive effects of the teams are expressed as departures from a team of average offensive and defensive performance. An alternative set of constraints that could be imposed to achieve identifiability is known as corner constraints, where the overall offensive and defensive effects of a ‘reference’ team are set to 0, and the effects of all other teams are expressed as departures from the effects of the reference team. However, we find the interpretation of the model parameters under Equation (5) to be more intuitive compared with the corner constraints, without the potential issue of the choice of the reference team.
Within a Bayesian framework, prior distributions need to be specified for all random parameters in the model. Weakly informative Normal distributions centred at 0 with a relatively large variances are specified for the fixed effect parameters (note that we specify Normal distributions in terms of mean and precision, where precision=1/variance)
The team-specific random effect parameters are instead modelled as exchangeable from common distributions
for and . Finally, the mean and precision of the team-specific effect distributions are modelled using weakly informative Normal and Gamma priors, respectively:
for . These hyperparameters represent the latent structure of the model which affects the estimation of the scoring intensity parameters , and thus indirectly takes into account a form of correlation between the two outcome variables and .
3.2. Module 2: modelling the probability of playing 5 sets
In the second module, we explicitly model the chance of playing 5 sets in the i-th match of the season, i.e. the sum of the sets won by the home () and away () team is equal to 5. This is necessary when generating predictions in order to correctly assign the points to the winning/losing teams throughout the season and evaluate the rankings of the teams at the end of the season (Section 2).
We model the indicator variable , taking value 1 if 5 sets were played in the i-th match and 0 otherwise, using a Bernoulli distribution
(6) |
where denotes an indicator function which takes value 1 when the sum of the sets won by the home and away teams equal 5 and 0 otherwise, while is the match probability of playing 5 sets, which is estimated on the logit scale as a function of a set of parameters . These include a baseline parameter and the total number of points scored by the home and away teams, whose effect is captured by the parameters and , respectively. For all logistic regression coefficients vague Normal prior distributions with mean 0 and standard deviation of 100 are specified.
3.3. Module 3: modelling the probability of winning the match
The last module deals with the chance of the home team to win the i-th match, i.e. the total number of sets won by the home team () is larger than that of the away team () – we note that we could have also equivalently decided to model the chance of the away team to win the i-th match. This part of the model is again necessary when predicting the results for future matches, since the team associated with the higher number of points scored in the i-th match may not correspond to the winning team (Section 2).
We model the indicator variable , taking value 1 if the home team won the i-th match and 0 otherwise, using another Bernoulli distribution
(7) |
where denotes an indicator function which takes value 1 when the number of the sets won by the home team is larger than that of the away team and 0 otherwise, while is the match probability for the home team to win the match, which is estimated on the logit scale as a function of a set of parameters . These include a baseline parameter , the total number of points scored by the home and away teams, and the binary indicator associated with playing 5 sets. The effects of the last three variables are captured by the parameters , and , respectively. For all logistic regression coefficients vague Normal prior distributions with mean 0 and standard deviation of 100 are specified.
Figure 1 shows a graphical representation of the structure of the model as well as the contribution of each of the three modules to the framework.
Figure 1.
Graphical representation of the modelling framework. The joint distribution is expressed in terms of a distribution of the scoring intensity , the conditional distributions of the probability of playing 5 sets and the conditional probability of winning the match , respectively indicated with a dashed blue, red and green line. The parameters indexing the corresponding distributions or ‘modules’ are indicated with different Greek letters, while i denotes the match index within the regular season. The solid black arrows show the dependence relationships between the parameters and variables within and between the three modules.
The framework corresponds to a joint distribution for all the observed quantities which are explicitly modelled. This is factored into the product of the marginal distribution of the total number of points scored by the two teams in each match, Module 1 – , the conditional distribution of the probability of playing 5 sets in a match given , Module 2 – , and the conditional probability of winning the match given and , Module 3 – . Module 1 also includes the different in-game statistics as covariates in the model. These are related to the either the offensive (serve and attack efficiency) or defensive (defence and block efficiency) effects of the home and away teams in each match of the season, and are respectively denoted in the graph as and to ease notation, for .
4. Accounting for the multilevel correlation
Although the individual-level correlation between the observable variables and is taken into account through the hierarchical structure of the framework, a potential limitation of the model is that it ignores the possible multilevel correlation between the team-specific offensive and defensive coefficients, for and . To show the implications of this assumption, it is convenient to represent the two categories of multilevel parameters, described in Section 3.1, using the matrix notation
(8) |
where and are the two vectors of length 3 containing the offensive and defensive regression coefficients associated with team k in the league, respectively. The hyperparameters
are two vectors of length 3, representing the mean of the joint distributions of and , while
are two matrices, whose elements form the covariance matrices of the distributions of and in Equation (8). In Section 3.1, we implicitly assumed independence between the multilevel parameters and specify prior distributions only on and , i.e. we set for .
To account for the possible multilevel correlation, prior distributions on can be specified. One possibility is to directly specify separate priors on each component of the covariance matrices , including the correlation parameters. However, when the number of multilevel parameters is more than 2, modelling is challenging, since they all need to be constrained between and 1 while also ensuring the positive definiteness of the covariance matrices. An alternative approach is to specify Inverse-Wishart prior distributions on (or equivalently Wishart priors on ). This is defined as and has the computational advantage of being the conjugate prior for the covariance matrix of a multivariate Normal distribution, but requires the specification of two hyperprior parameters. The scale matrix , which must be positive definite with dimension equal to the covariance matrix, and the degrees of freedom ν, which is bounded from below by the dimension of . A general problem when setting these parameters is that, depending on the prior choice, some undesirable constraints may be imposed on some of the parameters of . For example, setting ν equal to the dimension of corresponds to specifying uniform priors on the correlation parameters , but it may lead to too restrictive priors on the scale parameters of . Conversely, increasing the value of ν allows to estimate the scale parameters of with more freedom, but at the cost of constraining the correlation parameters [8].
In our analysis, we account for the multilevel correlation using Inverse-Wishart distributions on and , which are scaled in order to facilitate the specification of the priors. This prior is known as the scaled Inverse-Wishart distribution [8] and can be constructed in the following way. First, two vectors of scale parameters and of length 3 are introduced such that the covariance matrices can be decomposed as the products
(9) |
where and are two diagonal matrices with the entries of and being the diagonal elements. The Inverse-Wishart priors are then specified on the unscaled covariance matrices and , that is
We set and as identity matrices, and set and equal to 4. Weakly informative priors are then assigned to each scaling factor, i.e. and , for . Under this approach, the j-th variance components and of the covariance matrices can then be retrieved as the product between the corresponding diagonal elements of and and the appropriate scaling factors from and :
for . Similarly, the jl-th covariance components are given by
for and . Since this parametrisation is based on the decomposition of the covariance matrix , the parameters and cannot be interpreted separately. However, Equation (9) provides a useful way to set up the model and minimise the amount of undesirable prior constraints when estimating the parameters of interest.
5. Implementation and results
We fitted all models using JAGS [15], a software specifically designed for the analysis of Bayesian models using Markov Chain Monte Carlo (MCMC) simulation [3], which was interfaced with R through the package R2jags [19]. Samples from the posterior distribution of the parameters of interest generated by JAGS and saved to the R workspace were then used to produce summary statistics and plots. We ran two chains with 20,000 iterations per chain, using a burn-in of 10,000, for a total sample of 20,000 iterations for posterior inference. For each unknown quantity in the model, we assessed convergence and autocorrelation of the MCMC simulations using diagnostic measures such as the potential scale reduction factor and the effective sample size [7].
Alternative prior distributions were considered to check that no unintended information was included into the models through the priors. For example, weakly informative uniform priors for the standard deviations between 0 and an upper value in were explored. In addition, different values for the variance included in of normally distributed regression parameters on the log (Module 1) or logit (Module 2 and 3) scale were considered. Results were robust to these specifications. We provide in Appendix 1 the JAGS code to implement the models.
5.1. Results
In this section, we present the results of the models from a twofold perspective. First, we look at the estimates of the team-specific marginal offensive and defensive effects, which are denoted with and , respectively. This is achieved by setting all the attack and defence-related covariates in Equations (3) and (4) to their centred version, i.e. subtracting the means from the original vectors. In this way, it is possible to interpret the intercept parameters and as the marginal offensive and defensive effects associated with each team in the league. Second, we assess and compare the fit of the models using relative measures of predictive accuracy and graphical posterior checks. The first provides an approximation of the relative ability of the models to predict future results, while the second offers a relatively easy way to assess the fit of the model. Specifically, we use the estimates of at each posterior iteration to generate a vector of 1000 replications from the posterior predictive distribution of . This, in turn, is used to validate the model by comparing the results obtained from the replicated matches with those from the observed data (if the models are plausible, the observed data should look similar under the posterior predictive distribution).
5.1.1. Posterior estimates
Figure 2 displays the average marginal mean offensive () and defensive () effects for each of the 12 teams in the league, which are estimated using the basic model of Section 3 and the scaled Inverse-Wishart (IW) model of Section 4.
Figure 2.
Average marginal mean offensive () and defensive () effects for all teams in the league, estimated from the log-linear regression of the basic model from Section 3 (a) and the scaled IW model from Section 4 (b). The teams associated with the best (worst) effects are denoted with triangles (diamonds) to distinguish them from the others.
The position of the points in the graphs is related to the average offensive and defensive performance of the teams, which are in turn based on the number of points scored and conceded by each team throughout the season. We note that the teams who perform better are those associated with positive attack effects (more points scored) and negative defence effects (less points conceded).
The results from both models are similar with limited differences. In general, we can identify four main clusters of teams. The first cluster is formed by a single team (Novara) who is associated with the largest attack effect among all the teams, but with a close-to-zero defence effect (the effect is estimated to be slightly negative and positive under the first and second model, respectively). The second cluster is formed by two teams (Conegliano and Scandicci) who have small positive attack effects but relatively large negative defence effects; Scandicci is by far the team with the smallest defence effect (reflecting the fact that it has the minimum number of conceded points in the league).The third cluster is formed by six teams whose offensive and defensive effects place them in the top part of the graphs; among these, Casalmaggiore is the team with the largest attack and smallest defence effect, while Bergamo and San Casciano are associated with negative attack effects. Finally, the fourth cluster is formed by three teams (Filottrano, Legnano and Pesaro) who are located on the top-left part of the graphs, and are associated with the poorest offensive performances. Additional posterior summary measures, including the posterior 95% credible intervals, for the team-specific attack and defence effects, the home effect (λ) and the constant (μ) parameters are provided in Tables 4 and 5 in Appendix 2.
5.1.2. Model assessment
We compute two relative measures of predictive accuracy to compare the fit of the two models considered. We specifically rely on the Widely Applicable Information Criterion (WAIC) [23] and the Leave-One-Out Information Criterion (LOOIC), which provide estimates for the pointwise out-of-sample prediction accuracy from a fitted Bayesian model using the log-likelihood evaluated at the posterior simulations of the parameter values [22]. Both measures can be seen as approximations to different versions of cross-validation [18]. They can also be viewed as an improvement on the Deviance Information Criterion (DIC) [17] for Bayesian models in that they use the entire posterior distribution, are invariant to parametrisation, and are asymptotically equal to Bayesian cross-validation [9]. These information criteria are obtained based on the model deviance and a penalty for model complexity known as effective number of parameters () and, when comparing a set of models based on the same data, the one associated with the lowest WAIC or LOOIC is the best-performing, among those assessed. A detailed presentation about how these measures are computed and their interpretation with respect to cross-validation approaches is outside the scope of this article and we refer to [9] for a comprehensive review on the matter. Table 2 reports the estimates of WAIC and LOOIC (and respective effective number of parameters) associated with the two alternative model specifications.
Table 2. WAIC, LOOIC and respective effective number of parameter () estimates for the basic and scaled IW model.
Basic model | Scaled IW model | |
---|---|---|
WAIC () | 2646 (148) | 2640 (145) |
LOOIC () | 2665 (157) | 2661 (155) |
Both measures of predictive accuracy are higher when the Basic model is used compared with Scaled IW model, with relatively similar estimates for , therefore suggesting an overall better fit to the data of the second specification.
We additionally assess the predictive performance of the models using the observed and replicated data. The latter are generated from the posterior predictive distributions of the models using the posterior samples of the parameters indexing the three modules of the framework: the scoring intensity rates (Module 1), the probability of playing 5 sets (Module 2), and the probability of winning the match (Module 3). We use the posterior estimates from these parameters to jointly sample 1000 replications of the data , which are then used for model assessment. Table 3 compares the predictive performance of the models in terms of the total number of points scores and conceded, the number of matches won and the number of points in the league earned by each team at the end of the season. Overall, the predicted results from both the basic and the scaled IW model seem to replicate the observed data relatively well for most of the teams. The total number of points scored and conceded are similar between the observed and replicated data, with the teams scoring (conceding) the most being also associated with the highest replicated points scored (conceded) and vice versa. Relatively small discrepancies are observed between the results of the two models for some of the teams. The total number of wins and league points are almost identical between the observed and replicated data, with the scaled IW model being associated with slightly more accurate predictions compared with the basic model.
Table 3. Posterior predictive validation of the models.
Observed | Basic model | Scaled IW model | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Teams | Scored | Conc'd | Wins | Points | Scored | Conc'd | Wins | Points | Scored | Conc'd | Wins | Points |
Bergamo | 1848 | 2025 | 7 | 19 | 1846 | 2020 | 7 | 21 | 1847 | 2020 | 7 | 21 |
Busto Arsizio | 1999 | 1927 | 12 | 39 | 1998 | 1919 | 12 | 37 | 2006 | 1920 | 12 | 37 |
Casalmaggiore | 1922 | 2051 | 6 | 23 | 1918 | 2042 | 7 | 23 | 1906 | 2039 | 7 | 23 |
Conegliano | 1960 | 1696 | 17 | 50 | 1960 | 1706 | 18 | 50 | 1955 | 1713 | 18 | 50 |
Filottrano | 1781 | 1961 | 7 | 19 | 1790 | 1954 | 6 | 18 | 1810 | 1950 | 6 | 18 |
Legnano | 1642 | 1903 | 5 | 11 | 1656 | 1898 | 4 | 17 | 1660 | 1901 | 4 | 17 |
Monza | 2003 | 1943 | 13 | 37 | 1994 | 1938 | 13 | 38 | 2005 | 1935 | 13 | 38 |
Novara | 1987 | 1776 | 17 | 51 | 1963 | 1776 | 17 | 51 | 1963 | 1781 | 17 | 51 |
Pesaro | 1776 | 1820 | 10 | 32 | 1785 | 1828 | 11 | 32 | 1789 | 1831 | 11 | 33 |
Piacenza | 1888 | 1939 | 12 | 33 | 1886 | 1935 | 9 | 30 | 1890 | 1933 | 9 | 30 |
San Casciano | 1807 | 1881 | 8 | 27 | 1812 | 1880 | 9 | 29 | 1797 | 1879 | 9 | 28 |
Scandicci | 1865 | 1556 | 18 | 50 | 1858 | 1578 | 18 | 51 | 1862 | 1580 | 18 | 51 |
Note: The observed data and the replications from the basic and the scaled IW model are compared in terms of: total number of points scores and conceded, total number of wins and total number of points earned by each team at the end of the season.
Figure 3 compares the cumulative points derived from the observed results throughout the season (the black line) and the predictions from both the basic model (in red), and the scaled Inverse-Wishart model (in blue).
Figure 3.
Posterior predictive validation of the models. The black line represents the observed cumulative points through the season for each team in the league, while the red and blue lines represent the corresponding predictions from the basic and scaled IW model, respectively.
For almost all teams the predicted results are relatively close to the observed data and suggest a good performance of both models. In particular, the lines of the observed and replicated cumulative points intertwine in many cases, with only two teams for which the results are slightly overestimated (Filottrano and San Casciano). We note that, for some of the teams, the predictions from the scaled IW model are closer to the line of the observed points and outperform the predictions from the basic model (Bergamo, Busto Arsizio, Conegliano, Monza, San Casciano and Scandicci). For all the other teams, the performance of the two models is either similar or varies throughout the season.
Finally, we compare the predictive ability of the two models in terms of the rankings of the teams in the league (calculated based on the total number of points earned by each team at the end of the season). Figure 4 shows the probability of ending the season in different positions for each team in the league, which is calculated based on the replications from either the basic (top graph) or scaled IW (bottom graph) model. The order of the teams has been set according to the expected mean rank of the teams in the league across the replicated seasons to favour the interpretation of the graphs. Each position is associated with a different colour, from dark blue (1-st position) to yellow (12-th position).
Figure 4.
Posterior probabilities of the rankings for each team in the league, estimated using the replicated data generated from the basic model (top panel) and the scaled IW model (bottom panel). Each position is associated with a different colour with higher (lower) ranks being associated with darker (lighter) colours. Only the probability values equal or above 10% are displayed for clarity.
Across the replications, the expected mean rank from the Basic model (top graph) and the Scaled IW model (bottom graph) lead to the same ordering of the teams. In particular, the predictive probabilities of finishing the season in a specific position are very similar between the two models and are in line with the observed rankings of the teams in the league (see Table 3), with Novara, Scandicci and Conegliano being associated with the highest probabilities of placing in the top three positions and with Filottrano and Legnano having the highest chance of being placed at the bottom of the league. Overall, there are some variations (from to ) in the probability of being in a specific rank between the results from the two models for some of the teams, but in most of the cases these differences remain negligible.
6. Discussion
The model presented in this paper is an application of Bayesian hierarchical modelling to sport data. The basic structure presented in Section 3 and its extension in Section 4 can be easily implemented and run using standard MCMC algorithms, such as the one provided for JAGS in Appendix 1. Previous approaches have focused either on the identification of the key factors determining the performance of a team [6,24] or the prediction of team rankings [21]. To our knowledge, this is the first modelling framework which jointly allows to predict team rankings and the outcomes of the matches during a season in volleyball. The two alternative specifications implemented in our analysis show generally good predictive performances; between the two models, the scaled IW model seems to be slightly more accurate compared with the basic model, but is also associated with a higher level of complexity.
One potential limitation of the framework is that only match-specific statistics are used for estimation and prediction purposes. Ideally, the use of set-specific statistics could improve the predictive power of the model while also eliminating the need to include the models for the indicator variables of Equations (6) and (7). Indeed, since predictions would be related to the results from each set played, the information about the number of sets and the winner of the match would be directly incorporated. However, this would introduce additional problems related to the choice of the distributions for modelling the number of points scored by the opposing teams in a set, which is subject to specific constraints (e.g. difference of 2 points if one team scores more than 25). In addition, set-specific statistics are not typically available from national federation web sites and are difficult to collect for all the matches in a season.
Finally, the flexibility of the proposed framework allows the extension of the model in many ways. For example, additional types of in-game statistics (e.g. number of passes), if available, could be incorporated to further improve the predictions of the model; alternative distributions could also be specified to model the total number of scores during the season (e.g. Negative Binomial). The scaled IW model could be extended by jointly modelling the team-specific attack and defence effects () with a single multivariate Normal distribution and scaled IW prior to account for the possible multilevel correlation between all random effect parameters. Although in our analysis the implementation of this more complex model did not lead to any substantial variation in the results, compared with the model of Section 4, this may not be the case in other applications. In addition, following the approaches in the recent literature on the analysis of football matches [20], data from different leagues and seasons could also be analysed jointly to deliver a context-specific validation framework that accounts for the temporal dimension in the data and the clustering across the leagues.
Appendix 1.
Model code
The complete JAGS code for the models in Sections 3 and 4 used in the analysis is given below.
Appendix 2.
Posterior results
Table A1. Posterior summaries of the estimates of the marginal team-specific attack () and defence () effects, the home effect and the constant from the log-linera regression of the basic model in Section 3.
Attack effect | Defence effect | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Teams | Mean | Sd | 2.5% | Median | 97.5% | Mean | Sd | 2.5% | Median | 97.5% |
Bergamo | −0.0098 | 0.0246 | −0.0589 | −0.0098 | 0.0382 | 0.0434 | 0.0224 | −0.0005 | 0.0433 | 0.0873 |
Busto Arsizio | 0.0177 | 0.0223 | −0.0257 | 0.0178 | 0.0612 | 0.0594 | 0.0276 | 0.0054 | 0.0594 | 0.1138 |
Casalmaggiore | 0.0247 | 0.0221 | −0.0190 | 0.0251 | 0.0676 | 0.0859 | 0.0236 | 0.0399 | 0.0861 | 0.1320 |
Conegliano | 0.0042 | 0.0242 | −0.0433 | 0.0042 | 0.0518 | −0.0846 | 0.0235 | −0.1307 | −0.0843 | −0.0389 |
Filottrano | −0.0750 | 0.0276 | −0.1299 | −0.0746 | −0.0219 | 0.0318 | 0.0248 | −0.0172 | 0.0315 | 0.0799 |
Legnano | −0.0957 | 0.0260 | −0.1471 | −0.0957 | −0.0446 | 0.0144 | 0.0242 | −0.0327 | 0.0143 | 0.0622 |
Monza | 0.0309 | 0.0239 | −0.0161 | 0.0307 | 0.0777 | 0.0491 | 0.0250 | −0.0001 | 0.0492 | 0.0976 |
Novara | 0.1448 | 0.0334 | 0.0807 | 0.1447 | 0.2103 | −0.0069 | 0.0281 | −0.0612 | −0.0072 | 0.0487 |
Pesaro | −0.0527 | 0.0232 | −0.0982 | −0.0526 | −0.0078 | −0.0326 | 0.0237 | −0.0792 | −0.0324 | 0.0131 |
Piacenza | 0.0105 | 0.0247 | −0.0375 | 0.0104 | 0.0594 | 0.0311 | 0.0251 | −0.0178 | 0.0312 | 0.0800 |
San Casciano | −0.0140 | 0.0248 | −0.0628 | −0.0141 | 0.0345 | 0.0087 | 0.0256 | −0.0423 | 0.0091 | 0.0582 |
Scandicci | 0.0144 | 0.0294 | −0.0428 | 0.0142 | 0.0727 | −0.1998 | 0.0340 | −0.2680 | −0.1992 | −0.1351 |
Home | 0.0343 | 0.0147 | 0.0054 | 0.0343 | 0.0633 | |||||
Constant | 4.443 | 0.012 | 4.419 | 4.443 | 4.467 |
Note: The model is fitted to data from the women's volleyball Italian Serie A1 season 2017–2018.
Table A2. Posterior summaries of the estimates of the marginal team-specific attack () and defence () effects, the home effect and the constant from the log-linear regression of the scaled IW model in Section 4.
Attack effect | Defence effect | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Teams | Mean | Sd | 2.5% | Median | 97.5% | Mean | Sd | 2.5% | Median | 97.5% |
Bergamo | −0.0130 | 0.0211 | −0.0543 | −0.0131 | 0.0287 | 0.0401 | 0.0224 | −0.0037 | 0.0401 | 0.0841 |
Busto Arsizio | 0.0188 | 0.0207 | −0.0224 | 0.0190 | 0.0589 | 0.0610 | 0.0289 | 0.0050 | 0.0610 | 0.1176 |
Casalmaggiore | 0.0138 | 0.0209 | −0.0269 | 0.0136 | 0.0555 | 0.0817 | 0.0233 | 0.0364 | 0.0816 | 0.1272 |
Conegliano | 0.0011 | 0.0236 | −0.0452 | 0.0011 | 0.0473 | −0.0792 | 0.0234 | −0.1256 | −0.0790 | −0.0341 |
Filottrano | −0.0573 | 0.0244 | −0.1086 | −0.0563 | −0.0128 | 0.0312 | 0.0242 | −0.0165 | 0.0313 | 0.0782 |
Legnano | −0.0854 | 0.0233 | −0.1329 | −0.0849 | −0.0412 | 0.0160 | 0.0234 | −0.0300 | 0.0159 | 0.0619 |
Monza | 0.0298 | 0.0206 | −0.0111 | 0.0300 | 0.0702 | 0.0424 | 0.0247 | −0.0054 | 0.0422 | 0.0913 |
Novara | 0.1530 | 0.0350 | 0.0850 | 0.1527 | 0.2221 | 0.0073 | 0.0303 | −0.0520 | 0.0070 | 0.0666 |
Pesaro | −0.0537 | 0.0207 | −0.0951 | −0.0535 | −0.0133 | −0.0296 | 0.0238 | −0.0762 | −0.0297 | 0.0170 |
Piacenza | 0.0067 | 0.0220 | −0.0367 | 0.0069 | 0.0497 | 0.0309 | 0.0240 | −0.0163 | 0.0309 | 0.0781 |
San Casciano | −0.0291 | 0.0228 | −0.0725 | −0.0297 | 0.0175 | 0.0068 | 0.0251 | −0.0434 | 0.0071 | 0.0554 |
Scandicci | 0.0155 | 0.0264 | −0.0355 | 0.0153 | 0.0686 | −0.2084 | 0.0369 | −0.2818 | −0.2082 | −0.1360 |
Home | 0.0311 | 0.0147 | 0.0022 | 0.031 | 0.0601 | |||||
Constant | 4.447 | 0.012 | 4.424 | 4.447 | 4.471 |
Note: The model is fitted to data from the women's volleyball Italian Serie A1 season 2017–2018.
Disclosure statement
No potential conflict of interest was reported by the author.
ORCID
Andrea Gabrio http://orcid.org/0000-0002-7650-4534
References
- 1.Akarçeşme C., Is it possible to estimate match result in volleyball: A new prediction model, Cent. Eur. J. Sport Sci. Med. 19 (2017), pp. 5–17. [Google Scholar]
- 2.Baio G. and Blangiardo M., Bayesian hierarchical model for the prediction of football results, J. Appl. Stat. 37 (2010), pp. 253–264. doi: 10.1080/02664760802684177 [DOI] [Google Scholar]
- 3.Brooks S., Gelman A., Jones G., and Meng X., Handbook of Markov Chain Monte Carlo, CRC Press, Boca Raton, 2011. [Google Scholar]
- 4.Dixon M.J. and Coles S.G., Modelling association football scores and inefficiencies in the football betting market, J. R. Stat. Soc.: Ser. C (Appl. Stat.) 46 (1997), pp. 265–280. doi: 10.1111/1467-9876.00065 [DOI] [Google Scholar]
- 5.Federation International de Volleyball , Sports regulations volleyball (2013). Available at http://www.fivb.org/en/FIVB/Document/Legal/FIVB_Sports_Regulations_2018_20180504.pdf.
- 6.Gaetan C. and Favaretto P., Gli score determinanti per la classifica del campionato italiano di pallavolo, doctoral dissertation, University Ca' Foscari of Venice - Faculty of economics, Venice, 2008.
- 7.Gelman A., Carlin J., Stern H., and Rubin D., Bayesian Data Analysis, 2nd ed., Chapman and Hall, New York, NY, 2004. [Google Scholar]
- 8.Gelman A. and Hill J., Data Analysis Using Regression and Multilevel/Hierarchical Models, Cambridge University Press, New York, NY, 2007. [Google Scholar]
- 9.Gelman A., Hwang J., and Vehtari A., Understanding predictive information criteria for bayesian models, Stat. Comput. 24 (2014), pp. 997–1016. doi: 10.1007/s11222-013-9416-2 [DOI] [Google Scholar]
- 10.George G. and Panagiotis Z., Statistical analysis of men's fivb beach volleyball team performance, Int. J. Perform. Anal. Sport 8 (2008), pp. 31–43. doi: 10.1080/24748668.2008.11868420 [DOI] [Google Scholar]
- 11.Joseph A., Fenton N.E., and Neil M., Predicting football results using bayesian nets and other machine learning techniques, Knowl. Based Syst. 19 (2006), pp. 544–553. doi: 10.1016/j.knosys.2006.04.011 [DOI] [Google Scholar]
- 12.Karlis D. and Ntzoufras I., Analysis of sports data by using bivariate poisson models, Statistician 52 (2003), pp. 381–393. [Google Scholar]
- 13.Lee A.J., Modeling scores in the premier league: Is manchester united really the best?, Chance 10 (1997), pp. 15–19. doi: 10.1080/09332480.1997.10554791 [DOI] [Google Scholar]
- 14.Magel R. and Unruh S., Determining factors influencing the outcome of college basketball games, Open J. Stat. 3 (2013), pp. 225. doi: 10.4236/ojs.2013.34026 [DOI] [Google Scholar]
- 15.Plummer M., JAGS: Just another Gibbs sampler (2010). Available at http://www-fis.iarc.fr/martyn/software/jags/.
- 16.Roith J.M., An analysis of factors contributing to wins in the National hockey League, PhD thesis, North Dakota State University, 2013.
- 17.Spiegelhalter D.J., Best N.G., Carlin B.P., and Van Der Linde A., Bayesian measures of model complexity and fit, J. R. Stat. Soc.: Ser. B (Stat. Method.) 64 (2002), pp. 583–639. doi: 10.1111/1467-9868.00353 [DOI] [Google Scholar]
- 18.Stone M., An asymptotic equivalence of choice of model by cross-validation and akaike's criterion, J. R. Stat. Soc.: Ser. B (Stat. Method.) 39 (1977), pp. 44–47. [Google Scholar]
- 19.Su Y. and Yajima M., Package ‘R2jags’ (2015). Available at: https://cran.r-project.org/web/packages/R2jags/.
- 20.Tsokos A., Narayanan S., Kosmidis I., Baio G., Cucuringu M., Whitaker G., and Király F., Modeling outcomes of soccer matches, Mach. Learn. 108 (2019), pp. 77–95. doi: 10.1007/s10994-018-5741-1 [DOI] [Google Scholar]
- 21.Tümer A.E. and Koçer S., Prediction of team league's rankings in volleyball by artificial neural network method, Int. J. Perform. Anal. Sport 17 (2017), pp. 202–211. doi: 10.1080/24748668.2017.1331570 [DOI] [Google Scholar]
- 22.Vehtari A., Gelman A., and Gabry J., Practical bayesian model evaluation using leave-one-out cross-validation and waic, Stat. Comput. 27 (2017), pp. 1413–1432. doi: 10.1007/s11222-016-9696-4 [DOI] [Google Scholar]
- 23.Watanabe S., Asymptotic equivalence of bayes cross validation and widely applicable information criterion in singular learning theory, J. Mach. Learn. Res. 11 (2010), pp. 3571–3594. [Google Scholar]
- 24.Zhang D., Forecasting point spread for women's volleyball, Master of Science dissertation, North Dakota State University, Fargo, 2016.