A stochastic model for NFL games and point spread assessment

Muhammad Mohsin; Albrecht Gebhardt

doi:10.1080/02664763.2022.2120973

. 2022 Sep 16;51(2):216–229. doi: 10.1080/02664763.2022.2120973

A stochastic model for NFL games and point spread assessment

Muhammad Mohsin ^a,^CONTACT,^✉, Albrecht Gebhardt ^b

PMCID: PMC10929675 PMID: 38476452

Abstract

Statistical modelling of sports data is indispensable to analyse the sports behaviour and apprehend significant inferences that are helpful to adopt decisive strategies before or during the sports events. This paper introduces a stochastic model as the distribution of difference derived from the Bivariate Affine-Linear Exponential distribution. The distribution of difference is first ever used to model the margin of victory that provides an adequate fitting on the observed data. A simulation study is carried out to observe the stability of the model parameters through their average estimated values, biases, standard errors, root mean square errors and confidence intervals. The performance of the proposed model is examined by applying it on the real data of the National Football League and comparing the results with those of the existing models. Finally, the quantile function of the proposed distribution is used to assess the possible range of point spreads for winning the bet in a particular game.

Keywords: Distribution of the difference, bivariate affine-linear exponential distribution, point spread, simulation, national football league, quantile

1. Introduction

Football, being the most popular sport in the USA, generates billions of dollars every year and contributes considerable revenue towards the sports industry as well as the country's economy. Consequently, a massive amount is wagered in betting market where the bettors are curious to make more accurate predictions about the outcomes of the game. According to the American Gaming Association, the Americans gambled $95 billion on the NFL and college football games in 2015 seasons. The National Football League (NFL) is the richest premier football league in the USA having 32 teams divided into two pools: the National Football Conference and the American Football Conference.

While the NFL generates a huge turnover, the betting market associated with the NFL substantially adds to the economy as well. There are many ways to place a bet but point spread is the most popular one for betting in the American football. It helps the wagerer to bet on a team likely to win (favourite), or even on a team likely to lose (underdog) by a certain margin. The point spread is a perceived real number that does not actually predict which team will win or lose but provides a vantage to the bettors to place bets even on weaker teams. For example, if the point spread is 3 for the favourite Washington DC football team versus the underdog New York team, the favourite team should win the match with a margin of more than 3 scores; otherwise, they will lose the bet despite winning the game. If the favourite team wins with a margin of exact 3 scores, the bet is neither won nor lost. This situation is called ‘push’. The basic objective of the point spread betting is to create an equal opportunity to put money on the underdog as well. This is a rational assumption which convinces people to wager on a relatively weaker team. It makes the betting market balanced.

In literature, many statistical models have been proposed using different approaches to predict different aspects of the game i.e. modelling the probability of win, loss or draw, modelling the game score, modelling the difference of scores made by two teams, etc. Warner [29] predicts the margin of victory in the NFL games using the Gaussian process model. Glickman and Stern [7] develop a state-space model for the NFL score assuming that team strength parameters follow the first-order auto regressive process. Matthews [19] studies the effect of data transformation on the paired comparison of Glickman and Stern [7] to reduce the influence of ‘blowouts’ on future predictions. Szalkowski and Nelson [28] investigate the opening and closing lines along with margin of victory and find that the line difference can predict the divisional winner with 75% accuracy. Pelechrinis and Papalexakis [21] provide a descriptive model to find the winning probability of an NFL game and combine it with bootstrap method to make a future matchup prediction scheme. They use Bradley–Terry regression model to study the impact of several factors on winning probability of a game. MacDonald and Dare [17] formulate a generalised model to test the efficiency of the point spread market of football. Baker and McHale [1] exhibit a point process model for forecasting exact end-of-match scores in the NFL by using hazards of scoring and bookmaker point spread to predict exact scores.

The literature review reveals that several aspects of the NFL games have been analysed by using different methodologies and models. These models involve regression, time series and descriptive models in particular but the analysis and prediction of any event are incomplete without using the appropriate probability models. Probability distributions model the random phenomena in real life to foresee the maximum chances of particular outcomes and draw possible inferences about the parameters involved. They observe and estimate the likelihood of the outcomes of a random experiment systematically and provide useful inferences and predictions. The sports activities and events all over the world have become an imperative integrant of everyday life. Thus, the probability models are equally helpful to analyse the sports data and predict different aspects of a game. It is observed that the role of probability distributions in analysing and predicting the global sports events like the American Football is scanty. We find no evidence of using probability distributions to model the margin of victory in the NFL games and assess the point spread so far. Only Stern [26,27] and Carlin [4] use normal distribution to model the margin of victory over point spread and compute the probabilities of winning of the favourite team in the NFL games.

In the present paper, we develop a new stochastic distribution of difference derived from the BALE distribution for modelling the margin of victory in the NFL games. The distribution of difference is first ever used to model the margin of victory that dissociates it from the previous work. Another salient feature of our research is that we provide a point spread assessment scheme relying on the possible inferences obtained from the quantiles of the model. Our approach differs from that of Stern [26,27] in three ways. Stern [26,27] uses the existing normal distribution, models the margin of victory over point spread and computes the probability of winning of the favourite team. On the other hand, we derive a new stochastic distribution of difference, model only the margin of victory and provide the point spread assessment scheme. Indeed, the NFL data can be modelled by using any bivariate distribution but we select the BALE distribution to derive the distribution of difference because the BALE distribution has make out characteristics that it is absolutely continuous; whereas, many well-known distributions, e.g. Marshall and Olkin [18], Gupta and Kundu [8], Sarhan and Balakrishnan [24] etc. have continuous as well as singular parts. Moreover, the BALE distribution has closed form of its distribution function showing that it has vast applications; which is not a case with the distribution proposed by Regoli [23]. An additional merit of the BALE distribution is that it effectively models the data having negative low correlation, see details in Mohsin et al. [20].

The paper is organised as follows: A concise description of the BALE distribution along with the explicit expression for the probability density function and the characteristics of the stochastic distribution of difference are given in Section 2. A simulation study is conducted to observe the consistency of the model parameters in Section 3. For illustration, the application of the proposed model on a real data set of the NFL games and its comparison with other extant distributions are exhibited in Section 4. The computation of quantiles and the assessment of the point spread are presented in Section 5. Finally, conclusion and discussion are stated in Section 6.

2. The methodology

In this section, we first provide the expression and a brief description of the BALE distribution proposed by Mohsin et al. [20] along with some lemmas used. Then, we derive the stochastic distribution of difference and its moments by using the baseline BALE distribution.

The joint pdf and survival function of the BALE distribution are given by

f_{XY} (x, y) = α (β + γx) \exp {- αx - β y - γxy}, α > 0, β, γ, x, y \geq 0, β + γ > 0,

(1)

and

{\bar{F}}_{XY} (x, y) = \exp {- αx} - \frac{αexp {- β y} [1 - \exp {- (α + γy) x}]}{(α + γy)},

(2)

respectively. The BALE distribution is a quite flexible bivariate distribution obtained by compounding the two exponentially distributed random variables, $X \sim Exp (α)$ and $Y | X \sim Exp (β + γX)$ . The parameter γ reflects the dependency between the random variables X and Y. For $γ = 0$ , the random variables X and Y are independently and exponentially distributed with the parameters α and β, respectively. The attractive characteristics and vast applications of the BALE distribution propel us to further probe into it and derive a new distribution of the difference by using (1). We use the following lemmas developed by Prudnikov et al. [22] to derive the distribution of difference and its moments.

Lemma 2.1 Prudnikov et al. [22, Vol.1, Equation (2.3.15.7), p.344] —

For $Re (p) > 0$ , $Re (q) > 0$ and n=0,1,…,

$\int_{0}^{\infty} x^{n} \exp (- p x^{2} - qx) d x = \frac{(- 1)^{n}}{2} \sqrt{\frac{π}{p}} \frac{\partial^{n}}{\partial q^{n}} [\exp (\frac{q^{2}}{4 p}) erfc (\frac{q}{2 \sqrt{p}})],$

where $erfc (x)$ is the complementary error function which is defined as

$erfc (x) = \frac{2}{\sqrt{π}} \int_{x}^{\infty} \exp (- w^{2}) d w .$

Lemma 2.2 Prudnikov et al. [22, Vol. 1, Equation (2.3.6.9), p. 324] —

If $a, b \in R$ , s>0 and $| \arg c | < π$ ,

$\int_{0}^{\infty} \frac{x^{a - 1}}{(c + x)^{b}} \exp (- sx) d x = Γ (a) (c)^{a - b} Ψ (a, a + 1 - b; cs),$

where $Ψ (.)$ is Kummer's (confluent hypergeometric) function of second kind which is given by

$Ψ (x, y; z) = \frac{1}{Γ (x)} \int_{0}^{\infty} \exp (- zt) t^{x - 1} (1 + t)^{y - x - 1} d t .$

2.1. The proposed model

The following theorems are presented to develop the distribution of difference and derive its moments.

Theorem 2.3

If X and Y are jointly distributed according to (1), the pdf of the difference is given as

$\begin{aligned} f_{D} (d) \\ = {\begin{cases} \frac{α}{4} \exp (- αd) {2 + \sqrt{\frac{π}{γ}} (β + γd - α) \exp (\frac{{(ϕ_{1} (d))}^{2}}{4 γ}) erfc (\frac{ϕ_{1} (d)}{2 \sqrt{γ}})}, & for d > 0, \\ \frac{α}{2} \exp (β d) + \frac{1}{2} \sqrt{\frac{π}{γ}} erfc (\frac{ϕ_{2} (d)}{2 \sqrt{γ}}) {(α β + αγd) \exp (- αd) \\ \exp (\frac{{(ϕ_{1} (d))}^{2}}{4 γ}) - \frac{1}{2} (α^{2} + α β + αγd) \exp (β d) \exp (\frac{{(ϕ_{2} (d))}^{2}}{4 γ})}, & for d < 0, \end{cases} \end{aligned}$ (3)

where $ϕ_{1} (d) = α + β + γd$ and $ϕ_{2} (d) = α + β - γd$ .

Proof.

From (1), the joint pdf of $(D, W) = (X - Y, Y)$ becomes

$f (d, w) = α (β + γd + γw) \exp (- αd) \exp (- γ w^{2} - (α + β + γd) w) .$

The transformation maps x = 0 to w = −d, $x = \infty$ to $w = \infty$ and y = 0 to w = 0, $y = \infty$ to $w = \infty$ . The conditions x>0 and y>0 from the BALE distribution transformed the pdf of D into $- \infty < d < \infty$ and $w > - d$ for d>0 and w>0 for d>0. The pdf of D can be written as

$f_{D} (d) = {\begin{cases} (α β + αγd) \exp (- αd) I_{0} (d) + αγ \exp (- αd) I_{1} (d), & if d > 0, \\ (α β + αγd) \exp (- αd) M_{0} (d) + αγ \exp (- αd) M_{1} (d), & if d < 0, \end{cases}$ (4)

where $I_{n} (d) = \int_{0}^{\infty} w^{n} \exp (- γ w^{2} - (α + β + γd) w) d w, n = 0, 1.$ Now, using Lemma 2.1, the integrals $I_{n} (d)$ can be calculated as

$I_{n} (d) = \frac{{(- 1)}^{n}}{2} \sqrt{\frac{π}{γ}} \frac{\partial^{n}}{\partial q^{n}} {[\exp (\frac{q^{2}}{4 γ}) erfc (\frac{q}{2 \sqrt{γ}})]}_{q = α + β + γd}$ (5)

Evaluating the derivative in (5) for n = 0, 1, and simplifying, we get

$I_{0} (d) = \frac{\sqrt{π}}{2 \sqrt{γ}} \exp (\frac{{(α + β + γd)}^{2}}{4 γ}) erfc (\frac{(α + β + γd)}{2 \sqrt{γ}})$ (6)

and

$I_{1} (d) = \frac{1}{2 γ} - \frac{\sqrt{π} (α + β + γd)}{4 γ^{3 / 2}} \exp (\frac{{(α + β + γd)}^{2}}{4 γ}) erfc (\frac{(α + β + γd)}{2 \sqrt{γ}}) .$ (7)

Similarly

$M_{n} (d) = \int_{- d}^{\infty} w^{n} \exp (- γ w^{2} - (α + β + γd) w) d w,$ $n = 0, 1,$

follows

$M_{0} (d) = \frac{\sqrt{π}}{2 \sqrt{γ}} \exp (\frac{{(α + β + γd)}^{2}}{4 γ}) erfc (\frac{(α + β - γd)}{2 \sqrt{γ}})$ (8)

and

$\begin{aligned} M_{1} (d) \\ = \frac{\exp (αd + β d) [2 \sqrt{(γ)} - \sqrt{π} (α + β + γd) \exp (\frac{{(α + β - γd)}^{2}}{4 γ}) erfc (\frac{(α + β - γd)}{2 \sqrt{γ}})]}{4 γ^{3 / 2}} . \end{aligned}$ (9)

The result follows by substituting (6), (7),(8) and (9) in (4) and then simplifying the expressions.

The different shapes of the pdf of D for different values of α, β and γ are given in Figure 1. The figure shows that the new distribution of difference is asymmetric and unimodal. It is also observed as α approaches to β, the proposed distribution approaches to symmetry. The new distribution becomes right skewed with lower peak when α decreases, whereas it becomes left skewed with flat peak when β decreases. Also when γ increases, the probability mass shifts to the right and changes its behaviour at the peak in term of its smoothness.

The distribution of difference is quite useful and it is applied in many fields such as finance, physics, sports, etc. Karlis and Ntzoufras [11] derive the distribution of difference from bivariate Poisson distribution to model the Italian serie A data and water-polo games. Skellam [25] derives the distribution of the difference between two Poisson random variates assuming different means which is the generalisation of Irwin [10]. Cox and Isham [6] give an interesting stochastic connection of the distribution of difference arising in stochastic point process with thinning. For more references, see Kotz et al. [14].

In addition, the moments of $D = X - Y$ when X and Y are distributed according to (1) are presented in the following theorem. These moments are helpful to study the characteristics of the proposed distribution.

Theorem 2.4

If X and Y are jointly distributed according to (1), then it holds:

$E (D^{n}) = \sum_{k = 0}^{n} {(- 1)}^{k} (\binom{n}{k}) \frac{α β^{2 k - n + 1} Γ (k + 1) Γ (n - k + 1)}{γ^{k + 1}} Ψ (k + 1, 2 k - n + 2; \frac{α β}{γ}) .$ (10)

Proof.

The proof of (10) requires the following result. The product moment of the distribution proposed in (1) is obtained as:

$\begin{aligned} E (X^{p} Y^{q}) & = α \int_{0}^{\infty} \int_{0}^{\infty} x^{p} y^{q} (β + γx) \exp {- αx - β y - γxy} d y d x, p, q = 1, 2, \dots \\ \frac{α Γ (q + 1)}{γ^{q}} \int_{0}^{\infty} \frac{x^{(p + 1) - 1} \exp {- αx}}{{(\frac{β}{γ} + x)}^{q}} d x . \end{aligned}$

Using Lemma 2.2 and further simplifying we arrive at

$\begin{aligned} E (X^{p} Y^{q}) & = \frac{α β^{p - q + 1}}{γ^{p + 1}} Γ (q + 1) Γ (p + 1) Ψ (p + 1, p - q + 2; \frac{α β}{γ}), \\ for p > - 1 and q > - 1. \end{aligned}$ (11)

The result follows by writing $E (D^{n}) = E (X - Y)^{n} = \sum_{k = 0}^{n} (- 1)^{(k)} (\binom{n}{k}) E (X^{n - k}) (Y^{k})$ and applying (11) to the expectation in the difference we get the results stated in (10).

For n = 1 and 2, the expressions for $E (D)$ and $E (D^{2})$ are given as:

E (D) = \frac{γ + e^{\frac{α β}{γ}} (α^{2} - 2 α β - 2 γ) E_{3} (\frac{α β}{γ})}{αγ},

(12)

where $E_{3}$ is exponential integral function which is defined as:

\begin{aligned} E_{v} (z) = z^{v - 1} \int_{z}^{\infty} t^{- v} \exp (- t) d < t; | Arg (z) | < π . \\ E (D^{2}) = \frac{β (\begin{matrix} α β e^{\frac{α β}{γ}} (2 α^{3} Γ (- 3, \frac{α β}{γ}) \\ + (α - β) (α β + 2 γ) Γ (0, \frac{α β}{γ})) - γ (α - β) (α β + γ) \end{matrix})}{γ^{4}}, \end{aligned}

(13)

where $Γ (- n, z)$ and $Γ (0, z)$ are incomplete gamma functions.

3. Simulation

In this section, simulation is carried out to study the stability of the model parameters. The maximum-likelihood method is applied to estimate the model parameters $α, β$ and γ. We perform numerical method to solve $F (x) = u$ where $u \sim U (0, 1)$ as discussed by Lange [15]. The simulation is run 1000 times for five different combinations of the model parameters to draw the random samples of size n each from the proposed model. The ML estimates are obtained by BFGS method implemented in R package maxLik given by Henningsen and Toomet [9].

Table 1 presents the average estimates (AEs), biases, standard errors (SEs), root mean square errors (RMSEs) and corresponding confidence intervals (CIs) for the samples sizes 50, 200 and 500. It is observed from Table 1 that AEs of the model parameters approach the true values of the parameters as n increases. The biases and SEs for each parameter approach zero as sample size increases. The 95% CIs for all the model parameters tend to contain the estimated values of the parameter as sample size increases. These findings endorse the asymptotic theory (large sample) of the normal distribution showing that the errors of these estimates, as expected, decrease when n increases.

Table 1.

Estimated values, biases, standard errors, root mean square errors and 95% C.Is of the model parameters.

		$α = 1.5, β = 1, γ = 0.5$			$α = 2, β = 0.5, γ = 1$			$α = 0.7, β = 0.5, γ = 0.1$			$α = 1, β = 1, γ = 1$			$α = 2, β = 1, γ = 1$
		n			n			n			n			n
		50	200	500	50	200	500	50	200	500	50	200	500	50	200	500
Avg. Estimate	$\hat{α}$	1.5004	1.5003	1.5003	2.0031	2.0026	2.0022	0.7004	0.7003	0.7003	1.0004	1.0003	1.0003	2.0004	2.0003	2.0003
	$\hat{β}$	1.0006	1.0005	1.0005	0.5039	0.5038	0.5045	0.5006	0.5005	0.5004	1.0005	1.0005	1.0005	1.0006	1.0005	1.0004
	$\hat{γ}$	0.5008	0.5007	0.5007	1.0048	1.0059	1.0045	0.1007	0.1006	0.1006	1.0007	1.0007	1.0007	1.0006	1.0006	1.0006
Bias	$\hat{α}$	0.0007	0.0007	0.0007	0.0231	0.0005	0.0016	0.0008	0.0008	0.0008	0.0008	0.0008	0.0008	0.0008	0.0008	0.0007
	$\hat{β}$	0.0013	0.0010	0.0010	0.0392	0.0007	0.0038	0.0014	0.0011	0.0010	0.0010	0.0010	0.0012	0.0014	0.0011	0.0010
	$\hat{γ}$	0.0019	0.0018	0.0017	0.0426	0.0013	0.0044	0.0015	0.0012	0.0015	0.0018	0.0016	0.0017	0.0015	0.0012	0.0011
std. error	$\hat{α}$	8.4e–05	5.7e–05	0.0001	0.0022	0.0003	0.0004	0.0001	9.3e–05	6.2e–05	0.0001	0.0001	0.0001	0.0001	9.3e–05	6.2e–05
	$\hat{β}$	0.0001	0.0001	0.0001	0.0023	0.0005	0.0011	0.0002	9.7e–05	7.5e–05	9.8e–05	8.6e–05	0.0002	0.0002	9.7e–05	7.5e–05
	$\hat{γ}$	0.0002	0.0002	0.0002	0.0034	0.0008	0.0005	0.0002	9.2e–05	0.0002	0.0002	0.0002	0.0002	0.0002	9.2e–05	0.0002
RMSE	$\hat{α}$	0.0037	0.0025	0.0018	0.0158	0.0125	0.0082	0.0051	0.0041	0.0028	0.0051	0.0045	0.0036	0.0051	0.0041	0.0028
	$\hat{β}$	0.0064	0.0052	0.0050	0.0165	0.0213	0.0237	0.0078	0.0043	0.0034	0.0044	0.0038	0.0029	0.0078	0.0043	0.0034
	$\hat{γ}$	0.0094	0.0080	0.0073	0.0240	0.0345	0.0119	0.0068	0.0041	0.0068	0.0072	0.0082	0.0082	0.0068	0.0041	0.0068
LCL	$\hat{α}$	1.5002	1.5002	1.5001	1.9987	2.0020	2.0015	0.7002	0.7002	0.7002	1.0002	1.0002	1.0001	2.0002	2.0002	2.0002
	$\hat{β}$	1.0003	1.0003	1.0003	0.4994	0.5029	0.5024	0.5003	0.5003	0.5003	1.0003	1.0003	1.0002	1.0003	1.0003	1.0003
	$\hat{γ}$	0.5004	0.5004	0.5003	0.9981	1.0045	1.0034	0.1004	0.1005	0.1004	1.0004	1.0004	1.0003	1.0004	1.0005	1.0004
UCL	$\hat{α}$	1.5005	1.5005	1.5006	2.0075	2.0031	2.0030	0.7006	0.7006	0.7005	1.0006	1.0006	1.0006	2.0006	2.0006	2.0005
	$\hat{β}$	1.0008	1.0007	1.0008	0.5085	0.5048	0.5066	0.5009	0.5007	0.5006	1.0007	1.0007	1.0009	1.0009	1.0007	1.0006
	$\hat{γ}$	0.5012	0.5011	0.5011	1.0114	1.0075	1.0055	0.1010	0.1008	0.1010	1.0011	1.0011	1.0011	1.0010	1.0008	1.0010

Open in a new tab

4. Application

In this section, a real data set of the National Football League (NFL) is analysed to illustrate the application of the proposed model.

4.1. Fitting the model

For modelling the margin of victory, we use the data of 278 NFL games from 3 January 2015 to 7 February 2016 available on the website www.aussportsbetting.com. The data include complete details of the game statistics consisting of scores by the home team and scores by the away team. Though the home team is not necessarily the favourite one always but for the purpose of compatibility, the similar terms are used in the data, i.e. the scores by the home team, and the scores by the away team are named as scores by the favourite team (X), and scores by the underdog team (Y), respectively. The margin of victory (D) is defined as the difference of the scores by the favourite team (X) and the scores by the underdog team (Y), i.e. D= X-Y.

For the fitting of our proposed model, the maximum-likelihood (ML) method is used to estimate the model parameters for the observed data of the difference between the scores by the two teams. If $d_{i}; i = 1, 2, \dots, n$ is a random sample of size n from (3), the ML estimates of α, β and γ are obtained by BFGS method implemented in R package maxLik given by Henningsen and Toomet [9]. The ML estimates of α, β and γ are 0.0998, 0.0721 and 0.0071 along with standard errors 0.0092, 0.0151 and 0.0049 respectively. The ML estimates of α, β and γ for margin of victory look quite stable as their standard errors are smaller than their point estimates. The ML estimate of γ is very small that represents the joint effect of the scores by the favourite team and the scores by the underdog team which is also confirmed by their low and insignificant correlation, i.e. −0.061. The physical interpretation of the parameters is that α and β represent point scoring capabilities of the favourite and the underdog teams, respectively, whereas γ shows the common factors like teams strength, home ground, home crowd, weather conditions, etc. The small value of γ shows that the common factors are least responsible for the margin of victory while α and β significantly affect the margin of victory.

Now (3) is applied to compute the fitted pdf using the ML estimates. This fitted distribution is compared to the histogram of D, difference between the scores by two teams, from the observed data. The fitted pdf of D reasonably follows the general pattern of the histogram. Figure 2(a) shows the histogram of the observed data D along with the fitted pdf of D for each game which depicts that our proposed distribution fits the NFL data adequately. Further to observe the goodness of fit of (3), the observed probabilities are plotted against the predicted probabilities for the proposed model. Figure 2(b) exhibits the probability plot for D, $F_{D} (d_{i})$ versus $(i - 0.375) / (n + 0.25)$ as recommended by Blom [2] and Chambers et al. [5] where $d_{i}$ are the sorted values of D in ascending order. The fit looks reasonably good for the margin of victory since the dots follow the diagonal line closely.

Figure 2. — (a) Fitted values of the pdf of D on the histogram of the difference between the scores by the two teams and (b) Probability plot of D for the difference between the scores by the two teams, for 278 NFL games.

4.2. Comparison with other models

It is important to know that how well the new proposed distribution performs as compared to the other existing distributions. For this comparison, we use asymmetric Laplace, Cauchy, skew t, skew Laplace and odd log-logistic normal distributions because they all belong to asymmetric family and have the same domain for the random variable X as our propose model keeps. The negative log-likelihood, Akaike information criterion (AIC) and Bayesian information criterion (BIC) are used to compare and examine their relative performance using the difference of the scores by the two teams (D) in the NFL data. The model with the highest negative log-likelihood value is taken as the best model. The AIC and BIC express the relative loss of information thus the smaller values of AIC and BIC reflect the better model. These measures can be calculated as: Inline graphic and , where = maximum value of negative log-likelihood, k= number of parameters estimated and n=number of observations. Asymmetric Laplace distribution discussed by Koenker and Machado [13] and Yu and Zhang [30] have three parameters, i.e. location, scale and skewness. The optimize function implemented in R package ald is used for the computation of negative log-likelihood value for the asymmetric Laplace distribution. Similarly, the mledist function implemented in R package fitdistrplus is used to find the value of negative log-likelihood for Cauchy distribution which has two parameters, i.e. location and scale. The skew t distribution has two parameters, whereas skew Laplace distribution has three parameters and to find the values of their negative log-likelihood functions we use R packages skewt, see details King [12], and rmutil, see deatils Lindsey and Swihart [16], respectively. The odd log-logistic normal distribution, given by Braga et al. [3], has three parameters and to find its negative log-likelihood value we use Simulated Annealing (SANN) method implemented in R package maxLik given by Henningsen and Toomet [9]. Table 2 depicts the negative log-likelihood, AIC and BIC values for the difference between the scores by the two teams in the NFL data. The result shows that our proposed model provides the largest value of negative log-likelihood function and smallest values of AIC and BIC as compared to the other models and fits the best on the difference between the scores by the two teams in the NFL data.

Table 2.

Estimated values of negative log-likelihood function, AIC and BIC for different models.

Model	Negative log-likelihood	AIC	BIC
Proposed model	−1136.99	2279.98	2290.86
Asymmetric Laplace	−1927.89	3861.78	3872.66
Cauchy	−1167.28	2338.56	2345.82
Skew-t	−1483.06	2970.12	2977.38
Skew Laplace	−3249.70	6505.40	6516.28
Odd log-logistic normal	−1565.42	3136.84	3147.72

Open in a new tab

5. Quantiles and point spread assessment

In this section, we provide quantiles of the proposed model by using the arbitrary values of the parameters associated with the pdf of the difference. These quantiles are computed numerically by solving the equation:

\int_{- \infty}^{z_{q}} f (t) d t = q, where f \in {f_{D}} .

(14)

The function uniroot in R software is used for the numerical solution of this equation. Table 3 provides tabulated values of $z_{q}$ using $α = 0.5, 0.7, 1.0, 1.5, 2.0$ , $β = 0.5, 1.0$ and $γ = 0.1, 0.5, 1.0$ .

Table 3.

The quantiles of the distribution of difference.

q
α	β	γ	0.99	0.95	0.90	0.75	0.50	0.25
0.5	0.5	0.1	8.6212	5.2647	3.7967	1.8213	0.2854	−0.9533
0.7	0.5	0.1	5.9191	3.4993	2.4416	1.0214	−0.0805	−1.3052
1.0	0.5	0.1	3.9053	2.2013	1.4582	0.4638	−0.3821	−1.6305
1.5	0.5	0.1	2.3773	1.2410	0.7472	0.0893	−0.6612	−1.9365
2.0	0.5	0.1	1.6460	0.7970	0.4291	−0.0646	−0.8194	−2.1129
0.5	1.0	0.1	8.7548	5.4580	4.0294	2.1291	0.6799	−0.2014
0.7	1.0	0.1	6.0336	3.7205	2.6972	1.3379	0.3034	−0.4172
1.0	1.0	0.1	4.0909	2.4339	1.7174	0.7668	0.0449	−0.6201
1.5	1.0	0.1	2.5656	1.4620	0.9855	0.3542	−0.1470	−0.8153
2.0	1.0	0.1	1.8231	0.9973	0.6410	0.1693	−0.2584	−0.9304
0.5	0.5	0.5	9.0262	5.7312	4.2888	2.3303	0.7491	−0.3595
0.7	0.5	0.5	6.3419	3.9555	2.9042	1.4657	0.2908	−0.6942
1.0	0.5	0.5	4.3086	2.6064	1.8516	0.8125	−0.0433	−1.0322
1.5	0.5	0.5	2.7129	1.5512	1.0338	0.3195	−0.3437	−1.3865
2.0	0.5	0.5	1.9156	1.0316	0.6377	0.0947	−0.5316	−1.6139
0.5	1.0	0.5	9.0416	5.7607	4.3314	2.4093	0.9033	−0.0255
0.7	1.0	0.5	6.3667	4.0002	2.9658	1.5711	0.4768	−0.2354
1.0	1.0	0.5	4.3467	2.6699	1.9353	0.9439	0.1680	−0.4422
1.5	1.0	0.5	2.7679	1.6351	1.1388	0.4704	−0.0525	−0.6531
2.0	1.0	0.5	1.9810	1.1258	0.7517	0.2493	−0.1732	−0.7847
0.5	0.5	1.0	9.1108	5.8458	4.4234	2.5028	0.9601	−0.0959
0.7	0.5	1.0	6.4461	4.0898	3.0567	1.6482	0.4935	−0.3926
1.0	0.5	1.0	4.4304	2.7538	2.0124	0.9905	0.1340	−0.7044
1.5	0.5	1.0	2.8435	1.6972	1.1854	0.4714	−0.1576	−1.0484
2.0	0.5	1.0	2.0419	1.1645	0.7705	0.2174	−0.3432	−1.2809
0.5	1.0	1.0	9.1155	5.8555	4.4383	2.5342	1.0342	0.0836
0.7	1.0	1.0	6.4543	4.1060	3.0805	1.6949	0.5939	−0.1185
1.0	1.0	1.0	4.4443	2.7796	2.0489	1.0564	0.2635	−0.3181
1.5	1.0	1.0	2.8665	1.7367	1.2385	0.5598	0.0178	−0.5291
2.0	1.0	1.0	2.0724	1.2140	0.8350	0.3189	−0.1047	−0.6658

Open in a new tab

The physical interpretation of a quantile, which indeed relates the position of the difference of scores by the two teams in a certain game, is point spread. Hence, Table 3 also exhibits point spread assessment scheme. In Table 3, for α=0.5, β=0.5 and γ= 0.1 one can be $95 %$ confident that the difference between the scores of two teams will not exceed 5.27. It means that favourite team will win by the maximum margin of 5 scores providing an incentive to fix the point spread below 5 to win the bet. The point spread should be fixed below 5 because at 5 or above the chances of push or loosing the bet are certainly greater than those of winning the bet. Although the point spread is an arbitrary number that depends upon the contemporary facts about the particular games and teams, the quantiles help the consultants, bettors, and bookies to assess the possible range of point spreads and finally enable them to make the decisions. The usefulness of these quantiles is not restricted to sports data only but can be extended to many other situations where the behaviour of differences of random variables with linear exponential distribution is of vital importance.

6. Conclusion and discussion

We introduce a new stochastic model to analyse the NFL data and provide a point spread assessment scheme. Since, in an NFL game the correlation between the scores by the favourite and the scores by the underdog is low, i.e. −0.061 therefore we need a subtle distribution like BALE distribution which is quite suitable for modelling the low and moderate negative dependence. The results of the simulation study establish that the model parameters are quite stable and consistent. Also the findings of simulation study show that the estimated values of the model parameters become close to the true values and their standard errors decrease as the sample sizes increase which is in accordance with the asymptotic theory of the normal distribution. In addition, the adequate fitting of the model on the observed data approves its compatibility. The comparison of the proposed model with some existing distributions clearly favours its competency. Therefore, we find that our proposed model is flexible enough to explain all the aspects of the NFL data sufficiently. Moreover, point spreads are estimated by using the quantiles of the proposed model. This estimation of the point spreads not only depends on the values of the parameters but also relates the assumed level of confidence to fix the point spread. As the level of confidence decreases, the point spread also decreases for the favourite team that helps the bookies and the consultants to select the appropriate point spread. The tabulated quantiles are also helpful to draw the important inferences for adopting better strategies to win a bet. The quantiles computed on the basis of previous data of two particular teams help to select point spread for their upcoming football game. In addition, the use of quantiles of the proposed model comes up with a new dimension for the consultants to analyse the NFL data and find the worthwhile clues to fix point spread and adopt appropriate betting strategies.

Sports industry, being one of the largest profit making industries, proves to be an attractive market for bookies, bettors and wagerers. Some authors and analysts state that football betting market is uncertain but majority of the professionals conclude that this market becomes stable if the betting strategy is handled tactfully. The new proposed model and the assessment of point spreads by using quantiles play an important role to handle the betting strategy tactfully. Now bookies, wagerers, bettors and consultants can be more confident while putting money on an NFL game that ensures the stability of the betting market convincing more and more people to wager eagerly.

Acknowledgments

The authors appreciate and acknowledge the suggestions/comments made by the reviewers as well as the associate editor which certainly helped to improve the paper.

Disclosure statement

The authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

1.Baker R.D. and McHale I.G., Forecasting exact scores in national football leagues games, Int. J. Forecast. 29 (2013), pp. 122–130. [Google Scholar]
2.Blom G., Statistical Estimates and Transformed Beta-variables, John Wiley & Sons, New York, 1958. [Google Scholar]
3.Braga A.D.S., Cordeiro G.M., Ortega E.M.M., and Cruz J.N.D., The odd log-logistic normal distribution: theory and applications in analysis of experiments, J. Stat. Theory. Pract. 10 (2016), pp. 311–335. [Google Scholar]
4.Carlin B.P., Improved NCAA basketball tournament modeling via point spread and team strength information, Am. Stat. 50 (1996), pp. 39–43. [Google Scholar]
5.Chambers J., Cleveland W., Kleiner B., and Tukey P., Graphical Methods for Data Analysis, Chapman & Hall/CRC, London, 1983. [Google Scholar]
6.Cox D.R. and Isham V., Point Processes, Chapman & Hall, London, 1980. [Google Scholar]
7.Glickman M.E. and Stern H.S., A state-space model for national football league scores, J. Am. Stat. Assoc. 93 (1998), pp. 25–35. [Google Scholar]
8.Gupta R.D. and Kundu D., Generalized exponential distribution, Australian and New Zealand Journal of Statistics 41 (1999), pp. 173–188. [Google Scholar]
9.Henningsen A. and Toomet O., maxLik: A package for maximum likelihood estimation in R, Journal of Computational Statistics 26 (2011), pp. 443–458. [Google Scholar]
10.Irwin J.O., The frequency distribution of the difference between two independent variates following the same Poisson distribution, Journal of the Royal Statistical Society, Series A 100 (1937), pp. 415–416. [Google Scholar]
11.Karlis D. and Ntzoufras I., Analysis of sports data using bivariate Poisson models, J. R. Stat. Soc., Ser. D 52 (2003), pp. 381–393. [Google Scholar]
12.King R., skewt. R package available preprint (2015) via Available at https://cran.r-project.org/web/packages/skewt/skewt.pdf.
13.Koenker R. and Machado J., Goodness of fit and related inference processes for quantile regression, J. Am. Stat. Assoc. 94 (1999), pp. 1296–1309. [Google Scholar]
14.Kotz S., Balakrishnan N., and Johnson N.L., Continuous Multivariate, Distributions Models and Applications, John Wiley & Sonc, New York, 2000. [Google Scholar]
15.Lange K., Numerical Analysis for Statisticians, 2nd ed., Springer, New York, 2010. [Google Scholar]
16.Lindsey J. and Swihart B., rmutil.R package, preprint (2018). Available at https://cran.r-project.org/web/packages/rmutil/rmutil.pdf.
17.MacDonald S.S. and Dare W.H., A generalized model for testing the home and favorite team advantage in point spread markets, J. Financ. Econ. 40 (1996), pp. 295–318. [Google Scholar]
18.Marshall A.W. and Olkin I., A multivariate exponential distribution, J. Am. Stat. Assoc. 62 (1967(a)), pp. 30–44. [Google Scholar]
19.Matthews G.J., Improving paired comparison models for NFL point spreads by data transformation, Master Thesis, Worcester Polytechnic Institute, 2005.
20.Mohsin M., Kazianka H., Pilz J., and Gebhardt A., A new bivariate exponential distribution for modeling moderately negative dependence, Statistical Methods and Applications 23 (2014), pp. 123–148. [Google Scholar]
21.Pelechrinis K. and Papalexakis E., The anatomy of American football: evidence from 7 years of NFL game data, PLoS. ONE. 11 (2016), pp. e0168716. doi: 10.1371/journal.pone.0168716. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Prudnikov A.P., Brychkov Y.A., and Marichev O.I., Integrals and Series, Vol. 1, Gordon and Breach Science, New York, 1986. [Google Scholar]
23.Regoli G., A class of bivariate exponential distribution, J. Multivar. Anal. 100 (2009), pp. 1261–1269. [Google Scholar]
24.Sarhan A.M. and Balakrishnan N., A new class of bivariate distributions and its mixture, J. Multivar. Anal. 98 (2007), pp. 1508–1527. [Google Scholar]
25.Skellam J.G., The frequency distribution of the difference between two poisson variates belonging to different populations, Journal of the Royal Statistical Society, Series A 109 (1946), pp. 296. [PubMed] [Google Scholar]
26.Stern H., On the probability of winning a football game, Am. Stat. 45 (1991), pp. 179–183. [Google Scholar]
27.Stern H., A statistician reads the sports pages, Chance 11 (1998), pp. 17–21. [Google Scholar]
28.Szalkowski G. and Nelson M.L., The performance of betting lines for predicting the outcome of NFL games, Old Dominion University, Department of Computer Science, 2012, pp. 1–26.
29.Warner J., Predicting Margin of Victory in NFL Games: Machine Learning vs. the Las Vegas Line, Project: Department of Computer Science, Cornell University, 2010.
30.Yu K. and Zhang J., A three-parameter asymmetric Laplace distribution and its extension, Commun. Stat.-Theor. Meth. 34 (2005), pp. 1867–1879. [Google Scholar]

[CIT0001] 1.Baker R.D. and McHale I.G., Forecasting exact scores in national football leagues games, Int. J. Forecast. 29 (2013), pp. 122–130. [Google Scholar]

[CIT0002] 2.Blom G., Statistical Estimates and Transformed Beta-variables, John Wiley & Sons, New York, 1958. [Google Scholar]

[CIT0003] 3.Braga A.D.S., Cordeiro G.M., Ortega E.M.M., and Cruz J.N.D., The odd log-logistic normal distribution: theory and applications in analysis of experiments, J. Stat. Theory. Pract. 10 (2016), pp. 311–335. [Google Scholar]

[CIT0004] 4.Carlin B.P., Improved NCAA basketball tournament modeling via point spread and team strength information, Am. Stat. 50 (1996), pp. 39–43. [Google Scholar]

[CIT0005] 5.Chambers J., Cleveland W., Kleiner B., and Tukey P., Graphical Methods for Data Analysis, Chapman & Hall/CRC, London, 1983. [Google Scholar]

[CIT0006] 6.Cox D.R. and Isham V., Point Processes, Chapman & Hall, London, 1980. [Google Scholar]

[CIT0007] 7.Glickman M.E. and Stern H.S., A state-space model for national football league scores, J. Am. Stat. Assoc. 93 (1998), pp. 25–35. [Google Scholar]

[CIT0008] 8.Gupta R.D. and Kundu D., Generalized exponential distribution, Australian and New Zealand Journal of Statistics 41 (1999), pp. 173–188. [Google Scholar]

[CIT0009] 9.Henningsen A. and Toomet O., maxLik: A package for maximum likelihood estimation in R, Journal of Computational Statistics 26 (2011), pp. 443–458. [Google Scholar]

[CIT0010] 10.Irwin J.O., The frequency distribution of the difference between two independent variates following the same Poisson distribution, Journal of the Royal Statistical Society, Series A 100 (1937), pp. 415–416. [Google Scholar]

[CIT0011] 11.Karlis D. and Ntzoufras I., Analysis of sports data using bivariate Poisson models, J. R. Stat. Soc., Ser. D 52 (2003), pp. 381–393. [Google Scholar]

[CIT0012] 12.King R., skewt. R package available preprint (2015) via Available at https://cran.r-project.org/web/packages/skewt/skewt.pdf.

[CIT0013] 13.Koenker R. and Machado J., Goodness of fit and related inference processes for quantile regression, J. Am. Stat. Assoc. 94 (1999), pp. 1296–1309. [Google Scholar]

[CIT0014] 14.Kotz S., Balakrishnan N., and Johnson N.L., Continuous Multivariate, Distributions Models and Applications, John Wiley & Sonc, New York, 2000. [Google Scholar]

[CIT0015] 15.Lange K., Numerical Analysis for Statisticians, 2nd ed., Springer, New York, 2010. [Google Scholar]

[CIT0016] 16.Lindsey J. and Swihart B., rmutil.R package, preprint (2018). Available at https://cran.r-project.org/web/packages/rmutil/rmutil.pdf.

[CIT0017] 17.MacDonald S.S. and Dare W.H., A generalized model for testing the home and favorite team advantage in point spread markets, J. Financ. Econ. 40 (1996), pp. 295–318. [Google Scholar]

[CIT0018] 18.Marshall A.W. and Olkin I., A multivariate exponential distribution, J. Am. Stat. Assoc. 62 (1967(a)), pp. 30–44. [Google Scholar]

[CIT0019] 19.Matthews G.J., Improving paired comparison models for NFL point spreads by data transformation, Master Thesis, Worcester Polytechnic Institute, 2005.

[CIT0020] 20.Mohsin M., Kazianka H., Pilz J., and Gebhardt A., A new bivariate exponential distribution for modeling moderately negative dependence, Statistical Methods and Applications 23 (2014), pp. 123–148. [Google Scholar]

[CIT0021] 21.Pelechrinis K. and Papalexakis E., The anatomy of American football: evidence from 7 years of NFL game data, PLoS. ONE. 11 (2016), pp. e0168716. doi: 10.1371/journal.pone.0168716. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0022] 22.Prudnikov A.P., Brychkov Y.A., and Marichev O.I., Integrals and Series, Vol. 1, Gordon and Breach Science, New York, 1986. [Google Scholar]

[CIT0023] 23.Regoli G., A class of bivariate exponential distribution, J. Multivar. Anal. 100 (2009), pp. 1261–1269. [Google Scholar]

[CIT0024] 24.Sarhan A.M. and Balakrishnan N., A new class of bivariate distributions and its mixture, J. Multivar. Anal. 98 (2007), pp. 1508–1527. [Google Scholar]

[CIT0025] 25.Skellam J.G., The frequency distribution of the difference between two poisson variates belonging to different populations, Journal of the Royal Statistical Society, Series A 109 (1946), pp. 296. [PubMed] [Google Scholar]

[CIT0026] 26.Stern H., On the probability of winning a football game, Am. Stat. 45 (1991), pp. 179–183. [Google Scholar]

[CIT0027] 27.Stern H., A statistician reads the sports pages, Chance 11 (1998), pp. 17–21. [Google Scholar]

[CIT0028] 28.Szalkowski G. and Nelson M.L., The performance of betting lines for predicting the outcome of NFL games, Old Dominion University, Department of Computer Science, 2012, pp. 1–26.

[CIT0029] 29.Warner J., Predicting Margin of Victory in NFL Games: Machine Learning vs. the Las Vegas Line, Project: Department of Computer Science, Cornell University, 2010.

[CIT0030] 30.Yu K. and Zhang J., A three-parameter asymmetric Laplace distribution and its extension, Commun. Stat.-Theor. Meth. 34 (2005), pp. 1867–1879. [Google Scholar]

PERMALINK

A stochastic model for NFL games and point spread assessment

Muhammad Mohsin

Albrecht Gebhardt

Abstract

1. Introduction

2. The methodology

Lemma 2.1 Prudnikov et al. [22, Vol.1, Equation (2.3.15.7), p.344] —