A new bivariate Poisson distribution via conditional specification: properties and applications

Indranil Ghosh; Filipe Marques; Subrata Chakraborty

doi:10.1080/02664763.2020.1793307

. 2020 Jul 14;48(16):3025–3047. doi: 10.1080/02664763.2020.1793307

A new bivariate Poisson distribution via conditional specification: properties and applications

Indranil Ghosh ^a,^CONTACT, Filipe Marques ^b, Subrata Chakraborty ^c

PMCID: PMC9041993 PMID: 35707254

ABSTRACT

In this article, we discuss a bivariate Poisson distribution whose conditionals are univariate Poisson distributions and the marginals are not Poisson which exhibits negative correlation. Some useful structural properties of this distribution namely marginals, moments, generating functions, stochastic ordering are investigated. Simple proofs of negative correlation, marginal over-dispersion, distribution of sum and conditional given the sum are also derived. The distribution is shown to be a member of the multi-parameter exponential family and some natural but useful consequences are also outlined. Parameter estimation with maximum likelihood is implemented. Copula-based simulation experiments are carried out using Bivariate Normal and the Farlie–Gumbel–Morgenstern copulas to assess how the model behaves in dealing with the situation. Finally, the distribution is fitted to seven bivariate count data sets with an inherent negative correlation to illustrate suitability.

Keywords: Bivariate Poisson distribution, conditional specification, negative correlation, bivariate copula, copula-based simulation, English premier league data, seeds and plant grown data

1. Introduction

Bivariate discrete Poisson distributions have enjoyed a good amount of attention over the last couple of decades or so. These distributions are quite useful in modeling paired count data exhibiting correlation and possibly representing over and under dispersion. Paired count data arise in a wide context including marketing (number of purchases of different products), epidemiology (incidents of different diseases in a series of districts), accident analysis (number of road accidents in a site before and after infrastructure changes), medical research (the number of seizures before and after treatment), sports (the number of goals scored by each one of the two opponent teams in soccer), econometrics (number of voluntary and involuntary job changes), survival analysis, queuing and branching process just to name a few. However, not much is discussed and studied in thorough details on a correlated bivariate Poisson distribution with a negative correlation. The most celebrated form of a bivariate Poisson distribution that starts with three independent Poisson random variables $X_{i}, i = 1, 2, 3$ with $X_{i} \sim P o i s s o n (λ_{i}) .$ Then, the random variables $Y_{1} = X_{1} + X_{3},$ and $Y_{2} = X_{2} + X_{3}$ is said to follow jointly a bivariate Poisson distribution with the following probability mass function (p.m.f.)

P (Y_{1} = y_{1}, Y_{2} = y_{2}) = \exp (- [λ_{1} + λ_{2} + λ_{3}]) \frac{λ_{1}^{y_{1}} λ_{2}^{y_{2}}}{y_{1}! y_{2}!} \sum_{i = 0}^{min {y_{1}, y_{2}}} (\binom{y_{1}}{i}) (\binom{y_{2}}{i}) i {[\frac{λ_{3}}{λ_{1} + λ_{2}}]}^{i} .

For a comprehensive treatment of the bivariate Poisson distribution and its multivariate extensions, the reader can refer to [12] and [9]. Paul and Ho [21] discussed in the estimation of the bivariate Poisson distribution and hypothesis testing regarding independence.

The major limitation of the above bivariate distribution is that it is applicable to data which allows for positive dependence between the two random variables only. Additionally, since its marginal distributions are Poisson (with parameters $λ_{1} + λ_{3}$ and $λ_{2} + λ_{3}$ respectively), it cannot be used to model over-dispersion/under-dispersion. Several earlier works on the bivariate Poisson distribution utilizes the above joint p.m.f. Mimicking the words/phrases provided in [14], ‘It should be worth noting that the bivariate Poisson distribution reported by Teicher [24], Campbell [5], Holgate [10] has the inherent limitation that the correlation is necessarily positive, and hence is not useful in modeling the situations (e.g. in biology) involving competition for limited resources where a negative correlation is expected.’ As a consequence, Lakshminarayana et al. [14] studied a bivariate distribution whose marginals are Poisson developed as a product of Poisson marginals with a multiplicative factor. The resulting model exhibits both positive and negative correlation. The one apparent limitation of the bivariate Poisson model as described in [14] could be that in the case of a boundary value of the dependence parameter (i.e. $λ_{3}$ assuming a very small negative value), the model might fail to adequately fit the data and may provide a false positive scenario. Subsequently, any test for independence in such a case under the classical likelihood ratio set up might not be appropriate.

Motivated by this, we consider a different construction approach, based on conditional specification (for details, see [3] and the references cited therein) to construct a bivariate Poisson distribution. In this approach, we start with two conditionals which are Poisson type with the parameter(s) depending on the conditioning variable and obtain the most general form of a bivariate discrete Poisson distribution that accounts for negative correlation. The associated marginal distributions of X and Y can cater to both over and under dispersion under certain parametric restrictions as shown in Section 2. Noticeably, the marginals are no more Poisson. Therefore, the resulting model can be applied to model over-dispersion/under-dispersion as the case may be. Moreover, another motivation is that we can not have Poisson conditionals with positive correlation. Therefore, in a real-life scenario, if we have dependent variables whose random nature can well be approximated by a Poisson distribution, we can be sure of the fact that the inherent dependence structure will be negative. As a consequence, the bivariate Poisson distribution via conditional specification which is considered here will be the most preferred choice. This is a salient feature which outperforms all such bivariate Poisson models discussed and developed earlier in the literature. In the literature, several truncated versions of the bivariate Poisson distribution are also considered. Among them, noteworthy of mention is a work by Hamdan [8], who introduced and studied a truncated bivariate Poisson distribution with missing zeros and discussed the method of moments for a particular scenario of parametric restriction. The p.m.f. of the truncated bivariate Poisson distribution with missing zeros is given by

P_{1} (Y_{1} = y_{1}, Y_{2} = y_{2}) = V^{- 1} \exp (- [a + b + d]) \sum_{i = 0}^{min {y_{1}, y_{2}}} \frac{d^{x_{3}} a^{y_{1} - x_{3}} d^{x_{3}} b^{y_{2} - x_{3}}}{x_{3} (y - x_{3})},

where $V = 1 - \exp (a + d) - \exp (b + d) - \exp (a + b + d),$ $(y_{1}, y_{2}) \in C,$ where $C = {Y_{1} \geq 1, Y_{2} \geq 1} .$

Adamidis and Loukas [1] discussed the estimation of the above model using the EM algorithm. Dahiya [6] discussed the method of maximum-likelihood estimates for the special case $a = b .$ Kocherlakota and Kocherlakota [13] studied some problems associated with count data from a bivariate Poisson distribution, in which the marginal means are functions of explanatory variables.

Success of any new model heavily depends on its suitability in handling data observed in real situations. As such after investigating various useful properties, we consider a number negatively correlated bivariate count data sets from different contexts to justify its applicability. In what follows, we will first motivate with five data sets from the highly popular English Premier Football League (https://datahub.io/sports-data/english-premier-league-data) where the goal scored by the home team and away team in each game is seen to exhibit negative dependency. We consider data sets arising from five seasons, namely, 2014/15, 2015/16, 2016/17, 2017/18 and 2018/19 in order to understand the general tendency of the negative correlation that exists. For each season we have 380 observed pairs of counts, (Full Time Home Team Goal, Full time Away Team Goal). Both counts (in each data set) are over dispersed and are negatively correlated which are the two chief characteristics of our model. From the findings of our modeling, it can be argued that the proposed distribution provides an adequate fit to four out of these five observed bivariate data sets barring the season 2017/18.

Next, in order to further check the proposed model with some existing three parameter bivariate count models, we consider the example from [2] where the authors have analyzed a negatively correlated bivariate count data that originate as counts of surface faults (X) and interior faults (Y) of lenses. Findings from data fitting reveal that the proposed model is better than McKendrick–Wicksell bivariate Poisson distribution [2], BPHD (the Bivariate Poisson distribution proposed by Holgate [8] and the bivariate geometric [13].

Finally, we consider another negatively correlated bivariate data about the number of seeds and plants grown over a plot of size five square feet and plants grown studied in [20] and [12]. Here the observed small negative correlation is again −0.094 and the proposed distribution fit the data very well.

The rest of the paper is organized as follows. In Section 2, introduce some structural properties of the proposed bivariate Poisson distribution via conditional specification which was described in [3] and provide the expression for associated marginal p.m.f.'s, of X and Y. In Section 3, we provide some useful structural properties of the bivariate Poisson distribution described in Section 2. Section 4 deals with the statistical inference of the bivariate Poisson distribution under the classical setup using copula-based simulation study. Some well-known paired count data exhibiting negative correlation are re-analyzed to illustrate the applicability of this distribution in Section 5. Finally, some concluding remarks are provided in Section 6.

2. Bivariate dependent Poisson distribution via conditional specification

Let us assume the following:

$X | Y = y \sim P o i s s o n (λ_{1} λ_{3}^{y}),$ for each fixed $Y = y .$
$Y | X = x \sim P o i s s o n (λ_{2} λ_{3}^{x}),$ for each fixed $X = x .$

Here, $(λ_{1}, λ_{2}) > 0, 0 < λ_{3} \leq 1.$ Note that if $λ_{3} = 1,$ X and Y are independent. According to [3, see Theorem 4.1, page 76], the associated joint p.m.f. will be

P (X = x, Y = y) = K (λ_{1}, λ_{2}, λ_{3}) \times \frac{λ_{1}^{x} λ_{2}^{y} λ_{3}^{x y}}{x! y!},

(1)

where $x = 0, 1, 2, \dots; y = 0, 1, 2, \dots,$ and $K (λ_{1}, λ_{2}, λ_{3})$ is the normalizing constant and

K^{- 1} = K^{- 1} (λ_{1}, λ_{2}, λ_{3}) = \sum_{y = 0}^{\infty} \frac{λ_{2}^{y}}{y!} \exp (λ_{1} λ_{3}^{y}) = \sum_{x = 0}^{\infty} \frac{λ_{1}^{x}}{x!} \exp (λ_{2} λ_{3}^{x}) .

We will denote (henceforth, in short) the bivariate Poisson distribution of the pair $(X, Y)$ with the p.m.f. in (1) as $B P D (λ_{1}, λ_{2}, λ_{3}) .$ Note that the joint p.m.f. in (1) is also known as Obrechkoff's distribution [20]. It is also independently derived in [27].

Note that

The marginal p.m.f. of X will be
$\begin{aligned} P (X = x) & = K (λ_{1}, λ_{2}, λ_{3}) \frac{λ_{1}^{x}}{x!} \sum_{y = 0}^{\infty} \frac{{(λ_{2} λ_{3}^{x})}^{y}}{y!} \\ = K (λ_{1}, λ_{2}, λ_{3}) \frac{λ_{1}^{x}}{x!} \exp (λ_{2} λ_{3}^{x}), \end{aligned}$ (2)
for $= 0, 1, 2, \dots .$
Similarly, the marginal p.m.f. of Y will be
$P (Y = y) = K (λ_{1}, λ_{2}, λ_{3}) \frac{λ_{2}^{y}}{y!} \exp (λ_{1} λ_{3}^{y}),$ (3)
for $y = 0, 1, 2, \dots .$
For fixed $λ_{3},$ $P (X = x, Y = y | λ_{1}, λ_{2}, λ_{3}) = P (Y = y, X = x | λ_{2}, λ_{1}, λ_{3}) .$

Thus, if $λ_{1} = λ_{2} = λ, s a y$ then $P (X = x, Y = y | λ, λ_{3}) = P (Y = y, X = x | λ, λ_{3}) .$ Therefore, in this situation, we get symmetric shapes which are observed in Figures 3 and 4.

Figure 2. — The p.m.f. function for $λ_{1} = 10, λ_{2} = 2, λ_{3} = 0.5$ with correlation = −0.195.

Figure 5. — The p.m.f. function for $λ_{1} = 2, λ_{2} = 2, λ_{3} = 0.03,$ with correlation = −0.556.

Figure 3. — The p.m.f. function for $λ_{1} = 10, λ_{2} = 10, λ_{3} = 0.5$ with correlation = −0.812.

Figure 4. — The p.m.f. function for $λ_{1} = 30, λ_{2} = 30, λ_{3} = 0.1$ with correlation = −0.937.

Some representative p.m.f. plots for varying parameter choices are provided in Figures 1– 6 for the joint p.m.f. in (1). The negative dependence is quite apparent ; moreover, it can be easily seen that, for larger values of $λ_{1}, a n d λ_{2},$ the marginals tend to assume a normal shape.

Figure 1. — The p.m.f. function for $λ_{1} = 0.5, λ_{2} = 0.5, λ_{3} = 0.5$ with correlation = −0.2.

Figure 6. — The p.m.f. function for $λ_{1} = 2, λ_{2} = 2, λ_{3} = 0.97$ with correlation = −0.057.

3. Structural properties

Note that since, $X | Y = y \sim P o i s s o n (λ_{1} λ_{3}^{y}),$ $E (X | Y = y) = λ_{1} λ_{3}^{y} .$ Consequently,

\begin{aligned} E (X) & = E_{Y} (X | Y = y) \\ = λ_{1} K (λ_{1}, λ_{2}, λ_{3}) \sum_{y = 0}^{\infty} λ_{3}^{y} \frac{λ_{2}^{y}}{y!} \exp (λ_{1} λ_{3}^{y}) \\ = \frac{λ_{1} K (λ_{1}, λ_{2}, λ_{3})}{K_{1}}, \end{aligned}

(4)

where $K_{1} = K (λ_{1}, λ_{2} λ_{3}, λ_{3}),$ and

K_{1}^{- 1} = K^{- 1} (λ_{1}, λ_{2} λ_{3}, λ_{3}) = \sum_{y = 0}^{\infty} \frac{{(λ_{2} λ_{3})}^{y}}{y!} \exp (λ_{1} λ_{3}^{y}) .

Similarly,

E (Y) = \frac{K (λ_{1}, λ_{2}, λ_{3}) λ_{2}}{K_{2}},

(5)

where

K_{2}^{- 1} = K^{- 1} (λ_{1} λ_{3}, λ_{2}, λ_{3}) = \sum_{x = 0}^{\infty} \frac{{(λ_{1} λ_{3})}^{x}}{x!} \exp (λ_{2} λ_{3}^{x}) .

Furthermore,

\begin{aligned} E (X Y) & = K (λ_{1}, λ_{2}, λ_{3}) \sum_{x = 0}^{\infty} \sum_{y = 0}^{\infty} \frac{λ_{1}^{x} λ_{2}^{y} λ_{3}^{x y}}{(x - 1) (y - 1)} \\ = (λ_{1} λ_{2} λ_{3}) K (λ_{1}, λ_{2}, λ_{3}) \sum_{ℓ_{1} = 0}^{\infty} \sum_{ℓ_{2} = 0}^{\infty} \frac{{(λ_{1} λ_{3})}^{ℓ_{1}} {(λ_{2} λ_{3})}^{ℓ_{2}} λ_{3}^{ℓ_{1} ℓ_{2}}}{ℓ_{1}! ℓ_{2}!} \\ = \frac{(λ_{1} λ_{2} λ_{3}) K (λ_{1}, λ_{2}, λ_{3})}{K_{3}}, \end{aligned}

(6)

where $K_{3} = K (λ_{1} λ_{3}, λ_{2} λ_{3}, λ_{3}),$ and

K_{3}^{- 1} = \sum_{ℓ_{1} = 0}^{\infty} \sum_{ℓ_{2} = 0}^{\infty} \frac{(λ_{1} λ_{3})^{ℓ_{1}} (λ_{2} λ_{3})^{ℓ_{2}} λ_{3}^{ℓ_{1} ℓ_{2}}}{ℓ_{1}! ℓ_{2}!} .

One can find the individual variances as well. For example, from the conditional of X given $Y = y,$ $V a r (X | Y = y) = λ_{1} λ_{3}^{y} .$

Utilizing this fact in the following expression, we can obtain

\begin{aligned} V a r (X) & = E (V a r (X | Y = y)) + V a r (E (X | Y = y)) \\ = λ_{1} K (λ_{1}, λ_{2}, λ_{3}) [\frac{1}{K_{1}} + λ_{1} \{\frac{1}{K (λ_{1}, \sqrt{λ_{2}} λ_{3}, λ_{3})} - \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{1}^{2}}\}] . \end{aligned}

Therefore, the covariance between X and Y will be

\begin{aligned} C o v (X, Y) & = E (X Y) - E (X) E (Y) \\ = \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{3}} - \{\frac{K (λ_{1}, λ_{2}, λ_{3}) λ_{1}}{K_{1}}\} \{\frac{K (λ_{1}, λ_{2}, λ_{3}) λ_{2}}{K_{2}}\} \\ = λ_{1} λ_{2} K (λ_{1}, λ_{2}, λ_{3}) [\frac{λ_{3}}{K_{3}} - \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{1} K_{2}}] \\ = λ_{1} λ_{2} K (λ_{1}, λ_{2}, λ_{3}) [\frac{λ_{3}}{K (λ_{1} λ_{3}, λ_{2} λ_{3}, λ_{3})} - \frac{K (λ_{1}, λ_{2}, λ_{3})}{K (λ_{1} λ_{3}, λ_{2}, λ_{3}) K (λ_{1}, λ_{2} λ_{3}, λ_{3})}] . \end{aligned}

(7)

We list some results related to the sum $K^{- 1} = K^{- 1} (λ_{1}, λ_{2}, λ_{3})$ which will be used in derivation of results in sequel.

\begin{aligned} K^{- 1} (0, λ_{2}, λ_{3}) & = \exp (λ_{2}) \\ K^{- 1} (λ_{1}, 0, λ_{3}) & = \exp (λ_{1}) \\ K^{- 1} (λ_{1}, λ_{2}, 1) & = \exp (λ_{1} + λ_{2}) \\ K^{- 1} (λ_{1}, λ_{2}, 0) & = \exp (λ_{1} + λ_{2} - 1) \\ K_{2}^{- 1} (0, 0, λ_{3}) & = K_{1}^{- 1} (0, 0, λ_{3}) = 1 \\ K^{- 1} (λ_{1}, λ_{2}, λ_{3}) & = K^{- 1} (λ_{2}, λ_{1}, λ_{3}) \\ \frac{d}{d λ_{1}} K^{- 1} (λ_{1}, λ_{2}, λ_{3}) & = K^{- 1} (λ_{1}, λ_{2} λ_{3}, λ_{3}) \\ max {\exp (λ_{1}), \exp (λ_{2})} & < K^{- 1} (λ_{1}, λ_{2}, λ_{3}) \\ < \exp (λ_{1} + λ_{2}) . \end{aligned}

Theorem 3.1

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then $C o v (X Y) < 0.$

Proof.

From (7), since $λ_{1}, λ_{2} > 0,$ it is sufficient to show that for any $0 < λ_{3} < 1,$

$\frac{K (λ_{1}, λ_{2}, λ_{3})}{λ_{3} K (λ_{1} λ_{3}, λ_{2}, λ_{3})} \times \frac{K (λ_{1} λ_{3}, λ_{2} λ_{3}, λ_{3})}{K (λ_{1}, λ_{2} λ_{3}, λ_{3})} > 1.$ (8)

Next, observe that for any $0 < λ_{3} < 1,$ one can write

$\frac{λ_{3}}{K (λ_{1}, λ_{2}, λ_{3})} < \frac{1}{K (λ_{1}, λ_{2}, λ_{3})} < \frac{1}{K (λ_{1} λ_{3}, λ_{2}, λ_{3})} .$ (9)

Therefore,

$\frac{K (λ_{1}, λ_{2}, λ_{3})}{λ_{3} K (λ_{1} λ_{3}, λ_{2}, λ_{3})} > 1.$

Similarly, for any $0 < λ_{3} < 1,$

$\frac{K (λ_{1} λ_{3}, λ_{2} λ_{3}, λ_{3})}{K (λ_{1}, λ_{2} λ_{3}, λ_{3})} > 1.$ (10)

Our result immediately follows on combining (9) and (10). Hence, the proof. Alternatively, it can be verified from the following Corollary (see, Sundt (2000) for details).

Corollary 3.1

The elements of a random vector with infinitely divisible distribution on $N_{m}$ are non-negatively correlated, where $N_{m}$ denote the set of all $m \times 1$ vectors where all elements are non-negative integers.

Therefore, X and Y are negative quadrant dependent, i.e. for every pair of decreasing functions $ℓ_{1} (.)$ and $ℓ_{2} (.),$ it follows that $C o v (ℓ_{1} (X), ℓ_{2} (Y)) \leq 0.$ Moreover, in view of the fact that

S_{X, Y} (x, y) - S_{X} (x) S_{Y} (y) = F_{X, Y} (x, y) - F_{X} (x) F_{Y} (y),

one may readily observe that $S_{X, Y} (x, y) \leq S_{X} (x) S_{Y} (y) .$

Lemma 3.1

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then for any $0 < λ_{3} \leq 1,$ the following holds (for any $(m_{1}, m_{2}) \in R^{+},$

$\sum_{n = 0}^{\infty} \frac{m_{1}^{n}}{n!} \exp [m_{2} λ_{3}^{n}] = \sum_{n = 0}^{\infty} \frac{m_{2}^{n}}{n!} \exp [m_{1} λ_{3}^{n}] .$

Proof.

Since the above is an identity, it can be rewritten as (i.e. we need to show equivalently)

$\sum_{n = 0}^{\infty} \frac{m_{1}^{n}}{n!} \exp [- m_{1} λ_{3}^{n}] = \sum_{n = 0}^{\infty} \frac{m_{2}^{n}}{n!} \exp [- m_{2} λ_{3}^{n}] .$

Next, the left-hand side of the above can be written as

$\sum_{n = 0}^{\infty} \frac{m_{1}^{n}}{n!} \exp [- m_{1} λ_{3}^{n}] = \sum_{n = 0}^{\infty} \frac{m_{1}^{n}}{n!} {\{\exp [λ_{3}^{n}]\}}^{- m_{1}} = \exp (m_{1}),$

because, for $0 < λ_{3} \leq 1,$ and as n increases and goes to infinity, $\exp [λ_{3}^{n}] = \exp (0) = 1.$ By a similar argument, the right-hand side will be equal to $\exp (m_{2}) .$ Then, on rearrangement of these two terms on both sides of the identity, we get the desired result. Hence, the proof.

Lemma 3.1 will be utilized in establishing the result in Theorem 3.3 which derives the moment generating function (m.g.f.) and the probability generating function (p.g.f.) for the bivariate joint p.m.f. in (1). Notice that similar version of this result has independently observed by [3] [see Exercise problem 4.2(b), page 100].

Theorem 3.2

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then the joint the joint factorial moment will be given by

$E (X_{(r)} Y_{(s)}) = λ_{1}^{r} λ_{2}^{s} λ_{3}^{r s} \frac{K (λ_{1}, λ_{2}, λ_{3})}{K (λ_{1} λ_{3}^{s}, λ_{2} λ_{3}^{r}, λ_{3})} .$

Proof.

Simple and thus excluded.

Theorem 3.3

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then the joint probability generating function (p.g.f.) and the joint moment generating function (m.g.f.) will be

$G_{X, Y} (s_{1}, s_{2}) = \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{2} (λ_{1} s_{1}, λ_{2} s_{2}, λ_{3})} = \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{1} (λ_{1} s_{1}, λ_{2} s_{2}, λ_{3})},$

for $| s_{1} | < 1, | s_{2} | < 1.$

$M_{X, Y} (t_{1}, t_{2}) = \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{2} (λ_{1} \exp (t_{1}), λ_{2} \exp (t_{2}), λ_{3})} = \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{1} (λ_{1} \exp (t_{1}), λ_{2} \exp (t_{2}), λ_{3})} .$

Proof.

Using Lemma 3.1, our result immediately follows. Therefore, for brevity, the details are excluded.

Corollary 3.2

From the bivariate p.g.f. we can obtain the corresponding probabilities by the following formula:

$P (X = x, Y = y) = \frac{1}{x! y!} \frac{\partial^{x + y}}{\partial s_{1}^{x} \partial s_{2}^{y}} G_{X, Y} (s_{1}, s_{2}) |_{s_{1} = 0, s_{2} = 0} .$

For example,

Setting $s_{1} = 0, s_{2} = 0,$ $P (X = x, Y = y) = K (λ_{1}, λ_{2}, λ_{3}) K_{2}^{- 1} (0, 0, λ_{3}) = K (λ_{1}, λ_{2}, λ_{3}) .$

Next, consider
$P (X = 1, Y = 1) = \frac{\partial^{2}}{\partial s_{1} \partial s_{2}} G_{X, Y} (s_{1}, s_{2}) |_{s_{1} = 0, s_{2} = 0} .$
Now,
$\frac{\partial^{2}}{\partial s_{1} \partial s_{2}} G_{X, Y} (s_{1}, s_{2}) = (λ_{1} λ_{2} λ_{3}) \exp (λ_{1} λ_{3} s_{1} + λ_{2} λ_{3}^{t + 1} s_{2}),$
where $t = x - 1.$

Therefore,
$P (X = 1, Y = 1) = \frac{\partial^{2}}{\partial s_{1} \partial s_{2}} G_{X, Y} (s_{1}, s_{2}) |_{s_{1} = 0, s_{2} = 0} = (λ_{1} λ_{2} λ_{3}) K (λ_{1}, λ_{2}, λ_{3})$

Notice that these individual joint probabilities exactly matches with the joint probabilities that can be obtained from (1) by substituting $(x, y) = {(0, 0), (1, 1) \dots} .$ Using the joint p.g.f. different moments and product moments can be obtained.

Theorem 3.4

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then we have the following results.

$P (X + Y = n) = K (λ_{1}, λ_{2}, λ_{3}) \sum_{i \geq 0}^{n} (\binom{n}{i}) λ_{1}^{i} λ_{2}^{n - i} λ_{3}^{i (n - i)} .$ In particular $X + Y \sim P (λ_{1} + λ_{2}),$

if $λ_{3} = 1.$

$P (X = x ∣ X + Y = n) = \frac{(\binom{n}{x}) {(\frac{λ_{1} λ_{3}^{n - x}}{λ_{2}})}^{x}}{\sum_{i \geq 0}^{n} (\binom{n}{i}) {(\frac{λ_{1} λ_{3}^{n - x}}{λ_{2}})}^{i}} .$

The regression of X on Y is given by $E (X | Y = y) = λ_{1} λ_{3}^{y},$ and The regression of Y on X is given by $E (Y | X = x) = λ_{2} λ_{3}^{x} .$

Proof.

Part (a) can be obtained from the joint p.g.f. given in Theorem 3.2. Subsequently, part (b) can be obtained on using (a). Proof of part(c) can be obtained immediately by the information that states that both the conditionals are Poisson with respective parameters. Precisely,

Since, $X | Y = y \sim P o i s s o n (λ_{1} λ_{3}^{y}),$ therefore, the regression of X on Y will be obtained as $X = E (X | Y = y) = λ_{1} λ_{3}^{y} .$

Similarly, since, $Y | X = x \sim P o i s s o n (λ_{2} λ_{3}^{y}),$ therefore, the regression of Y on X will be obtained as $Y = E (Y | X = x) = λ_{2} λ_{3}^{y} .$

Theorem 3.5

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then we have the following stochastic ordering results of the family of univariate Poisson distribution arising from the bivariate distribution in (1).

If $λ_{1} > λ_{2},$ then for any $0 < λ_{3} < 1,$ X is stochastically larger than Y. This also implies that under this parametric restriction, Y is smaller than X in the hazard rate order, mean residual life order, and the likelihood ratio order.

If $λ_{1} < λ_{2},$ then for any $0 < λ_{3} < 1,$ Y is stochastically larger than X. This also implies that under this parametric restriction, X is smaller than Y in the hazard rate order, mean residual life order, and the likelihood ratio order.

If $λ_{1} < λ_{2},$ then for any $0 < λ_{3} < min {λ_{1}, λ_{2}} < 1,$ Y is stochastically larger than X. This also implies that under this parametric restriction, X is smaller than Y in the hazard rate order, mean residual life order, and the likelihood ratio order.

If $λ_{1} > λ_{2},$ then for any $1 > λ_{3} > max {λ_{1}, λ_{2}},$ X is stochastically larger than Y. This also implies that under this parametric restriction, Y is smaller than X in the hazard rate order, mean residual life order, and the likelihood ratio order.

Proof.

The proof is quite simple and hence, the details avoided.

Theorem 3.6

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then the marginals of X and Y exhibit over-dispersion.

Proof.

For over-dispersion, we need to show that the variance is greater than the mean. We start with the mean and variance of X. The case of Y can be similarly dealt. Regarding the mean and variance of X, the condition of over-dispersion reduces to establish that

$λ_{1} K (λ_{1}, λ_{2}, λ_{3}) [\frac{1}{K_{1}} + λ_{1} \{\frac{1}{K (λ_{1}, \sqrt{λ_{2}} λ_{3}, λ_{3})} - \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{1}^{2}}\}] > \frac{λ_{1} K (λ_{1}, λ_{2}, λ_{3})}{K_{1}},$

which is, equivalently can be written as

$\frac{1}{K (λ_{1}, \sqrt{λ_{2}} λ_{3}, λ_{3})} > \frac{K (λ_{1}, λ_{2}, λ_{3})}{K_{1}^{2}} .$ (11)

Next, note that (11) is true since $V a r (X) \geq 0.$ Hence, the proof.

Theorem 3.7

$(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ belongs to three parameter exponential family.

Proof.

The joint p.m.f. of $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ will be a member of the 3-parameter exponential family if its p.m.f. can be expressed in the form

$P (X = x, Y = y) = {(x! y!)}^{- 1} \exp [\sum_{j = 1}^{3} η_{j} (λ_{1}, λ_{2}, λ_{3}) T_{j} (x, y) - A (λ_{1}, λ_{2}, λ_{3})] .$ (12)

From (13), it is easy to observe that the proposed distribution belongs to the exponential family by rewriting the p.m.f. as

$P (X = x, Y = y) = [h (x, y)] \times \exp [x \log λ_{1} + y \log λ_{2} + x y \log λ_{3} - \log K^{- 1} (λ_{1}, λ_{2}, λ_{3})],$ (13)

and identifying

$\begin{aligned} h (x, y) = (x! y!)^{- 1}, \\ η_{1} (λ_{1}, λ_{2}, λ_{3}) = λ_{1}, η_{2} (λ_{1}, λ_{2}, λ_{3}) = λ_{2}, η_{3} (λ_{1}, λ_{2}, λ_{3}) = λ_{3}, \\ T_{1} (x, y) = x, T_{2} (x, y) = y, T_{3} (x, y) = x y, \\ a n d A (λ_{1}, λ_{2}, λ_{3})) = \log K^{- 1} (λ_{1}, λ_{2}, λ_{3}) . \end{aligned}$

Thus, based on a sample of size m from BPD, $(\sum x, \sum y, \sum x y)$ is completely sufficient for $(λ_{1}, λ_{2}, λ_{3}) .$

Distributions belonging to exponential family enjoy many properties. For example mean, variance, co-variance and moment generating functions can be easily derived using differentiation's of $A (λ_{1}, λ_{2}, λ_{3})$ . Moreover using Lehmann Scheffe result, it may be possible to derive UMVUE of the parameters, provided we get hold of function of T that is unbiased for the parameter. Even otherwise one can derive MVUE implementing bias correction.

Theorem 3.8

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then the modal value of the distribution will be as follows

For strictly integer-valued parameters, the modal value will be at $(λ_{1} + λ_{3} - 1, λ_{2} + λ_{3} - 1) .$

For any set of parameters that contain non-integers, the modal value will be at $(⌊*⌋ λ_{1} + λ_{3} - η_{1}, ⌊*⌋ λ_{2} + λ_{3} - η_{2}),$ where $⌊*⌋ x$ is the greatest integer function, and $η_{i} = 0, 1$ is an indicator function.

Proof.

Straightforward and hence omitted.

Theorem 3.9

If $(X, Y) \sim B P D (λ_{1}, λ_{2}, λ_{3}),$ then

$P (X < Y) = λ_{2} {[K (λ_{1}, λ_{2}, λ_{3})]}^{2} (\sum_{j = 0}^{\infty} \sum_{x = 0}^{j} (\frac{\exp (λ_{1} λ_{3}^{j + 1})}{j!}) (\frac{\exp (λ_{2} λ_{3}^{x})}{x!})) .$

Proof.

Observe that

$\begin{aligned} P (X < Y) & = \sum_{j = 0}^{\infty} P (Y = j + 1) P (X \leq j) \\ = \sum_{j = 0}^{\infty} (K (λ_{1}, λ_{2}, λ_{3}) (\frac{λ_{2}^{j + 1} \exp (λ_{1} λ_{3}^{j + 1})}{j!})) \\ \times (\sum_{x = 0}^{j} (K (λ_{1}, λ_{2}, λ_{3}) \frac{λ_{1}^{x} \exp (λ_{2} λ_{3}^{x})}{x!})) \\ = λ_{2} {[K (λ_{1}, λ_{2}, λ_{3})]}^{2} (\sum_{j = 0}^{\infty} \sum_{x = 0}^{j} (\frac{λ_{2}^{j} \exp (λ_{1} λ_{3}^{j + 1})}{j!}) (\frac{λ_{1}^{x} \exp (λ_{2} λ_{3}^{x})}{x!})) . \end{aligned}$ (14)

Note: Observe that since $0 < λ_{3} \leq 1,$ one can obtain a upper bound inequality of $P (X < Y)$ by setting $λ_{3} = 1,$ which will be as follows:

P (X < Y) \leq λ_{2} \exp (3 λ_{1} + 2 λ_{2}) (\sum_{j = 0}^{\infty} \frac{λ_{2}^{j}}{j!} \sum_{x = 0}^{j} \frac{\exp (- λ_{1}) λ_{1}^{x}}{x!}),

on using the fact that $K (λ_{1}, λ_{2}, 1) = \exp (λ_{1} + λ_{2}) .$ Furthermore, on using the inequality result related to a Poisson distribution in Sort [23], we may additionally write the above inequality as

\begin{aligned} P (X < Y) & \leq λ_{2} \exp (3 λ_{1} + 2 λ_{2}) (\sum_{j = 0}^{\infty} \frac{λ_{2}^{j}}{j!} \sum_{x = 0}^{j} \frac{\exp (- λ_{1}) λ_{1}^{x}}{x!}) \\ > λ_{2} \exp (3 λ_{1} + 2 λ_{2}) (\sum_{j = 0}^{\infty} \frac{λ_{2}^{j}}{j!} [1 - \frac{\exp (- H (λ_{1}, j))}{max {2, \sqrt{4 π H (λ_{1}, j)}}}], \end{aligned}

where $H (λ_{1}, j)$ is the KL divergence between two Poisson distributed random variables with respective means $λ_{1}$ and j. $R = P (X < Y)$ is known as the stress- strength reliability in engineering where the random variables X and Y, respectively, represent the stress and strength associated with a system. This measure is also useful in a probabilistic assessment of inequality in two phenomena X and Y. While in most cases X and Y are assumed to be independent in real life there may be dependence between X and Y. Stress-strength reliability with both variables having independent Poisson distribution was discussed in [4]. As such, the result of Theorem 9 has the potential to be used in such contexts.

4. Statistical inference

4.1. Maximum likelihood estimation

In this section, we consider the maximum-likelihood estimation of the unknown parameters of a BPD $(λ_{1}, λ_{2}, λ_{3})$ model based on a random sample of size m, namely $C = (x_{1}, y_{1}), \dots, (x_{m}, y_{m}) .$ The proposed bivariate Poisson model has three parameters. It is observed that the MLEs of the unknown parameters can be obtained by solving a three-dimensional optimization problem. However, we have used Mathematica to evaluate the estimates and the corresponding performance of the MLE in this case. For notational simplicity, we will indicate the parameter vector by $Δ = (λ_{1}, λ_{2}, λ_{3}) .$ Based on the random sample $C$ as mentioned above and using (2), the log-likelihood function can be written as

ℓ (Δ | C) = m K (λ_{1}, λ_{2}, λ_{3}) + \log λ_{1} \sum_{i = 1}^{m} x_{i} + \log λ_{2} \sum_{i = 1}^{m} y_{i} + \log λ_{3} \sum_{i = 1}^{m} x_{i} y_{i} .

(15)

The MLEs of the unknown parameters can be obtained by maximizing (15) with respect to Δ. It requires solving the following three non-linear equations

\frac{\partial ℓ (Δ | C)}{\partial λ_{1}} = 0, \frac{\partial ℓ (Δ | C)}{\partial λ_{2}} = 0, \frac{\partial ℓ (Δ | C)}{\partial λ_{3}} = 0.

(16)

Clearly, they cannot be obtained in explicit forms. Newton–Raphson method may be used to solve these non-linear equations. One may consider an appropriate EM algorithm in this case as an alternative. However, as described later on, we will consider a copula-based estimation and simulation for a BPD $(λ_{1}, λ_{2}, λ_{3})$ for varying parameter choices.

4.2. Likelihood ratio test for independence of the random variables

As suggested earlier, if $λ_{3} = 1,$ then X and Y are independent. Thus, we may conduct the test of independence of the two random variables X and Y by testing the null hypothesis

H_{0} : λ_{3} = 1 v s H_{a} : λ_{3} \neq 1 .

The likelihood ratio statistic is given by

Λ = \frac{max_{Δ \in Ω_{0}} L (Δ | C)}{max_{Δ \in Ω} L (Δ | C)},

where $L (.)$ stands for the likelihood function, $Ω_{0}$ is the space of parameters under $H_{0},$ and Ω is the space of parameters with no restrictions. It is well known that for large sample sizes, we have

- 2 \log Λ \sim_{}^{a s y m p} χ_{r - r_{0}}^{2},

where, in our testing procedure, r = 3 is the number of free parameters under the alternative hypothesis and $r_{0} = 2$ is the number of free parameters under the null hypothesis. Using the methodologies describe subsection 4.1, it is possible to implement this test and subsequently test the independence of the random variables X and Y. In a similar way, one can develop of testing the following hypothesis to check for symmetry (see item 3 on section 2)

H_{0} : λ_{1} = λ_{2} v s H_{a} : λ_{1} \neq λ_{2} .

4.3. Copula-based simulations

In order to assess the importance and efficacy of the proposed distribution, we consider a copula-based simulation study. We generate samples of size 10000 from two copulas which allow to define a negative correlation, more precisely (i) a Bivariate Normal (BN) copula with correlation ρ, and (ii) a Farlie–Gumbel–Morgenstern (FGM) copula with the dependence parameter α; both combined with two marginal Poisson distributions with parameters $m_{1}$ and $m_{2}$ , respectively. The purpose here is to understand how the bivariate distribution described in (1) may be used to model bivariate data arising from a BN or FGM copulas with Poisson marginals.

The simulation from a copula-based distribution is quite straightforward when the marginals are continuous. However, when the marginals are discrete or a mixture of discrete and continuous, a more careful approach is required, for reference, see [11,25]. In this section, we use the BN and the FGM copulas combined with marginal Poisson distributions with parameters $m_{1}$ and $m_{2}$ and corresponding cumulative distribution functions (c.d.f.'s) given by $F_{1} (x_{1})$ and $F_{2} (x_{2}) .$ This approach generates the following joint distribution:

F (x_{1}, x_{2}) = C_{θ} (u_{1} = F_{1} (x_{1}), u_{2} = F_{2} (x_{2})),

where $C_{θ}$ stands for the copula considered (BN or FGM) with the association parameter θ. To generate sample from this distribution, we have to first sample the vector $\underline{u} = (u_{1}, u_{2})$ and then, using the inverses $F_{1}^{- 1} (.)$ and $F_{1}^{- 1} (.),$ obtain $\underline{x} = (x_{1}, x_{2}) = (F_{1}^{- 1} (u_{1}), F_{2}^{- 1} (u_{2}))$ . In Appendix A.1.4 of [26], the authors present an example of a method that may be used to simulate discrete dependent Poisson variables using a specific copula. For illustrative purposes, we have provided a scenario of simulating discrete dependent Poisson variables with means $m_{1}$ and $m_{2}$ from a bivariate Normal copula in the Appendix. For further details on discrete copulas characterization and simulation see [18,19,25] and the references cited therein. An interesting review of multivariate distributions for count data derived from the Poisson distribution can be found in [11]. For the simulations here, Mathematica software is used to implement the methodology.

In our simulation study, for the BN copula, the correlation parameter ρ varied between $- 0.9$ to $0.2,$ and for the FGM copula, the parameter α varied also from $- 0.9$ to 0.2. The MLEs for $λ_{1}$ , $λ_{2}$ and $λ_{3}$ were obtained using the procedure described in Subsection 4.1, using the expression in (13). To test the fit of the BPD distribution to the data generated from the BN and FGM copulas, we consider the $χ^{2}$ goodness-of-fit test for bivariate discrete distributions following the suggestions in [16], namely we considered the ordered expected-frequencies procedure described in Section 3 of the same reference. From Tables 1 to 4, we may observe that the BPD distribution may be a useful tool to model bivariate data, which may be characterized by a BN or FGM copula with a small negative correlation in the range $(- 0.2, 0) .$ When the negative correlation is weaker, (smaller than $- 0.2$ ) the null hypothesis, which states that the population distribution is BPD, is rejected.

Table 1. Analysis of the fit of the BPD distribution to the data generated from a Binormal copula, with correlation ρ, of two Poisson distributions with parameters $m_{1} = 2$ and $m_{2} = 3$ .

		Correlation
ρ	${{\hat{λ}}_{1}, {\hat{λ}}_{2}, {\hat{λ}}_{3}}$	BPD	Samp.	p-value
$- 0.9$	${3.97, 4.75, 0.76}$	−0.54	−0.82	≈ 0.00
$- 0.8$	${3.81, 4.61, 0.78}$	−0.51	−0.72	≈ 0.00
$- 0.7$	${3.63, 4.46, 0.79}$	−0.48	−0.64	≈ 0.00
$- 0.6$	${3.51, 4.36, 0.81}$	−0.45	−0.58	≈ 0.00
$- 0.5$	${3.22, 4.14, 0.84}$	−0.39	−0.46	≈ 0.00
$- 0.4$	${3.04, 3.96, 0.86}$	−0.34	−0.38	≈ 0.00
$- 0.3$	${2.76, 3.69, 0.89}$	−0.26	−0.28	≈ 0.00
$- 0.2$	${2.47, 3.49, 0.93}$	−0.18	−0.19	0.21
$- 0.1$	${2.24, 3.25, 0.96}$	−0.10	−0.10	0.76
0.1	${2.03, 3.02, 1.00}$	0.00	0.10	≈ 0.00
0.2	${4.94, 10.13, 1.0}$	0.00	0.19	≈ 0.00

Open in a new tab

Table 4. Analysis of the fit of the BPD distribution to the data generated from a FGM copula, with correlation ρ, of two Poisson distributions with parameters $m_{1} = 5$ and $m_{2} = 10$ .

		Correlation
ρ	${{\hat{λ}}_{1}, {\hat{λ}}_{2}, {\hat{λ}}_{3}}$	BPD	Samp.	p-value
$- 0.9$	${7.31, 12.10, 0.96}$	−0.26	−0.29	≈ 0.00
$- 0.8$	${7.18, 11.93, 0.96}$	−0.25	−0.27	≈ 0.00
$- 0.7$	${6.73, 11.60, 0.97}$	−0.21	−0.23	≈ 0.00
$- 0.6$	${6.58, 11.48, 0.97}$	−0.19	−0.20	0.02
$- 0.5$	${6.06, 11.03, 0.98}$	−0.14	−0.14	0.02
$- 0.4$	${6.02, 11.01, 0.98}$	−0.13	−0.14	0.38
$- 0.3$	${5.64, 10.58, 0.99}$	−0.09	−0.09	0.26
$- 0.2$	${5.58, 10.52, 0.99}$	−0.08	−0.08	0.34
$- 0.1$	${5.38, 10.35, 0.99}$	−0.05	−0.05	0.48
0.1	${5.01, 9.93, 1.00}$	0.00	0.04	0.13
0.2	${4.98, 9.96, 1.00}$	0.00	0.06	0.12

Open in a new tab

Table 2. Analysis of the fit of the BPD distribution to the data generated from a Binormal copula, with correlation ρ, of two Poisson distributions with parameters $m_{1} = 5$ and $m_{2} = 10$ .

		Correlation
ρ	${{\hat{λ}}_{1}, {\hat{λ}}_{2}, {\hat{λ}}_{3}}$	BPD	Samp.	p-value
$- 0.9$	${11.3, 15.04, 0.92}$	−0.57	−0.87	≈ 0.00
$- 0.8$	${10.8, 14.69, 0.92}$	−0.54	−0.78	≈ 0.00
$- 0.7$	${10.1, 14.15, 0.93}$	−0.49	−0.67	≈ 0.00
$- 0.6$	${9.63, 13.85, 0.93}$	−0.46	−0.58	≈ 0.00
$- 0.5$	${8.91, 13.36, 0.94}$	−0.41	−0.49	≈ 0.00
$- 0.4$	${8.12, 12.77, 0.95}$	−0.35	−0.40	≈ 0.00
$- 0.3$	${7.38, 12.16, 0.96}$	−0.28	−0.30	≈ 0.00
$- 0.2$	${6.71, 11.57, 0.97}$	−0.21	−0.21	0.44
$- 0.1$	${5.82, 10.80, 0.98}$	−0.11	−0.11	0.97
0.1	${5.00, 9.96, 1.00}$	0.00	0.09	≈ 0.00
0.2	${5.01, 9.97, 1.00}$	0.00	0.20	≈ 0.00

Open in a new tab

Table 3. Analysis of the fit of the BPD distribution to the data generated from a FGM copula, with correlation ρ, of two Poisson distributions with parameters $m_{1} = 2$ and $m_{2} = 3$ .

		Correlation
ρ	${{\hat{λ}}_{1}, {\hat{λ}}_{2}, {\hat{λ}}_{3}}$	BPD	Samp.	p-value
$- 0.9$	${2.69, 3.68, 0.90}$	−0.25	−0.26	≈ 0.00
$- 0.8$	${2.65, 3.59, 0.91}$	−0.23	−0.24	≈ 0.00
$- 0.7$	${2.57, 3.52, 0.92}$	−0.20	−0.20	0.02
$- 0.6$	${2.43, 3.44, 0.93}$	−0.17	−0.17	≈ 0.00
$- 0.5$	${2.36, 3.34, 0.95}$	−0.13	−0.14	0.07
$- 0.4$	${2.31, 3.30, 0.95}$	−0.12	−0.12	0.16
$- 0.3$	${2.23, 3.19, 0.96}$	−0.09	−0.09	0.05
$- 0.2$	${2.16, 3.15, 0.97}$	−0.06	−0.06	0.65
$- 0.1$	${2.09, 3.10, 0.99}$	−0.03	−0.03	0.58
$0.1$	${2.02, 3.01, 1.00}$	0.00	0.05	≈ 0.00
$0.2$	${2.00, 3.02, 1.00}$	0.00	0.07	≈ 0.00

Open in a new tab

5. Real data applications

5.1. English premier football league data

Here, we consider data from the English Premier Football League [available in the web page – https://datahub.io/sports-data/english-premier-league#data –] for five(5) seasons,2014/15, 2015/16, 2016/17, 2017/18 and 2018/19. Each season giving us 380 pairs of counts, namely the goals scored in each game in the form (Full Time Home Team Goal, Full time Away Team Goal) with negative correlations. We conjecture that the BPD distribution proposed in this paper will provide a reasonably good fit to model this data. In Table 5, we present a descriptive summary of the data from each of the five (5) seasons, and provide plots of the observed histograms for these bivariate data in Figure 7 for each of the 5 seasons, respectively. As expected, pairwise negative correlations can be observed for each of the 5 seasons.

Table 5. Descriptive analysis of the data of goals scored, (Home team, Away team), for the seasons 2014/15,2015/16, 2016/17, 2017/18 and 2018/19 of the English Premier Football League.

Seasons		Min	$q_{0.25}$	Median $= q_{0.5}$	Mean	$q_{0.75}$	Max	Correlation
2014/15	Home team	0	1.0	1.0	1.5	2.0	8.0
								−0.030
	Away team	0	0	1.0	1.1	2.0	6.0
2015/16	Home team	0	1.0	1.0	1.5	2.0	6.0
								−0.022
	Away team	0	0	1.0	1.2	2.0	6.0
2016/17	Home team	0	1.0	1.0	1.6	2.0	6.0
								−0.132
	Away team	0	0	1.0	1.2	2.0	7.0
2017/18	Home team	0	1.0	1.0	1.5	2.0	7.0
								−0.130
	Away team	0	0	1.0	1.1	2.0	6.0
2018/19	Home team	0	1.0	1.0	1.6	2.0	6.0
								−0.178
	Away team	0	0	1.0	1.3	2.0	6.0

Open in a new tab

Figure 7. — Histograms for the bivariate data of goals scored, (*Home team*, *Away team*), for the seasons $2014 / 15, 2015 / 16, 2016 / 17, 2017 / 18$ and 2018/19 for the English Premier Football League.

The results from the data fitting of the BPD distribution to the data for the seasons 2014/15, 2015/16, 2016/17,2017/18 and 2018/19 is presented in the Table 6 reveal that, with the exception of one season namely, $2017 / 18,$ the BPD distribution provides an adequate fit to the bivariate pair data arising from the rest of the for seasons.

Table 6. Analysis of the fit of the BPD distribution to the data of goals scored, (Home team, Away team), for the seasons $2014 / 15, 2015 / 16, 2016 / 17, 2017 / 18$ and 2018/19 for the English Premier Football League.

		Correlation
Season	${{\hat{λ}}_{1}, {\hat{λ}}_{2}, {\hat{λ}}_{3}}$	BPD	data	p-value
2014/15	${1.52, 1.13, 0.97}$	−0.03	−0.03	0.36
2015/16	${1.52, 1.24, 0.99}$	−0.02	−0.02	0.05
2016/17	${1.82, 1.43, 0.89}$	−0.15	−0.13	0.27
2017/18	${1.75, 1.37, 0.89}$	−0.15	−0.13	0.0009
2018/19	${1.86, 1.55, 0.86}$	−0.19	−0.18	0.10

Open in a new tab

5.2. Data of counts of surface and interior faults in 100 lenses

Aitchison and Ho [2] have analyzed a bivariate count data (Table I from [2])that originated as counts of surface faults (X) and interior faults (Y) of lenses. The same data (Table VI from [15]) was also studied by [15]. In Table 2 of [2] and Table VII of [15], the both the authors considered five different bivariate distributions to model data of counts of surface and interior faults in 100 lenses. The results are compared based on the log-likelihood, AIC, and BIC criteria. In this subsection, we have re-analyzed this dataset since it is negatively correlated, more precisely $ρ = - 0.175$ . In Figure 8, we present the histogram of the data, and in Table 7, we present the observed values of the log-likelihood, AIC and BIC obtained when this data set is fitted with a BPD distribution proposed in this paper. Comparing our findings with those in [2] and [15], it is observed that the BPD distribution in (1) presents better results, in terms of the AIC and BIC measures than McKendrick–Wicksell bivariate Poisson distribution [2], Bivariate Poisson distribution proposed by Holgate [10] and bivariate geometric [15]. It is important here to note that the other distributions compared in the said papers which apparently provide a better fit than the BPD all have four or more parameters and hence more parsimony as expected.

Table 7. AIC and BIC values for BPD distribution fit of the data of counts of surface and interior faults in 100 lenses in [2,15].

${{\hat{λ}}_{1}, {\hat{λ}}_{2}, {\hat{λ}}_{3}}$	log-likelihood	AIC	BIC
${2.69, 3.68, 0.90}$	−445.0	896.0	903.8

Open in a new tab

Figure 8. — Data of counts of surface and interior faults in 100 lenses.

5.3. Seeds and plants grown data

As a last example of application, we consider the data about seeds and plants grown in [22] and also in Table 1 in [14]. The data set reports the number of seeds and plants grown over a plot of size five square feet. The sample correlation is negative and equal $- 0.094,$ thus this is the kind of scenario in which the proposed BPD distribution may be useful to model the data. In Figure 9 we present the histogram of the data and in Table 8, we present the estimated values of $λ_{1}$ , $λ_{2}$ and $λ_{3}$ , together with the correlation measures and with the p-value of $χ^{2}$ goodness-of-fit test for the BPD distribution.

Table 8. Analysis of the fit of the BPD distribution to the data of seeds and plants grown.

	Correlation
${{\hat{λ}}_{1}, {\hat{λ}}_{2}, {\hat{λ}}_{3}}$	BPD	data	p-value
${1.85, 2.17, 0.96}$	−0.08	−0.09	0.97

Open in a new tab

Figure 9. — Histogram of the data of seeds and plants grown.

One may observe that the sample and population correlations are similar and that the p-value of the $χ^{2}$ goodness-of-fit test for the BPD distribution is close to 1, indicating an excellent fit of the BPD distribution to the data. In [14], the authors also perform the goodness-of-fit test for a bivariate Poisson distribution and also in the case of independence. However, since in the bivariate case, there are different ways of grouping the classes we will not compare our results with the ones in [14].

6. Conclusion

The statistical analysis of bivariate count data has proved to be challenging, because of the lack of a parametric class of distributions supporting a rich- enough correlation structure. Application of bivariate Poisson distribution as a tool to model bivariate count data is not new in the literature (see [2] and the references cited there in), but, several such earlier proposed models has the drawback of its inability to model data sets exhibiting negative correlation. For example, the bivariate Poisson distribution given by [5] and many other authors later on exhibit only a positive correlation by the very nature of the genesis of the distribution. However, in the present paper, we have considered a special type of bivariate Poisson distribution described by [3] in which the construction of a bivariate distribution starts with two given conditionals which belongs to the same family (in our case Poisson) with appropriate dependence parameters. We have established that under certain conditions of the parameters, this model will exhibit negative correlation, in fact always as suggested by [3]. That is one major objective for this study among others. We have also fitted the BPD distribution to seven data sets exhibiting negative correlation with a satisfactory level of performance as can be verified by the associated p-values. The other striking feature of this BPD distribution in (1) is that it is free from the limitation that the derivation of any structural properties including the derivation of marginal p.m.f.'s that involves evaluation of incomplete gamma function unlike other bivariate Poisson distributions that exist in the literature. It is important to see how the model in (1) can be generalized to the multivariate case. A substantive amount of work is required in this direction to see the applicability of such models in practice. However, extension to the multivariate version of this discrete distribution should be motivated from a real-world perspective.

Acknowledgements

We express our sincere thanks to the editor and an anonymous reviewer for making several useful suggestion on an earlier version of this manuscript which led to this improved one.

Appendix.

Here, we provide for illustrative purposes, as to how we can simulate random samples from a bivariate discrete distribution via a bivariate dependent copula, namely from a bivariate normal (Gaussian) copula. The bivariate normal (or Gaussian) copula with dependence parameter ρ is defined by (for details, see Section 2.3 of [17]):

C (u, v; ρ) = Φ_{2} (Φ^{- 1} (u), Φ^{- 1} (v); ρ),

(A1)

where

φ_{2} (x, y; ρ) = \frac{1}{2 π \sqrt{1 - ρ^{2}}} \exp (- \frac{x^{2} + y^{2} - 2 ρ x y}{2 (1 - ρ^{2})}), Φ_{2} (h, k; ρ) = \int_{- \infty}^{h} \int_{- \infty}^{k} φ_{2} (x, y; ρ) d y d x,

are the density and distribution function of the bivariate standard normal distribution with correlation parameter $ρ \in [- 1, 1] .$ We focus on simulating discrete Poisson variables using a method based on Devroye's technique of sequential search [7]. The algorithm is follows:

First, we draw correlated uniform random variables $(u, v)$ from the BVN copula in (A1) using a well known standard procedure given as follows:
1. Generate two independently distributed $N (0, 1)$ variables, say u and v.
2. Set $y_{1} = u,$ and $y_{2} = u \times ρ + v \sqrt{1 - ρ^{2}} .$
3. Set $u = Φ (y_{1}),$ and $v = Φ (y_{2}),$ where Φ is the cumulative distribution function of the standard normal distribution. Then, the pair $(u, v)$ are uniformly distributed variables drawn from the Gaussian copula $C (u, v; ρ) .$
Next, set the Poisson mean $m_{1}$ such that $P (Y_{1} = 0) = \exp (- m_{1}) .$
Set $Y_{1} = 0,$ $A_{0} = \exp (- m_{1}),$ and $W = A_{0} .$
If $u < W,$ then $Y_{1}$ remains equal to 0.
If $u > W,$ then proceed sequentially as follows: While $u > W,$ replace–(a) $Y_{1} \leftarrow Y_{1} + 1,$ (b) $A_{0} \leftarrow m_{1} \frac{A_{0}}{Y_{1}},$ (c) $W \leftarrow W + A_{0} .$ Continue this process until $u < W .$

These steps produce a simulated variable $Y_{1}$ with Poisson distribution with mean $m_{1} .$ To obtain draws of the second Poisson variable $Y_{2},$ (say) replace u and $m_{1}$ with v and $m_{2}$ and repeat the steps above. Consequently, the pair $(Y_{1}, Y_{2})$ are jointly distributed Poisson variables with means $m_{1}$ and $m_{2}$ .

Disclosure statement

No potential conflict of interest was reported by the authors.

References

1.Adamidis K. and Loukas S., Estimation in a truncated bivariate Poisson distribution using the EM algorithm, Commun. Stat. – Theory Meth. 25 (1996), pp. 2215–2222. doi: 10.1080/03610929608831833 [DOI] [Google Scholar]
2.Aitchison J. and Ho C.H, The multivariate Poisson-log normal distribution, Biometrika 76 (1989), pp. 643–653. doi: 10.1093/biomet/76.4.643 [DOI] [Google Scholar]
3.Arnold B.C., Castillo E., and Sarabia J.M., Conditional Specification of Statistical Models, Springer, New York, 1999. [Google Scholar]
4.Barbiero A, Inference on reliability of stress-Strength models for Poisson data, J. Quality and Reliab. Eng. 8 (2013), pp. 1–9. Available at http://dx.doi.org/ 10.1155/2013/530530. [DOI] [Google Scholar]
5.Campbell J.T, The Poisson correlation function, Proc. Edinburgh Math. Soc. (Series 2) 4 (1938), pp. 18–26. doi: 10.1017/S0013091500024135 [DOI] [Google Scholar]
6.Dahiya R.C., Estimation in a truncated bivariate Poisson distribution, Commun. Stat.-Theory and Meth. 6 (1977), pp. 113–120. doi: 10.1080/03610927708827476 [DOI] [Google Scholar]
7.Devroye L., Sample-based non-uniform random variate generation. Proceedings of the 18th conference on Winter simulation, (1986, December), pp. 260–265,
8.Hamdan M.A., Estimation in the truncated bivariate Poisson distribution, Technometrics 14 (1972), pp. 37–45. doi: 10.1080/00401706.1972.10488881 [DOI] [Google Scholar]
9.Johnson N.L., Kotz S., and Balakrishnan N., Discrete Multivariate Distributions., John Wiley & Sons, New York, 1997. [Google Scholar]
10.Holgate P, Estimation for the bivariate Poisson distribution, Biometrica 51 (1964), pp. 241–287. doi: 10.1093/biomet/51.1-2.241 [DOI] [Google Scholar]
11.Inouye D.I., Yang E., Allen G.I., and Ravikumar P., A review of multivariate distributions for count data derived from the Poisson distribution, WIREs Comput. Stat. 9 (2017), pp. e1398. doi: 10.1002/wics.1398 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Kocherlakota S. and Kocherlakota K., Bivariate Discrete Distributions, New York, Marcel Dekker, 1992. [Google Scholar]
13.Kocherlakota S. and Kocherlakota K., Regression in the bivariate Poisson distribution, Commun. Stat. - Theory Meth. 30 (2001), pp. 815–825. doi: 10.1081/STA-100002259. [DOI] [Google Scholar]
14.Lakshminarayana J., Pandit S.N.N., and Srinivasa Rao K., On a bivariate Poisson distribution, Commun. Stat.–Theory Meth. 28 (1999), pp. 267–276. doi: 10.1080/03610929908832297 [DOI] [Google Scholar]
15.Lee H., Cha J.H., and Pulcini G, Modeling discrete bivariate data with applications to failure and count data, Quality Reliab. Eng. Int. 33 (2017), pp. 1455–1473. doi: 10.1002/qre.2118 [DOI] [Google Scholar]
16.Loukas S. and Kemp C., On the Chi-square goodness-of-fit statistic for bivariate discrete distributions, J. Royal Stat. Soc. Ser. D (The Statistician) 35 (1986), pp. 525–529. [Google Scholar]
17.Nelsen R.B, An Introduction to Copulas, 2nd ed. Springer, New York, 2006. [Google Scholar]
18.Nikoloulopoulos A.K., Copula-based models for multivariate discrete response data, in Copulae in Mathematical and Quantitative Finance, P. Jaworski, F. Durante, and W. Hardle, eds, 2013a, pp. 231–249,
19.Nikoloulopoulos A.K., On the estimation of normal copula discrete regression models using the continuous extension and simulated likelihood, in Copulae in Mathematical and Quantitative Finance, P. Jaworski, F. Durante, and W. Hardle, eds., 231–249. A. K. Nikoloulopoulos, J. Stat. Planning and Inference, 143(11): (2013a), pp. 1923–1937,
20.Obrechkoff N, Theory of Probability, Nauka i Izkustvo, Sofia, 1963. [Google Scholar]
21.Paul S.R. and Ho N.I., Estimation in the bivariate Poisson distribution and hypothesis testing concerning independence, Commun. Stat.-Theory and Meth. 18 (1989), pp. 1123–1133. doi: 10.1080/03610928908829955 [DOI] [Google Scholar]
22.Rao S., Experimental studies on the yield of groundnuts in coastal region. Technical Report, Andhra University, Visakhapatnam, 1990.
23.Short M., Improved inequalities for the Poisson and binomial distribution and upper tail quantile functions. ISRN Probability and Statistics, 2013, 2013.
24.Teicher H, On the multivariate Poisson distribution, Scand. Actuar. J. 1 (1954), pp. 1–9. doi: 10.1080/03461238.1954.10414190 [DOI] [Google Scholar]
25.Trivedi P.K. and Zimmer D.M., A note on identification of bivariate copulas for discrete count data, Econometrics 5 (2017), pp. 10. doi: 10.3390/econometrics5010010 [DOI] [Google Scholar]
26.Trivedi P.K. and Zimmer D.M, Copula Modeling: An Introduction for Practitioners, Foundations and Trends(R) in Econometrics, now publishers, Vol. 1, 2007. pp. 1–111.
27.Wesolowski J., A new conditional specification of the bivariate Poisson conditionals distribution. Technical Report, Mathematical Institute, Warsaw University of Technology, Warsaw, Poland, 1994. [PubMed]

[CIT0001] 1.Adamidis K. and Loukas S., Estimation in a truncated bivariate Poisson distribution using the EM algorithm, Commun. Stat. – Theory Meth. 25 (1996), pp. 2215–2222. doi: 10.1080/03610929608831833 [DOI] [Google Scholar]

[CIT0002] 2.Aitchison J. and Ho C.H, The multivariate Poisson-log normal distribution, Biometrika 76 (1989), pp. 643–653. doi: 10.1093/biomet/76.4.643 [DOI] [Google Scholar]

[CIT0003] 3.Arnold B.C., Castillo E., and Sarabia J.M., Conditional Specification of Statistical Models, Springer, New York, 1999. [Google Scholar]

[CIT0004] 4.Barbiero A, Inference on reliability of stress-Strength models for Poisson data, J. Quality and Reliab. Eng. 8 (2013), pp. 1–9. Available at http://dx.doi.org/ 10.1155/2013/530530. [DOI] [Google Scholar]

[CIT0005] 5.Campbell J.T, The Poisson correlation function, Proc. Edinburgh Math. Soc. (Series 2) 4 (1938), pp. 18–26. doi: 10.1017/S0013091500024135 [DOI] [Google Scholar]

[CIT0006] 6.Dahiya R.C., Estimation in a truncated bivariate Poisson distribution, Commun. Stat.-Theory and Meth. 6 (1977), pp. 113–120. doi: 10.1080/03610927708827476 [DOI] [Google Scholar]

[CIT0007] 7.Devroye L., Sample-based non-uniform random variate generation. Proceedings of the 18th conference on Winter simulation, (1986, December), pp. 260–265,

[CIT0008] 8.Hamdan M.A., Estimation in the truncated bivariate Poisson distribution, Technometrics 14 (1972), pp. 37–45. doi: 10.1080/00401706.1972.10488881 [DOI] [Google Scholar]

[CIT0009] 9.Johnson N.L., Kotz S., and Balakrishnan N., Discrete Multivariate Distributions., John Wiley & Sons, New York, 1997. [Google Scholar]

[CIT0010] 10.Holgate P, Estimation for the bivariate Poisson distribution, Biometrica 51 (1964), pp. 241–287. doi: 10.1093/biomet/51.1-2.241 [DOI] [Google Scholar]

[CIT0011] 11.Inouye D.I., Yang E., Allen G.I., and Ravikumar P., A review of multivariate distributions for count data derived from the Poisson distribution, WIREs Comput. Stat. 9 (2017), pp. e1398. doi: 10.1002/wics.1398 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0012] 12.Kocherlakota S. and Kocherlakota K., Bivariate Discrete Distributions, New York, Marcel Dekker, 1992. [Google Scholar]

[CIT0013] 13.Kocherlakota S. and Kocherlakota K., Regression in the bivariate Poisson distribution, Commun. Stat. - Theory Meth. 30 (2001), pp. 815–825. doi: 10.1081/STA-100002259. [DOI] [Google Scholar]

[CIT0014] 14.Lakshminarayana J., Pandit S.N.N., and Srinivasa Rao K., On a bivariate Poisson distribution, Commun. Stat.–Theory Meth. 28 (1999), pp. 267–276. doi: 10.1080/03610929908832297 [DOI] [Google Scholar]

[CIT0015] 15.Lee H., Cha J.H., and Pulcini G, Modeling discrete bivariate data with applications to failure and count data, Quality Reliab. Eng. Int. 33 (2017), pp. 1455–1473. doi: 10.1002/qre.2118 [DOI] [Google Scholar]

[CIT0016] 16.Loukas S. and Kemp C., On the Chi-square goodness-of-fit statistic for bivariate discrete distributions, J. Royal Stat. Soc. Ser. D (The Statistician) 35 (1986), pp. 525–529. [Google Scholar]

[CIT0017] 17.Nelsen R.B, An Introduction to Copulas, 2nd ed. Springer, New York, 2006. [Google Scholar]

[CIT0018] 18.Nikoloulopoulos A.K., Copula-based models for multivariate discrete response data, in Copulae in Mathematical and Quantitative Finance, P. Jaworski, F. Durante, and W. Hardle, eds, 2013a, pp. 231–249,

[CIT0019] 19.Nikoloulopoulos A.K., On the estimation of normal copula discrete regression models using the continuous extension and simulated likelihood, in Copulae in Mathematical and Quantitative Finance, P. Jaworski, F. Durante, and W. Hardle, eds., 231–249. A. K. Nikoloulopoulos, J. Stat. Planning and Inference, 143(11): (2013a), pp. 1923–1937,

[CIT0020] 20.Obrechkoff N, Theory of Probability, Nauka i Izkustvo, Sofia, 1963. [Google Scholar]

[CIT0021] 21.Paul S.R. and Ho N.I., Estimation in the bivariate Poisson distribution and hypothesis testing concerning independence, Commun. Stat.-Theory and Meth. 18 (1989), pp. 1123–1133. doi: 10.1080/03610928908829955 [DOI] [Google Scholar]

[CIT0022] 22.Rao S., Experimental studies on the yield of groundnuts in coastal region. Technical Report, Andhra University, Visakhapatnam, 1990.

[CIT0023] 23.Short M., Improved inequalities for the Poisson and binomial distribution and upper tail quantile functions. ISRN Probability and Statistics, 2013, 2013.

[CIT0024] 24.Teicher H, On the multivariate Poisson distribution, Scand. Actuar. J. 1 (1954), pp. 1–9. doi: 10.1080/03461238.1954.10414190 [DOI] [Google Scholar]

[CIT0025] 25.Trivedi P.K. and Zimmer D.M., A note on identification of bivariate copulas for discrete count data, Econometrics 5 (2017), pp. 10. doi: 10.3390/econometrics5010010 [DOI] [Google Scholar]

[CIT0026] 26.Trivedi P.K. and Zimmer D.M, Copula Modeling: An Introduction for Practitioners, Foundations and Trends(R) in Econometrics, now publishers, Vol. 1, 2007. pp. 1–111.

[CIT0027] 27.Wesolowski J., A new conditional specification of the bivariate Poisson conditionals distribution. Technical Report, Mathematical Institute, Warsaw University of Technology, Warsaw, Poland, 1994. [PubMed]

PERMALINK

A new bivariate Poisson distribution via conditional specification: properties and applications

Indranil Ghosh

Filipe Marques

Subrata Chakraborty

ABSTRACT

1. Introduction

2. Bivariate dependent Poisson distribution via conditional specification

Figure 2.

Figure 5.

Figure 3.

Figure 4.

Figure 1.

Figure 6.

3. Structural properties

Theorem 3.1

Proof.

Corollary 3.1

Lemma 3.1

Proof.

Theorem 3.2

Proof.

Theorem 3.3

Proof.

Corollary 3.2

Theorem 3.4

Proof.

Theorem 3.5

Proof.

Theorem 3.6

Proof.

Theorem 3.7

Proof.

Theorem 3.8

Proof.

Theorem 3.9

Proof.

4. Statistical inference

4.1. Maximum likelihood estimation

4.2. Likelihood ratio test for independence of the random variables

4.3. Copula-based simulations

Table 1. Analysis of the fit of the BPD distribution to the data generated from a Binormal copula, with correlation ρ, of two Poisson distributions with parameters m1=2 and m2=3.

Table 4. Analysis of the fit of the BPD distribution to the data generated from a FGM copula, with correlation ρ, of two Poisson distributions with parameters m1=5 and m2=10.

Table 2. Analysis of the fit of the BPD distribution to the data generated from a Binormal copula, with correlation ρ, of two Poisson distributions with parameters m1=5 and m2=10.

Table 3. Analysis of the fit of the BPD distribution to the data generated from a FGM copula, with correlation ρ, of two Poisson distributions with parameters m1=2 and m2=3.

5. Real data applications

5.1. English premier football league data

Table 5. Descriptive analysis of the data of goals scored, (Home team, Away team), for the seasons 2014/15,2015/16, 2016/17, 2017/18 and 2018/19 of the English Premier Football League.

Figure 7.

Table 6. Analysis of the fit of the BPD distribution to the data of goals scored, (Home team, Away team), for the seasons 2014/15,2015/16,2016/17,2017/18 and 2018/19 for the English Premier Football League.

5.2. Data of counts of surface and interior faults in 100 lenses

Table 7. AIC and BIC values for BPD distribution fit of the data of counts of surface and interior faults in 100 lenses in [2,15].

Figure 8.

5.3. Seeds and plants grown data

Table 8. Analysis of the fit of the BPD distribution to the data of seeds and plants grown.

Figure 9.

6. Conclusion

Acknowledgements

Appendix.

Disclosure statement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 1. Analysis of the fit of the BPD distribution to the data generated from a Binormal copula, with correlation ρ, of two Poisson distributions with parameters $m_{1} = 2$ and $m_{2} = 3$ .

Table 4. Analysis of the fit of the BPD distribution to the data generated from a FGM copula, with correlation ρ, of two Poisson distributions with parameters $m_{1} = 5$ and $m_{2} = 10$ .

Table 2. Analysis of the fit of the BPD distribution to the data generated from a Binormal copula, with correlation ρ, of two Poisson distributions with parameters $m_{1} = 5$ and $m_{2} = 10$ .

Table 3. Analysis of the fit of the BPD distribution to the data generated from a FGM copula, with correlation ρ, of two Poisson distributions with parameters $m_{1} = 2$ and $m_{2} = 3$ .

Table 6. Analysis of the fit of the BPD distribution to the data of goals scored, (Home team, Away team), for the seasons $2014 / 15, 2015 / 16, 2016 / 17, 2017 / 18$ and 2018/19 for the English Premier Football League.