A model for bimodal rates and proportions

Roberto Vila; Lucas Alfaia; André FB Menezes; Mehmet N Çankaya; Marcelo Bourguignon

doi:10.1080/02664763.2022.2146661

. 2022 Nov 18;51(4):664–681. doi: 10.1080/02664763.2022.2146661

A model for bimodal rates and proportions

Roberto Vila ^a,^CONTACT, Lucas Alfaia ^a, André FB Menezes ^b, Mehmet N Çankaya ^c,^d, Marcelo Bourguignon ^e

PMCID: PMC10929684 PMID: 38476621

Abstract

The beta model is the most important distribution for fitting data with the unit interval. However, the beta distribution is not suitable to model bimodal unit interval data. In this paper, we propose a bimodal beta distribution constructed by using an approach based on the alpha-skew-normal model. We discuss several properties of this distribution, such as bimodality, real moments, entropies and identifiability. Furthermore, we propose a new regression model based on the proposed model and discuss residuals. Estimation is performed by maximum likelihood. A Monte Carlo experiment is conducted to evaluate the performances of these estimators in finite samples with a discussion of the results. An application is provided to show the modelling competence of the proposed distribution when the data sets show bimodality.

Keywords: Bimodal model, bimodality, bounded data, beta distribution, maximum likelihood, regression model

1. Introduction

The need for modelling and analysing the bimodal bounded data, especially for data on the unit interval, occurs in many fields of real life, such as bioinformatics [12], image classification [16], transaction at a car dealership [26] and so on. In such situations, in order to apply probabilistic modelling for these phenomena, under a parametric paradigm, probability distributions limited to $(0, 1)$ are indispensable. The unimodal beta model is the most widely used model in the literature to describe data in the unit interval, especially because of its flexibility and fruitful properties [13]. However, despite its broad sense applicability in many fields, the beta distribution is not suitable to model bimodal data on the unit interval.

In general, one uses mixtures of distributions to describe the bimodal data. For example, the studies [26] and [25] consider finite mixtures of beta regression models to analyze the priming effects in judgements of imprecise probabilities. However, in general, mixtures of distributions may suffer from identifiability problems in the parameter estimation; see Refs. [14,15]. Thus, new mixture-free models which have the capacity to accommodate both unimodal and bimodal data are very important. The nature of phenomena can show bimodality due to many reasons, such as economical policies, uncertainty of social movement and its effects on the economy [28,30].

Since the structure of phenomena depends on many factors, it is reasonable to expect that the non-identically distributed data can occur in the data observed by the experimenter. For example, bimodality was introduced by Elal-Olivero [8], Domma et al. [6] and Vila and Çankaya [28] is a necessary probabilistic model to perform an efficient fitting on the non-identically distributed data set or the mixed data set. If we have the mixed data set, the mixed form of Beta and Weibull in Vila and Çankaya [28] should be necessary to model the data set efficiently, because the mixing proportions $π_{1}$ and $π_{2} = (1 - π_{1})$ in the bimodal case cannot be estimated accurately. The analytical expression of the mixed distribution can lead to problem while the optimization of the maximum likelihood estimation method according to parameters from parametric models, such as Beta, Weibull, etc. and the mixing parameters $π_{1}$ and $π_{2}$ of Beta from separate populations Beta₁ and Beta₂ is performed. At least, we can come across the numerical error while performing computation. The original working principle of phenomena can depend on the probabilistic model such as the bimodal beta (Bbeta) or the bimodal Weibull (BWeibull) in Vila and Çankaya [28]. On the other hand, the parameters in the mixed form of two Beta distributions, i.e. Beta₁ and Beta₂, are $α_{1}$ , $α_{2}$ , $β_{1}$ , $β_{2}$ , $π_{1}$ and $π_{2}$ . However, Bbeta distribution includes four parameters which are α, β, ρ and δ. Bbeta has less parameters when compared with the mixed form of Beta. Further, since we have the exact expression for the cumulative distribution function of Bbeta, it is advantageous for us to generate the bimodal artificial data sets, which can be used to check whether or not a data set in the system can be modelled by Bbeta with the estimated parameters. In other words, the results and outputs on the unit interval data can be modelled and tested by using the proposed distribution.

Variations of the beta model can be found in Ferrari and Cribari-Neto [9], Ospina and Ferrari [21], Bayes et al. [4], Hahn [11], among others. However, all the models cited above are not suitable for capturing bimodality. Recently, probabilistic models for modelling bimodality on the positive real line were discussed by various authors. Olmos et al. [20] introduced recently a bimodal extension of the unit-Birnbaum-Saunders distribution. Vila et al. [29] proposed the bimodal gamma distribution. Vila and Çankaya [28] considered a bimodal Weibull distribution. Recently, [17] proposed a family of bimodal distributions generated by distributions with positive support. Despite this, to the best of our knowledge, a specific parametric model to describe bimodality data observed of the unit interval has never been considered in the literature recently. Despite this, to the best of our knowledge, a specific regression bimodal model to unit interval data with a regression structure for the parameters has never been considered in the literature. Martnez-Flrez et al. [18] considered a transformation in a random variable that follows a unit-bimodal Birnbaum-Saunders (UBBS for short) distribution only in the case of identically and independently variables.

Based on the above discussion and motivated by the presence of bimodality in proportion responses, we develop a model for double-bounded response variables. In particular, we extended the usual beta distribution using a quadratic transformation technique used to generate bimodal functions [8,28]. The approach, therefore, appears to be a new development for the literature. We discuss several properties of the proposed model, such as bimodality, real moments, hazard rate, entropies and identifiability. Furthermore, we study the effects of the explanatory variables on the response variable using a regression model.

In what follows, we list some of the main contributions and advantages of the proposed model.

We introduce a new family of distributions that is flexible version of the usual beta distribution so that it is capable of fitting bimodal as well as unimodal data. We provide general properties of the proposed model;
We propose an extended version of the quadratic transformation technique used to generate bimodal functions;

The rest of the article proceeds as follows. In Sections 2 and 3, we present the new distribution and derive some of its properties. Then in Section 4, we present the main properties of the bimodal beta, which include entropies, stochastic representation and identifiability. Section 5 presents the bimodal beta regression model. Also, the estimation method for the model parameters and diagnostic measures are discussed. In Section 6, some numerical results of the estimators and the empirical distribution of the residuals are presented with a discussion of the results. A real-life application related to the proportion of votes that Jair Bolsonaro received in the second turn of Brazilian elections in 2018 is analysed in Section 7. Section 8 summarizes the main findings of the paper.

2. The bimodal beta distribution

In this section, the bimodal beta (Bbeta) distribution is introduced and its density is derived. Moreover, some results on the bimodality properties are obtained. We say that a random variable (r.v.) X has a Bbeta distribution with parameter vector $θ_{δ} = (α, β, ρ, δ)$ , $α > 0, β > 0$ , $ρ ⩾ 0$ and $δ \in R$ , denoted by $X \sim Bbeta (θ_{δ})$ , if its probability density function (PDF) is given by

\begin{aligned} f (x; θ_{δ}) = {\begin{cases} \frac{ρ + (1 - δ x)^{2}}{Z (θ_{δ}) B (α, β)} x^{α - 1} (1 - x)^{β - 1}, & 0 < x < 1, \\ 0, & otherwise, \end{cases} \end{aligned}

(1)

where

Z (θ_{δ}) = 1 + ρ - 2 δ \frac{α}{α + β} + δ^{2} \frac{α (α + 1)}{(α + β) (α + β + 1)}

(2)

denotes the normalization constant and $B (α, β) = \int_{0}^{1} t^{α - 1} (1 - t)^{β - 1} d t$ is the beta function. When $δ = 2$ , $α = β = 1$ and $ρ = 0$ , we have the U-quadratic distribution on $(0, 1)$ . When $δ = 0$ , we obtain the classical beta distribution with parameter vector $θ_{0} = (α, β, ρ, 0) := (α, β)$ . The parameters α, β (which appear as exponents of the r.v.) and ρ control the shape of the distribution. The uni- or bimodality is controlled by the parameter δ. Note that for α, β and $δ \neq 0$ fixed, the parameter ρ also controls the unimodality or bimodality of the distribution; see Subsection 2.1. From Figure 1, we note some different shapes of the Bbeta PDF for different combinations of parameters. Figure 1(a,b) represents L shape and its bimodal form and bell shaped case of beta distribution, respectively.

Unlike Figure 1(b), Figure 2(a,b) shows that, a peak can be major peak and the other one can be minor peak.

The asymptotic behaviour of the PDF (1) is as follows:

\begin{aligned} f (0^{+}; θ_{δ}) = lim_{\begin{matrix} x \to 0, \\ x > 0 \end{matrix}} f (x; θ_{δ}) = {\begin{cases} \frac{β (ρ + 1)}{1 + ρ - \frac{2 δ}{1 + β} + \frac{2 δ^{2}}{(1 + β) (2 + β)}}, & α = 1, \\ 0, & α > 1, \\ + \infty, & α < 1, \end{cases} \end{aligned}

(3)

and

\begin{aligned} f (1^{-}; θ_{δ}) = lim_{\begin{matrix} x \to 1, \\ x < 1 \end{matrix}} f (x; θ_{δ}) = {\begin{cases} \frac{α [ρ + (1 - δ)^{2}]}{1 + ρ - \frac{2 δα}{α + 1} + \frac{δ^{2} α (α + 1)}{(α + 1) (α + 2)}}, & β = 1, \\ 0, & β > 1, \\ + \infty, & β < 1. \end{cases} \end{aligned}

(4)

This asymptotic behaviour of the Bbeta PDF was expected, since the bimodal beta distribution is defined in terms of the classical beta. It is clear that when $δ = 0$ ; $f (0^{+}; θ_{δ}) = β$ for $α = 1$ and $f (1^{-}; θ_{δ}) = α$ for $β = 1$ .

If $X \sim Bbeta (θ_{δ})$ , the cumulative distribution function (CDF) (see Figure 3), the survival function (SF) and the hazard rate function (HR) of X are, respectively, given by

\begin{aligned} F (x; θ_{δ}) = \frac{1}{Z (θ_{δ})} [(1 + ρ) I_{x} (α, β) - 2 δ \frac{B_{x} (α + 1, β)}{B (α, β)} + δ^{2} \frac{B_{x} (α + 2, β)}{B (α, β)}], \end{aligned}

(5)

\begin{aligned} S (x; θ_{δ}) = \frac{1}{Z (θ_{δ})} \sum_{i = 0}^{2} c_{i} [\frac{B (α + i, β)}{B (α, β)} - \frac{B_{x} (α + i, β)}{B (α, β)}] and \end{aligned}

(6)

\begin{aligned} H (x; θ_{δ}) = \frac{[ρ + (1 - δ x)^{2}] x^{α - 1} (1 - x)^{β - 1}}{\sum_{i = 0}^{2} c_{i} [B (α + i, β) - B_{x} (α + i, β)]}, \end{aligned}

(7)

where $I_{x} (α, β) = B_{x} (α, β) / B (α, β)$ is the incomplete beta function ratio, $B_{x} (α, β) = \int_{0}^{x} t^{α - 1} (1 - t)^{β - 1} d t$ is the incomplete beta function, and $c_{0} = 1 + ρ$ , $c_{1} = - 2 δ$ , $c_{2} = δ^{2}$ . For more details on the derivation of these formulas, see Section 3.

Figure 3. — The CDF of the bimodal beta distribution for different values of parameters. Due to bimodality, as seen in the figure, it is natural to expect the CDF graph to present up to three inflection points.

2.1. Bimodality properties

To state the following result that guarantees the uni- or bimodality of the Bbeta distribution, we define the following cubic polynomial:

p_{3} (x) = a_{3} x^{3} + a_{2} x^{2} + a_{1} x + a_{0} = 0,

(8)

where $a_{3} = - δ^{2} (α + β)$ , $a_{2} = δ [α (δ + 2) + 2 β + δ - 2]$ , $a_{1} = - [α (2 δ + ρ + 1) + (β - 2) (ρ + 1)]$ and $a_{0} = (α - 1) (ρ + 1)$ .

Theorem 2.1 Uni- or bimodality —

Let $X \sim Bbeta (θ_{δ})$ such that $α > 1, β > 1$ , $(δ - 1)^{2} + ρ > 0$ and $δ > 0$ .

(i)
If $p_{3} (x)$ has a single positive zero then the Bbeta distribution is unimodal.

(ii)
If $p_{3} (x)$ has exactly three zeros in $(0, 1)$ then the Bbeta distribution is bimodal.

Proof.

A simple computation shows that

$f^{'} (x; θ_{δ}) = \frac{x^{α - 2} (1 - x)^{β - 2}}{Z (θ_{δ}) B (α, β)} p_{3} (x),$ (9)

where $p_{3} (x)$ is given in (8). Under the conditions stated in the theorem, we have $a_{3} < 0$ , $a_{2} > 0$ , $a_{1} < 0$ and $a_{0} > 0$ . By definition, the boundary points are never critical points, then we exclude the analysis at these points.

Since $p_{3} (0) = a_{0} > 0$ and $p_{3} (1) = a_{3} + a_{2} + a_{1} + a_{0} = (1 - β) [(δ - 1)^{2} + ρ] < 0$ because $β > 1$ and $(δ - 1)^{2} + ρ > 0$ , the Intermediate Value Theorem guarantees that there is at least one root in the interval $(0, 1)$ . Further, by Descartes rule of signs (see, e.g. Refs. [31] and [10]), $p_{3} (x)$ has one or three roots in the interval $(0, 1)$ .

Assume that $p_{3} (x)$ has a single zero. In this case, $f (x; θ_{δ})$ has a single critical point, denoted by $x_{0}$ . Since, for $α > 1$ and $β > 1$ , $f (0^{+}; θ_{δ}) = 0$ and $f (1^{-}; θ_{δ}) = 0$ , see limits in (3) and (4); it follows that $f (x; θ_{δ})$ increases on $(0, x_{0})$ and decreases on $(x_{0}, 1)$ . That is, $x_{0}$ is a global maximum point of $f (x; θ_{δ})$ . This proves Item (i).

On the other hand, if $p_{3} (x)$ has exactly three zeros in $(0, 1)$ then $f (x; θ_{δ})$ has three critical points $x_{1}, x_{2}$ and $x_{3}$ . Without loss of generality, let us assume that $x_{1} < x_{2} < x_{3}$ . Again, since, for $α > 1$ and $β > 1$ , $f (0^{+}; θ_{δ}) = 0$ and $f (1^{-}; θ_{δ}) = 0$ , it follows that $f (x; θ_{δ})$ increases on the intervals $(0, x_{1})$ and $(x_{2}, x_{3})$ , and decreases on $(x_{1}, x_{2})$ and $(x_{3}, 1)$ . In other words, $x_{1}$ and $x_{3}$ are two maximum points and $x_{2}$ is the unique minimum point. Then the statement in Item (ii) follows.

Thus we have completed the proof of the theorem.

Remark 2.1

By considering α, β, δ and ρ as in the Table 1, it is clear that the conditions of Theorem 2.1 are satisfied. Then, depending on the number of roots of $p_{3} (x)$ , Theorem 2.1 guarantees the uni- or bimodality (U- or B) of PDF $f (x; θ_{δ})$ . These results are compatible with Figure 1(b).

Again, using the values of Figure 2(a ,b), it can be verified that the conditions of Theorem 2.1 are satisfied, which allows concluding the bimodality of the PDF $f (x; θ_{δ})$ . This contrasts the shape of PDF shown in Figure 2.

Table 1.

Roots of the polynomial $p_{3} (x)$ and shapes of the PDF bimodal beta using the values of the parameters of Figure 1(b).

α	β	δ	ρ	$a_{3}$	$a_{2}$	$a_{1}$	$a_{0}$	Real roots of $p_{3} (x)$ in $(0, 1)$	Shape
2	2	2	0.25	$- 16$	24	$- 10.5$	1.25	$x = 0.19, x = 0.5, x = 0.81$	B
2	2	2	1.5	$- 16$	24	$- 13$	2.5	x = 0.5	U
2	2	2	0	$- 16$	24	$- 10$	1	$x = 0.15, x = 0.5, x = 0.85$	B
2	2	2	0.5	$- 16$	24	$- 11$	1.5	$x = 0.25, x = 0.5, x = 0.75$	B
2	2	2	2	$- 16$	24	$- 14$	3	x = 0.5	U

Open in a new tab

Theorem 2.2 Bimodality; case $ρ = 0$ —

If $X \sim Bbeta (θ_{δ})$ , $α > 1, β > 1$ , $ρ = 0$ , $δ > 1$ , and

${[δ (α + 1) + α + β - 2]}^{2} > 4 δ (α + β) (α - 1),$ (10)

then the Bbeta distribution is bimodal with maximum points

$\begin{aligned} x_{max, \pm} & = \frac{1}{δ} + \frac{δ (α + 1) - (α + β + 2)}{2 δ (α + β)} \\ \pm \frac{\sqrt{{[δ (α + 1) + α + β - 2]}^{2} - 4 δ (α + β) (α - 1)}}{2 δ (α + β)} \end{aligned}$

and minimum point $x = 1 / δ$ , where $0 < x_{max, -} < x = 1 / δ < x_{max, +} < 1$ .

Proof.

Taking $ρ = 0$ in (9), we have

$f^{'} (x; θ_{δ}) = \frac{x^{α - 2} (1 - x)^{β - 2}}{Z (θ_{δ}) B (α, β)} (1 - δx) {(α + β) δ x^{2} - [δ (α + 1) + α + β - 2] x + (α - 1)} .$

A direct calculus shows that $f^{'} (x; θ_{δ}) = 0$ if and only if (excluding the boundary points) $x = 1 / δ$ and

$x_{\pm} = \frac{δ (α + 1) + α + β - 2 \pm \sqrt{{[δ (α + 1) + α + β - 2]}^{2} - 4 δ (α + β) (α - 1)}}{2 δ (α + β)} = x_{max, \pm} .$

Hence, under condition (10), it follows that the equation $f^{'} (x; θ_{δ}) = 0$ has three roots $x = 1 / δ, x_{-}$ and $x_{+}$ within the interval $(0, 1)$ , where $x_{-} < x = 1 / δ < x_{+}$ .

Since, for $α > 1$ and $β > 1$ , $f (0^{+}; θ_{δ}) = 0$ and $f (1^{-}; θ_{δ}) = 0$ , see limits in (3) and (4); the bimodality of the Bbeta distribution is guaranteed, where $x_{-} = x_{max, -}$ and $x_{+} = x_{max, +}$ are two maximum points and $x = 1 / δ$ is the unique minimum point.

Remark 2.2

Let $α = β = δ = 2$ and $ρ = 0$ . It is clear that the condition (10) is satisfied. Then, by Theorem 2.2, the Bbeta distribution is bimodal with maximum points $x_{max, -} = (2 - \sqrt{2}) / 4 \approx 0.15$ and $x_{max, +} = (2 + \sqrt{2}) / 4 \approx 0.85$ , and minimum point $x = 1 / 2$ ; which is compatible with Figure 1(b).

Proposition 2.3

The Bbeta PDF $f (x; θ_{δ})$ is symmetric at the point $x = 1 / 2$ whenever $α = β$ and $δ = 2$ .

Proof.

A simple algebraic manipulation shows that, if $α = β$ and $δ = 2$ then $f (0.5 - x; θ_{δ}) = f (0.5 + x; θ_{δ})$ , $\forall 0 < x < 1$ . Then the proof follows.

Theorem 2.3

If $X \sim Bbeta (θ_{δ})$ , $α = β > 1$ , $ρ < 1 / (α - 1)$ and $δ = 2$ , then the Bbeta distribution is bimodal with maximum points

$x_{max, \pm}^{*} = \frac{1}{2} \pm \frac{\sqrt{ρ α (1 - α) + α}}{2 α}$

and minimum point $x = 1 / 2$ , where $0 < x_{max, -}^{*} < x = 1 / 2 < x_{max, +}^{*} < 1$ . Moreover, the maximum values coincide, that is, $f (x_{max, -}^{*}; θ_{δ}) = f (x_{max, +}^{*}; θ_{δ})$ .

Proof.

As a by-product of proof of the Theorem 2.1, we have $f^{'} (x; θ_{δ}) = 0$ if and only if x is a zero of polynomial $p_{3} (x)$ defined in (8). Setting $α = β > 1$ and $δ = 2$ in polynomial $p_{3} (x)$ , we get $f^{'} (x; θ_{δ}) = 0$ if and only if

$p_{3}^{*} (x) = - 8 α x^{3} + 12 α x^{2} - [α (5 + ρ) + (α - 2) (ρ + 1)] x + (α - 1) (ρ + 1) = 0.$

Note that the above polynomial can be written as $p_{3}^{*} (x) = 2 (x - (1 / 2)) [4 α x^{2} - 4 αx + (α - 1) (ρ + 1)] .$ Then, it is clear that $x = 1 / 2$ and $x_{max, \pm}^{*}$ are critical points of $f (x; θ_{δ})$ , where $0 < x_{max, -}^{*} < x = 1 / 2 < x_{max, +}^{*} < 1$ . Note that the restriction $ρ < 1 / (α - 1)$ guarantees that the discriminant of the quadratic polynomial implicit in $p_{3}^{*} (x)$ is positive.

By using that $α > 1$ and $β > 1$ , and by following the same steps as in the final paragraph of proof of the Theorem 2.2, we guarantee bimodality of the Bbeta distribution.

Finally, the identity $f (x_{max, -}^{*}; θ_{δ}) = f (x_{max, +}^{*}; θ_{δ})$ follows from Proposition 2.3.

Remark 2.4

By considering α, β, δ and ρ as in the Table 2, it is clear that the restriction $ρ < 1 / (α - 1)$ is satisfied. Then, Theorem 2.3 guarantees the bimodality (B) of PDF $f (x; θ_{δ})$ with minimum point $x = 1 / 2$ and points (and values) of maximum specified in this table. These results are compatible with Figures 1(b) and 2(a)–(b).

Table 2.

Modes, maximum values and shapes of the PDF bimodal beta using the parameter values in Figure 1 (b) and Figure 2(a)–(b).

α	β	δ	ρ	$x_{max, -}^{*}$	$x_{max, +}^{*}$	$f (x_{max, -}^{*}; θ_{δ})$	$f (x_{max, +}^{*}; θ_{δ})$	Shape
2	2	2	0.25	0.19	0.81	1.30	1.30	B
2	2	2	0	0.15	0.85	1.87	1.87	B
2	2	2	0.5	0.25	0.75	1.21	1.21	B

Open in a new tab

3. Moments

In this section, some closed expressions for truncated moments and real moments of the Bbeta distribution are obtained. Other properties as raw moments, mean residual life function and moment generating function were also analysed in Section I of the Supplementary Material.

Theorem 3.1

If $X \sim BBeta (θ_{δ})$ then, for $0 ⩽ a < b ⩽ 1$ and $r > - α$ ,

$E (X^{r} 1_{{a ⩽ X ⩽ b}}) = \frac{1}{Z (θ_{δ})} \sum_{i = 0}^{2} c_{i} [\frac{B_{b} (α + r + i, β)}{B (α, β)} - \frac{B_{a} (α + r + i, β)}{B (α, β)}],$

where $c_{0} = 1 + ρ$ , $c_{1} = - 2 δ$ , $c_{2} = δ^{2}$ , and $B_{x} (α, β)$ is the incomplete beta function.

Proof.

By using definition of expectation and definition of Bbeta density, we have

$E (X^{r} 1_{{a ⩽ X ⩽ b}}) = \frac{1}{Z (θ_{δ})} \sum_{i = 0}^{2} c_{i} E (Y^{r + i} 1_{{a ⩽ Y ⩽ b}}), Y \sim Bbeta (θ_{0}) .$

Since $E (Y^{r + i} 1_{{a ⩽ Y ⩽ b}}) = [B_{b} (α + r + i, β) - B_{a} (α + r + i, β)] / B (α, β),$ the proof of the theorem follows.

Taking r = 0, b = x and a = 0 in Theorem 3.1, we get the formula (5) for the CDF. Letting r = 0, b = 1 and a = x in Theorem 3.1, we get the formula (6) for the SF. By combining the formula (6) of CDF and definition of the Bbeta distribution, we obtain the formula (7) for the HR.

Taking r = 1, a = x and b = 1 in Theorem 3.1, we get a closed formula for the mean residual life function, see Corollary 1.1 of the Supplementary Material.

Corollary 3.1 Real moments —

If $X \sim Bbeta (θ_{δ})$ and $r > - α$ , then

$E (X^{r}) = \frac{1}{Z (θ_{δ})} [(1 + ρ) \frac{B (α + r, β)}{B (α, β)} - 2 δ \frac{B (α + r + 1, β)}{B (α, β)} + δ^{2} \frac{B (α + r + 2, β)}{B (α, β)}] .$

Proof.

By taking b = 1 and a = 0 in Theorem 3.1 we have the following:

$E (X^{r}) = \frac{1}{Z (θ_{δ})} \sum_{i = 0}^{2} c_{i} \frac{B (α + r + i, β)}{B (α, β)},$

where $c_{0} = 1 + ρ$ , $c_{1} = - 2 δ$ and $c_{2} = δ^{2}$ .

As a consequence of the above corollary, the closed expressions for the standardized moments, variance, skewness and kurtosis of the bimodal beta r.v. X are easily obtained.

4. Further properties

In this section, we consider some properties of the Bbeta distribution, such as stochastic representation and identifiability. For reasons of space, entropy measures such as Tsallis [27], quadratic [23] and Shannon [24] ones were studied in Section II of the Supplementary Material.

4.1. Stochastic representation

Let W be a discrete r.v. with the following probability function $P (W = w_{k}) = π_{k}$ , k = 0, 1, 2, where

π_{0} = \frac{1 + ρ}{Z (θ_{δ})}, π_{1} = - \frac{2 αδ}{Z (θ_{δ}) (α + β)}, π_{2} = \frac{α (α + 1) δ^{2}}{Z (θ_{δ}) (α + β) (α + β + 1)}, for δ < 0,

and $Z (θ_{δ})$ is as in (2). Notice that $π_{0} + π_{1} + π_{2} = 1$ .

Let's consider the following three r.v.'s: $Y_{0; α, β} \sim Beta (α, β)$ , $Y_{1; α + 1, β} \sim Beta (α + 1, β)$ and $Y_{2; α + 2, β} \sim Beta (α + 2, β)$ . Then we define a new r.v. X as follows:

X = Y_{0; α, β} 1_{{W = w_{0}}} + Y_{1; α + 1, β} 1_{{W = w_{1}}} + Y_{2; α + 2, β} 1_{{W = w_{2}}},

(11)

where W is independent of $Y_{0; α, β}$ , $Y_{1; α + 1, β}$ and $Y_{2; α + 2, β}$ .

Proposition 4.1 Stochastic representation for $δ < 0$ —

If X admits the form (11), then $X \sim B beta (θ_{δ})$ . Conversely, if $X \sim B beta (θ_{δ})$ then X is as in (11).

Proof.

Using the law of total probability and the definition of X, we get

$\begin{aligned} P (X ⩽ x) & = P (Y_{0; α, β} ⩽ x | W = w_{0}) π_{0} + P (Y_{1; α + 1, β} ⩽ x | W = w_{1}) π_{1} \\ + P (Y_{2; α + 2, β} ⩽ x | W = w_{2}) π_{2} \\ = P (Y_{0; α, β} ⩽ x) π_{0} + P (Y_{1; α + 1, β} ⩽ x) π_{1} + P (Y_{2; α + 2, β} ⩽ x) π_{2}, \end{aligned}$

where in the last line we used the independence of W with respect to variables $Y_{1; α, β}$ , $Y_{2; α + 1, β}$ and $Y_{3; α + 2, β}$ . Since $P (Y_{k; α + k, β} ⩽ x) = I_{x} (α + k, β)$ , k = 0, 1, 2, the above equality becomes

$\frac{1}{Z (θ_{δ})} [(1 + ρ) I_{x} (α, β) - 2 δ \frac{B_{x} (α + 1, β)}{B (α, β)} + δ^{2} \frac{B_{x} (α + 2, β)}{B (α, β)}] .$

But, by (5), the right-hand side is equal to the CDF $F (x; θ_{δ})$ .

Then we have completed the proof.

4.2. Identifiability

A simple observation shows that the bimodal beta PDF $f (x; θ_{δ})$ in (1), with parameter vector $θ_{δ} = (α, β, ρ, δ)$ , can be written as a finite (generalized) mixture of three beta distributions with different shape parameters, i.e.

f (x; θ_{δ}) = π_{0} f (x; α, β) + π_{1} f (x; α + 1, β) + π_{2} f (x; α + 2, β), 0 < x < 1,

(12)

where $π_{0}$ , $π_{1}$ and $π_{2}$ are constants (that depends only on $θ_{δ}$ ) given in Proposition 4.1, and $f (x; α, β) = x^{α - 1} (1 - x)^{β - 1} / B (α, β)$ , 0<x<1, ( $α > 0, β > 0$ ), denotes the standard beta PDF.

Unlike Proposition 4.1, here δ can be non-negative. In principle, mixing non-negative weights are not necessary since mixtures can be PDF even if some of weights are negative.

Let $B$ be the family of beta distributions, as follows:

B = {F : F (x; α, β) = \int_{0}^{x} f (y; α, β) d y, α > 0, β > 0, 0 < x < 1} .

Write $H_{B}$ as the class of all finite mixtures of $B$ . It is well-known that $H_{B}$ is not identifiable; see the main Theorem of Ahmad and Al-Hussaini [1]. Let $H_{B^{*}}$ be the class of all finite mixtures of $B$ with the restriction that the shape parameters β are pairwise different (that is, $β_{i} \neq β_{j}$ for $i \neq j$ ). As a consequence of the main result of Atienza et al. [3], it is a simple task to prove that the class $H_{B^{*}}$ is identifiable; see, e.g. Proposition 3.2.2 of de Alencar [5] or Proposition 1.2 in the Appendix of Alfaia [2].

The following result proves the identifiability of bimodal beta distribution.

Proposition 4.2

The mapping $θ_{δ} = (α, β, ρ, δ) ⟼ f (\cdot; θ_{δ})$ , where the β's are pairwise different, is one-to-one.

Proof.

Let us suppose that $f (x; θ_{δ_{i}}) = f (x; θ_{δ_{j}})$ for all 0<x<1, where $θ_{δ_{i}} = (α_{i}, β_{i}, ρ_{i}, δ_{i})$ and $θ_{δ_{j}} = (α_{j}, β_{j}, ρ_{j}, δ_{j})$ . In other words, by (12),

$\begin{aligned} π_{0; i} f (x; α_{i}, β_{i}) + π_{1; i} f (x; α_{i} + 1, β_{i}) + π_{2; i} f (x; α_{i} + 2, β_{i}) \\ = π_{0; j} f (x; α_{j}, β_{j}) + π_{1; j} f (x; α_{j} + 1, β_{j}) + π_{2; j} f (x; α_{j} + 2, β_{j}), \end{aligned}$

where $π_{k; i}$ and $π_{k; j}$ , k = 0, 1, 2, are defined as in Proposition 4.1. Since $H_{B^{*}}$ is identifiable, we have $π_{k; i} = π_{k; j}$ , for k = 0, 1, 2, and $α_{i} = α_{j}$ , $β_{i} = β_{j}$ . Hence, from equalities $π_{k; i} = π_{k; j}$ , k = 0, 1, 2, immediately follows that $ρ_{i} = ρ_{j}$ and $δ_{i} = δ_{j}$ . Therefore, $θ_{δ_{i}} = θ_{δ_{j}}$ , and the proof follows.

5. Regression model, estimation and diagnostic analysis

Let $X_{1}, \dots, X_{n}$ be n independent random variables, where each $X_{i}$ , $i = 1, \dots, n$ , follows the PDF given in (1). We assume that the parameters $α_{i}$ and $β_{i}$ satisfy the following functional relations:

g_{1} (α_{i}) = η_{1 i} = w_{i}^{⊤} γ and g_{2} (β_{i}) = η_{2 i} = z_{i}^{⊤} ζ,

(13)

where $γ = (γ_{1}, \dots, γ_{p})^{⊤}$ and $ζ = (ζ_{1}, \dots, ζ_{q})^{⊤}$ are vectors of unknown regression coefficients which are assumed to be functionally independent, $γ \in R^{p}$ and $ζ \in R^{q}$ , with p + q<n, $η_{1 i}$ and $η_{2 i}$ are the linear predictors, and $w_{i} = (w_{i 1}, \dots, w_{ip})^{⊤}$ and $z_{i} = (z_{i 1}, \dots, z_{iq})^{⊤}$ are observations on p and q known regressors, for $i = 1, \dots, n$ . Furthermore, we assume that the covariate matrices $W = (w_{1}, \dots, w_{n})^{⊤}$ and $Z = (z_{1}, \dots, z_{n})^{⊤}$ have rank p and q, respectively. The link functions $g_{1} : R \to R^{+}$ and $g_{2} : R \to R^{+}$ in (13) must be strictly monotone, positive and at least twice differentiable, such that $α_{i} = g_{1}^{- 1} (w_{i}^{⊤} γ)$ and $β_{i} = g_{2}^{- 1} (z_{i}^{⊤} ζ)$ , with $g_{1}^{- 1} (\cdot)$ and $g_{2}^{- 1} (\cdot)$ being the inverse functions of $g_{1} (\cdot)$ and $g_{2} (\cdot)$ , respectively. There are several possible choices for the link functions $g_{1} (\cdot)$ and $g_{2} (\cdot)$ . For example, one can use the logarithmic specification $g_{j} (\cdot) = \log (\cdot)$ , square root $g_{j} (\cdot) = \sqrt{\cdot}$ , or identity $g_{j} (\cdot) = \cdot$ (with special attention to the sign of the estimates), j = 1, 2. In this paper, we consider the log link, $g_{j} (\cdot) = \log (\cdot)$ , since it is the most used for positive parameters.

The log-likelihood function for $θ_{δ} = (γ, ζ, ρ, δ)$ based on a sample of n independent observations is given by

ℓ (θ_{δ}) = \sum_{i = 1}^{n} ℓ (α_{i}, β_{i}, ρ, δ),

(14)

where $ℓ (α_{i}, β_{i}, ρ, δ) = - \log Z (θ_{δ}) - \log B (α_{i}, β_{i}) + \log [ρ + (1 - δ x_{i})^{2}] + (α_{i} - 1) \log x_{i} + (β_{i} - 1) \log (1 - x_{i}), i = 1, \dots, n,$ and $Z (θ_{δ})$ is as in (2).

The maximum likelihood estimator (MLE) ${\hat{θ}}_{δ} = ({\hat{γ}}^{⊤}, {\hat{ζ}}^{⊤}, \hat{ρ}, \hat{δ})^{⊤}$ of $θ_{δ} = (γ^{⊤}, ζ^{⊤}, ρ, δ)^{⊤}$ is obtained by the maximization of the log-likelihood function (14). However, it is not possible to derive analytical solution for the MLE $\hat{θ}$ , hence we resort to numerical solution using some optimization algorithm, such as Newton-Raphson and quasi-Newton.

Under mild regularity conditions and when n is large, the asymptotic distribution of the MLE ${\hat{θ}}_{δ} = ({\hat{γ}}^{⊤}, {\hat{ζ}}^{⊤}, \hat{ρ}, \hat{δ})^{⊤}$ is approximately multivariate normal (of dimension p + q + 2) with mean vector $θ_{δ} = (γ^{⊤}, ζ^{⊤}, ρ, δ)^{⊤}$ and variance covariance matrix $K^{- 1} (θ_{δ})$ where $K (θ_{δ}) = E [- \partial ℓ (θ_{δ}) / \partial θ_{δ} \partial θ_{δ}^{⊤}],$ is the expected Fisher information matrix. Unfortunately, there is no closed form expression for the matrix $K (θ_{δ})$ . Nevertheless, a consistent estimator of the expected Fisher information matrix is given by $J ({\hat{θ}}_{δ}) = - \partial ℓ (θ_{δ}) / \partial θ_{δ} \partial θ_{δ}^{⊤} |_{θ_{δ} = {\hat{θ}}_{δ}},$ which is the estimated observed Fisher information matrix. Therefore, for large n, we can replace $K (θ_{δ})$ by $J ({\hat{θ}}_{δ})$ .

Let $θ_{δ_{r}}$ be the r-th component of $θ_{δ}$ . The asymptotic $100 (1 - φ) %$ confidence interval for $θ_{δ_{r}}$ is given by ${\hat{θ}}_{δ_{r}} \pm z_{φ / 2} SE ({\hat{θ}}_{δ_{r}}), r = 1, \dots, p + q + 2,$ where $z_{φ / 2}$ is the $φ / 2$ upper quantile of the standard normal distribution and $SE ({\hat{θ}}_{δ_{r}})$ is the asymptotic standard error of $θ_{δ_{r}}$ . Note that $SE ({\hat{θ}}_{δ_{r}})$ is the square root of the r-th diagonal element of the matrix $J^{- 1} ({\hat{θ}}_{δ})$ .

Residuals are widely used to check the adequacy of the fitted model. To check the goodness of fit of the Bbeta model, we propose to use the randomized quantile residuals introduced by Dunn and Smyth [7]. Let $F (x_{i}; θ_{δ})$ be the cumulative distribution function of the Bbeta distribution, as defined in (5), in which the regression structures are assumed as in (13). The randomized quantile residual is given by

r_{i} = Φ^{- 1} (F (x_{i}; {\hat{θ}}_{δ})), i = 1, \dots, n,

where $Φ^{- 1} (\cdot)$ is the standard normal distribution function. If the assumed model for the data is well adjusted, these residuals have standard normal distribution [7].

6. Simulation study

In this section, Monte Carlo simulations are performed (i) to evaluate the finite-sample behaviour of the maximum likelihood estimates of the regression coefficients and (ii) to investigate the empirical distribution of the randomized quantile residuals.

The Monte Carlo experiments were carried out by considering the following regression structure

\begin{aligned} \log (α_{i}) & = γ_{0} + γ_{1} w_{i}, \\ \log (β_{i}) & = ζ_{0} + ζ_{1} z_{i}, i = 1, \dots, n, \end{aligned}

i.e. $g_{j} (\cdot) = \log (\cdot), j = 1, 2$ , where the true values of the parameters were chosen to be the same with the values of the estimated parameters for the case in which we use the application part of regression, i.e. $γ_{0} = - 1.8, γ_{1} = 5.9, ζ_{0} = 3.8, ζ_{1} = - 2.4, ρ = 0.1$ and $δ = 2.4$ . The covariate values of $w_{i}$ and $z_{i}$ were generated from the standard uniform distribution. The sample size considered was n = 50, 100, 200 and 300. All simulations were conducted in R [22] using the BFGS algorithm available in the optim() function. For each scenario, the Monte Carlo experiment was repeated 5000 times.

The Bbeta distribution is easily simulated from (5) as follows: if U has a uniform $U (0, 1)$ distribution, the solution of the non-linear equation $X = F^{- 1} (U; θ_{δ})$ has the $X \sim Bbeta (θ_{δ})$ distribution, where $F^{- 1} (\cdot; \cdot)$ is the inverse functions of $F (\cdot; \cdot)$ . To simulate data from this non-linear equation, we can use the programming language R through f.inv() function [22].

In the rest of this section, a small simulation study is presented to observe the finite sample performance of the proposed estimators from a regression approach. For such evaluation, the estimated bias and the estimated mean squared error (MSE) were calculated. The results are presented in Table 3 and Figure 4.

Table 3.

The estimated values for bias and mean squared error of the maximum likelihood estimators of $γ_{0}, γ_{1}, ζ_{0}, ζ_{1}, ρ$ and δ, and some values of sample size n.

n	The estimated bias						The estimated MSE
	$γ_{0}$	$γ_{1}$	$ζ_{0}$	$ζ_{1}$	δ	ρ	$γ_{0}$	$γ_{1}$	$ζ_{0}$	$ζ_{1}$	δ	ρ
50	0.212	0.106	0.132	0.299	0.177	1.306	0.234	0.634	0.417	0.839	0.488	0.235
100	0.213	0.099	0.114	0.254	0.120	0.938	0.192	0.475	0.276	0.558	0.183	0.091
200	0.202	0.093	0.095	0.215	0.081	0.543	0.157	0.390	0.181	0.381	0.068	0.006
300	0.195	0.091	0.088	0.200	0.061	0.414	0.139	0.353	0.152	0.313	0.037	0.003

Open in a new tab

Figure 4. — Box plots from 5000 simulated estimates of $γ_{0}, γ_{1}, ζ_{0}, ζ_{1}, ρ$ and δ for different sample sizes.

Table 3 presents the bias and MSE for the MLEs of $γ_{0}, γ_{1}, ζ_{0}, ζ_{1}, ρ$ and δ. Based on the results at these tables, we find that the estimates are convergent to their corresponding values of parameters. As expected, increasing the sample size n reduces substantially both bias and MSE. The previous findings are confirmed by the box plots shown in Figure 4.

6.1. Residuals

The second simulation study was performed to examine how well the distributions of the randomized quantile residuals are approximated by the standard normal distribution. The evaluation of the randomized quantile residuals were based on the normal probability plots of the mean order statistics and descriptive statistics. The results are presented in Table 4 and Figure 4 of the Supplementary Material.

Table 4.

Descriptive measures of the randomized quantile residuals for the bimodal beta model for different sample sizes.

n	Mean	StdDev	Skewness	Kurtosis
50	−0.001	0.999	0.028	2.854
100	−0.002	0.999	0.054	2.976
200	−0.003	0.997	0.077	3.002
300	−0.003	0.997	0.084	3.025

Open in a new tab

In Table 4, we present the mean, standard deviation (StdDev), skewness and kurtosis of the randomized quantile residuals. For all scenarios, that is, the residuals have approximately zero mean and unit standard deviation, have skewness close to zero, and the kurtosis is near three.

7. Real data application

In this section, to evaluate the applicability of the proposed model, a real data set with bimodality is considered. In particular, a real-life application related to the proportion of votes that Jair Bolsonaro received in the second turn of Brazilian elections in 2018 is analysed. We compared the potentiality of the Bbeta regression with the traditional beta regression model. In order to estimate the parameters of model, we adopt the MLE method (as discussed in Section 5). The asymptotic standard errors were computed using the observed Fisher information matrix. The required numerical evaluations for data analysis were implemented using the R software [22].

The goal of this data analysis is to describe the proportion of votes that Jair Bolsonaro received in the second turn of Brazilian elections in 2018 for all 5.565 cities, and it is available at https://dadosabertos.tse.jus.br. The response variable $X_{i}$ is the proportion of votes given the municipal human development (mhdi). The MHDI is used as explanatory variable since it is an important measure to guide authorities to assess progress and social reality as well as to define public policy priorities and comparisons of different cities [19]. Figure 5 plots the histogram with density estimated the response variable used in the application and the scatter plot of municipal human development against proportion of votes. From Figure 5, we can see that the response variable has bimodality. Furthermore, there is evidence of a proportion of votes trend with increased municipal human development. The correlation coefficient between the proportion of votes and MHDI is 0.8290.

Figure 5. — Estimated PDF and scatter plot of municipal human development against proportion of votes.

To explain this proportion of votes we consider the bimodal beta regression model, defined as

\begin{aligned} Y_{i} & \sim Bbeta (θ_{δ}), \\ \log (α_{i}) & = γ_{0} + γ_{1} {mhdi}_{i}, \\ \log (β_{i}) & = ζ_{0} + ζ_{1} {mhdi}_{i}, \end{aligned}

where $i = 1, 2, \dots, 5.565$ cities and ${mhdi}_{i}$ is municipal human development of cities i. For comparison purposes the beta regression model was fitted, assuming that

\begin{aligned} Y_{i} & \sim beta (μ_{i}, ϕ_{i}), \\ logit (μ_{i}) & = β_{0} + β_{1} {mhdi}_{i}, \\ \log (ϕ_{i}) & = γ_{0} + γ_{1} {mhdi}_{i} . \end{aligned}

and the unit-bimodal Birnbaum-Saunders (UBBS) regression model was fitted, assuming that

\begin{aligned} Y_{i} & \sim UBBS (α_{i}, β_{i}, δ), \\ \log (α_{i}) & = ν_{0} + ν_{1} {mhdi}_{i}, \\ \log (β_{i}) & = η_{0} + η_{1} {mhdi}_{i} . \end{aligned}

Table 5 shows the estimated parameters and standard errors. Table 6 shows Akaike information criterion (AIC) and Bayesian information criterion (BIC) for the fitted models. In general, it is expected that the better model fitting the data presents the smallest values for the quantities which are AIC and BIC. Based on the AIC and BIC criteria, the model which provides a better fit in this data set is the Bbeta regression model. This claim is also supported by the residual plots with simulated envelopes shown in Figure 6.

Table 5.

Maximum likelihood estimates and standard errors (SE) for the fit of the bimodal beta, beta and unit-bimodal Birnbaum-Saunders models in the proportion of votes.

Model	Parameter	Estimate	SE
Bbeta	$γ_{0}$	−1.8999	0.1963
	$γ_{1}$	5.9471	0.3044
	$ζ_{0}$	3.8341	0.1915
	$ζ_{1}$	−2.4232	0.2862
	ρ	0.1096	0.0090
	δ	2.4092	0.0351
beta	$β_{0}$	−7.5343	0.0749
	$β_{1}$	11.1820	0.1105
	$γ_{0}$	1.0029	0.1675
	$γ_{1}$	2.5214	0.2528
UBBS	$ν_{0}$	−0.5721	0.1001
	$ν_{1}$	−0.1035	0.1436
	$η_{0}$	5.0120	0.0257
	$η_{1}$	−8.0601	0.0381
	δ	0.6405	0.0990

Open in a new tab

Table 6.

Information criteria for the fitted models.

Models	AIC	BIC
Bbeta	−8786	−8746
beta	−8238	−8212
UBBS	−8115	−8082

Open in a new tab

Figure 6. — Half-normal plot of randomized quantile residuals with simulated envelope for the fit of beta and bimodal beta.

8. Concluding remarks

When modeling responses with bimodal bounded to the unit interval, despite its broad sense applicability in many fields, the beta distribution is not suitable. In this paper, the well-known two-parameter beta distribution is extended by introducing two extra parameters, thus defining the bimodal beta (Bbeta) distribution, based on a quadratic transformation technique used to generate bimodal functions [8,28], which generalizes the beta distribution. We provide a mathematical treatment of the new distribution, including bimodality, moments, entropies, stochastic representation and identifiability. We allow a regression structure for the parameters α and β. The estimation of the model parameters is approached by maximum likelihood and its good performance has been evaluated by means of Monte Carlo simulations. Furthermore, we have proposed residuals for the proposed model and conducted a simulation study to establish their empirical properties in order to evaluate their performances. The proposed model was fitted to the proportion of votes that Jair Bolsonaro received in the second turn of Brazilian elections in 2018. As expected, the Bbeta model outperforms the beta regression in the presence of bimodality. Further, Bbeta is capable to fit well when compared with UBBS.

Supplementary Material

Supplemental Material

CJAS_A_2146661_SM1389.pdf^{(244.2KB, pdf)}

Acknowledgments

The authors would like to thank the reviewers for all useful and helpful comments on an earlier version of our manuscript, which resulted in this improved version.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

1.Ahmad K.E. and Al-Hussaini E.K., Remarks on the non-identifiability of mixtures of distributions, Ann. Inst. Stat. Math. 34 (1982), pp. 543–544. [Google Scholar]
2.Alfaia L.M., A Distribuição Beta Bimodal: Propriedades e Aplicaçães, UnB, Brasília, 2021. [Google Scholar]
3.Atienza N., Garcia-Heras J., and Munoz-Pichardo J.M., A new condition for identifiability of finite mixture distributions, Metrika 63 (2006), pp. 215–221. [Google Scholar]
4.Bayes C.L, Bazán J.L., and Catalina G., A new robust regression model for proportions, Bayesian Anal. 7 (2012), pp. 841–866. [Google Scholar]
5.de Alencar E.R., Discriminante não-linear Para Mistura De Distribuições Beta, UnB, Brasília, 2018. [Google Scholar]
6.Domma F., Popović B.V., and Nadarajah S., An extension of Azzalini's method, J. Comput. Appl. Math. 278 (2015), pp. 37–47. [Google Scholar]
7.Dunn P.K. and Smyth G.K., Randomized quantile residuals, J. Comput. Graph. Stat. 5 (1996), pp. 236–244. [Google Scholar]
8.Elal-Olivero D., Alpha-skew-normal distribution, Proyecciones J. Math. 29 (2010), pp. 224–240. [Google Scholar]
9.Ferrari S. and Cribari-Neto F, Beta regression for modelling rates and proportions, J. Appl. Stat. 31 (2004), pp. 799–815. [Google Scholar]
10.Griffiths L., Introduction to the Theory of Equations, J. Wiley, 1947. [Google Scholar]
11.Hahn E.D., Regression modelling with the tilted beta distribution: A Bayesian approach, Can. J. Stat. 49 (2021), pp. 262–282. [Google Scholar]
12.Ji Y., Wu C., Liu P., Wang J., and Coombes K.R, Applications of beta-mixture models in bioinformatics, Bioinformatics 21 (2005), pp. 2118–2122. [DOI] [PubMed] [Google Scholar]
13.Johnson N.L., Kotz S., and Balakrishnan N., Continuous Univariate Distributions, Vol. 2nd ed., 2, John Wiley & Sons Inc., New York, 1995. [Google Scholar]
14.Lin T.I., Lee J.C., and Hsieh W.J., Robust mixture models using the skew-t distribution, Stat. Comput. 17 (2007a), pp. 81–92. [Google Scholar]
15.Lin T.I., Lee J.C., and Yen S.Y., Finite mixture modeling using the skew-normal distribution, Stat. Sin. 17 (2007b), pp. 909–927. [Google Scholar]
16.Ma Z. and Leijon A., Beta mixture models and the application to image classification. Proceedings of IEEE International Conference on Image Processing (ICIP), 2045–2048, 2009.
17.Martínez-Flórez M., Martínez E., Tovar-Falón R., and Gómez H.W., A family of bimodal distributions generated by distributions with positive support, J. Appl. Stat. 49 (2022), pp. 3614–3637. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Martínez-Flórez M., Olmos N.M., and Venegas O., Unit-bimodal Birnbaum-Saunders distribution with applications, Commun. Stat. -- Simul. Comput. (2022), pp. 1–20. 10.1080/03610918.2022.2069260 [DOI] [Google Scholar]
19.Menezes A.F.B. and Furriel W.O., Beta and simplex regression models in the analysis of the municipal human development index 2010, Rev. Bras. Biom. 37 (2019), pp. 394–408. [Google Scholar]
20.Olmos N.M., Martínez-Flórez G., and Bolfarine H., Bimodal Birnbaum-Saunders distribution with applications to non-negative measurements, Commun. Stat. -- Theory Methods 46 (2017), pp. 6240–6257. [Google Scholar]
21.Ospina R. and Ferrari S.L.P., Inflated beta distributions, Stat. Pap. 51 (2008), pp. 111–126. [Google Scholar]
22.R Core Team , R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/ (2021).
23.Rao C.R., Quadratic entropy and analysis of diversity, Sankhya Ser. A. 72 (2010), pp. 70–80. [Google Scholar]
24.Shannon C.E., A mathematical theory of communication, Bell Syst. Tech. J. 27 (1948). 379–423. 623–656. [Google Scholar]
25.Smithson M., Merkle E.C., and Verkuilen J., Beta regression finite mixture models of polarization and priming, J. Educ. Behav. Stat. 36 (2011), pp. 804–831. [Google Scholar]
26.Smithson M. and Segale C., Partition priming in judgments of imprecise probabilities, J. Stat. Theory Pract. 3 (2009), pp. 169–181. [Google Scholar]
27.Tsallis C., Possible generalization of Boltzmann-Gibbs statistics, J. Stat. Phys. 52 (1988), pp. 479–487. [Google Scholar]
28.Vila R. and Çankaya M.N., A bimodal Weibull distribution: properties and inference, J. Appl. Stat. 49 (2022), pp. 3044–3062. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Vila R., Ferreira L., Saulo H., Prataviera F., and Ortega E.M.M., A bimodal gamma distribution: properties, regression model and applications, Statistics 54 (2020), pp. 469–493. [Google Scholar]
30.Wong M.C., Bubble Value At Risk: A Countercyclical Risk Management Approach, John Wiley & Sons, Singapore, 2013. [Google Scholar]
31.Xue J.Loop Tiling for Parallelism, The Springer International Series in Engineering and Computer Science, Springer, New York, 2012. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

CJAS_A_2146661_SM1389.pdf^{(244.2KB, pdf)}

[CIT0001] 1.Ahmad K.E. and Al-Hussaini E.K., Remarks on the non-identifiability of mixtures of distributions, Ann. Inst. Stat. Math. 34 (1982), pp. 543–544. [Google Scholar]

[CIT0002] 2.Alfaia L.M., A Distribuição Beta Bimodal: Propriedades e Aplicaçães, UnB, Brasília, 2021. [Google Scholar]

[CIT0003] 3.Atienza N., Garcia-Heras J., and Munoz-Pichardo J.M., A new condition for identifiability of finite mixture distributions, Metrika 63 (2006), pp. 215–221. [Google Scholar]

[CIT0004] 4.Bayes C.L, Bazán J.L., and Catalina G., A new robust regression model for proportions, Bayesian Anal. 7 (2012), pp. 841–866. [Google Scholar]

[CIT0005] 5.de Alencar E.R., Discriminante não-linear Para Mistura De Distribuições Beta, UnB, Brasília, 2018. [Google Scholar]

[CIT0006] 6.Domma F., Popović B.V., and Nadarajah S., An extension of Azzalini's method, J. Comput. Appl. Math. 278 (2015), pp. 37–47. [Google Scholar]

[CIT0007] 7.Dunn P.K. and Smyth G.K., Randomized quantile residuals, J. Comput. Graph. Stat. 5 (1996), pp. 236–244. [Google Scholar]

[CIT0008] 8.Elal-Olivero D., Alpha-skew-normal distribution, Proyecciones J. Math. 29 (2010), pp. 224–240. [Google Scholar]

[CIT0009] 9.Ferrari S. and Cribari-Neto F, Beta regression for modelling rates and proportions, J. Appl. Stat. 31 (2004), pp. 799–815. [Google Scholar]

[CIT0010] 10.Griffiths L., Introduction to the Theory of Equations, J. Wiley, 1947. [Google Scholar]

[CIT0011] 11.Hahn E.D., Regression modelling with the tilted beta distribution: A Bayesian approach, Can. J. Stat. 49 (2021), pp. 262–282. [Google Scholar]

[CIT0012] 12.Ji Y., Wu C., Liu P., Wang J., and Coombes K.R, Applications of beta-mixture models in bioinformatics, Bioinformatics 21 (2005), pp. 2118–2122. [DOI] [PubMed] [Google Scholar]

[CIT0013] 13.Johnson N.L., Kotz S., and Balakrishnan N., Continuous Univariate Distributions, Vol. 2nd ed., 2, John Wiley & Sons Inc., New York, 1995. [Google Scholar]

[CIT0014] 14.Lin T.I., Lee J.C., and Hsieh W.J., Robust mixture models using the skew-t distribution, Stat. Comput. 17 (2007a), pp. 81–92. [Google Scholar]

[CIT0015] 15.Lin T.I., Lee J.C., and Yen S.Y., Finite mixture modeling using the skew-normal distribution, Stat. Sin. 17 (2007b), pp. 909–927. [Google Scholar]

[CIT0016] 16.Ma Z. and Leijon A., Beta mixture models and the application to image classification. Proceedings of IEEE International Conference on Image Processing (ICIP), 2045–2048, 2009.

[CIT0017] 17.Martínez-Flórez M., Martínez E., Tovar-Falón R., and Gómez H.W., A family of bimodal distributions generated by distributions with positive support, J. Appl. Stat. 49 (2022), pp. 3614–3637. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0018] 18.Martínez-Flórez M., Olmos N.M., and Venegas O., Unit-bimodal Birnbaum-Saunders distribution with applications, Commun. Stat. -- Simul. Comput. (2022), pp. 1–20. 10.1080/03610918.2022.2069260 [DOI] [Google Scholar]

[CIT0019] 19.Menezes A.F.B. and Furriel W.O., Beta and simplex regression models in the analysis of the municipal human development index 2010, Rev. Bras. Biom. 37 (2019), pp. 394–408. [Google Scholar]

[CIT0020] 20.Olmos N.M., Martínez-Flórez G., and Bolfarine H., Bimodal Birnbaum-Saunders distribution with applications to non-negative measurements, Commun. Stat. -- Theory Methods 46 (2017), pp. 6240–6257. [Google Scholar]

[CIT0021] 21.Ospina R. and Ferrari S.L.P., Inflated beta distributions, Stat. Pap. 51 (2008), pp. 111–126. [Google Scholar]

[CIT0022] 22.R Core Team , R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/ (2021).

[CIT0023] 23.Rao C.R., Quadratic entropy and analysis of diversity, Sankhya Ser. A. 72 (2010), pp. 70–80. [Google Scholar]

[CIT0024] 24.Shannon C.E., A mathematical theory of communication, Bell Syst. Tech. J. 27 (1948). 379–423. 623–656. [Google Scholar]

[CIT0025] 25.Smithson M., Merkle E.C., and Verkuilen J., Beta regression finite mixture models of polarization and priming, J. Educ. Behav. Stat. 36 (2011), pp. 804–831. [Google Scholar]

[CIT0026] 26.Smithson M. and Segale C., Partition priming in judgments of imprecise probabilities, J. Stat. Theory Pract. 3 (2009), pp. 169–181. [Google Scholar]

[CIT0027] 27.Tsallis C., Possible generalization of Boltzmann-Gibbs statistics, J. Stat. Phys. 52 (1988), pp. 479–487. [Google Scholar]

[CIT0028] 28.Vila R. and Çankaya M.N., A bimodal Weibull distribution: properties and inference, J. Appl. Stat. 49 (2022), pp. 3044–3062. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0029] 29.Vila R., Ferreira L., Saulo H., Prataviera F., and Ortega E.M.M., A bimodal gamma distribution: properties, regression model and applications, Statistics 54 (2020), pp. 469–493. [Google Scholar]

[CIT0030] 30.Wong M.C., Bubble Value At Risk: A Countercyclical Risk Management Approach, John Wiley & Sons, Singapore, 2013. [Google Scholar]

[CIT0031] 31.Xue J.Loop Tiling for Parallelism, The Springer International Series in Engineering and Computer Science, Springer, New York, 2012. [Google Scholar]

PERMALINK

A model for bimodal rates and proportions

Roberto Vila

Lucas Alfaia

André FB Menezes

Mehmet N Çankaya

Marcelo Bourguignon

Abstract

1. Introduction

2. The bimodal beta distribution

Figure 1.

Figure 2.

Figure 3.

2.1. Bimodality properties

Theorem 2.1 Uni- or bimodality —

Proof.

Remark 2.1

Table 1.

Theorem 2.2 Bimodality; case ρ=0 —

Proof.

Remark 2.2

Proposition 2.3

Proof.

Theorem 2.3

Proof.

Remark 2.4

Table 2.

3. Moments

Theorem 3.1

Proof.

Corollary 3.1 Real moments —

Proof.

4. Further properties

4.1. Stochastic representation

Proposition 4.1 Stochastic representation for δ<0 —

Proof.

4.2. Identifiability

Proposition 4.2

Proof.

5. Regression model, estimation and diagnostic analysis

6. Simulation study

Table 3.

Figure 4.

6.1. Residuals

Table 4.

7. Real data application

Figure 5.

Table 5.

Table 6.

Figure 6.

8. Concluding remarks

Supplementary Material

Acknowledgments

Disclosure statement

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Theorem 2.2 Bimodality; case $ρ = 0$ —

Proposition 4.1 Stochastic representation for $δ < 0$ —