A Bayesian nonparametric model for bounded directional data on the positive orthant of the unit sphere

Emiliano Geneyro; Gabriel Núñez-Antonio

doi:10.1080/02664763.2022.2156485

. 2022 Dec 14;51(4):721–739. doi: 10.1080/02664763.2022.2156485

A Bayesian nonparametric model for bounded directional data on the positive orthant of the unit sphere

Emiliano Geneyro ¹, Gabriel Núñez-Antonio ^1,^CONTACT

PMCID: PMC10896154 PMID: 38414804

Abstract

Directional data appears in several branches of research. In some cases, those directional variables are only defined in subsets of the K-dimensional unit sphere. For example, in some applications, angles as measured responses are limited on the positive orthant. Analysis on subsets of the K-dimensional unit sphere is challenging and nowadays there are not many proposals that discuss this topic. Thus, from a methodological point of view, it is important to have probability distributions defined on bounded subsets of the K-dimensional unit sphere. Specifically, in this paper, we introduce a nonparametric Bayesian model to describe directional variables restricted to the first orthant. This model is based on a Dirichlet process mixture model with multivariate projected Gamma densities as kernel distributions. We show how to carry out inference for the proposed model based on a slice sampling scheme. The proposed methodology is illustrated using simulated data sets as well as a real data set.

Keywords: Multivariate projected gamma distribution, Dirichlet process mixture, circular data, spherical data

1. Introduction

In recent years, interest in analyzing data representing directions has increased. These type of measurements are known as directional data and appear in several areas of knowledge such as biology, geology, meteorology, ecology and environmental sciences. Directional data are related to unit vectors in the $(K + 1)$ -dimensional space $R^{K + 1}$ . Thus, these kinds of data can be represented by K angles and their natural sample space is the K-dimensional unit sphere, $S^{K}$ . In the cases where K = 1 and K = 2, directional data are named circular data and spherical data, respectively. Since the unit sphere is topologically different from the Euclidean space, a proper analysis of directional data requires appropriate statistical methods that consider inherent properties from the corresponding sample spaces. Contributions range from graphical methods to the development of new statistical models to describe directional observations. For a survey, the reader is referred to [3,6,10,13,16,18,19,21,27,28,30].

Although there has been an increasing development in directional data modeling, most recent proposals have focused on defining new parametric models on the entire unit sphere and the development of probability distributions restricted to subsets of the unit sphere has been overlooked. In many instances, there are applications relating angle measurements on subsets of the unit sphere. An example of the previous situation is axial data, where the corresponding sample space turns out to be the interval $(0, π]$ . In the same context, there are phenomena where the interest lies in angles defined only on the positive orthant of the unit sphere. Some applications of the latter can be found in the analysis of incidence and refraction angles of the archerfish [33]. In sports science, particularly in baseball, the study of angles related to achieving home runs is relevant [32]; in kinesiology, the relation of the extension angle of the human knee with the possibility of suffering injuries is a topic of general interest [29]; in environmental sciences, the tilt angle inclination of photovoltaic cells is analyzed to optimize power generation. In phenology, the analysis of the behavior of solar-to-sensor angles of backscatter light is important in the study of its correlation with some phenological metrics [24]. On the other hand, directional data present characteristics such as multimodality, which could not be well fitted by standard parametric models. There are undoubtedly multimodal parametric models (see, for example, [2], [15] and [22]), which can be fitted for describing multimodal behaviors of directional data. However, the fitting of the above models may require particular procedures, even more when sequential processes are applied. In general instances, where the dataset has high skewness or kurtosis and several modes, it may be preferable to consider semiparametric or nonparametric models as an alternative (see, for example, [21] and [26]).

From a theoretical point of view, there are several basic approaches to generate circular distributions. One way is defining a distribution directly on the unit sphere, as with the von Mises–Fisher or Kent or Fisher–Bingham distributions. Another method is based on projecting a multivariate distribution initially defined on the $(K + 1)$ -dimensional space onto the corresponding unit sphere $S^{K}$ . Let $Y$ be a random $(K + 1)$ -dimensional vector such that P $r (Y = 0) = 0$ . Then $U = Y | | Y | |^{- 1}$ is a random point on the $K -$ dimensional unit sphere, $S^{K}$ . Two important instances are those in which $Y$ has a $(K + 1)$ -variate Normal distribution or a Gamma distribution. In the first case, $U$ is said to have a multivariate projected Normal distribution and, in the second case, $U$ is said to have a multivariate projected Gamma distribution. While the projected Normal distribution has received a lot of attention, the projected Gamma distribution has only been studied by Núñez-Antonio and Geneyro [22], who mentioned that the family of projected Gamma distributions is appropriate for analyzing particular directional data defined only on the positive orthant.

In this paper, we propose a new nonparametric model to describe directional data restricted to the first orthant of the K-dimensional unit sphere; that means, directional data whose corresponding angles belong to the interval $(0, π / 2]$ . The proposed model is a Dirichlet process (DP) mixture model based on multivariate projected Gamma distributions. As discussed below, this model allows greater flexibility to perform analyses for directional random variables defined in the first orthant with behaviors such as multimodality or high skewness and kurtosis.

The rest of this paper proceeds as follows. In Section 2, the multivariate projected Gamma model is briefly reviewed. In Section 3, the proposed model based on the Dirichlet process mixture of multivariate projected Gamma densities is presented, as well as the way to perform Bayesian inferences. Section 4 illustrates our proposal with several simulated examples and a real dataset on solar-to-sensor angles of backscatter light. Finally, Section 5 provides some concluding remarks.

2. The projected gamma distribution

As mentioned above, one technique used to define directional distributions is radially projecting on the unit sphere $S^{K}$ a multivariate distribution initially defined on $R^{K + 1}$ . Since that technique does not establish any additional restrictions on the original multivariate distribution, radial projection can be used to define distributions on specific subsets of $S^{K}$ . Following this idea, Núñez-Antonio and Geneyro [22] have developed the multivariate projected Gamma distribution which is a model only defined on the positive orthant. They show how to carry out inference based on a Gibbs sampling scheme after the introduction of suitably chosen latent variables. The details of that Gibbs sampler are given in Núñez-Antonio and Geneyro [22], but we briefly describe the basis of the algorithm here. The starting point is the joint density

\begin{aligned} f_{Y} (y | α, β) & = \prod_{k = 1}^{K + 1} Ga (y_{k} | α_{k}, β_{k}), \\ \equiv G a_{K + 1} (y | α, β) \end{aligned}

(1)

where $Y = (y_{1}, y_{2}, \dots, y_{K + 1})^{t}$ is a vector of dimension $(K + 1)$ , $α = (α_{1}, \dots, α_{K + 1})^{t}$ , $β = (β_{1}, \dots, β_{K + 1})^{t}$ and $Ga (y_{k} | α_{k}, β_{k})$ is a Gamma density given by $Ga (y_{k} | α_{k}, β_{k}) = \frac{β_{k}^{α_{k}}}{Γ (α_{k})} y_{k}^{α_{k} - 1} \exp (- β_{k} y_{k})$ . By applying a spherical coordinates transformation $Y \to (R, Θ)$ , and integrating out R, the variable $Θ = (θ_{1}, \dots, θ_{K})^{t}$ has a multivariate projected Gamma distribution with density defined as

\begin{aligned} {PG}_{K} (θ | α, β) & = \frac{Γ (A) β_{K + 1}^{α_{K + 1}}}{(B)^{A} Γ (α_{K + 1})} (\prod_{j = 1}^{K} \frac{β_{j}^{α_{j}}}{Γ (α_{j})} (\cos θ_{j})^{α_{j} - 1} (\sin θ_{j})^{(\sum_{h = j + 1}^{K + 1} α_{h}) - 1}) \\ I_{(0, π / 2]^{K}} (θ) \end{aligned}

(2)

where

A = \sum_{j = 1}^{K + 1} α_{j} and B = β_{1} \cos θ_{1} + \sum_{m = 2}^{K} (β_{m} \cos θ_{m} \prod_{j = 1}^{m - 1} \sin θ_{j}) + β_{K + 1} \prod_{j = 1}^{K} \sin θ_{j} .

The vector parameters $(α, β)^{t}$ are not identifiable, and Núñez-Antonio and Geneyro [22] propose to consider an additional constraint given by $β = (1, β_{2}, β_{3}, \dots, β_{K + 1})^{t}$ . Despite that restriction, the model maintains enough flexibility to describe several behaviors on the positive orthant.

Given a sample of directional vectors ${θ_{1}, θ_{2}, \dots, θ_{n}}$ , where $θ_{i} = (θ_{i 1}, θ_{i 2}, \dots, θ_{i K})$ , to make inferences on parameters $(α, β)$ , Núñez-Antonio and Geneyro [22] propose to consider $R_{i}$ , $i = 1, \dots, n$ , as suitable latent variables so that each vector $y_{i} = (r_{i}, θ_{i})$ follows a distribution as (1). It can be shown that the conditional posterior distribution of $R_{i}$ is

f (r_{i} | θ_{i}, α, β) = Ga (\sum_{j = 1}^{K + 1} α_{j}, \cos θ_{i 1} + \sum_{m = 2}^{K} (β_{m} \cos θ_{im} \prod_{j = 1}^{m - 1} \sin θ_{ij}) + β_{K + 1} \prod_{j = 1}^{K} \sin θ_{ij}) .

Then considering the complete data $D_{n} = {(θ_{i}, r_{i})}_{i = 1}^{n}$ , posterior conditional distributions for each component of the vectors $α$ and $β$ are obtained and a Gibbs sampler can be carried out.

3. Directional DP mixture model

In this section, we introduce a new nonparametric model for directional variables defined only in the first orthant. The model is based on the DP mixture model of multivariate projected Gamma distributions. Firstly, we provide a brief introduction to DP mixture models.

3.1. DP mixture model

From a Bayesian framework, a nonparametric model is a model defined on an infinite-dimensional parameter space. The application of the nonparametric Bayesian methodology was limited by its computational complexity. It was not until the last decade of the 20th century, with computational development and advances in Markov chain Monte Carlo methods (MCMC), that this methodology attracted the interest of researchers because of its application in several areas. In order to consider infinite-dimensional initial distributions, it is necessary to define random probability measures (RPM). One of the most important RPM is the Dirichlet process (DP), which was proposed by Ferguson in 1973 [8]. The DP is denoted as $DP (M, G_{0})$ , where M>0 is a precision parameter and $G_{0}$ is a base measure. An important result related to a DP is the stick-breaking construction proposed by Sethuraman [31], where any $G \sim DP (M, G_{0})$ can be represented as

\begin{aligned} G (\cdot) & = \sum_{h = 1}^{\infty} w_{h} δ_{m_{h}} (\cdot), m_{h} i . i . d G_{0}, \end{aligned}

(3)

\begin{aligned} w_{1} = v_{1}, w_{h} & = v_{h} \prod_{1 \leq l < h} (1 - v_{l}), v_{h} i . i . d Beta (1, M) . \end{aligned}

(4)

It is well known that random distribution functions chosen from a DP are almost surely discrete [4]. In order to extend these kind of priors to the case of absolutely continuous random distribution functions, DP mixture models (DPM) were first considered by Lo [17]. Thus, absolutely continuous random density functions can be considered as

f_{G} (x) = \int f (x | γ) dG (γ)

where $G \sim DP (M, G_{0})$ , and $f (x | γ)$ is a density belonging to a family of continuous densities indexed by $γ$ . From a hierarchical approach, DPM models can be defined as

\begin{aligned} x | γ_{j} & \sim f (\cdot | γ_{j}) \\ γ_{j} | G & \sim G \\ G & \sim DP (M, G_{0}) . \end{aligned}

Hence, if the stick-breaking representation of G is considered, then the DPM model can be represented as

f (x) = \int f (x | γ) dG (γ) = \sum_{j = 1}^{\infty} w_{j} f (x | γ_{j}) .

(5)

3.2. The projected gamma DP mixture model

Consider a multivariate Gamma DP mixture model given by

\begin{aligned} Y | α, β & \sim G a_{K + 1} (y | α, β) \\ α, β | G & \sim G \\ G & \sim DP (M, G_{0}), \end{aligned}

where $G a_{K + 1} (y | α, β)$ is a multivariate Gamma distribution as defined in (1) and $G_{0}$ is specified by

G_{0} (α, β) = \prod_{k = 1}^{K + 1} Ga (α_{k} | c_{k}, d_{k}) \prod_{l = 2}^{K + 1} Ga (β_{l} | a_{l}, b_{l}) .

Here, we assume that $a_{l}$ , $b_{l}$ , $c_{k}$ and $d_{k}$ for all ls and ks are known and we set a prior distribution $M \sim Ga (a_{0}, b_{0})$ for the concentration parameter of the DP. Therefore, from (5) the density of $Y$ can be expressed as

f (y) = \sum_{j = 1}^{\infty} w_{j} G a_{K + 1} (y | α_{j}, β_{j}) .

Now, if we define a directional variable $Θ$ by projecting $Y$ onto the unit sphere as outlined in Section 2, the directional variable $Θ$ follows an infinite mixture of multivariate projected Gamma distributions,

f (θ | α, β) = \sum_{j = 1}^{\infty} w_{j} P G_{K} (θ | α_{j}, β_{j}) .

We call that model the projected Gamma DP mixture model.

3.3. Bayesian inference

Suppose we are given a sample of data $θ_{i} = (θ_{i 1}, θ_{i 2}, \dots, θ_{i K})$ for $i = 1, 2, \dots, n$ as previously defined. Following the idea described in Section 2 we can consider complete data ${y_{1}, \dots, y_{n}}$ for carrying out inferences, where $y_{i} = (y_{i 1}, y_{i 2}, \dots, y_{i (K + 1)})$ . That means, the completed data has a DPM of multivariate Gamma distributions as previously defined and we can use nonparametric inference techniques for this type of models. In order to do that, we use a slice sampling algorithm developed by Kalli et al. [14]. The basic idea of this approach is to introduce further latent variables so that, conditional on these, each $y_{i}$ can be represented as being generated from a single multivariate Gamma distribution. Details of the original slice sampler algorithm can be found in Walker [34], but we describe the basic ideas of that algorithm for our model.

Given the stick-breaking representation for $G \sim DP (M, G_{0})$ , we can write

f (y | w, α, β) = \sum_{j = 1}^{\infty} w_{j} G a_{K + 1} (y | α_{j}, β_{j}) .

Hence, firstly, if latent variables $u_{i}$ are introduced such that the joint density of $(y_{i}, u_{i})$ given $(w, α, β)$ is given by

f (y, u | w, α, β) = \sum_{j = 1}^{\infty} I (u < w_{j}) G a_{K + 1} (y | α_{j}, β_{j}),

then, integrating over $u$ we obtain our density $f (y | w, α, β)$ . In addition, given u, the number of components is finite with the indexes being $A_{u} = {j : w_{j} > u}$ . Thus,

f (y | u, w, α, β) = \sum_{j = 1}^{\infty} N_{u}^{- 1} G a_{K + 1} (y | α_{j}, β_{j}),

where $N_{u}$ is the size of $A_{u}$ , which is a finite set. Secondly, as it is usually done in mixture settings, for each $i = 1, \dots, n$ , indicator variables $d_{i}$ are introduced which indicates which of these finite number of components provide the i-th observation. Then, given $d_{i}$ , the variables $y_{i}$ follow a multivariate Gamma distribution $G a_{K + 1} (\cdot | α_{d_{i}}, β_{d_{i}})$ , where $w_{d_{i}}$ is the corresponding mixture weight of the $d_{i}$ -th component. Thus, the completed likelihood function is proportional to

f ({y_{i}, u_{i}, d_{i}}_{i = 1}^{n} | w, α, β) = \prod_{i = 1}^{n} I (u_{i} < w_{d_{i}}) G a_{K + 1} (y_{i} | α_{d_{i}}, β_{d_{i}}),

(6)

and this allows a simple Gibbs sampling scheme for inferences. However, as Kalli et al. [14] pointed out, the Walker's algorithm [34] has some problems and proposed to use a general class of slice sampler, which in our case is defined by the following completed likelihood

f ({y_{i}, u_{i}, d_{i}}_{i = 1}^{n} | w, α, β) = \prod_{i = 1}^{n} ξ_{d_{i}}^{- 1} I (u_{i} < ξ_{d_{i}}) w_{d_{i}} G a_{K + 1} (y_{i} | α_{d_{i}}, β_{d_{i}}),

where $ξ_{1}, ξ_{2}, \dots$ is any positive decreasing sequence, which plays a role between balance of efficiency and computational time.

Since the construction of the weights $w$ (defined as in (4)) depends directly on sampling of the variables $v$ , the latter will be used in the description of sampling process. Thus, to generate samples of the joint posterior distribution using a Gibbs sampler algorithm, the variables that need to be sampled at each step are $α_{j}, β_{j}, v_{j}$ and $d_{i}, u_{i}$ , for $j = 1, \dots$ and $i = 1, \dots, n$ , respectively. In this paper, we take $ξ_{j} = e^{- j}$ , then ξ and v are conditionally independent and our Gibbs sampler for completed data model turns out to be

(0)
Set initial values for $d_{1}, d_{2}, \dots, d_{n}$ and $r_{1}, r_{2}, \dots, r_{n}$ .
(1)
Applying the standard k-dimensional spherical coordinate transformation $y \to (r, θ)$ , construct $y_{i} = (r_{i}, θ_{i})$ ∀ $i = 1, \dots, n$ . See, for example, Blumenson [5].
(2)
Simulate $(α_{j}, β_{j})$ from $π (α_{j}, β_{j} | \dots) \propto G_{0} (α, β) \prod_{d_{i} = j} G a_{K + 1} (y_{i} | α_{d_{i}}, β_{d_{i}})$
(3)
Update $v_{j}$ from $π (v_{j}) \propto Be (1 + \sum_{i = 1}^{n} I (d_{i} = j), M + \sum_{i = 1}^{n} I (d_{i} > j))$
(4)
Simulate $u_{i} \sim Unif (0, ξ_{d_{i}})$
(5)
Simulate $d_{i}$ from $P (d_{i} = q | \dots) \propto ξ_{q}^{- 1} I (q : ξ_{q} > u_{i}) w_{q} G a_{K + 1} (y_{i} | α_{q}, β_{q})$
(6)
Update $M | d$ .
(7)
Update $r_{i}$ from $f (r_{i} | θ_{i}, α_{d_{i}}, β_{d_{i}})$ . ∀ $i = 1, \dots, n .$
(8)
Repeat from (1) until convergence.

Here $I (H)$ is the indicator function on the set H and $Be (\cdot, \cdot)$ represents a Beta distribution.

It should be noticed that the set of all $α_{j}, β_{j}, v_{j}$ variables to be sampled is infinite. However, as Kalli et al. [14] show, it is only necessary to obtain samples up to an appropriate number N, at each step of the algorithm. Consider the set of ${d_{i}}_{i = 1}^{n}$ , such that $d_{i}$ is the largest integer that meets $u_{i} < ξ_{d_{i}} = e^{- d_{i}}$ . In this case, since the sequence of $ξ_{j}$ is positive decreasing, it is possible to obtain all the necessary $d_{i}$ . Thus, the set of integers q for the simulation of $P (d_{i} = q | \cdot)$ in step 5 can be gotten. Therefore $N = \max {d_{i}}$ can be defined, where $d_{i} = d_{i} + ⌊ - \ln (Unif (0, 1)) ⌋$ and $⌊ \cdot ⌋$ represents the floor function.

Step 6 can be carried out following a scheme introduced in Escobar and West [7]. Firstly, we can sample a latent variable $η | M \sim B (M + 1, n)$ and then

M | η, d \sim ψ Ga (a_{0} + N, b_{0} - \log η) + (1 - ψ) Ga (a_{0} + N - 1, b_{0} - \log η)

where the weight is such that $ψ / (1 - ψ) = (a_{0} + N - 1) / (n (b_{0} - \log η))$ .

Once convergence is reached, an approximation to the posterior predictive distribution for the directional variable $θ$ can be gotten. As will be shown in the examples, we have observed that a good approximation can be obtained by estimating the predictive density using

\hat{f} (θ | θ_{1}, \dots, θ_{n}) = \frac{1}{T} \sum_{r = 1}^{T} P G_{K} (θ | α^{(r)}, β^{(r)})

(7)

where T is the number of iterations in the algorithm and $(α^{(r)}, β^{(r)})$ are the projected Gamma parameters of the related mixture components which are sampled at each step of the algorithm.

4. Illustrations

In this section, we use five simulated examples and one real data set to illustrate the performance of the proposed methodology. In all of these examples, we impose non-informative prior assumptions by setting for all models $a_{0} = 2$ , $b_{0} = 8$ , and for the projected Gamma parameters, we set $c_{m} = a_{m} = 1$ and $d_{m} = b_{m} = 0.05$ , for all necessary subscripts m. We use the R language and environment ([1]) to simulate the corresponding data sets and to carry out all of the analyses in this section.

Example 4.1

In this example, we examine data simulated from a finite mixture of three univariate projected Gamma densities $P G_{1} (θ | α, β)$ . Specifically, we consider a data sample of angles θ of size n = 500 from the following model:

$f (θ) = 0.2 P G_{1} (θ | α_{1}, β_{1}) + 0.5 P G_{1} (θ | α_{2}, β_{2}) + 0.3 P G_{1} (θ | α_{3}, β_{3}),$

where $α_{1} = (5, 8)$ , $α_{2} = (20, 10)$ , $α_{3} = (8, 12)$ , $β_{1} = (1, 6)$ , $β_{2} = (1, 0.5)$ and $β_{3} = (1, 0.5)$ . That model produces a trimodal data set. Figure 1 shows a linear histogram of that data set and the true density.

We apply our Bayesian nonparametric approach to analyze that data. Using the simulation algorithm described in the previous section, convergence diagnostics led us to stop the process after 150, 000 iterations, discarding the first 120, 000 as burn-in. From the remaining 30, 000 iterations, we kept one observation every 30 iterations as part of the final sample. Figure 2 compares the true density with the estimated predictive density using (7). We can observe that the proposed model captures the multimodality of the data.

Figure 1. — True density (solid line) and data sample (histogram) for Example 4.1.

Figure 2. — True density (solid black line); estimated density (dashed blue line); 0.95 pointwise credible bands (dot-dashed red lines).

In addition, in order to have an idea of the variability of the process, at each s-th iteration in convergence state, a posterior realization of the random measure for θ as a finite mixture of projected Gamma distributions is obtained. Thus, once these realizations $f^{(s)}$ are available, it is possible to obtain 95% joint pointwise intervals all over the interval $(0, π / 2]$ . In this work, we will call these intervals as pointwise credible bands. For this example, the corresponding 0.95 pointwise credible bands for $f (θ)$ are shown in Figure 2.

In general, the estimator for the predictive density and the pointwise credible bands capture the shape of the true density and describe this type of data qualitatively well.

As a measure of performance for our proposal, we also calculate the number of non-empty components derived from our approach. In each convergence iteration of the algorithm, we get the number of non-empty components, and the mode of these 1000 estimates was J = 3. That estimate (the mode) matches the actual number of components. Table 1 shows the corresponding estimated probability for the number of components obtained through our proposal.

Table 1.

Estimated probability of non-empty components for data from Example 4.1.

	Non-empty components
	$J = 3$	$J = 4$	$J = 5$	$J = 6$
Probability	0.581	0.342	0.070	0.007

Open in a new tab

In the framework of DPM models, the number of components used to estimate the predictive distribution does not necessarily coincide with the number of modes or groups (see, for example, [11] and [20]). In this example, both numbers are equal. This result was foreseen since the data modes are well defined by the data set sample. In addition, the data set was directly sampled from a mixture of projected Gamma densities.

It is worth pointing out that our proposed methodology does not require that observations follow a mixture of projected Gamma distributions. In the next two examples, we analyze the performance of our proposal to describe data sets defined in the first quadrant, but whose distributions are different from a mixture of univariate projected Gamma distributions.

Example 4.2

For this second example, we consider circular observations (univariate angles) from a mixture of truncated von Mises distributions (see, [9]). Specifically, we use the truncated von Mises density defined by

$tvM (θ | μ, κ, a, b) = \frac{\exp {κ \cos (θ - μ)}}{\int_{a}^{b} \exp {κ \cos (θ - μ)} d θ} .$

Using an acceptance-rejection algorithm, we simulate a sample of size n = 500 of circular data from the next model:

$\begin{aligned} f (θ) & = 0.15 tvM (θ | μ = π / 12, κ = 130, 0, π / 2) + 0.5 tvM (θ | μ = π / 4, κ = 25, 0, π / 2) \\ + 0.35 tvM (θ | μ = 9 π / 24, κ = 100, 0, π / 2) . \end{aligned}$

Figure 3 shows a histogram for these data and the true density. It can be noticed that the components for the model have a significant overlap between its corresponding densities, and particularly the mode associated with the second component is dimmed despite the weight assigned to this component is the largest. We carry out our Bayesian nonparametric proposal. The algorithm was implemented and stopped after 200,000 iterations, discarding the first 160, 000 as burn-in. From the remaining 40, 000, we kept one observation every 40 iterations.

The true density with the posterior predictive density $\hat{f} (θ)$ and, the corresponding 0.95 pointwise credible bands are shown in Figure 4. It can be noticed our projected Gamma DP mixture model performs well, despite the fact that observations follow a mixture of truncated von Mises distributions.

Figure 3. — True density (solid line) and histogram for data set of Example 4.2.

Figure 4. — Example 4.2. True density of truncated von Mises mixture (solid black line); estimated predictive density (dashed blue line); 0.95 pointwise credible bands (dot-dashed red lines).

Example 4.3

In this example, we analyze circular observations (univariate angles) from a mixture of truncated projected normal distributions (tPN). Following the works of Fernandez-Gonzalez [9] and Núñez-Antonio and Gutiérrez-Peña [23], we defined a truncated projected normal distribution as:

$tPN (θ | μ, λ, a, b) \propto C [1 + \frac{d b Φ (d b)}{ϕ (d b)}] I_{(a, b)} (θ),$

where $μ = (μ_{1}, μ_{2})^{t}$ is a vector defined on the truncated interval $(a, b)$ and a, $b \in (0, 2 π]$ , $λ \in R^{+}$ and, $ϕ (\cdot)$ and $Φ (\cdot)$ are the standard normal density function and the standard normal cumulative distribution function, respectively. Here $d = \sqrt{\cos^{2} θ + λ \sin^{2} θ}$ , and

$\begin{aligned} b & = \frac{μ_{1} \cos θ + λ μ_{2} \sin θ}{\cos^{2} θ + λ \sin^{2} θ} \\ C & = \frac{λ^{1 / 2} \exp {- \frac{1}{2} (μ_{1}^{2} + λ μ_{2}^{2})}}{2 π d^{2}} \end{aligned}$

For this example, using an acceptance-rejection algorithm we simulated a sample of size n = 500 of angles $θ_{i} \in (0, π / 2]$ from the following model

$\begin{aligned} f (θ) & = 0.2 tPN (θ | μ_{1}, λ_{1}, 0, π / 2) + 0.5 tPN (θ | μ_{2}, λ_{2}, 0, π / 2) \\ + 0.3 tPN (θ | μ_{3}, λ_{3}, 0, π / 2), \end{aligned}$

where $μ_{1} = (13, 4)^{t}$ , $μ_{2} = (3, 5)^{t}$ , $μ_{3} = (5, 10)^{t}$ , $λ_{1} = 2$ , $λ_{2} = 0.6$ and $λ_{3} = 4$ .

We ran our algorithm for 150, 000 iterations. We discarded the first 120, 000 as burn-in. From the remaining 30, 000 iterations, we kept one observation every 30 iterations. Figure 5 compares the true density and the posterior predictive estimator of $f (θ)$ . In addition, the corresponding 0.95 pointwise credible bands are shown. It can be noticed that our proposal based on the DPM of projected Gamma model describes that type of data quite well.

Figure 5. — True density (solid black line); estimated density (dashed blue line); 0.95 pointwise credible bands (dot-dashed red lines).

Results from Example 4.2 and Example 4.3 show our proposed model is able to describe data sets whose probability distribution is not necessary a mixture of projected Gamma densities.

Example 4.4

In this example, we examine data from a mixture of three bivariate projected Gamma densities, $P G_{2} (θ | α, β)$ . From Section 2, the density $P G_{2} (θ | α, β)$ is given by

$\begin{aligned} {PG}_{2} (θ | α, β) \\ = \frac{Γ (\sum_{i = 1}^{3} α_{i}) β_{1}^{α_{1}} β_{2}^{α_{2}} β_{3}^{α_{3}} (\cos θ_{1})^{α_{1} - 1} (\cos θ_{2})^{α_{2} - 1} (\sin θ_{1})^{(α_{2} + α_{3}) - 1} (\sin θ_{2})^{α_{3} - 1}}{Γ (α_{1}) Γ (α_{2}) Γ (α_{3}) (β_{1} \cos θ_{1} + β_{2} \sin θ_{1} \cos θ_{2} + β_{3} \sin θ_{1} \sin θ_{2})^{(α_{1} + α_{2} + α_{3})}} \\ I_{(0, π / 2]^{2}} (θ), \end{aligned}$

where $θ = (θ, ϕ)$ , $α = (α_{1}, α_{2}, α_{3})$ and $β = (β_{1} = 1, β_{2}, β_{3})$ . We simulated a sample of size n = 1000 of spherical data $θ = (θ, ϕ)$ from the next model

$f (θ) = 0.2 {PG}_{2} (θ | α_{1}, β_{1}) + 0.5 {PG}_{2} (θ | α_{2}, β_{2}) + 0.3 {PG}_{2} (θ | α_{3}, β_{3}),$

where $α_{1} = (6.9, 11, 19.2)$ , $β_{1} = (1, 5.3, 9.2)$ , $α_{2} = (4.5, 8.7, 7.8)$ , $β_{2} = (1, 1.3, 2.2)$ and $α_{3} = (4.8, 13, 10)$ , $β_{3} = (1, 3.2, 1.3)$ .

Figure 6(a) shows the corresponding contour plots in the $(θ, ϕ)$ -plane for the true density. It can be appreciated that variability components produce three modes. We run our Bayesian nonparametric proposal, convergence diagnostics led us to stop the algorithm after 300, 000 iterations, discarding the first 240, 000 as burn-in. From the remaining 60, 000, we kept one observation every 60 iterations as part of the final realizations. Figure 6(b) shows the contour plots on the $(θ, ϕ)$ -plane for the corresponding posterior predictive density $\hat{f} (θ, ϕ)$ . It can be noticed how closely $\hat{f} (θ, ϕ)$ and the true model resemble each other. Particularly, $\hat{f} (θ, ϕ)$ captures the three-modal behavior of the actual density. In addition, the contour plot of the $\hat{f} (θ, ϕ)$ together with the observed data are represented in Figure 7(a,b) on the $(θ, ϕ)$ -plane and on the first orthant of the unit sphere $S^{2}$ , respectively.

Figure 7. — Data set and contour plots for Example 4.4. (a) Data set (blue dots) and estimated density contour plot (solid black lines) in $(θ, ϕ)$ plane. (b) Data set (blue dots) and contour plot of estimated density over $S^{2}$ (solid black lines).

Like the Example 4.1, we also estimate the true number of components. In each convergence iteration of the algorithm, we calculate the number of non-empty components, and the mode of these 1000 estimates was J = 3. In this case, that estimate exactly matches the correct number of components, too. Table 2 shows the corresponding estimated probability for the numbers of components obtained through our proposal.

Table 2.

Estimated probability of non-empty components for data from Example 4.4.

	Non-empty components
	$J = 3$	$J = 4$	$J = 5$
Probability	0.509	0.458	0.033

Open in a new tab

Results of this section show that the proposed projected Gamma DP mixture model is appropriate for analyzing directional data only defined on the first orthant of $S^{k}$ .

Example 4.5

In the mixture model approach, a potential problem in density estimation is overfitting. That means, the estimation carried out with a model could collect noise, inducing non-existent modes in the estimation. In order to analyze that issue, we simulated a sample of size n = 500 from a projected Gamma distribution, $P G_{1} (θ | α, β)$ , with $α = (5, 8)$ and $β = (1, 6)$ . Figure 8 shows the corresponding histogram of the simulated sample. Initially, from the histogram features, a multimodal model could be probable for the true model from which the sample was obtained. However, it is worth mentioning that the true model, $P G_{1} (θ | α, β)$ , is unimodal.

Figure 8. — Simulated dataset sampled from an unimodal model (histogram) and 100 estimated predictive densities (red points) obtained from the proposed approach.

In order to analyze a possible overfitting for the predictive density estimated from our approach, we run the proposed algorithm 100 times. At each of these repetitions, we estimated the predictive density. In all cases, the estimated predictive distribution obtained was unimodal, adequately describing the data set. The previous analysis suggests our approach does not induce overfitting. Figure 8 shows the corresponding 100 estimated predictive densities.

4.1. Real data example

Phenology is the study of the life cycle events of plants and animals caused by environmental changes. These studies are very important for ecological monitoring and analysis of the impact of climate change on the environment. In recent years, technological advances have made it possible to obtain a large amount and variety of data. In particular, satellite-derived metrics allowed the phenological analysis to be expanded to a broader scale.

One of these metrics is the Normalized Difference Vegetation Index (NDVI), which is an indicator of the amount of vegetation and its health in a specific area. This indicator mathematically compares the amount of absorbed visible red light and reflected near-infrared light. In this context, backscatter light is considered when the satellite sensor is aligned with the incident illumination and is reflected at a phase angle lower than $π / 2$ , which is called backscatter solar-to-sensor angle. The analysis of behavior of this angle is important in the study of its correlation with some phenological metrics, such as NDVI peaks [24]. A database of 243, 422 backscatter solar-to-sensor angles in a pinyon-juniper ecosystem in Grand Canyon National Park was released for public access by the U.S. Geological Survey [25].

For this example, we apply our methodology for analyzing a sample of size n = 2000 of backscatter solar-to-sensor angles from the database cited above. The sample obtained has special characteristics such as multimodality. We fitted a DPM of projected Gamma model and produced the estimators for the predictive density and the 0.95 pointwise credible bands, which are shown in Figure 9 along with the histogram for this data set. It can be noticed that these estimators adequately describe the general behavior of this type of data. Remarkably, the variability described from the pointwise credible bands in the first half of the data could lead to a model with two, three or four modes, as a probable model to the predictive distribution. In general, measures of goodness-of-fit for describing real phenomena through inferences based on models are useful.

Figure 9. — Estimated density (dashed blue line) and 0.95 pointwise credible bands (dot-dashed red lines) for sample of backscatter solar-to-sensor angle.

As an objective measure of comparison, we compute the logarithm of the pseudo marginal likelihood (LPML) for some existing circular distributions, which is a goodness-of-fit measure originally suggested by Geisser and Eddy [12]. The considered models are shown in Table 3. Here $PN (θ | μ)$ denotes a projected normal model, $tPN (θ | μ, λ, 0, π / 2)$ and $tvM (θ | μ, κ, 0, π / 2)$ , a projected normal model and a von Mises truncated on the interval $(0, π / 2]$ , respectively. The Model 1 and Model 2 are models defined all over (unbounded) the unit circle (see Núñez-Antonio et al. [21]). On the other hand, Model 1, Model 3, Model 5 and Model 6 are nonparametric models, which can describe multimodal and asymmetric behaviors. In addition, we must mention the fitting of Model 1, Model 5 and Model 6 become a model with only two probable modes (the last two modes from the real data histogram, see Figure 10) for the predictive distribution. Table 3 shows the corresponding value of the statistics LPML for each of these models. It can be seen the worst fitting (smaller LPML value) is obtained from Model 2, which is an unimodal and symmetric model. The next better model is the projected Gamma model (Model 4), which can describe some symmetric, asymmetric, and unimodal or bimodal patterns of behavior (see Núñez-Antonio and Geneyro [22]). Despite Model 1, Model 3, Model 5 and Model 6 being nonparametric models, the best fitting according to the LPML is obtained from Model 3. It should be mentioned that our proposed methodology is quite robust regarding the sensitivity of the LPML, under different non-informative prior specifications for the parameters $α$ and $β$ of the projected Gamma DP mixture model (Model 3). That means, the corresponding obtained values of the statistics LPML result in a range from 371 to 377. Finally, it is worth mentioning that the fit of the projected Gamma DP mixture model (Model 3) is approximately 20% more computationally expensive than the truncated projected normal DP mixture model (Model 5) and comparable to the truncated von Mises DP mixture model (Model 6).

Table 3.

LPML goodness-of-fit measures for real data.

Model	$f (θ)$	Type	LPML
Model 1:	$\sum_{j = 1}^{\infty} w_{j} PN (θ \| μ_{j})$	unbounded (multimodal)	344.04
Model 2:	$PN (θ \| μ)$	unbounded (unimodal)	231.56
Model 3:	$\sum_{j = 1}^{\infty} w_{j} P G_{1} (θ \| α_{j}, β_{j})$ (our model)	bounded (multimodal)	373.90
Model 4:	$P G_{1} (θ \| α, β)$	bounded	286.50
Model 5:	$\sum_{j = 1}^{\infty} w_{j} tPN (θ \| μ_{j}, λ_{j}, 0, π / 2)$	bounded (multimodal)	348.87
Model 6:	$\sum_{j = 1}^{\infty} w_{j} tvM (θ \| μ_{j}, κ_{j}, 0, π / 2)$	bounded (multimodal)	335.49

Open in a new tab

Figure 10 shows the corresponding predictive distribution and pointwise credible bands for each of the proposed models for making inferences about the backscatter solar-to-sensor angles. In addition, Table 4 shows the corresponding estimated probabilities of non-empty components for all nonparametric models. As mentioned above, initially a model with two or three or four modes could be appropriated to describe the backscatter solar-to-sensor angles. However, an analysis of the results from Table 3, Table 4 and Figure 10 leads to select a four-mode model as the best model. In this case, the best fitting is obtained from the Model 3. Thus, the best model to describe the backscatter solar-to-sensor angles turns out to be the projected Gamma DP mixture model.

Table 4.

Estimated probability of non-empty components of DPM models for the real data example.

		Non-empty components
		$J = 3$	$J = 4$	$J = 5$	$J = 6$
Model 1:	$\sum_{j = 1}^{\infty} w_{j} PN (θ \| μ_{j})$	0.170	0.681	0.124	0.025
Model 3:	$\sum_{j = 1}^{\infty} w_{j} P G_{1} (θ \| α_{j}, β_{j})$	–	0.988	0.012	–
Model 5:	$\sum_{j = 1}^{\infty} w_{j} tPN (θ \| μ_{j}, λ_{j}, 0, π / 2)$	0.856	0.135	0.009	–
Model 6:	$\sum_{j = 1}^{\infty} w_{j} tvM (θ \| μ_{j}, κ_{j}, 0, π / 2)$	0.282	0.435	0.220	0.063

Open in a new tab

Once a projected Gamma DP mixture model has been adjusted and selected to describe the backscatter solar-to-sensor angles, some inferences can be made. It is possible to conclude that there are four angle values that may be correlated with NDVI index value peaks and consequently indicate possible different vegetation health status in the pinyon-juniper ecosystem in Grand Canyon National Park. These clustering values are given by the modes of the final predictive distribution, which are determinated by 0.183, 0.341, 0.593 and 0.814 radians. Thus, once these clustering values have been defined, appropriate correlation studies in regions around these values can be carried out by ecological researchers.

On the other hand, as is pointed out in [24], a goal of phenological studies related with NDVI is to detect timing changes of life cycle events during a year or from year-to-year. From the specifical angular regions described previously, a range of possible NDVI index values associated with these regions can be calculated. Then, it is possible to decide about the existence of atypical data in future samples of NDVI index and analyze the possible causes of these atypical records.

The results obtained in this section suggest our proposed model can be suitable for analyzing real directional data defined on the first orthant of the K-dimensional unit sphere.

5. Concluding remarks

This work introduces a new Bayesian nonparametric approach for analyzing directional data defined only on the first orthant of the K-dimensional unit sphere, $S^{K}$ . The proposed model is based on the Dirichlet Process mixture where a multivariate projected Gamma density is considered as base distribution.

Our nonparametric proposal considers a particular multivariate projected Gamma distribution where an independent structure defines the corresponding unprojected multivariate distribution. Initially, this choice of unprojected multivariate distribution seems restrictive for describing data with complex behaviors. However, the proposed model is able to describe data sets generated even by mixtures from different distributions, such as truncated von Mises and truncated projected Normal distributions.

Some extensions of the proposed approach are possible. Firstly, from a theoretical frame, it would be interesting to develop an extension of the model without the independent structure originally contemplated. Secondly, due to the close relationship that exists between the first orthant of the K-dimensional unit sphere and the corresponding K-dimensional unit simplex, it would be interesting to develop a methodology that allows the analysis of compositional data through the approach proposed in this paper. Work is currently in progress for these problems.

Funding Statement

The work of the first author was supported by CONACYT, Mexico. The second author was partially supported from CONACYT, through Sistema Nacional de Investigadores, Mexico. The support received from the Department of Mathematics of the Metropolitan Autonomous University, Iztapalapa Unit is also gratefully acknowledged. Finally, the authors are grateful to the anonymous reviewers for their detailed and insightful comments.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

1.R Core Team R: A language and environment for statistical computing, in R Foundation for statistical computing, Vienna, Austria. https://www.R-project.org. Accessed 2022.
2.Abe T. and Pewsey A., Symmetric circular models through duplication and cosine perturbation, Comput. Stat. Data Anal. 55 (2011), pp. 3271–3282. [Google Scholar]
3.Arnold B.C. and SenGupta A., Recent advances in the analyses of directional data in ecological and environmental sciences, Environ. Ecol. Stat. 13 (2006), pp. 253–256. [Google Scholar]
4.Blackwell D., Discreteness of Ferguson selections, Ann. Stat. 1 (1973), pp. 356–358. [Google Scholar]
5.Blumenson L.E., A derivation of n-dimensional spherical coordinates, Am. Math. Mon. 67 (1960), pp. 63–66. [Google Scholar]
6.D'Elia A., Borgioli C., and Scapini F., Orientation of sandhoppers under natural conditions in repeated trials: An analysis using longitudinal directional data, Estuar. Coast. Shelf Sci. 53 (2001), pp. 839–847. [Google Scholar]
7.Escobar M.D. and West M., Bayesian density estimation and inference using mixtures, J. Amer. Stat. Assoc. 90 (1995), pp. 577–588. [Google Scholar]
8.Ferguson T.S., A Bayesian analysis of some nonparametric problems, Ann. Stat. 1 (1973), pp. 209–230. [Google Scholar]
9.Fernandez-Gonzales P., Bielza C., and Larrañaga P., Univariate and bivariate truncated von mises distributions, Prog. Artif. Intell. 6 (2017), pp. 171–180. [Google Scholar]
10.Fisher N.I., Statistical Analysis of Circular Data, Cambridge University Press, Cambridge, 1995. [Google Scholar]
11.Frühwirth-Schnatter S. and Malsiner-Walli G., From here to infinity: Sparse finite versus Dirichlet process mixtures in model-based clustering, Adv. Data Anal. Classif. 13 (2019), pp. 33–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Geisser S. and Eddy W.F., A predictive approach to model selection, J. Am. Stat. Assoc. 74 (1979), pp. 153–160. [Google Scholar]
13.Jammalamadaka S.R. and SenGupta A., Topics in Circular Statistics, World Scientific, Singapore, 2001. [Google Scholar]
14.Kalli M., Griffin J.E., and Walker S.G., Slice sampling mixture models, Stat. Comput. 21 (2011), pp. 93–105. [Google Scholar]
15.Kim S. and SenGupta A., Multimodal exponential families of circular distributions with application to daily peak hours of PM2. 5 level in a large city, J. Appl. Stat. 48 (2020), pp. 3193–3207. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Lee A., Circular data, Wiley Interdiscip Rev. Comput. Stat. 2 (2010), pp. 477–486. [Google Scholar]
17.Lo A.Y., On a class of Bayesian nonparametric estimates: I. Density estimates, Ann. Stat. 12 (1984), pp. 351–357. [Google Scholar]
18.Mardia K.V., Statistics of Directional Data, Academic Press, London, 1972. [Google Scholar]
19.Mardia K.V. and Jupp P.E., Directional Statistics, Wiley, Chichester, 2000. [Google Scholar]
20.Miller J.W. and Harrison M.T., A simple example of Dirichlet process mixture inconsistency for the number of components, Adv. Neural Inf. Process Syst. 26 (2013), pp. 199–206. [Google Scholar]
21.Núñez-Antonio G., Ausín M.C., and Wiper M.P., Bayesian nonparametric models of circular variables based on Dirichlet process mixtures of normal distributions, J. Agric. Biol. Environ. Stat. 20 (2015), pp. 47–64. [Google Scholar]
22.Núñez-Antonio G. and Geneyro E., A multivariate projected gamma model for directional data, Comm. Stat. Simul. Comput. 50 (2019), pp. 2721–2742. doi: 10.1080/03610918.2019.1612910 [DOI] [Google Scholar]
23.Núñez-Antonio G. and Gutiérrez-Peña E., A Bayesian analysis of directional data using the projected normal distribution, J. Appl. Stat. 32 (2005), pp. 995–1001. [Google Scholar]
24.Norris J.R. and Walker J.J., Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States, Remote Sens. Environ. 249 (2020), pp. 112013. [Google Scholar]
25.Norris J.R. and Walker J.J., Data release associated with the journal article: Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States, U.S. Geological Survey data release, 2017. Available at 10.5066/P9LNQL6L. [DOI]
26.Oliveira M., Crujeiras R.M., and Rodríguez-Casal A., A plug-in rule for bandwidth selection in circular density estimation, Comput. Stat. Data Anal. 56 (2012), pp. 3898–3908. [Google Scholar]
27.Paine P.J., Preston S.P., Tsagris M., and Wood A.T., An elliptically symmetric angular Gaussian distribution, Stat. Comput. 28 (2018), pp. 689–697. [Google Scholar]
28.Pewsey A., Neuhäuser M., and Ruxton G.D., Circular Statistics in R, University Press, Oxford, 2013. [Google Scholar]
29.Poletto P.R., Santos H.H., Salvini T.F., Coury H.J.C.G., and Hansson G.A., Peak torque and knee kinematics during gait after eccentric isokinetic training of quadriceps in healthy subjects, Braz. J. Phys. Ther. 12 (2008), pp. 331–337. [Google Scholar]
30.Presnell B., Morrison S.P., and Littell R.C., Projected multivariate linear models for directional data, J. Am. Stat. Assoc. 93 (1998), pp. 1068–1077. [Google Scholar]
31.Sethuraman J., A constructive definition of Dirichlet priors, Stat. Sin. 4 (1994), pp. 639–650. [Google Scholar]
32.Sheinin D. and Emamdjomeh A., These days in baseball, every batter is trying to find an angle. The Washington Post (2017). Available at https://www.washingtonpost.com/graphics/sports/mlb-launch-angles-story/.
33.Vailati A., Zinnato L., and Cerbino R., How Archer fish achieve a powerful impact: Hydrodynamic instability of a pulsed jet in Toxotes Jaculatri, PLoS ONE 7 (2012), pp. e47867. 10.1371/journal.pone.0047867 [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Walker S.G., Sampling the Dirichlet mixture model with slices, Comm. Stat. Simul. Comput. 36 (2007), pp. 45–54. [Google Scholar]

[CIT0001] 1.R Core Team R: A language and environment for statistical computing, in R Foundation for statistical computing, Vienna, Austria. https://www.R-project.org. Accessed 2022.

[CIT0002] 2.Abe T. and Pewsey A., Symmetric circular models through duplication and cosine perturbation, Comput. Stat. Data Anal. 55 (2011), pp. 3271–3282. [Google Scholar]

[CIT0003] 3.Arnold B.C. and SenGupta A., Recent advances in the analyses of directional data in ecological and environmental sciences, Environ. Ecol. Stat. 13 (2006), pp. 253–256. [Google Scholar]

[CIT0004] 4.Blackwell D., Discreteness of Ferguson selections, Ann. Stat. 1 (1973), pp. 356–358. [Google Scholar]

[CIT0005] 5.Blumenson L.E., A derivation of n-dimensional spherical coordinates, Am. Math. Mon. 67 (1960), pp. 63–66. [Google Scholar]

[CIT0006] 6.D'Elia A., Borgioli C., and Scapini F., Orientation of sandhoppers under natural conditions in repeated trials: An analysis using longitudinal directional data, Estuar. Coast. Shelf Sci. 53 (2001), pp. 839–847. [Google Scholar]

[CIT0007] 7.Escobar M.D. and West M., Bayesian density estimation and inference using mixtures, J. Amer. Stat. Assoc. 90 (1995), pp. 577–588. [Google Scholar]

[CIT0008] 8.Ferguson T.S., A Bayesian analysis of some nonparametric problems, Ann. Stat. 1 (1973), pp. 209–230. [Google Scholar]

[CIT0009] 9.Fernandez-Gonzales P., Bielza C., and Larrañaga P., Univariate and bivariate truncated von mises distributions, Prog. Artif. Intell. 6 (2017), pp. 171–180. [Google Scholar]

[CIT0010] 10.Fisher N.I., Statistical Analysis of Circular Data, Cambridge University Press, Cambridge, 1995. [Google Scholar]

[CIT0011] 11.Frühwirth-Schnatter S. and Malsiner-Walli G., From here to infinity: Sparse finite versus Dirichlet process mixtures in model-based clustering, Adv. Data Anal. Classif. 13 (2019), pp. 33–64. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0012] 12.Geisser S. and Eddy W.F., A predictive approach to model selection, J. Am. Stat. Assoc. 74 (1979), pp. 153–160. [Google Scholar]

[CIT0013] 13.Jammalamadaka S.R. and SenGupta A., Topics in Circular Statistics, World Scientific, Singapore, 2001. [Google Scholar]

[CIT0014] 14.Kalli M., Griffin J.E., and Walker S.G., Slice sampling mixture models, Stat. Comput. 21 (2011), pp. 93–105. [Google Scholar]

[CIT0015] 15.Kim S. and SenGupta A., Multimodal exponential families of circular distributions with application to daily peak hours of PM2. 5 level in a large city, J. Appl. Stat. 48 (2020), pp. 3193–3207. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0016] 16.Lee A., Circular data, Wiley Interdiscip Rev. Comput. Stat. 2 (2010), pp. 477–486. [Google Scholar]

[CIT0017] 17.Lo A.Y., On a class of Bayesian nonparametric estimates: I. Density estimates, Ann. Stat. 12 (1984), pp. 351–357. [Google Scholar]

[CIT0018] 18.Mardia K.V., Statistics of Directional Data, Academic Press, London, 1972. [Google Scholar]

[CIT0019] 19.Mardia K.V. and Jupp P.E., Directional Statistics, Wiley, Chichester, 2000. [Google Scholar]

[CIT0020] 20.Miller J.W. and Harrison M.T., A simple example of Dirichlet process mixture inconsistency for the number of components, Adv. Neural Inf. Process Syst. 26 (2013), pp. 199–206. [Google Scholar]

[CIT0021] 21.Núñez-Antonio G., Ausín M.C., and Wiper M.P., Bayesian nonparametric models of circular variables based on Dirichlet process mixtures of normal distributions, J. Agric. Biol. Environ. Stat. 20 (2015), pp. 47–64. [Google Scholar]

[CIT0022] 22.Núñez-Antonio G. and Geneyro E., A multivariate projected gamma model for directional data, Comm. Stat. Simul. Comput. 50 (2019), pp. 2721–2742. doi: 10.1080/03610918.2019.1612910 [DOI] [Google Scholar]

[CIT0023] 23.Núñez-Antonio G. and Gutiérrez-Peña E., A Bayesian analysis of directional data using the projected normal distribution, J. Appl. Stat. 32 (2005), pp. 995–1001. [Google Scholar]

[CIT0024] 24.Norris J.R. and Walker J.J., Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States, Remote Sens. Environ. 249 (2020), pp. 112013. [Google Scholar]

[CIT0025] 25.Norris J.R. and Walker J.J., Data release associated with the journal article: Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States, U.S. Geological Survey data release, 2017. Available at 10.5066/P9LNQL6L. [DOI]

[CIT0026] 26.Oliveira M., Crujeiras R.M., and Rodríguez-Casal A., A plug-in rule for bandwidth selection in circular density estimation, Comput. Stat. Data Anal. 56 (2012), pp. 3898–3908. [Google Scholar]

[CIT0027] 27.Paine P.J., Preston S.P., Tsagris M., and Wood A.T., An elliptically symmetric angular Gaussian distribution, Stat. Comput. 28 (2018), pp. 689–697. [Google Scholar]

[CIT0028] 28.Pewsey A., Neuhäuser M., and Ruxton G.D., Circular Statistics in R, University Press, Oxford, 2013. [Google Scholar]

[CIT0029] 29.Poletto P.R., Santos H.H., Salvini T.F., Coury H.J.C.G., and Hansson G.A., Peak torque and knee kinematics during gait after eccentric isokinetic training of quadriceps in healthy subjects, Braz. J. Phys. Ther. 12 (2008), pp. 331–337. [Google Scholar]

[CIT0030] 30.Presnell B., Morrison S.P., and Littell R.C., Projected multivariate linear models for directional data, J. Am. Stat. Assoc. 93 (1998), pp. 1068–1077. [Google Scholar]

[CIT0031] 31.Sethuraman J., A constructive definition of Dirichlet priors, Stat. Sin. 4 (1994), pp. 639–650. [Google Scholar]

[CIT0032] 32.Sheinin D. and Emamdjomeh A., These days in baseball, every batter is trying to find an angle. The Washington Post (2017). Available at https://www.washingtonpost.com/graphics/sports/mlb-launch-angles-story/.

[CIT0033] 33.Vailati A., Zinnato L., and Cerbino R., How Archer fish achieve a powerful impact: Hydrodynamic instability of a pulsed jet in Toxotes Jaculatri, PLoS ONE 7 (2012), pp. e47867. 10.1371/journal.pone.0047867 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0034] 34.Walker S.G., Sampling the Dirichlet mixture model with slices, Comm. Stat. Simul. Comput. 36 (2007), pp. 45–54. [Google Scholar]

PERMALINK

A Bayesian nonparametric model for bounded directional data on the positive orthant of the unit sphere

Emiliano Geneyro

Gabriel Núñez-Antonio

Abstract

1. Introduction

2. The projected gamma distribution

3. Directional DP mixture model

3.1. DP mixture model

3.2. The projected gamma DP mixture model

3.3. Bayesian inference

4. Illustrations

Example 4.1

Figure 1.

Figure 2.

Table 1.

Example 4.2

Figure 3.

Figure 4.

Example 4.3

Figure 5.

Example 4.4

Figure 6.

Figure 7.

Table 2.

Example 4.5

Figure 8.

4.1. Real data example

Figure 9.

Table 3.

Figure 10.

Table 4.

5. Concluding remarks

Funding Statement

Disclosure statement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases