Skip to main content
Journal of Applied Statistics logoLink to Journal of Applied Statistics
. 2022 Jan 24;50(6):1358–1377. doi: 10.1080/02664763.2022.2028130

Compound truncated Poisson gamma distribution for understanding multimodal SAR intensities

A D C Nascimento a,CONTACT, Leandro C Rêgo b, Jonas W A Silva a
PMCID: PMC10071979  PMID: 37025275

Abstract

In recent years, many works have addressed the proposal of new probability models by theoretic and applied reasons. Specifically, mixture models have been indicated to describe phenomena whose resulting data impose high flexibility. One drawback of these tools is the high number of parameters involved, which implies hard inference procedures. To outperform this gap, we propose a new model that is able to describe multimodal behaviors with only three parameters, called compound truncated Poisson gamma (CTrPGa) distribution. Some properties of the CTrPGa law are derived and discussed: hazard, characteristic and cumulative functions and ordinary moments. Beyond, moment estimation, maximum likelihood estimation (via the expectation maximization algorithm) and empirical characteristic function methods for CTrPGa parameters are furnished. The first of them may be reduced to solve one nonlinear equation, which facilitates its use. We perform a simulation analysis to compare the performance of the three estimation methods studied. Moreover, since the gamma distribution and its mixture versions are commonly used to characterize synthetic aperture radar (SAR) intensities, we perform some real experiments with SAR imagery. The results present evidence that our model is a reasonable assumption that can be taken into account in the pre-processing step of such images.

Keywords: Compound models, gamma distribution, multimodal data, ordinary moments

1. Introduction

Several phenomena produce multimodal data in practice Cobb [5]; for instance, in image processing El-Zaart and Ziou [11] and in voice recognition Povey et al. [31]. Mixture models have been employed for describing these data McLachlan and Peel [23]. Bouguila and Ziou [2] proposed a new finite mixture model based on the generalized Dirichlet distribution. They also applied their proposal to image object recognition. Yang and Liu [37] have presented a new mixture density model based on the concept of group. Mignotte et al. [24] have proposed an unsupervised segmentation based on a hierarchical Markov random field model and Yang and Krishnan [36] provided an image segmentation method based on mixture and spatial information. However, such models impose difficult to inference and optimization (crucial steps in their adjustment) procedures because the presence of a high number of parameters. To solve this issue, we propose a three-parameter probability distribution which may describe data having multiple modes.

The practical focus of this paper is to study features in the remote sensing context. Synthetic aperture radar (SAR) images have been employed as an efficient source for solving remote sensing issues. Each pixel of a SAR image is associated with a complex element whose squared norm is known as intensity. To detail the last statement, as discussed by Delignon and Pieczynski [8], the back-scattered field defined on SAR illuminated areas is determined as

F=k=1NFkC, (1)

where Fk is the kth scatter in a resolution cell and N is the number of scatters in one resolution cell that is addressed in two senses: (i) if N is large and constant, F follows the circular Gaussian distribution and the associated intensity I=||F||2 has exponential law [16]; (ii) N can be understood as a random variable following the Poisson(λ) distribution. We aim to advance in (ii). Note that E(N)=λ governs the number of parcels in (1). The intensity of F is

I=||F||2=l=1NFlFl+2Re[l<jFlFj], (2)

where |||| is the Euclidean norm, E() is the expected value and () is the conjugate of a complex argument. Here, we assume that the second parcel in (2) is null [3, p. 47]. The intensity of the SAR signal plays an important role because it carries physical properties of the under-inspection surface. Several imagery post-processing methods – like boundary detection Nascimento et al. [26], classification Ferreira et al. [12] and segmentation Ferreira and Nascimento [7] – are based on the intensity probability distribution. In the processing of such images, two important issues are: (i) SAR returns receive the interference of the speckle noise what makes their processing difficult and, when working with any image, (ii) it is expected some scenarios involve several texture kinds (or objects) and, therefore, the associated empirical distributions are multimodal. Here, we suggest as a pre-processing probabilistic assumption to describe SAR intensities by the sum of a random number N, following the truncated Poisson (TrP) model, of gamma (Ga) distributed independent random variables, called compound truncated Poisson gamma (CTrPGa) distribution.

One way to generate probability distributions is by means of the compounding N method proposed by Karlis and Xekalaki [20]. In this method, a phenomenon is modeled by the random sum:

SN=X1+X2++XN, (3)

where both N and {Xi;i=1,,N} are independent random variables. In what follows, we assume that S=SN. Equation (3) has been widely used by its analytic form arising from several natural phenomena. Revfeim [32] derived the compound Poisson exponential for describing the total precipitation per day, where the number of days with precipitation is Poisson distributed and the precipitation amount follows the exponential distribution. Panger [28] discussed that the compound Poisson and the negative binomial models are extensively used in risk economic theory. Thompson [33] performed an application of the compound Poisson distribution to model the total amount of monthly rainfall. Özel and İnal [18,19] derived a procedure to obtain the compound Poisson process probability function and applied it to describe an aftershock sequence in Turkey and the number of fatalities in Groningen. Recently, Nascimento et al. [27] and Vasconcelos et al. [35] have also modeled SAR imagery using the compound truncated Poisson normal (CTrPN) and compound truncated Poisson Cauchy (CTrPCa) distributions, respectively.

In this paper, we address three aims. First, we introduce the CTrPGa model to fit intensities extracted from SAR images. It is worth to mention the Ga distribution is the ‘mother’ law in multiplication modeling to describe intensities Frery et al. [14], being the Ga more adherent to SAR imagery physical formation than Gaussian and Cauchy distributions that were studied by Nascimento et al. [27] and Vasconcelos et al. [35], respectively, by proposing the CTrPN and CTrPCa distributions. Our model has three parameters: one shape parameter, one of scale and one that indicates the number of parcels in (3) as latent information to observed intensity. The CTrPGa distribution is able to describe amodal, unimodal, and multimodal phenomena. Some of its properties are derived and discussed: characteristic and cumulant generator functions, ordinary moments and skewness and kurtosis measures. Second, we propose three estimation methods for CTrPGa parameters: moment method (MM), maximum likelihood estimation (MLE) via the expectation maximization (EM) algorithm and empirical characteristic function (ECF) methods. We rewrite the first of them such that the associated estimation system can be reduced to one nonlinear equation in terms of one parameter. Moreover, we compare these three estimation methods through a simulation analysis. Third, applications to real SAR data are made. The performance of our model is quantified and compared with other fits due to the generalized beta-generated Lindley (GBGL) (Lima et al. [21]) and Weibull–Poisson (WP) (Lu and Shi [22]) distributions (recently proposed to describe lifetime data) and other five models, G0 (Frery et al. [14]), K (Blacknell [1]), beta generalized normal (BGN)  (Cintra et al. [4]), gamma generalized normal (GaGN)  (Cordeiro et al. [6]) and CTrPN (Nascimento et al. [27]) distributions, which are commonly used in the SAR literature.

This paper is outlined as follows. In Section 2, we define the CTrPGa model and present some of its properties. Three estimation methods are discussed in Section 3. In Section 4, a simulation study is made to compare these estimation methods. Section 5 addresses two applications to real data. Conclusions are presented in Section 6.

2. Gamma-based stochastic sum

Nascimento et al. [27] have proposed the compound N family, defined as S=j=1NXj and having cumulative distribution function (cdf)

FS(x)=k=1P(N=k)FSk(x), (4)

where FSk is cdf of the sum of a k-points random sample, Sk=i=1kXk.

Consider the problem of deriving the random sum of gamma distributions with the number of terms following the TrP law. Thus, assuming NTrP(λ) and XiGa(α,β) as inputs to (4), we get that SkGa(kα,β) and

FS(x;λ,α,β)=k=1[λkk!(eλ1)][γ(kα,βx)Γ(kα)]=1eλ1k=1λkγ(kα,βx)k!Γ(kα), (5)

where λ,α,β>0, α and β are the scale and shape parameters, respectively, γ(δ,z)=[Γ(δ)]10ztδ1etdt denotes the lower incomplete gamma function and Γ() is the gamma function.

This case is denoted by SCTrPGa(λ,α,β). By differentiating (5), one has that the CTrPGa probability density function (pdf) is given by

fS(x;λ,α,β)=(1eλ1)k=1(λβα)kxkα1eβxk!Γ(kα). (6)

From (5) and (6), the CTrPGa hazard rate function (hrf) is given by

hS(x;λ,α,β)=fS(x;λ,α,β)1FS(x;λ,α,β)=(1eλ1)k=1(λβα)kxkα1eβxk!Γ(kα)1(1eλ1)k=1λkγ(kα,βx)k!Γ(kα).

Figures 1 and 2 present, respectively, some cases of the CTrPGa pdf and hrf. It is noticeable that the proposed model is able to describe multimodal, asymmetric and symmetric density behaviors (Figure 1). Moreover, CTrPGa hrf (Figure 2) may assume three standard patterns (increasing, decreasing and upside-down bathtub shapes).

Figure 1.

Figure 1.

Pdf curves for several parametric points.

Figure 2.

Figure 2.

CTrPGa hrf curves for several parametric points.

Based on Theorem 2.1 and on Equations (6) and (7) in [27], the following kind moment quantities hold: If SCTrPGa(λ,α,β), its characteristic function (CF), moment generate function (mgf) and cumulative generate function are given by, respectively,

φS(t)=eλ(ββit)α1eλ1,MS(t)=eλ(ββt)α1eλ1andKS(t)=log(eλ(ββt)α1eλ1).

The following proposition may be verified from the CTrPGa mgf.

Proposition 2.1

Let SCTrPGa(λ,α,β), its mean and variance are given by

E(S)=λαeλβ(eλ1)andVar(S)=λαeλβ2(eλ1)2[eλ(α+1)α(1+λ)1],

respectively.

It is possible to check easily that the expression for the variance is positive, since ex>1+x for all x>0.

Asymmetry and kurtosis measures of the CTrPGa distribution are given by, respectively,

SK=κ3κ23/2andK=κ4κ22,

where κ2=Var(S) and κ3 and κ4 are expressed in (7) and (8), respectively,

κ3=λαeλβ3(eλ1)3[(eλ1)2[(α+1)(α+2)+λα(3(α+1)+λα)]3λαeλ(eλ1)[α(1+λ)+1]+2λ2α2e2λ] (7)

and

κ4=λαeλβ4(eλ1)[(α+1)(α+2)[α(1+λβα+1)+3]+λα[α(6α+15)+3λα(α+1)+9]+βα+1λ2α2[3(α+1)+λα]]λ2α2e2λβ4(eλ1)2[4(α+1)(α+2)+λα[3(α+1)+λα]+3[α(1+λ)+1]2]+6λ3α3e3λβ4(eλ1)4[2(eλ1)[α(1+λ)+1]λαeλ]. (8)

Details of these results are presented in Appendix 1.

3. Estimation procedures for CTrPGa parameters

This section presents three estimation procedures for the CTrPGa parameters.

3.1. Moment estimation

Let S1,S2,,Sn be a random sample draw from SCTrPGa(λ,α,β). The moment estimator, θ^=(λ^,α^,β^), for θ=(λ,α,β) is obtained from the following system: for j = 1, 2, 3,

E(Sj)|θ=θ^=1ni=1nSij.

In particular, we use

{E(S)|θ=θ^=1ni=1nSiS¯E(S2)|θ=θ^=1ni=1nSi2S2¯E(S3)|θ=θ^=1ni=1nSi3S3¯

From Proposition 1 in [27], the next identities follow:

  1. From S¯=E(S)|θ=θ^,
    S¯=λ^α^eλ^β^(eλ^1)α^=S¯(β^(eλ^1)λ^eλ^). (9)
  2. From S2¯=E(S2)|θ=θ^,
    S2¯=λ^S¯β^(eλ^1)eλ^β2^(eλ^1)λ^eλ^[(S¯β^(eλ^1)λ^eλ^+1)+λ^(S¯β^(eλ^1)λ^eλ^)]=S¯β^(S¯β^(eλ^1)(1+λ^)+λ^eλ^λ^eλ^)β^=S¯S2¯S¯2(1eλ^)(1+λ^1). (10)
    Applying (10) in (9),
    α^=S¯2(eλ^1)λ^eλ^[S2¯S¯2(1eλ^)(1+λ^1)].
  3. From S3¯=E(S3)|θ=θ^,
    (1eλ^)λ^S3¯=α^β^3[(α^+1)(α^+2)+λ^α^(3α^+λ^α^+3)]. (11)

For more details, see Appendix 1. Thus, applying (9) and (10) in Equation (11) one obtains one nonlinear equation in λ^. By solving this equation, one obtains λ^. Using such results in (9) and in (10), one obtains α^ and β^, respectively.

In this paper, we use the nonparametric boostrap method [10] to estimate standard errors associated with the proposed moment estimators. A routine to find λ^,α^ and β^ for a given data set is detailed in Algorithm 1.

3.1.

3.2. Maximum likelihood estimation via the EM algorithm

Dempster et al. [9] pioneered a unified algorithm, termed expectation maximization (EM), which aims to obtain maximum likelihood estimates from incomplete data. As follows, we present a summary of the Dempster, Laird, and Rubin's theory. Let x and y be two data sets. The first is complete and understood as a random sample (independent, identically distributed) of a population variable with pdf f(x;θ). On the other hand, the second is an incomplete random sample from a variable equipped with density g(x;θ). Let X and Y be sample spaces due to x and y, respectively. Thus, the pdf of y can be defined as

g(y˙;θ)Y(y˙)f(x˙;θ)dx˙,

where Y(y˙) is the subsample space of X determined by y˙=y(x˙) and y˙ and x˙ are possible outcomes of y and x, respectively.

Setting Lc(θ)logf(x;θ) and L(θ)logg(x;θ), the EM algorithm consists in obtaining the maximum of L(θ) by means of an interactive process in terms of Lc(θ). As Lc(θ) is not observed, it is replaced by the following conditional expectation and temporary parametric values:

θ^(k+1)=argmaxθThetaE[Lc(θ)|y,θ^(k)].

Therefore, the EM algorithm can be understood in two steps:

  • E-step
    To obtain the conditional expectation
    Q(θ;θ^(k))E[Lc(θ)|y,θ^(k)].
  • M-step
    To find θ^(k+1) such that
    Q(θ^(k+1);θ^(k))Q(θ;θ^(k))forallθΘ.

Both steps above must be repeated until they converge in a specified sense. In what follows, we develop the EM algorithm for CTrPGa parameters.

Let SCTrPGa(λ,α,β) and s˙=(s1,,sn) be the observed sample, a possible outcome from a random sample s=(S1,,Sn) drawn of S. Note that a non-observed sample, k˙=(k1,,kn), is embedded in the observable one. Assume also that the last sample is an outcome of k=(K1,,Kn), a random sample from NTrP(λ). As defined previously, setting θ=(λ,α,β), one has that Lc(θ)=Lc(θ;s˙,k)=fs,k(s˙,k;θ) and the function Q(;) of the E-step evaluated in θ0=(λ0,α0,β0), Q(;θ0), is determined by

Q(θ;θ0)=E(Lc(θ)k,θ0)=kn=1k1=1{log[i=1nfSki|Ki(si|ki;θ)P(Ki=ki;θ)]}×P(k=k˙|s=s˙;θ0)=kn=1k1=1{log[i=1nfSki(si;θ)P(Ki=ki;θ)]}×i=1nP(Ki=ki|Si=si;θ0)=kn=1k1=1{i=1n[logfSki(si;θ)+logP(N=ki;θ)]}×i=1nP(N=ki|S=si;θ0),

where

P(N=ki|Si=si;θ0)=fSki(si;θ0)P(N=ki;θ0)fS(si;θ0)=β0kiα0Γ(kiα0)sikiα01eβsi(λ0ki(eλ01)ki!)j=1(λ0j(eλ01)j!)β0jα0Γ(jα0)sjjα01eβsj.

Specifically, one has that

Q(θ;θ0)=kn=1k1=1{i=1n[αkilog(β)logΓ(αki)+(αki1)log(xi)βxi+kilog(λ)log(eλ1)log(ki!)]}j=1nP(N=kj|S=sj;θ0). (12)

Proposition 3.1 summarizes the EM algorithm for the CTrPGa parameters. Details of these results are presented in Appendix 2.

Proposition 3.1

Let SCTrPGa(λ,α,β) and s˙=(s1,,sn) be the observed sample, a possible outcome from a random sample s=(S1,,Sn) drawn of S. Estimates, θ^=(λ^,α^,β^), which maximize Q(θ;θ0) in Equation (12) are defined by the following set of nonlinear equations:

λ^1eλ^=1ni=1nE(N|S=si;θ0),β^=α^ns¯i=1nE(N|S=si;θ0)

and α^ is a solution of the nonlinear equation

i=1nE(Nψ(α^N)|θ0,S=si)logα^i=1nE(N|θ0,S=si)c=0,

where

c=i=1nlog(si)E(N|θ0,S=si)[i=1nE(N|θ0,S=si)]×[logi=1nsilogi=1nE(N|θ0,S=si)]

such that

E(N|θ0,S=si)=i=1kifSki(si|θ0)P(N=ki|θ0)fS(si|θ0)=ki=1kiβ0α0kiΓ(α0ki)siα0ki1eβ0siλ0ki(eλ01)ki!j=1β0α0jΓ(α0j)siα0j1eβ0siλ0j(eλ01)j!

and

E(Nψ(α^N)|θ0,S=si)=i=1kiψ(α^ki)fSki(si|θ0)P(N=ki|θ0)fS(si|θ0).

Algorithm 2 describes the steps for this estimation.

3.2.

3.3. Minimum distance between empirical and theoretical characteristic functions method

Several works have addressed estimation methods based on the ECF. Parzen [29] introduced the ECF-based inference methods. The application of this method for independent data has been made by Paulson et al. [30], Feuerverger and McDunnough [13], Tran [34], Yu [38], among others.

The idea behind this process is minimizing a distance measure between the ECF and the theoretical CF. Let F(x;θ) be the cdf of a random variable X with support X. The CF of X is defined as (for i=1)

c(r;θ)=E[exp(irX)]=Xexp(irx)dF(x;θ),

and the ECF is

cn(r)=1nj=1nexp(irXj)=exp(irx)dFn(x),

where i=1 and {Xj}j=1n represents an independent and identically distributed sample and Fn(x) is the empirical cdf. The support of r may be utilized under two approaches: First the continuous support such that the next distance is used:

D1(θ;x)=|cn(r)c(r)|2g(r)dr,

where g(r) is a continuous weight function. Second the discrete support such the following distance is used:

D2(θ;x)=(VnVθ)Q(θ)(VnVθ),

where

Vn=(Re[cn(r1)],,Re[cn(rq)],Im[cn(r1)],,Im[cn(rq)]),Vθ=(Re[c(r1;θ)],,Re[c(rq;θ)],Im[c(r1;θ)],,Im[c(rq;θ)]),

Re[] and Im[] are real and imaginary parts of a complex number and r1,r2,, and rq is a set of points defined in the support of r. In this work, we use the discrete approach. According to Cojocaru and Doray [17], given a set of points from the support of r: r1,r2,,rq, the efficient estimator for θ, θ^, is given by θ^=argminθΘ[D2(θ;x)], where Θ is the parametric space and, for our case, Q(θ)=I. Algorithm 3 describes the steps for this estimation.

3.3.

4. Simulation study

Now we consider a Monte Carlo simulation study having one thousand replicas. This simulation aims to compare the three estimation methods discussed in the previous section. We use the MM estimates as initial points for the EM and ECF estimation algorithms. To that end, we use (λ,α,β)= (1,1,2), (1,3,2), (1,1,4), (1,2,4) and (2,1,4) and sample sizes n = 50, 100 and 150. As figures of merit, we employ the mean of estimates, the mean square error (MSE) and the fit mean absolute error (FMAE).

In order to choose the best r sequence in the EFC method, several studies were carried out searching for the best set of points. As a conclusion of this study, we note that for the majority of cases, the best results were obtained in interval (0.001,30). In what follows, Algorithm 4 describes the steps of the simulation study.

4.

Table 1 exhibits values of the figures of merit for the three estimation methods. In general, results point out the use of the EM algorithm outperforms the moment and ECF methods. For example, for n = 100 and (λ,α,β)=(1,1,2), the MSE(α^) in method EM with MM is equal to 0.0236, while in the MM and ECF with MM methods, 0.0331 and 0.0821, respectively.

Table 1.

Results for the used figures of merit for three estimation methods.

  MM EM with MM ECF with MM
Estimates 50 100 150 50 100 150 50 100 150
(λ,α,β)=(1,1,2)
λ^ 1.3268 1.3134 1.3716 1.3188 1.2949 1.3682 1.2444 1.2579 1.3753
α^ 0.8279 0.8731 0.8875 0.9729 0.9817 0.9687 0.9491 0.9388 0.9342
β^ 1.7645 1.8730 1.9706 2.0472 2.0381 2.1420 2.1681 2.0383 2.0547
MSE(λ^) 0.3312 0.2827 0.2930 0.3286 0.2799 0.2917 0.9998 0.9998 0.9216
MSE(α^) 0.0617 0.0331 0.0263 0.0478 0.0236 0.0176 0.1592 0.0821 0.0556
MSE(β^) 0.2178 0.1172 0.0940 0.1749 0.0951 0.1057 0.3044 0.1586 0.1737
FMAE 0.0171 0.0095 0.0066 0.0148 0.0083 0.0065 0.0232 0.0138 0.0092
(λ,α,β)=(1,1,4)
λ^ 1.3268 1.3134 1.3716 1.3113 1.2940 1.3632 1.0660 1.2548 1.4923
α^ 0.8279 0.8731 0.8875 0.9745 0.9872 0.9720 0.9324 0.9293 0.9074
β^ 3.5289 3.7460 3.9412 4.1018 4.0845 4.3089 4.1826 4.0468 4.1318
MSE(λ^) 0.3312 0.2827 0.2930 0.3260 0.2793 0.2909 0.9998 0.9998 0.9998
MSE(α^) 0.0617 0.0331 0.0263 0.0487 0.0237 0.0174 0.1615 0.1106 0.0756
MSE(β^) 0.8711 0.4687 0.3759 0.7022 0.3993 0.4390 1.1677 0.6472 0.5998
FMAE 0.0683 0.0379 0.0265 0.0587 0.0338 0.0266 0.0997 0.0574 0.0393
(λ,α,β)=(1,2,4)
λ^ 1.5302 1.3746 1.2117 1.5114 1.3637 1.2069 1.0517 1.0252 1.0135
α^ 1.8056 1.8741 1.9247 2.0204 2.0470 2.0460 1.9497 1.9661 1.9665
β^ 4.3269 4.3600 4.2445 4.8949 4.7295 4.4666 4.0939 4.0077 3.9937
MSE(λ^) 0.3400 0.1763 0.0841 0.3148 0.1705 0.0812 0.1846 0.1255 0.0932
MSE(α^) 0.0520 0.0241 0.0111 0.0277 0.0161 0.0082 0.0292 0.0131 0.0076
MSE(β^) 0.4293 0.3115 0.1700 1.2053 0.8501 0.4013 0.7359 0.5100 0.3527
FMAE 0.0016 0.0009 0.0004 0.0012 0.0007 0.0003 0.0012 0.0006 0.0003
(λ,α,β)=(2,1,4)
λ^ 1.3268 1.3134 1.3716 1.3148 1.2940 1.3663 2.4429 3.0749 2.9974
α^ 0.8279 0.8731 0.8875 0.9734 0.9857 0.9712 0.9378 0.7513 0.7362
β^ 2.6467 2.8095 2.9559 3.0739 3.0610 3.2222 3.6641 3.3947 3.4998
MSE(λ^) 0.3312 0.2827 0.2930 0.3240 0.2801 0.2913 2.9632 3.5119 2.8363
MSE(α^) 0.0617 0.0331 0.0263 0.0487 0.0238 0.0176 0.2191 0.2129 0.1774
MSE(β^) 0.4900 0.2637 0.2115 0.3946 0.2201 0.2417 1.4927 1.0203 0.7025
FMAE 0.0384 0.0213 0.0149 0.0330 0.0189 0.0149 0.0360 0.0189 0.0143

However, the difference between the MM and MLE via EM procedures tend to be inexpressive when the sample size increases. In fact, for n = 150 and (λ,α,β)=(1,1,4), for example, the FMAE in the MM and MLE via EM procedures have values equal to 0.0265 and 0.0266, respectively. Thus, due to its simplicity and since the sample size is large, we employ the MM in the applied study in the following section.

5. Application to real data

Now, we are in a position of applying the CTrPGa model to real data obtained from SAR images. The SAR system has been successfully used to solve remote sensing issues [25]. Its search can be justified by its ability at working under several weather conditions and at furnishing high spatial-resolution images. SAR data arise from the next dynamic: To send polarized (in horizontal, ‘H’, and, vertical, ‘V’, directions) pulses towards an under-study area and, after, capturing them (also in H and V directions). Each entry of produced images is associated with a complex vector, where each component represents one among three polarization channels: horizontal-horizontal, ‘HH’, vertical-horizontal (by physical reasons, similar to the horizontal-vertical channel), ‘VH’, and vertical-vertical, ‘VV’. In this paper, we wish to describe an HH channel feature, denoted as

ZHH=|ZHH|exp(iϕHH)C,

where |ZHH|2=ZHHZHH represents the HH intensity – with () being the conjugate operator – and ϕHH is the intensity phase. In what follows, we want to model SAR intensities by means of S with cdf in (5).

A Foulum (Denmark) image [15] – captured on 1998 by the EMISAR sensor under the number of looks eight – is used. Figure 3(a) exhibits the used SAR image at the HH channel and a selected region for analysis. A brief descriptive analysis is given in Table 2. Values of coefficient of variation, mean and median indicate an asymmetric empirical distribution having high-variability; while, the sample skewness and kurtosis suggest positively skewed and leptokurtic behaviors.

Figure 3.

Figure 3.

Real SAR images. (a) EMISAR HH intensities. (b) UVASAR VV intensities.

Table 2.

Descriptive analysis of real intensity data (CV, K and SK represent sample coefficient of variation, kurtosis and skewness, respectively).

Data Mean Median CV % K SK Size
EMISAR 0.0002582 0.0002223 47.46007 9.883381 2.508514 441
UVASAR 0.01934499 0.01728884 52.88208 3.302645 0.822702 271

A Los Angeles, CA (USA) image [15] – captured in 2015 by the UVASAR sensor under the number of looks four – is also employed. Figure 3(b) exhibits the SAR image used at the VV channel and a selected region for analysis. A brief descriptive analysis is given in Table 2. Values of coefficient of variation, mean and median indicate a quasi symmetric empirical distribution having high-variability; while, the sample skewness and kurtosis are near to the ones due to the normal law.

We aim to analyze Foulum SAR intensities within the selected rectangle according to (at least) two regions, representing wheat crops. Further, we wish to describe forest regions combined with urban targets in the Los Angeles SAR image. For this case, the CTrPGa will also be compared with the BGN( α,β,μ,σ,s) [4], G0(α,γ,L) [14], K(α,λ,L) [1], GBGL(λ,a,b,c) [21] CTrPN(λ,μ,σ) [27] WP(α,β,λ) [22] and GaGN(a,μ,σ,s) [6] laws having densities, respectively, given by

fBGN(x;α,β,μ,σ,s)=1σB(α,β)[Φs(xμσ)]α1[1Φs(xμσ)]β1×ϕs(xμσ),fG0(x;α,γ,L)=LLΓ(Lα)xL1γαΓ(L)Γ(α)(γ+Lx)Lα,fK(x;α,λ,L)=2λLΓ(α)Γ(L)(λLx)α+L21KαL(2λLx),fGBGL(x;λ,a,b,c)=cλ2(1+x)(1+λ)B(a,b)eλx[1eλx(1+λx1+λ)]ac1×{1[1eλx(1+λx1+λ)]c}b1,fCTrPN(x;λ,μ,σ)=1σ(1eλ1)k=1k1/2λkk!ϕ(xkμσk),fWP(x;α,λ,β)=αβλ1eλxα1eλβxα+eβxα

and

fGaGN(x;a,μ,σ,s)=ϕs(xμσ)σΓ(a){log[1Φs(xμσ)]}a1,

where ϕs() and Φs() are functions defined in [4], B(a,b) is the beta function, Ka(b) represents the second kind modified Bessel function with argument b at order a and ϕ() is the standard normal density. We do not use the CTrPCa distribution proposed in [35] in this analysis because it does not furnish reasonable estimates and fit for these scenarios.

As observed in Section 4, MM estimates for the CTrPGa model have a similar performance when compared to more sophisticated estimation methods and can be obtained resolving one nonlinear equation in terms of a single parameter. Thus, since it is a method that can be more easily used by practitioners, we compared its performance in this application with the ML estimates of the other seven models. Fits in Figure 4(a–d) suggest a qualitative comparison favorable to the CTrPGa model, which presented a better fitting when compared to the BGN, G0, K, GBGL, CTrPN, WP and GaGN distributions.

Figure 4.

Figure 4.

Plots of empirical densities (gray curve) vs. fitted pdf and cdf of BGN (dashed curves), G0 (dot curves), K (dashes and dot curves), GBGL (long dashed curves) and CTrPN (solid curves) distributions. (a) Fitted pdfs for EMISAR data. (b) Fitted cdfs for EMISAR data. (c) Fitted pdfs for UVASAR data. (d) Fitted cdfs for UVASAR data.

In a quantitative analysis, parameter estimates involved in both applications are presented in Tables 3 and 5. It is noticeable that all fits were acceptable in respect to point estimates.

Table 3.

ML estimates for the BGN(s,μ,σ,α,β), G0(α,γ,L), K(α,λ,L), GBGL(λ,a,b,c), CTrPN(λ,μ,σ), WP(α,λ,β) and GaGN(α,μ,σ,s) distributions and MM estimates for CTrPGa(λ,α,β).

Model Estimated parameters
CTrPGa 0.169 26.834 1.196 ×105
  (0.060) (7.628) (36192.89)
BGN 0.672 1.857 ×104 2.185 ×105 1.720 0.645
  (9.156 ×102) (8.00 ×106) (1.140 ×105) (0.448) (0.291)
G0 −10.564 0.002 41.741
  (0.797) (0.001) (6.998)
K 0.327 46.367 0.622
  (0.162) (22.321) (0.282)
GBGL 56.308 98.444 144.182 0.208
  (0.826) (0.102) (0.152) (0.001)
CTrPN 0.331 0.220 ×103 4.508 ×105
  (3.947 ×102) (2.481 ×106) (1.886 ×106)
WP 1.509 478.817 481.405
  (0.126) (33.556) (2.528)
GaGN 0.9737 2.20 ×104 2.94 ×105 0.6647
  (0.216) (1.05 ×105) (1.26 ×105) (0.125)

Note: Standard errors are in parenthesis.

Table 5.

ML estimates for the BGN(s,μ,σ,α,β), G0(α,γ,L), K(α,λ,L), GBGL(λ,a,b,c), CTrPN(λ,μ,σ), WP(α,λ,β) and GaGN(α,μ,σ,s) distributions and MM estimates for CTrPGa(λ,α,β).

Model Estimated Parameters
CTrPGa (ok) 3.390 10.674 1971.879
  (0.306) (3.743) (291.373)
BGN (ok) 1.591 0.003 0.006 3.474 0.173
  (2.187 ×104) (1.562 ×105) (6.032 ×105) (0.501) (0.012)
G0 (ok) −13.524 0.244 4.446
  (0.023) (0.013) (0.016)
K (ok) 2.96 146.591 37.797
  (0.286) (8.731) (4.926)
GBGL (ok) 2.964 9.1784 45.604 0.551
  (0.371) (0.215) (0.305) (0.009)
CTrPN (ok) 0.096 0.0181 0.0079
  (0.0220) (5.95 ×104) (3.46 ×104)
WP (ok) 1.9994 47.9637 44.3172
  (0.162) (8.520) (9.958)
GaGN (ok) 0.0951 0.0394 0.0126 3.2050
  (0.007) (0.012) (0.006) (0.571)

Note: Standard errors are in parenthesis.

Tables 4 and 6 present values for six goodness-of-fit measures: Kolmogorov–Smirnov ( dKS and its pvalueKS), Anderson–Darling ( A), Cramer–Von-Mises ( W) statistics, Akaike information criterion (AIC) and Bayesian information criterion (BIC). Based on the KS test, the CTrPGa distribution was the only adequate alternative, adopting the nominal level of 6% for Foulum data. With respect to A and W measures for Foulum data, the CTrPGa law outperforms all other distributions; while the CTrPN yields the best performance in terms of AIC and BIC, followed very closely by the CTrPGa. For Los Angeles data, all considered models were acceptable taking a nominal level of 10% according the KS test, being the smallest KS statistics value furnished by the CTrGa fit. The smallest values for A and W statistics were provided by the BGN and K fits, respectively. The CTrPGa fit yielded the best performance in terms of AIC and BIC.

Table 4.

GoF measures for the fitted CTrPN, BGN, GBGL, G0, K, WP, and GaGN models on EMISAR real data.

  Performance for different models
GoF measures CTrPGa BGN G0 K GBGL CTrPN WP GaGN
pvalueKS 0.1434 0.0413 2.03 ×105 <108 5.58 ×108 0.0502 2.2 ×1016 7.34 ×107
W 0.2346 0.3434 1.8848 3.0403 3.2316 0.3138 4.888 1.1426
A 1.7836 2.5443 11.9719 18.5387 19.5913 2.4711 28.432 8.4423
AIC −7110.957 −7110.677 −7040.178 −5338.857 −6961.375 −7126.182 −6661.929 −7107.565
BIC −7110.902 −7110.539 −7040.123 −5338.802 −6961.283 −7113.915 −6649.662 −7091.208

Table 6.

Goodness-of-fit measures for the fitted CTrPN, BGN, GBGL, G0, K, WP, and GaGN models on UAVSAR real data.

Goodness-of-fit measures Performance for different models
  CTrPGa BGN G0 K GBGL CTrPN WP GaGN
pvalueKS 0.9100 0.8920 0.7085 0.4071 0.6665 0.2038 0.7575 0.1232
W 0.0861 0.0360 0.0703 0.0359 0.0475 0.2790 0.1014 0.2603
A 0.5624 0.2501 0.4643 0.2517 0.3139 1.9604 0.6757 1.7320
AIC −1754.152 −1752.372 −1753.456 −1748.517 −1754.144 −1719.296 −1751.885 −1733.008
BIC −1743.435 −1734.588 −1742.739 −1737.801 −1739.886 −1708.490 −1741.079 −1718.600

6. Conclusions

In this paper, we have introduced a new three-parameter model named compound truncated Poisson gamma distribution. Some of its mathematical properties have been also derived: cumulative and characteristic functions, ordinary moments and skewness and kurtosis measures. A simple moment-based estimation procedure which involves finding the root of only one nonlinear equation is proposed as well. Further, we propose two other estimation methods based on the MLE via EM and on the ECF. It is worth emphasizing that the MM estimation procedure is computationally easier for CTrPGa users who may, for example, need to process images from neighborhoods of every image pixel. The MM estimation method was observed to have similar performance to the other two more sophisticated estimation methods. The CTrPGa model has been employed successfully as a descriptor of SAR intensities, outperforming seven other models.

Supplementary Material

Supplemental Material

Funding Statement

The authors would like to acknowledge the financial support from CAPES (financing code 001) and the first and second authors also would like to thank the financial support from CNPq under grants 310918/2018-9 and 307556/2017-4, respectively.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

  • 1.Blacknell D., Comparison of parameter estimators for K-distribution, IEE Proc. Radar Sonar Navig. 141 (1994), pp. 45–52. [Google Scholar]
  • 2.Bouguila N. and Ziou D., A hybrid SEM algorithm for high-dimensional unsupervised learning using a finite generalized Dirichlet mixture, IEEE. Trans. Image. Process. 15 (2006), pp. 2657–2668. [DOI] [PubMed] [Google Scholar]
  • 3.Brockwell P. and Davis R., Time Series: Theory and Methods, Springer, New York, 1991. [Google Scholar]
  • 4.Cintra R.J., Rêgo L.C., Cordeiro G.M., and Nascimento A.D.C., Beta generalized normal distribution with an application for SAR image processing, Statistics 48 (2012), pp. 1–16. [Google Scholar]
  • 5.Cobb L., Stochastic catastrophe models and multimodal distributions, Behav. Sci. 23 (1978), pp. 360–374. [Google Scholar]
  • 6.Cordeiro G.M., Cintra R.J., Rêgo L.C., and Nascimento A.D.C., The gamma generalized normal distribution: A descriptor of SAR imagery, J. Comput. Appl. Math. 347 (2019), pp. 257–272. [Google Scholar]
  • 7.de A. Ferreira J. and Nascimento A.D.C., Shannon entropy for the GI0 model: A new segmentation approach, IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 13 (2020), pp. 2547–2553. [Google Scholar]
  • 8.Delignon Y. and Pieczynski W., Modeling non-Rayleigh speckle distribution in SAR images, IEEE. Trans. Geosci. Remote. Sens. 40 (2002), pp. 1430–1435. [Google Scholar]
  • 9.Dempster A.P., Laird N.M., and Rubin D.B., Maximum likelihood from incomplete data via the EM algorithm, J. R. Stat. Soc. Ser. B (methodol.) 39 (1977), pp. 1–38. [Google Scholar]
  • 10.Efron B., Bootstrap methods: Another look at the jackknife, Ann. Stat. 7 (1979), pp. 1–26. [Google Scholar]
  • 11.El-Zaart A. and Ziou D., Statistical modelling of multimodal SAR images, Int. J. Remote Sens. 28 (2007), pp. 2277–2294. [Google Scholar]
  • 12.Ferreira J.A., Coelho H., and Nascimento A.D.C., A family of divergence-based classifiers for polarimetric synthetic aperture radar (PolSAR) imagery vector and matrix features, Int. J. Remote Sens. 42 (2021), pp. 1201–1229. [Google Scholar]
  • 13.Feuerverger A. and McDunnough P., On the efficiency of empirical characteristic function procedures, J. R. Stat. Soc. Ser. B (Methodol.). 43 (1981), pp. 20–27. [Google Scholar]
  • 14.Frery A.C., Muller H.J., Yanasse C.C.F., and Sant' Anna S.J.S., A model for extremely heterogeneous clutter, IEEE. Trans. Geosci. Remote Sens. 35 (1997), pp. 648–659. [Google Scholar]
  • 15.Frery A.C., Nascimento A.D.C., and Cintra R.J., Analytic expressions for stochastic distances between relaxed complex Wishart distributions, IEEE. Trans. Geosci. Remote Sens. 52 (2014), pp. 1213–1226. [Google Scholar]
  • 16.Goodman J.W., Laser Speckle and Related Phenomena, Springer-Verlag, Germany, 1975. [Google Scholar]
  • 17.Groparu-cojocaru I. and Doray L.G., Inference for the generalized normal Laplace distribution, Commun. Stat. – Simul. Comput. 42 (2013), pp. 1989–1997. [Google Scholar]
  • 18.Özel G. and İnal C., The probability function of a geometric Poisson distribution, J. Stat. Comput. Simul. 80 (2010), pp. 479–487. [Google Scholar]
  • 19.Özel G. and İnal C., The probability function of the compound Poisson process and an application to aftershock sequence in Turkey, Environmetrics 19 (2008), pp. 79–85. [Google Scholar]
  • 20.Karlis D. and Xekalaki E., Mixed Poisson distributions, Int. Stat. Rev. 73 (2005), pp. 35–58. [Google Scholar]
  • 21.Lima M.C.S., Cordeiro G.M., Nascimento A.D.C., and Silva K.F., A new model for describing remission times: The generalized beta-generated Lindley distribution, Anais Da Academia Brasileira De Ciências 89 (2017), pp. 1343–1367. [DOI] [PubMed] [Google Scholar]
  • 22.Lu W. and Shi. D., A new compounding life distribution: The Weibull-Poisson distribution, J. Appl. Stat. 39 (2012), pp. 21–38. [Google Scholar]
  • 23.McLachlan G. and Peel D., Finite Mixture Models, Wiley Series in Probability and Statistics, Wiley, 2004. [Google Scholar]
  • 24.Mignotte M., Collet C., Perez P., and Bouthemy P., Sonar image segmentation using an unsupervised hierarchical MRF model, IEEE. Trans. Image. Process. 9 (2000), pp. 1216–1231. [DOI] [PubMed] [Google Scholar]
  • 25.Muller K.V., SAR correlation imaging and anisotropic scattering, Inverse Probl. Imaging 12 (2018), pp. 697–731. [Google Scholar]
  • 26.Nascimento A.D.C., Horta M.M., Frery A.C., and Cintra R.J., Comparing edge detection methods based on stochastic entropies and distances for PolSAR imagery, IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 7 (2014), pp. 648–663. [Google Scholar]
  • 27.Nascimento A.D.C., Rêgo L.C., and A. Nascimento R.L.B., Compound truncated Poisson normal distribution: Mathematical properties and moment estimation, Inverse Problems Imaging 13 (2019), pp. 787–803. [Google Scholar]
  • 28.Panjer H.H., Recursive evaluation of a family of compound distributions, Astin Bull. 12 (1981), pp. 22–26. [Google Scholar]
  • 29.Parzen E., On estimation of a probability density function and mode, Ann. Math. Stat. 33 (1962), pp. 1065–1076. [Google Scholar]
  • 30.Paulson A.S., Holcomb E.W., and Leitch R.A., The estimation of the parameters of the stable laws, Biometrika 62 (1975), pp. 163–170. [Google Scholar]
  • 31.Povey D., Burget L., Agarwal M., Akyazi P., Kai F., Ghoshal A., Glembek O., Goel N., Karafiát M., Rastrow A., Rose R.C., Schwarz P., and Thomas S., The subspace gaussian mixture model – A structured model for speech recognition, Comput. Speech. Lang. 25 (2011), pp. 404–439. [Google Scholar]
  • 32.Revfeim K.J.A., An initial model of the relationship between rainfall events and daily rainfalls, J. Hydrol. (Amst). 75 (1984), pp. 357–364. [Google Scholar]
  • 33.Thompson C.S., Homogeneity analysis of rainfall series: An application of the use of a realistic rainfall model, J. Climatol. 4 (1984), pp. 609–619. [Google Scholar]
  • 34.Tran K.C., Estimating mixtures of normal distributions via empirical characteristic function, Econom. Rev. 17 (1998), pp. 167–183. [Google Scholar]
  • 35.Vasconcelos J.M., Cintra R.J., Nascimento A.D.C., and Rêgo L.C., The compound truncated Poisson Cauchy model: A descriptor for multimodal data, J. Comput. Appl. Math. 378 (2020), pp. 112887. [Google Scholar]
  • 36.Yang X. and Krishnan S.M., Image segmentation using finite mixtures and spatial information, Image. Vis. Comput. 22 (2004), pp. 735–745. [Google Scholar]
  • 37.Yang X. and Liu J., Mixture density estimation with group membership functions, Pattern. Recognit. Lett. 23 (2002), pp. 501–512. [Google Scholar]
  • 38.Yu J., Empirical characteristic function estimation and its applications, Econom. Rev. 23 (2004), pp. 93–123. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

Articles from Journal of Applied Statistics are provided here courtesy of Taylor & Francis

RESOURCES