Skip to main content
Journal of Research of the National Institute of Standards and Technology logoLink to Journal of Research of the National Institute of Standards and Technology
. 1994 Jul-Aug;99(4):563–569. doi: 10.6028/jres.099.055

The Measurement of Averages and Extremes of Environmental Variables

C W Anderson 1, K F Turkman 2
PMCID: PMC8345294  PMID: 37405285

Abstract

The relationship between the statistics of environmental measurements averaged over different time scales is related to extreme levels of the variables. Results on the asymptotic joint distributions of extreme averages over different time periods are treated.

Keywords: averages, extremes, extremal index, Poisson random measures

1. Introduction

Measurements of many environmental variables such as wind speeds, rainfall and the concentrations of atmospheric and aquatic pollutants are generally duration-specific: the actual quantities measured are averages over a specific time interval rather than instantaneous values. Thus concentrations of ozone are typically measured as parts per 100 million averaged over an hour, and wind speeds are routinely recorded as hourly or daily mean speeds. In practice however, the scientists wishing to understand the environmental processes which lie behind such measurements, and the regulatory body which monitors pollution, often wish to deal with characteristics measured over some other time interval: peak concentrations over a day, for example, or high wind gust values, which in practice correspond to peak 3–5 s averages. There is therefore a need to understand the relationship between the statistics of environmental measurements averaged over different time scales, and to relate these statistics to extreme levels of the variables. In this paper, results on the asymptotic joint distributions of extreme averages over different time periods will be treated. These results will make it possible, for example, to link long historical data series containing information about extremes of daily rainfall (sometimes extending back to the early years of the century) to the shorter series of extreme hourly rainfall which have been recorded only in the past 20 or 30 years. Thus important historical information could properly be taken account of in the estimation of floods, something recognised as higly desirable by hydrologists. Another area of application of the results is in the study of the dispersal of airborne pollutants. Here, it is known (see, for example Fakrell and Robins [9]) that instruments used to measure the concentration of pollutants dispersing in a turbulent flow cannot resolve the finest scales present in such flows. Measurements of concentration are therefore invariably obtained only as averages of the characteristics of primary physical interest, and so a statistical theory which links extremes of averages over different ranges would be of great value to scientists working in the area (Mole [10]). One related area is the study of joint distributions of averages and maxima of random sequences. Interest in such distributions is motivated by analysis of extreme winds. For purposes of building design or public safety, it is often important to estimate the speed of the most extreme wind likely to occur at a particular location over a period of years and to do this it is natural to apply the methods of extreme value theory to data on maximum gusts. The precision of estimates obtained may be low due to the limited amount of relevant data—often no more than 10 or 20 years, so it is desirable to try to improve the precision by introducing into the estimation procedure other information relevant to extreme winds. It is natural, thefore, to ask whether the data on gusts could be augmented by that on hourly means, as gusts and means are evidently related. One source of guidance here may be provided by the limit properties of the joint distribution of means and the maxima.

In Sec. 2, we give a summary of the results that exist on the joint limiting forms of sums and maxima of stationary sequences and in Sec. 3 we give some results on the asymptotic joint distributions of extreme averages over different time-periods of sequences which have moving average representations. Possible solutions for the general stationary case are also indicated.

2. Extremes and Averages

Let {Xi} be a stationary sequence of random variables with marginal distribution function P(Xix) = F(x) and let

Sn=i=1nXi,Mn=max1inXn,n=1,2,

We study here the joint limiting distributions of

(S¯n,M¯n)=(Snnbnan,Mndncn) (1)

as n → ∞ for suitable constants an > 0, cn > 0, bn and dn.

Case 1: Light Tailed Case

Assume that Var (Xi) < ∞ and FD (Λ) or FDα), α>2 or FDα), α>0. where Λ(.), Φα(.), Ψα(.) are respectively the Gumbel, Frechet and Weibull distributions.

Then for the associated iid sequence (Chow and Teugels [5])

(S¯n,M¯n)d(N,Λ),

or

(S¯n,M¯n)d(N,Φα),α>2,(S¯n,M¯n)d(N,Ψα),α>0,

where the limit components are independent.

Can dependence amongst Xi modify this limiting independence? As the following theorem (Anderson and Turkman [1]) shows, under quite weak conditions, dependence does not affect the limiting distribution.

Theorem 2.1

Assume that {Xi} is strong mixing and has positive extremal index and for some an, cn and dn,

S¯n=SnandN(0,1),
M¯n=(Mndn)cndG,

where G =Λ or G = Φα, for some α>2 or Gα, for α>0. Assume further that {Xi} satisfies the condition

limkklimnsupD(an,un)=0,

where

D(an,un)=j=1n/k=E[|exp(itan1l=1ljXt)1|[χ(Xl>un)] (2)

Then S¯n and M¯n are asymptotically independent. Local dependence condition D′(an, un) is quite weak and satisfied, for example, by m-dependent sequences and by Gaussian sequences with summable covariances.

Case 2: Heavy Tailed Case

Assume that

1F(x)=pxαL(x),F(x)=qxαL(x)

where L(x) is a slowly varying function and 0<α<2. Then for the associated iid sequence (Chow and Teugels [5])

(S¯n,M¯n)d(U,V),

where

E(eitυχ(Vυ))=Wα(t,p)expυeitkwdwα.

Here χ(A) denotes the indicator function of the event A, Wα(t, p) is the characteristic function of a stable law of index α and parameter p, and k is a constant depending on α and p. Note that for the heavy tailed case, U and V are dependent. Can the type of local dependence of the X-sequence make a difference to the Chow-Teugels limit? It can! If large values are cancelled by large negative values, then sums and maxima can be asymptotically independent. We show this by constructing an example:

Let {Yi} be a stationary sequence with

1FY(y)=1y4,

ϵ<1, γ≥1 and for ν>0 such that ϵ + ν< 1, let

Xi={Yiwith probabilityYitvYi1with probability1Yi1v

Then it can be shown that {Xi} is stationary, 1-dependent and 1−Fx(x) = ϵ(ϵ + ν)−1x−ϵ, Fx(−x) = x−ϵ. Hence the limit distribution of (S¯n,M¯n) for the associated iid sequence is the Chow-Teugels limit with α=ϵ and p = ϵ(2ϵ+ν)−1. The components of this limit are dependent. However, (S¯n,M¯n) of the dependent {Xi} process can be shown to be asymptotically independent due the cancellation of large positive values by large negative values values, thus showing that local dependence may make a difference on the limit distribution. However, if we rule out this type of cancellation, then the limit distribution is not affected by the dependence in {Xi}, as the following theorem demonstrates (Anderson and Turkman [4]). One possible local condition which rules out this type of cancellation is Davis’ [6] D′(an) condition, which we assume in the theorem. This restrictive technical condition also rules out clustering of large and small values above and below certain thresholds. Types of processes which satisfy this condition (and others which we need in the theorem) can be seen in Davis [6].

Let {Ei} be an iid unit mean exponential sequence, Γj=i=1jEi and {δi} iid taking values +1 and −1 with probabilities p and q respectively. Then

Theorem 2.2

  1. If 0<α<1, p>0 and conditions D(an), D′(an) of Davis [6] hold then
    (S¯n,M¯n)d(j=1δjΓj1/α,ΓD1/α),
    where D = min {j : δj = 1}.
  2. If 1≤α <2, p>0 and conditions D(an), D′ (an) and Dn(an) of Davis [6] hold then
    (S˜n,M¯n)d(j=1{δjΓj1/α(pq)E(ΓD1/α(0,1))},ΓD1/α).
  3. Under the conditions of (i) and (ii),
    E[eitS¯aχ(M¯nx)]dWα(t,p)Φα(x)eh(t,x),
    where
    h(t,x)=x[eit(Cδ)1/αy1]d(yn),
    and
    δ=p2αα,
    and C is a positive constant. Note that the value of the limit does not depend on the dependence structure of {Xi}.

These results seem to be discouraging for statistical applications. For example, for sequences with finite variance, the independence of (U, V) does not offer a basis for the use of average wind speeds in inferences about gusts, contrary to the evidence shown in data. This may be due to:

  1. Time intervals are not long enough in practice for asymptotic results to give adequate approximations,

  2. The correlation structure of the data is not well represented by our mixing and local dependence conditions,

  3. Residual seasonality remains in the data. Based on these possible deviations, statistical models establishing connection between means and extreme events are suggested in Anderson and Turkman [3].

3. Extremes of Averages Over Different Time Scales

The specific problem to be adressed in this section is as follows: Let X, represent the instantaneous value of the environmental variable at time t, and denote by XT,t a moving average of {Xt} over the range T:

XT,t=1Ti=0T1Xti, (3)

Then we are interested in the paired series {XT,t, XS,t} for different fixed S and T and in particular, in the joint distributional properties of extremes of the pair {XT,t, XS,t}.

We will give results only for sequences with the heavy tailed distributions. The light tailed case is more complicated, since in this case large values of the moving averages may occur due to the contribution of several relatively large values of the sequence in contrast to the heavy tailed case when large values of the moving averages are dominated by the largest value of the sequence. The techniques to be used to study these questions will be developments of those used by Davis and Resnick [8].

Suppose that {Xi}i=1 are iid random variables with

P(Xi>x)xαL(x),

where L(x) is slowly varying as x→∞, that is XDα). Take constants an such that

nP(Xi>anx)xα,

Consider a point process Pn which puts points at (kn,an1Xk),k=1,2,. Hence Pn is the random point measure on sets in R+ × R

k=1(kn,an1Xk)(),

where

x(A)={1ifxA0if not.

It is known that Pn converges weakly as n→∞ to a point process P, a Poisson process with mean measure μ. on R + × (R+ −{0}). (Davis and Resnick call this Poisson measure, the Poisson random measure PRM (µ) and consider a more general case which involve the left tail of the distribution as much as the right tail. Here due to the special simple form of the moving averages, we restrict ourselves to the space (0, ∞).)

Here

dμ=dt×αxu1x(0,)dx.

Hence

k=1(kn,an1Xk)()k=1(lk,jk)(),

where {(tk,jk)}k=1 are the Points of P.

Davis and Resnick [8] show that, correspondingly, for {XT,t} with same normalization

k=1(kn,an1XT,k)()k=1l=1T(lk,jkci)()

on R + ×(R+−{0}), where

ci{1Ti=1,2,,T.0otherwise

and (tk, jk) are as above.

Results for the point process generated simultaneously by {XT,t} and {Xs,i} processes can be obtained with a straightforward generalization:

k=1(kn,an1XT,kan1χs,i)()k=1xi=1T(lk,jkci,jkci)() (4)

on R+×(R+−{0})2, where

c¯i={1Si=1,2,,S.0otherwise

The joint limit distribution of MT,n and MS,n now can easily be calculated since

P(MT,nanx,MS,nany)=P(k=1(1n,an1XT,k,an1Xs,1)((0,1]×((,x]×(,y])c)=0)P(k=1i=1T(lk,jkci,jkci)((0,1]×((,x]×(,y])c)=0)=P(k=1(tk,T1/k,S1jk)((0,1]×((,x]×(,y])c)=0)=P(j=1(tkjk)((0,1]×(TxΛTs,))=0)=exp[μ((0,1]×(TxΛSy,))]=exp[TxSyq(x)dx]={exp[p(TxΛSy)α]ifTxΛSy>00if not (5)

Special Cases

[1] S = T. Then

P(an1MT,nx,an1MS,ny)=P(an1MT,nxΛy),

so from theorem 3.1 of Davis and Resnick [8], we should have

P(an1MT,nx,an1MS,ny)=ep(T(xΛy))α.

Hence the result Eq. (5) is consistent with the existing results on this special case.

[2] S = 1 <T, so MS,n = maxin Xi, and the result Eq. (5) says that

P(an1MT,nx,an1MS,ny)ep(TxΛy)α.

Note that the above distribution is the joint limit distribution of the maximum of the {Xi} process and the maximum of a moving average of it.

The above set up is a very simple case. The immediate question is: what if the {Xi} are themselves a dependent sequence?

We can get a partial answer by taking {Xi} to be itself a moving average:

Xi=j=1ajZij,

say, where Zi satisfy the same conditions as the Xi and aj≥0 (for simplicity)Thus

XT,i={X}*{1T},

and

X={Z}*{a}.

Hence

XT,i={Z}*{a}*{1T}={Z}*{d},

say, where

d={a}*{1T}.

Note that for any l = 1,2,…,

dl=1Ti=1TΛl1ali,i=1,2,

As before

P(an1MT,nx,an1MS,ny)P(k=1i=1(ik,fkd0jkd¯i)((0,1]×((,x]×(,y])c)=0), (6)

where

d¯i=1Si=1SΛI1ali.

Note that

{i=1(lk,jkd¯ijkd1)((0,1]×((,x]×(,y])c)=0}{jk(d¯i,d¯i)(,x]×(,y]},

for every i for which one of the components is non-zero. This is the case iff jkd+x and jkd+y, where d+ = max dì and d+ = max di. Hence the limit of Eq. (6) is given by

P(k=1jk(xd+Λyd+,)=0)=exd+Λyd+q(x)dx={ep(xd+Λyd+)e0if(xd+Λyd+)>0otherwise (7)

Hence we see that this kind of dependence in the underlying process does not change the form of the limit distribution. It would be interesting to obtain similar results for general stationary sequences. This could be done by using characterization of the limit point processes for the sequence of point processes with points (t/n,an1Xit=1,2,,n) given by Davis and Hsing [7] when {Xt} is a stationary sequence with regularly varying tails. In their paper, Davis and Hsing show that when {Xi} satisfies a proper mixing condition then

Nn=t=1n(ln,an1Xi))()

converges to a point process N of the form

N=i=1j=1(SnPtQij),

where

t=1j=1(St,Pt)

is PRM(ν) with

dv=dt×γαxα1dx,x>0,

(This is slightly weaker form of Davis-Hsing limit, since we consider the convergence only on R+ −{0} not involving the left tail.) They also show that under the proper mixing condition, the convergence of Nn to N is equivalent to:

For rn → ∞, rn/n →0 as n → ∞, and kn = [n/rn],

limnknP(max1irnXj>anx)=γxα,

and

P(j=1rnXj/max1irnXJ./max1itnXj>anx)Q(.)

Here Q(.) is the distribution of the iid point processes

{j=1Qtj()}t=1.

From this basic result it may be possible to obtain the limiting form of

Nnt=t=1n(tn,an1XT,t,an1XS,t)(),

where

XT,t=1Tj=1TXtj,

and

XS,t=1Sj=1SXtj.

One would expect that if the above sequence of point processes converges, then the limit point process should be of the form

Nnt=1j=1i=1(St,ci,PiQij,citPiQtj),

where

ci={1Ti=1,2,T0otherwiseci={1Si=1,2,S0otherwise

Straightforward adjustment of Resnick’s [11] arguments which involve consecutive application of the continuous mapping theorem is not possible, since these arguments use the convergence of

nP(an1X1>δ,an1X2>δ)0,δ>0

which is clearly not satisfied by most sequences with strong local dependence.

Biography

About the authors: C. W. Anderson is a lecturer in the Department of Probability and Statistics of the University of Sheffield, UK and K. F. Turkman is a professor in the Department of Statistics and Operations Research of the University of Lisbon, Portugal.

4. References

  • 1.Anderson CW, Turkman KF. The joint limiting distributions of sums and maxima in stationary sequences. J Appl Probab. 1991;28:33–44. [Google Scholar]
  • 2.Anderson CW, Turkman KF. Limit distributions for sums and maxima. J Appl Probab. 1991;28:715–716. [Google Scholar]
  • 3.Anderson CW, Turkman KF. Limiting joint distributions of sums and maxima in a statistical context. Theoriya Veroyatnostei ce Prim. 1992;37:352–354. [Google Scholar]
  • 4.Anderson CW, Turkman KF. Sums and maxima of stationary sequences with heavy tailed distributions. 1992. submitted for publication. [Google Scholar]
  • 5.Chow TL, LTeugels J. The sums and the maximum of i.i.d. random variables. Proc Second Prague Symp Asymptotic Statistics. 1978:81–92. [Google Scholar]
  • 6.Davis RA. Stable limits for partial sums of dependent random variables. Ann Probab. 1983;11:262–269. [Google Scholar]
  • 7.Davis RA, Hsing T. Point process and partial sums convergence for weakly dependent random variables with infinite variance. 1993. to be published. [Google Scholar]
  • 8.Davis RA, Resnick S. Limit theory for moving averages of random variables with regulary varying tail probabilities. Ann Probab. 1985;13:179–195. [Google Scholar]
  • 9.Fakrell JE, Robins AG. Concentration fluctuations and fluxes in plumes from point sources in a turbulent boundary layer. L Fluid Mech. 1982;117:1–26. [Google Scholar]
  • 10.Mole N. A model of instrument smoothing and tresholding in measurements of turbulent dispersion. Atmos Environ. 1990;24:313–323. [Google Scholar]
  • 11.Resnick SI. Extreme values, Regular Variation, and Point Processes. Springer-Verlag; New York: 1987. pp. 231–237. [Google Scholar]

Articles from Journal of Research of the National Institute of Standards and Technology are provided here courtesy of National Institute of Standards and Technology

RESOURCES