Abstract
The relationship between the statistics of environmental measurements averaged over different time scales is related to extreme levels of the variables. Results on the asymptotic joint distributions of extreme averages over different time periods are treated.
Keywords: averages, extremes, extremal index, Poisson random measures
1. Introduction
Measurements of many environmental variables such as wind speeds, rainfall and the concentrations of atmospheric and aquatic pollutants are generally duration-specific: the actual quantities measured are averages over a specific time interval rather than instantaneous values. Thus concentrations of ozone are typically measured as parts per 100 million averaged over an hour, and wind speeds are routinely recorded as hourly or daily mean speeds. In practice however, the scientists wishing to understand the environmental processes which lie behind such measurements, and the regulatory body which monitors pollution, often wish to deal with characteristics measured over some other time interval: peak concentrations over a day, for example, or high wind gust values, which in practice correspond to peak 3–5 s averages. There is therefore a need to understand the relationship between the statistics of environmental measurements averaged over different time scales, and to relate these statistics to extreme levels of the variables. In this paper, results on the asymptotic joint distributions of extreme averages over different time periods will be treated. These results will make it possible, for example, to link long historical data series containing information about extremes of daily rainfall (sometimes extending back to the early years of the century) to the shorter series of extreme hourly rainfall which have been recorded only in the past 20 or 30 years. Thus important historical information could properly be taken account of in the estimation of floods, something recognised as higly desirable by hydrologists. Another area of application of the results is in the study of the dispersal of airborne pollutants. Here, it is known (see, for example Fakrell and Robins [9]) that instruments used to measure the concentration of pollutants dispersing in a turbulent flow cannot resolve the finest scales present in such flows. Measurements of concentration are therefore invariably obtained only as averages of the characteristics of primary physical interest, and so a statistical theory which links extremes of averages over different ranges would be of great value to scientists working in the area (Mole [10]). One related area is the study of joint distributions of averages and maxima of random sequences. Interest in such distributions is motivated by analysis of extreme winds. For purposes of building design or public safety, it is often important to estimate the speed of the most extreme wind likely to occur at a particular location over a period of years and to do this it is natural to apply the methods of extreme value theory to data on maximum gusts. The precision of estimates obtained may be low due to the limited amount of relevant data—often no more than 10 or 20 years, so it is desirable to try to improve the precision by introducing into the estimation procedure other information relevant to extreme winds. It is natural, thefore, to ask whether the data on gusts could be augmented by that on hourly means, as gusts and means are evidently related. One source of guidance here may be provided by the limit properties of the joint distribution of means and the maxima.
In Sec. 2, we give a summary of the results that exist on the joint limiting forms of sums and maxima of stationary sequences and in Sec. 3 we give some results on the asymptotic joint distributions of extreme averages over different time-periods of sequences which have moving average representations. Possible solutions for the general stationary case are also indicated.
2. Extremes and Averages
Let {Xi} be a stationary sequence of random variables with marginal distribution function P(Xi ≤ x) = F(x) and let
We study here the joint limiting distributions of
| (1) |
as n → ∞ for suitable constants an > 0, cn > 0, bn and dn.
Case 1: Light Tailed Case
Assume that Var (Xi) < ∞ and F ∈ D (Λ) or F ∈ D (Φα), α>2 or F ∈ D (Ψα), α>0. where Λ(.), Φα(.), Ψα(.) are respectively the Gumbel, Frechet and Weibull distributions.
Then for the associated iid sequence (Chow and Teugels [5])
or
where the limit components are independent.
Can dependence amongst Xi modify this limiting independence? As the following theorem (Anderson and Turkman [1]) shows, under quite weak conditions, dependence does not affect the limiting distribution.
Theorem 2.1
Assume that {Xi} is strong mixing and has positive extremal index and for some an, cn and dn,
where G =Λ or G = Φα, for some α>2 or G =Ψα, for α>0. Assume further that {Xi} satisfies the condition
where
| (2) |
Then and are asymptotically independent. Local dependence condition D′(an, un) is quite weak and satisfied, for example, by m-dependent sequences and by Gaussian sequences with summable covariances.
Case 2: Heavy Tailed Case
Assume that
where L(x) is a slowly varying function and 0<α<2. Then for the associated iid sequence (Chow and Teugels [5])
where
Here χ(A) denotes the indicator function of the event A, Wα(t, p) is the characteristic function of a stable law of index α and parameter p, and k is a constant depending on α and p. Note that for the heavy tailed case, U and V are dependent. Can the type of local dependence of the X-sequence make a difference to the Chow-Teugels limit? It can! If large values are cancelled by large negative values, then sums and maxima can be asymptotically independent. We show this by constructing an example:
Let {Yi} be a stationary sequence with
ϵ<1, γ≥1 and for ν>0 such that ϵ + ν< 1, let
Then it can be shown that {Xi} is stationary, 1-dependent and 1−Fx(x) = ϵ(ϵ + ν)−1x−ϵ, Fx(−x) = x−ϵ. Hence the limit distribution of for the associated iid sequence is the Chow-Teugels limit with α=ϵ and p = ϵ(2ϵ+ν)−1. The components of this limit are dependent. However, of the dependent {Xi} process can be shown to be asymptotically independent due the cancellation of large positive values by large negative values values, thus showing that local dependence may make a difference on the limit distribution. However, if we rule out this type of cancellation, then the limit distribution is not affected by the dependence in {Xi}, as the following theorem demonstrates (Anderson and Turkman [4]). One possible local condition which rules out this type of cancellation is Davis’ [6] D′(an) condition, which we assume in the theorem. This restrictive technical condition also rules out clustering of large and small values above and below certain thresholds. Types of processes which satisfy this condition (and others which we need in the theorem) can be seen in Davis [6].
Let {Ei} be an iid unit mean exponential sequence, and {δi} iid taking values +1 and −1 with probabilities p and q respectively. Then
Theorem 2.2
- If 0<α<1, p>0 and conditions D(an), D′(an) of Davis [6] hold then
where D = min {j : δj = 1}. - If 1≤α <2, p>0 and conditions D(an), D′ (an) and Dn(an) of Davis [6] hold then
- Under the conditions of (i) and (ii),
where
and
and C is a positive constant. Note that the value of the limit does not depend on the dependence structure of {Xi}.
These results seem to be discouraging for statistical applications. For example, for sequences with finite variance, the independence of (U, V) does not offer a basis for the use of average wind speeds in inferences about gusts, contrary to the evidence shown in data. This may be due to:
Time intervals are not long enough in practice for asymptotic results to give adequate approximations,
The correlation structure of the data is not well represented by our mixing and local dependence conditions,
Residual seasonality remains in the data. Based on these possible deviations, statistical models establishing connection between means and extreme events are suggested in Anderson and Turkman [3].
3. Extremes of Averages Over Different Time Scales
The specific problem to be adressed in this section is as follows: Let X, represent the instantaneous value of the environmental variable at time t, and denote by XT,t a moving average of {Xt} over the range T:
| (3) |
Then we are interested in the paired series {XT,t, XS,t} for different fixed S and T and in particular, in the joint distributional properties of extremes of the pair {XT,t, XS,t}.
We will give results only for sequences with the heavy tailed distributions. The light tailed case is more complicated, since in this case large values of the moving averages may occur due to the contribution of several relatively large values of the sequence in contrast to the heavy tailed case when large values of the moving averages are dominated by the largest value of the sequence. The techniques to be used to study these questions will be developments of those used by Davis and Resnick [8].
Suppose that are iid random variables with
where L(x) is slowly varying as x→∞, that is X ∈ D (Φα). Take constants an such that
Consider a point process Pn which puts points at . Hence Pn is the random point measure on sets in R+ × R
where
It is known that Pn converges weakly as n→∞ to a point process P, a Poisson process with mean measure μ. on R + × (R+ −{0}). (Davis and Resnick call this Poisson measure, the Poisson random measure PRM (µ) and consider a more general case which involve the left tail of the distribution as much as the right tail. Here due to the special simple form of the moving averages, we restrict ourselves to the space (0, ∞).)
Here
Hence
where are the Points of P.
Davis and Resnick [8] show that, correspondingly, for {XT,t} with same normalization
on R + ×(R+−{0}), where
and (tk, jk) are as above.
Results for the point process generated simultaneously by {XT,t} and {Xs,i} processes can be obtained with a straightforward generalization:
| (4) |
on R+×(R+−{0})2, where
The joint limit distribution of MT,n and MS,n now can easily be calculated since
| (5) |
Special Cases
[1] S = T. Then
so from theorem 3.1 of Davis and Resnick [8], we should have
Hence the result Eq. (5) is consistent with the existing results on this special case.
[2] S = 1 <T, so MS,n = maxi≤n Xi, and the result Eq. (5) says that
Note that the above distribution is the joint limit distribution of the maximum of the {Xi} process and the maximum of a moving average of it.
The above set up is a very simple case. The immediate question is: what if the {Xi} are themselves a dependent sequence?
We can get a partial answer by taking {Xi} to be itself a moving average:
say, where Zi satisfy the same conditions as the Xi and aj≥0 (for simplicity)Thus
and
Hence
say, where
Note that for any l = 1,2,…,
As before
| (6) |
where
Note that
for every i for which one of the components is non-zero. This is the case iff jkd+≤x and jkd+≤y, where d+ = max dì and d+ = max di. Hence the limit of Eq. (6) is given by
| (7) |
Hence we see that this kind of dependence in the underlying process does not change the form of the limit distribution. It would be interesting to obtain similar results for general stationary sequences. This could be done by using characterization of the limit point processes for the sequence of point processes with points given by Davis and Hsing [7] when {Xt} is a stationary sequence with regularly varying tails. In their paper, Davis and Hsing show that when {Xi} satisfies a proper mixing condition then
converges to a point process N of the form
where
is PRM(ν) with
(This is slightly weaker form of Davis-Hsing limit, since we consider the convergence only on R+ −{0} not involving the left tail.) They also show that under the proper mixing condition, the convergence of Nn to N is equivalent to:
For rn → ∞, rn/n →0 as n → ∞, and kn = [n/rn],
and
Here Q(.) is the distribution of the iid point processes
From this basic result it may be possible to obtain the limiting form of
where
and
One would expect that if the above sequence of point processes converges, then the limit point process should be of the form
where
Straightforward adjustment of Resnick’s [11] arguments which involve consecutive application of the continuous mapping theorem is not possible, since these arguments use the convergence of
which is clearly not satisfied by most sequences with strong local dependence.
Biography
About the authors: C. W. Anderson is a lecturer in the Department of Probability and Statistics of the University of Sheffield, UK and K. F. Turkman is a professor in the Department of Statistics and Operations Research of the University of Lisbon, Portugal.
4. References
- 1.Anderson CW, Turkman KF. The joint limiting distributions of sums and maxima in stationary sequences. J Appl Probab. 1991;28:33–44. [Google Scholar]
- 2.Anderson CW, Turkman KF. Limit distributions for sums and maxima. J Appl Probab. 1991;28:715–716. [Google Scholar]
- 3.Anderson CW, Turkman KF. Limiting joint distributions of sums and maxima in a statistical context. Theoriya Veroyatnostei ce Prim. 1992;37:352–354. [Google Scholar]
- 4.Anderson CW, Turkman KF. Sums and maxima of stationary sequences with heavy tailed distributions. 1992. submitted for publication. [Google Scholar]
- 5.Chow TL, LTeugels J. The sums and the maximum of i.i.d. random variables. Proc Second Prague Symp Asymptotic Statistics. 1978:81–92. [Google Scholar]
- 6.Davis RA. Stable limits for partial sums of dependent random variables. Ann Probab. 1983;11:262–269. [Google Scholar]
- 7.Davis RA, Hsing T. Point process and partial sums convergence for weakly dependent random variables with infinite variance. 1993. to be published. [Google Scholar]
- 8.Davis RA, Resnick S. Limit theory for moving averages of random variables with regulary varying tail probabilities. Ann Probab. 1985;13:179–195. [Google Scholar]
- 9.Fakrell JE, Robins AG. Concentration fluctuations and fluxes in plumes from point sources in a turbulent boundary layer. L Fluid Mech. 1982;117:1–26. [Google Scholar]
- 10.Mole N. A model of instrument smoothing and tresholding in measurements of turbulent dispersion. Atmos Environ. 1990;24:313–323. [Google Scholar]
- 11.Resnick SI. Extreme values, Regular Variation, and Point Processes. Springer-Verlag; New York: 1987. pp. 231–237. [Google Scholar]
