Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2021 Dec 10;118(50):e2118893118. doi: 10.1073/pnas.2118893118

Taylor’s law and heavy-tailed distributions

W Brent Lindquist a,1, Svetlozar T Rachev a,1
PMCID: PMC8685895  PMID: 34893544

Lionel Roy Taylor’s law of the mean (TLM), also known as Taylor’s law (13), can be described succinctly as

sample variance a (sample mean)b,a>0, b>0, [1]

for a sample of positive elements. Careful consideration of TLM (2, 4, 5) raises two unanswered fundamental questions in natural and statistical science.

  • 1)

    Is TLM a unique phenomenon in nature, or should alternative extensions and refinements of TLM, exploiting higher moments and various measures for dependence (association), be explored?

  • 2)

    What are the proper measures for the location, spread, asymmetry, and dependence (association) for random samples with infinite mean?

Applying non-Gaussian (stable) limit theorems, Cohen et al. (4) show that the limiting distribution of [sample variance]/[sample mean]b exists even if the sample is drawn from a distribution with support on R+ having infinite mean. A similar limiting result was obtained by Drton and Xiao (6) and Pillai and Meng (7) with the Cauchy distribution as the limiting distribution. To describe how truly bewildering these results are, consider the following scenario from finance. Take a long-only portfolio consisting of n>1 risky assets, with a one-period, random portfolio return given by

rp=i=1nwi ri, wi0,i=1nwi=1. [2]

Assume the asset one-period returns r = (r1,, rn) have a joint distribution determined by

(r1, ,rn)=d(X1Y1,,XnYn), [3]

where Xn = (X1,, Xn) and Yn = (Y1,, Yn) are two independent and identically distributed (iid) Gaussian vectors with zero mean and covariance matrix Σ=[σij], with σii>0, i,j = 1,,n. The Drton and Xiao (6)–Pillai and Meng (7) theorem states that rp has a standard Cauchy distribution with probability density function (pdf)

frp(x)=1π(1+x2),xR, [4]

for any choice of Σ. In [4], the portfolio weights (w1,,wn) can be random as long as they are independent of (Xn,Yn). A portfolio allocation with random weights can occur if the portfolio manager has prior information not directly related to the fundamentals of the underlying assets, a scenario occurring frequently under active portfolio management. The Drton and Xiao (6)–Pillai and Meng (7) result indicates that such active portfolio management is meaningless when the portfolio return rp has a distribution with infinite mean as in [4]. One concludes then that portfolio diversification is meaningless when the asset returns do not have a (finite) mean value, leading to a tantamount problem for portfolio risk budgeting (811).

It is our firm opinion that the Drton and Xiao (6)–Pillai and Meng (7) and Cohen et al. (4) results are fundamental, as they shed further light on fascinating statistical phenomena occurring when dealing with samples from a distribution with infinite mean.

Is TLM Unique

We address question 1 by starting with a reminder of the usual estimators for sample mean and variance (12).

Mean is “a measure for location or central value for a continuous variable. … For a sample of observations x1,x2,,xn the measure is calculated as x¯=n1i=1nxi. Most useful when the data have a symmetric distribution and do not contain outliers” (ref. 12, p. 274).

Variance is, “in a population, the second moment about the mean. An unbiased estimator of the population value is provided by s2 given by s2=(n1)1i=1n(xix¯)2 where x1,x2,,xn are the n sample observations and x¯ is the sample mean” (ref. 12, p. 445).

Note, particularly, the qualification that the mean estimator is most useful for data that are symmetrically distributed and have no outliers. Consider the statistic Wn for the Taylor’s law ratio,

Wn=sample variance(sample mean)b=(n1)1i=1n(XiX¯)2(n1i=1nXi)b. [5]

When the sample Xn = (X1,,Xn) consists of iid random variables (RVs) with regularly varying tails having tail index 0 < α < 1, neither does the sample variance in the numerator of [5] represent a measure for the dispersion of Xn nor does X¯ in the denominator represent a measure for location in Xn. It is truly remarkable that the ratio Wn has a limit in distribution.

In PNAS, Brown et al. (5) extend the relationship in [5] to a more general form Mh2/(Mh1)b where Mh is the h th central moment, and the exponent has the dependence b=b(h1,h2,α). We suggest an approach that is framed in terms of an operator limit theorem (1317) for the first four central moments. Let X n(α)= (X1(α),,Xn(α)) be an iid random sample drawn from a continuous, unimodal distribution function FX(α) having regularly varying tails with tail index α > 0. Denote the first four central moments as (we are interested in n)

X¯n,α=n1i=1nXi(α), sn,α2=n1i=1n(Xi(α)X¯n,α)2 ,γn,α=n1i=1n(Xi(α)X¯n,α)3 ,κn,α=n1i=1n(Xi(α)X¯n,α)4 .

The operator limit theorem would determine normalizing constants an(i,α)>0 and bn(i,α)R, i =1, 2, 3, 4, governing the vector limit

[an(1,α)X¯n,αbn(1,α)an(2,α)sn,α2bn(2,α)an(3,α)γn,αbn(3,α)an(4,α)κn,αbn(4,α)]dG(α)=[G1(α)G2(α)G3(α)G4(α)] . [6]

Equally importantly, the theorem should address the rate of convergence to the vector of limiting distributions. A functional limit theorem [a Donsker (18)–Prokorov (19) type invariance principle] should also be proved. The theorem will have to consider separate ranges for the tail index: 0 <α < 1 (the mean of X¯n,α does not exist), 1 < α < 2 (the mean of sn,α2 does not exist), 2 < α < 3 (the mean of γn,α does not exist), 3 < α < 4 (the mean of κn,α does not exist), and 4 < α [where the limiting distribution can be obtained from the functional central limit theorem (18, 19)], as well as at the critical values α = 1, 2, 3, 4.

TLM Moment Measures

To address question 2 for heavy-tailed distributions, robust versions of these four central moments must be considered (2023), and TLM must be reformulated in terms of the robust moments. One possible set of robust estimators might be (for brevity, we drop the superscript (α)):* the median, m(Xn); the interquartile range IQR(Xn); the medcouple (24)*

γn(medcouple)=m{h(Xi;n,Xj;n)}, Xi;nm(Xn)Xj;n, [7]

where X1;nXn;n denotes the ordered sample, and the function h() is defined in Brys et al. (24); and (25)

κn(robust)=(FX(inv)(3/8)FX(inv)(1/8))+(FX(inv)(7/8)FX(inv)(5/8))FX(inv)(6/8)FX(inv)(2/8) . [8]

Then the usual form of the TLM would have the robust estimator

Wn(robust)=IQR(Xn)(m(Xn))b , [9]

where the constant b>0 can be estimated from a robust form of the regression model

ln(IQR(Xn))=b0+b ln(m(Xn))+εn, [10]

over the given sample. The ratio [9] is robust with respect to strictly increasing, one-to-one, continuous mappings on R to R, making the statistic useful when FX has heavy tails. It would be of interest to study the limiting behavior (26, 27) of Wn(robust) and the corresponding convergence rate.

Footnotes

The authors declare no competing interest.

See companion article, “Taylor’s law of fluctuation scaling for semivariances and higher moments of heavy-tailed data,” 10.1073/pnas.2108031118.

*For samples with small size n, it is important to carefully define the estimators in the case when the observations are tied.

References

  • 1.Taylor R. A. J., Taylor’s Power Law (Elsevier, Cambridge, MA, 2019). [Google Scholar]
  • 2.Brown M., Cohen J. E., de la Pena V. H., Taylor’s law, via ratios, for some distributions with infinite mean. J. Appl. Probab. 54, 657–669 (2017). [Google Scholar]
  • 3.Eisler Z., Imre B., Kertész J., Fluctuation scaling in complex systems: Taylor’s law and beyond. Adv. Phys. 57, 89–142 (2008). [Google Scholar]
  • 4.Cohen J. E., Davis R. A., Samorodnitsky G., Heavy-tailed distributions, correlations, kurtosis and Taylor’s law of fluctuation scaling. Proc. R. Soc. Lond. A Math. Phys. Sci. 476, 20200610 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Brown M., Cohen J. E., Tang C.-F., Yam S. C. P., Taylor’s law of fluctuation scaling for semivariances and higher moments of heavy-tailed data. Proc. Natl. Acad. Sci. U.S.A. 118, e2108031118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Drton M., Xiao H., Wald tests of singular hypotheses. Bernoulli 22, 38–59 (2016). [Google Scholar]
  • 7.Pillai N. S., Meng X.-L., An unexpected encounter with Cauchy and Lévy. Ann. Stat. 44, 2089–2097 (2016). [Google Scholar]
  • 8.Nešlehová J., Embrechts P., Chavez-Demoulin V., Infinite mean models and the LDA for operational risk. J. Oper. Risk 1, 3–25 (2006). [Google Scholar]
  • 9.Campbell R. A. J., Forbes C. S., Koedijk K. G., Kofman P., Increasing correlations or just fat tails? J. Empir. Finance 15, 287–309 (2008). [Google Scholar]
  • 10.Peters G., Targino R., Shevchenko P. V., Understanding operational risk capital approximations: first and second orders. SSRN [Preprint] (2013). 10.2139/ssrn.2980465(Accessed 15 April 2021). [DOI]
  • 11.Chavez-Demoulin V., Embrechts P., Hofert M., An extreme value approach for modeling operational risk losses depending on covariates. J. Risk Insur. 83, 735–776 (2016). [Google Scholar]
  • 12.Everitt B., Skrondal A., The Cambridge Dictionary of Statistics (Cambridge University Press, Cambridge, United Kingdom, 2002), vol. 106. [Google Scholar]
  • 13.Jurek Z. J., Mason J. D., Operator-Limit Distributions in Probability Theory (John Wiley, New York, NY, 1993). [Google Scholar]
  • 14.Maejima M., Limit theorems related to a class of operator-self-similar processes. Nagoya Math. J. 142, 161–181 (1996). [Google Scholar]
  • 15.Paulauskas V., Rachev S. T., Cointegrated processes with infinite variance innovations. Ann. Appl. Probab. 8, 775–792 (1998). [Google Scholar]
  • 16.Meerschaert M. M., Scheffler H. P., Limit Distributions for Sums of Independent Random Vectors: Heavy Tails in Theory and Practice (John Wiley, New York, NY, 2001). [Google Scholar]
  • 17.Kozubowski T. J., Meerschaert M. M., Panorska A. K., Scheffler H.- P., Operator geometric stable laws. J. Multivariate Anal. 92, 298–323 (2005). [Google Scholar]
  • 18.Donsker M., An invariance principle for certain probability limit theorems. Mem. Am. Math. Soc. 6, 1–10 (1951). [Google Scholar]
  • 19.Prokhorov Y., Convergence of random processes and limit theorems in probability theory. Theory Probab. Appl. 1, 157–214 (1956). [Google Scholar]
  • 20.Huber P. J., Robust Statistics (Wiley Series in Probability and Mathematical Statistics, John Wiley, New York, NY, 2004), vol. 523. [Google Scholar]
  • 21.Hoaglin D. C., Mosteller F., Tukey J. W., Exploring Data Tables, Trends, and Shapes (Wiley Series in Probability and Mathematical Statistics, John Wiley, New York, NY, 1985), vol. 62. [Google Scholar]
  • 22.Rousseeuw P. J., Croux C., Alternatives to the median absolute deviation. J. Am. Stat. Assoc. 88, 1273–1283 (1993). [Google Scholar]
  • 23.Maronna R. A., Martin R. D., Yohai V. J., Salibián-Barrera M., Robust Statistics: Theory and Methods (with R) (John Wiley, New York, NY, 2019). [Google Scholar]
  • 24.Brys G., Hubert M., Struyf A., A robust measure of skewness. J. Comput. Graph. Stat. 13, 996–1017 (2004). [Google Scholar]
  • 25.Moors J. J. A., Wagemakers R. Th. A., Coenen V. M. J., Heuts R. M. J., Janssens M. J. B. T., Characterizing systems of distributions by quantile measures. Stat. Neerl. 50, 417–430 (1996). [Google Scholar]
  • 26.van der Vaart A. W., Wellner J. A., “Weak convergence” in Weak Convergence and Empirical Processes, van der Vaart A. W., Wellner J. A., Eds. (Springer, New York, NY, 1996), pp. 16–28. [Google Scholar]
  • 27.Shorack G. R., Wellner J. A., Empirical Processes with Applications to Statistics (Society for Industrial and Applied Mathematics, Philadelphia, PA, 2009). [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES