Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2010 Aug 26;5(8):e12256. doi: 10.1371/journal.pone.0012256

Statistic Complexity: Combining Kolmogorov Complexity with an Ensemble Approach

Frank Emmert-Streib 1,*
Editor: Enrico Scalas2
PMCID: PMC2928735  PMID: 20865047

Abstract

Background

The evaluation of the complexity of an observed object is an old but outstanding problem. In this paper we are tying on this problem introducing a measure called statistic complexity.

Methodology/Principal Findings

This complexity measure is different to all other measures in the following senses. First, it is a bivariate measure that compares two objects, corresponding to pattern generating processes, on the basis of the normalized compression distance with each other. Second, it provides the quantification of an error that could have been encountered by comparing samples of finite size from the underlying processes. Hence, the statistic complexity provides a statistical quantification of the statement ‘Inline graphic is similarly complex as Inline graphic’.

Conclusions

The presented approach, ultimately, transforms the classic problem of assessing the complexity of an object into the realm of statistics. This may open a wider applicability of this complexity measure to diverse application areas.

Introduction

Complex systems is the study of interactions of simple building blocks that result in a collective behavior or properties absent in the elementary components of the system itself. Due to the fact that this problem does not fit into one of the traditional research fields, it is connected to various of these, for instance physics, biology, chemistry or econometrics [1][5]. Many measures, properties or characteristics of a multitude of different complex systems from these fields has been studied to date [6][8], however, the complexity of an object may have received the most attention. This property of complex systems has fascinated generations of scientists [9][11] trying to quantify such a notation. Very coarsely speaking, an object is said to be ‘complex’ when it does not match patterns regarded as simple, as López-Ruiz et al. [12] describe it in their article. Over the last decades, many approaches have been suggested to define the complexity of an object quantitatively [9], [11], [13][19]. An intrinsic problem with such a measure is that there are various ways to perceive and, hence, characterize complexity leading to complementing complexity measures [20]. For example, Kolmogorov complexity [9], [11], [21] is based on algorithmic information theory considering objects as individual symbol strings, whereas the measures effective measure complexity (EMC) [17], excess entropy [22], predictive information [23] or thermodynamic depth [18] relate objects to random variables and are ensemble based. Interestingly, despite considerable differences among all these complexity measures Inline graphic they all have in common that they assign a complexity value to each individual object Inline graphic under consideration, Inline graphic. In this paper we will assume that Inline graphic corresponds to a string sequence of a certain length and its components assume values from a certain domain, e.g., Inline graphic or Inline graphic. It is of importance to note that there is a conceptually different measure recently introduced by Vitányi et al. that evaluates the complexity distance among two objects Inline graphic and Inline graphic instead of their absolute values. This measure is called the normalized compression distance (NCD) [24], Inline graphic, and is based on Kolmogorov complexity [10].

The purpose of this paper is to introduce a new measure of complexity we call statistic complexity that is not only different to all other complexity measures introduced so far, but also connects directly to statistics, specifically, to statistical inference [25], [26]. More precisely, we introduce a complexity measure with the following properties. First, the measure is bivariate comparing two objects, corresponding to pattern generating processes, on the basis of the normalized compression distance with each other. Second, this measure provides the quantification of an error that could have encountered by comparing samples of finite size from the underlying processes. Hence, the statistic complexity provides a statistical quantification of the statement ‘Inline graphic is similarly complex as Inline graphic’.

This paper is organized as follows. In the next section we describe the general problem in more detail and introduce our complexity measure. Then we present numerical results and provide a discussion. We finish with conclusions and an outlook.

Methods

Currently, a commonly acknowledged, rigorous mathematical definition of the complexity of an object is not available. Instead, when complexity measures are suggested they are normally assessed by their behavior with respect to three qualitative patterns, namely simple, random (chaotic) and complex patterns. Qualitatively, a complexity measure is considered good if: (1) the complexity of simple and random objects is less than the complexity value of complex objects [17], (2) the complexity of an object does not change if the system size changes. For example, Kolmogorov complexity has the desireable property to remain unchanged if the system size doubles, i.e., Inline graphic, however, it cannot distinguish random from complex pattern because in both cases the compressibility of an object is low resulting in high values of Inline graphic. We want to add a third property to the above criteria: (3) A complexity measure should quantify the uncertainty of the complexity value. As motivation for this property we just want to mention that there is a crucial difference between an observed object Inline graphic and its generating process Inline graphic [23]. If the complexity of Inline graphic should be assessed, based on the observation Inline graphic only, this assessment may be erroneous. This error may stem from the limited (finite) size of observations. Also, the possibility of measurement errors would be another source derogating the ability of an error-free assessment. For this reason, the major objective of this article is to introduce a complexity measure possessing all three properties listed above that assesses the complexity classes of the underlying processes instead of individual objects.

We start by pointing out that criteria (1) provides a relative statement connecting different objects. That means the complexity of an object is always related to the complexity of another object [20] leading to relative statements like ‘Inline graphic is similarly complex as Inline graphic’. Hence, a numerical value Inline graphic without knowledge of any other complexity value for other objects has no meaning at all. For reasons of mathematical rigor, we propose to include this implicit reference point into a proper definition of complexity. This implies that a fundamental complexity measure needs to be bivariate, Inline graphic, instead of univariate comparing two processes Inline graphic and Inline graphic. As a side note, we remark that all complexity measures suggested so far we are aware of are univariate measures [13], [14], [16][18], [22], [23] with respect to the context set above, except for the normalized compression distance (NCD) [24], [27]. However, a practical problem of the NCD is that Kolmogorov complexity, on which it is based, is not computable but only upper semi-computable [27]. Li et al. introduced in [27] a normalized and universal metric called normalized information distance (NID) which can be approximated by,

graphic file with name pone.0012256.e026.jpg (1)

the normalized compression distance [27]. Here, Inline graphic denotes the compression size of string Inline graphic and Inline graphic the compression size of the concatenated stings Inline graphic and Inline graphic. Practically, the quantities Inline graphic are obtained by compressors like gzip or bzip2, see [28], [29] for details.

Criteria (3) of a complexity measure stated above acknowledges the fact that an assessment of an object's complexity cannot be without uncertainty or error in case only finite information about this object is available. That means, for a complexity measure to be applicable to real objects (rather than pure mathematical ones) it has to be statistic in order to deal appropriately with incomplete information. Based on these considerations, the statistic complexity measure we suggest is defined by the following procedure visualized in Fig. 1:

Figure 1. Visualization of the problem and the construction of the test statistic from observations.

Figure 1

The double headed arrows represent comparisons of entities. Red indicates that this comparison cannot be performed because the two entities are hidden (unobservable) whereas blue indicates a feasible comparison.

  1. Estimate the empirical distribution function Inline graphic (We indicate estimated entities by Inline graphic and refer to the ensemble by Inline graphic.) of the normalized compression distance from Inline graphic samples, Inline graphic, from objects Inline graphic and Inline graphic of size Inline graphic generated by process Inline graphic (Here Inline graphic means that Inline graphic is generated (or drawn) from process (distribution) Inline graphic.).

  2. Estimate the empirical distribution function Inline graphic of the normalized compression distance from Inline graphic samples, Inline graphic, from objects Inline graphic and Inline graphic of size Inline graphic from two different processes, Inline graphic and Inline graphic.

  3. Determine Inline graphic and Inline graphic.

  4. Define, Inline graphic, as statistic complexity

This procedure corresponds to a two-sided, two-sample Kolmogorov-Smirnov (KS) test [30], [31] based on the normalized compression distance [24], [27] obtaining distances among observed objects. The statistic complexity corresponds to the p-value of the underlying null hypotheses, Inline graphic, and, hence, assumes values in Inline graphic. The null hypothesis is a statement about the null distribution of the test statistic Inline graphic, and because the distribution functions are based on the normalized compression distances among objects Inline graphic and Inline graphic, drawn from the processes Inline graphic and Inline graphic, this leads to a statement about the distribution of normalized compression distances. Hence, verbally, Inline graphic can be phrased as ‘in average, the compression distance of objects from Inline graphic to objects from Inline graphic equals the compression distance of objects only taken from Inline graphic’. It is important to emphasize that this equality holds in average and, thus needs to be connected to two ensembles Inline graphic and Inline graphic. If the alternative hypothesis, Inline graphic, is true this equality does no longer hold implying differences in the underlying processes Inline graphic and Inline graphic, leading to differences in the NCDs. From the formulation of the hypotheses, tested by the statistic complexity, it is apparent that we are following closely the guiding principle expressed by López-Ruiz et al. [12] as cited at the beginning of this paper, because Inline graphic is intrinsically a comparative measure. As a side note regarding the choice of the null hypothesis we want to remark that substituting Inline graphic with Inline graphic may encounter problems in cases where the complexity value of objects in Inline graphic is systematically shifted compared to the complexity value of objects in Inline graphic. In this case, the distributions Inline graphic and Inline graphic could be similar, although, the complexity of elements in Inline graphic and Inline graphic are different. Practically, this may correspond to a pathological case rarely encountered in practice, however, conceptually, such a null hypothesis is apparently less stringent.

Regarding the notation and interpretation of the above procedure it is important to note the following. First, the entities Inline graphic and Inline graphic refer to values of the NCD. For example, Inline graphic whereas Inline graphic and Inline graphic are observable objects that are identically and independently (iid) generated from a process Inline graphic, Inline graphic. Because Inline graphic and Inline graphic are generated from the same process Inline graphic, the resulting distribution function Inline graphic is only indexed by this process. The Inline graphic entities are obtained similarly, however, in this case Inline graphic and Inline graphic are objects generated from two different processes, namely Inline graphic and Inline graphic. For this reason the distribution function is indexed by these two processes, Inline graphic. Second, we use the notation, Inline graphic, to indicate that Inline graphic is generated from a process Inline graphic, but also that Inline graphic is drawn from Inline graphic. The first meaning is clear if thinking of Inline graphic as a model for a complex system, e.g., a cellular automata or a stochastic process. The latter emphasizes the fact that such a process, even if deterministic, becomes random with respect to, e.g., random initial conditions and, hence, effectively is a stochastic process. Third, for reasons of conceptual simplicity we require all objects to have the same size Inline graphic. This condition may be relaxed to allow objects of varying sizes but it may require additional technical consideration. On a technical note, the above defined statistic complexity has the very desirable property that the power reaches asymptotically Inline graphic for Inline graphic and Inline graphic [32]. This means, for infinite many observations the error of the test to falsely accept the null hypotheses when in fact the alternative is true becomes zero. This limiting property is important to hold, because in this case all information about the system is available and, hence, it would be implausible if for such circumstances no error-free decision could be achieved. Formally, this property can be stated as Inline graphic for Inline graphic and Inline graphic. Finally, we would like to note that despite the fact that statistic complexity is a statistical test, it borrows part of its strength from the NCD respectively Kolmogorov complexity on which this is based on. Hence, it unites various properties from very different concepts.

Results

In the following we provide different numerical examples for data frequently used when studying complexity measures. This allows a direct comparison of ours with different measures.

The first characteristic of the statistic complexity we study is the influence of the size Inline graphic of objects on Inline graphic. Table 1 shows the results for comparing patterns generated by different rules of one-dimensional cellular automata. Column one represents the reference process, Inline graphic, and column two corresponds to Inline graphic. The third and fifty column shows the averaged p-values obtained for cellular automata of length Inline graphic respectively Inline graphic - column four and six provide the variances for the corresponding p-values. For the simulation results shown in Table 1 we generated spatiotemporal patterns for one-dimensional CA for Inline graphic (space) and Inline graphic (time), an alphabet of size Inline graphic and a Inline graphic neighborhood with periodic boundary conditions. As burn-in time we used Inline graphic time steps. Each of these spatiotemporal pattern Inline graphic, with Inline graphic and Inline graphic, is transformed to its difference pattern Inline graphic (Here Inline graphic with Inline graphic corresponds to a row vector of length Inline graphic.) resulting in a string (object) of length Inline graphic to be applicable for the NCD. Here, the operator Inline graphic means concatenation of strings. See [29] for numerical details for the application of NCD. The results in Table 1 show that the p-values remain in the same order of magnitude if the size of an object Inline graphic is doubled meaning that the overall quantitative assessment of two processes Inline graphic and Inline graphic - based on sampled objects thereof - by the measure Inline graphic is invariant to extensions of the size Inline graphic. Next we demonstrate that the statistic complexity is capable to differentiate between random and complex objects. For this reason we compare rule Inline graphic, producing random patterns, with rule Inline graphic, Inline graphic, both random, and rule Inline graphic, which is complex because it is capable of universal computation. From Table 1 one can see that the p-values correspond with our expectations giving high values for Inline graphic and Inline graphic and low values for Inline graphic. In addition we compare rule Inline graphic with rules Inline graphic and Inline graphic, classified according to Wolfram as random, and obtain very low p-values, suggesting significant differences among those patterns. The crucial point here is that not all CA rules that produce chaotic patterns are indistinguishable from each other. In [33] the growth exponent of the roughness along other measures have been used to obtain several subclasses for CA rules leading to chaotic behavior. Comparing our results with their classification reveals that actually rule Inline graphic and Inline graphic are in different subclasses whereas rule Inline graphic is classified together with rule Inline graphic and Inline graphic. Last, we compare rule Inline graphic with a periodic pattern, rule Inline graphic, and obtain also in this case a clear distinction. In summary, Inline graphic can not only distinguish between simple and complex patterns but finds also meaningful substructures among chaotic patterns if rule Inline graphic is used as reference process.

Table 1. Results for one-dimensional CA (Inline graphic, Inline graphic, Inline graphic (third and fourth column) and Inline graphic (fifth and sixth column)) averaged over Inline graphic runs.

X Y T = 100 T = 200
CA rule CA rule Inline graphic Inline graphic Inline graphic Inline graphic
30 30 0.593 0.075 0.684 0.102
30 90 0.617 0.102 0.575 0.139
30 225 0.388 0.131 0.632 0.086
30 73 0.002 0.001 0.002 0.001
30 54 0.002 0.000 0.001 0.001
30 22 0.002 0.000 0.001 0.001
30 33 0.001 0.001 0.002 0.000
30 110 0.002 0.001 0.002 0.001

First column: process Inline graphic. Second column: process Inline graphic. Sample size is Inline graphic.

Next, we apply our measure to the logistic map and compare the results with the Lyapunov exponent (Inline graphic). The results are summarized in Fig. 2. We calculate the time series for various values of Inline graphic (x-axis) in the intervall Inline graphic (Inline graphic was varied in step sizes of Inline graphic and sample size was Inline graphic.). Inline graphic assumes negative values in Inline graphic and Inline graphic indicating a nonchaotic behavior of the logistic map for these values of Inline graphic. The vertical dashed line separates positive from negative values. The p-values of the statistic complexity (blue line, cross symbols) are obtained for each value of Inline graphic by averaging over Inline graphic time series each of length Inline graphic (After waiting a transient period of Inline graphic steps.). As reference process, Inline graphic, we use a logistic map with Inline graphic, which corresponds to a periodic behavior. From Fig. 2 one can see that there are essentially two types of p-values, ones that are not zero and ones that are close to zero. For example, using a significance level of Inline graphic (dotted horizontal line) one obtains that significant values correspond to positive Lyapunov exponents and non-significant values to negative Lyapunov exponents. Again, we want to emphasize that the p-values do not provide a yes or no answer if the logistic map, for a given Inline graphic value, is chaotic or nonchaotic but the correct interpretation is that low p-values provide strong evidence against the null hypotheses whereas high p-values do not allow to reject the null hypotheses. Because we use Inline graphic as reference - for which the logistic map shows periodic (nonchaotic) behavior - this is a similar though not identical question. The results for the logistic map allow a comparison with a well studied system. As demonstrated by our results shown in Fig. 2, for an appropriately chosen reference process, Inline graphic, there is a clear correspondence between the statistic complexity and the Lyapunov exponent. This property is certainly desirable to hold because it may allows to connect to traditional contributions in the field beyond the logistic map. The possibility of such a connection, despite the seemingly different methods underlying the statistic complexity respectively the Lyapunov exponent, can be attributed to the parametric form of our complexity measure allowing a flexibility that is entirely missing in other measures. More importantly, this flexibility is not imposed into the measure but follows naturally from a consequent interpretation of complexity as a referential measure [12] implying imperatively the existence of a reference process Inline graphic against which another process Inline graphic is quantitatively compared.

Figure 2. Lyapunov exponent (Inline graphic - red line, plus symbol) and p-values (blue line, cross symbol) of the logistic map in dependence on Inline graphic.

Figure 2

The dotted horizontal line corresponds to a significance level of Inline graphic and the dashed line to Inline graphic.

Discussion

The complexity measure introduced in this paper has several properties that are different to all other measures proposed so far. First, Inline graphic is a bivariate measure allowing to make comparative statements, instead of absolute ones. This may appear as a disadvantage first, however, as López-Ruiz et al. [12] point out, we inevitably compare patterns with each other to make a decision about their complexity (See also the comparative discussion on page 909 in [17] about the three patterns shown in Fig. 1.) [20]. Second, we do not make assumptions with respect to the size of patterns to which our measure can be applied, instead, principally, we allow patterns of any finite or infinite size Inline graphic. For example, measures like EMC or excess entropy are based on block entropies of varying order Inline graphic and the final measure is obtained in the limit for Inline graphic against infinity. Strictly, such measures require an infinite amount of data. Third, due to the fact that statistic complexity allows the comparison of patterns of any size Inline graphic with finite sample sizes Inline graphic and Inline graphic the result of the comparison may be erroneous. The KS test, underlying Inline graphic, allows a quantification of such an error statistically. Because this error can be quantified in dependence on Inline graphic, Inline graphic and Inline graphic, there is no need to assume limiting properties. At this point we would like to re-emphasize that the term statistic complexity has been chosen to underline the involvement of a test statistic in our measure on which the complexity value is based. For this reason other complexity measures that have been named statistical complexity [12], [34], [35] are not similar to our measure at all due to the fact that none of these measures uses a test statistic or a statistical test. Hence, they are actually not related to statistics (the field). An alternative name for these measures would be probabilistic complexity, which would make this difference more obvious. The fourth point relates to the empirical distribution functions. The reason for their introduction is, besides the fact that they allow a connection to the KS test, they allow the introduction of two ensembles, one for the process Inline graphic and one for processes Inline graphic. These ensembles compensate that the classic Kolmogorov complexity is not related to any ensemble but only to one string. Further, the ensembles induce a probabilistic interpretation of the deterministic NCD with respect to the underlying processes that generate the patterns. This is in accordance with [17] emphasizing the importance of complexity measures being probabilistic. Taken together, this allows a quantifiable approximation, in dependence on Inline graphic, Inline graphic and Inline graphic, of the underlying processes Inline graphic and Inline graphic with respect to the information they provide about their complexity, in form of the real observable patterns.

From an applied point of view, the direct connection of statistic complexity with statistical inference allows a confirmatory analysis of the complexity of objects. Due to the fact that the uncertainty of a complexity comparison is inherently provided by our measure, it is applicable to (real) objects from a multitude of different application domains. In the future we are planing to investigate the complexity of biological pathways in the context of cancer and other complex diseases [37]. A further potential direction would be an analysis of different goodness-of-fit tests. For example, it would be interesting to study a Cramér-von Mises or an Anderson-Darling test, instead of a Kolmogorov-Smirnov test [36]. Other tests may have advantages in different application areas or specific experimental conditions, although, a Kolmogorov-Smirnov test was sufficient with respect to the applications studied in this paper.

Acknowledgments

For the statistical simulations R [38] has been used.

Footnotes

Competing Interests: The author has declared that no competing interests exist.

Funding: The author has no support or funding to report.

References

  • 1.Bar-Yam Y. Dynamics of Complex Systems. Perseus Books; 1997. [Google Scholar]
  • 2.Nicolis G, Prigogine I. Exploring Complexity. Freeman; 1989. [Google Scholar]
  • 3.Prokopenko M, Boschetti F, Ryan A. An information-theoretic primer on complexity, self-organization, and emergence. Complexity. 2009;15:11–28. [Google Scholar]
  • 4.Schuster H. Complex Adaptive Systems. Scator Verlag; 2002. [Google Scholar]
  • 5.Wolfram S. Statistical mechanics of cellular automata. Phys Rev E. 1983;55:601–644. [Google Scholar]
  • 6.Dehmer M. A novel method for measuring the structural information content of networks. Cybernetics and Systems. 2008;39:825–842. [Google Scholar]
  • 7.Dehmer M, Barbarini N, Varmuza K, Graber A. A large scale analysis of information-theoretic network complexity measures using chemical structures. PLoS ONE. 2009;4:e8057. doi: 10.1371/journal.pone.0008057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Watts D, Strogatz S. Collective dynamics of ‘small-world’ networks. Nature. 1998;393:440–442. doi: 10.1038/30918. [DOI] [PubMed] [Google Scholar]
  • 9.Kolmogorov AN. Three approaches to the quantitative definition of ‘information’. Problems of Information Transmission. 1965;1:1–7. [Google Scholar]
  • 10.Li M, Vitányi P. An Introduction to Kolmogorov Complexity and Its Applications. Springer; 1997. [Google Scholar]
  • 11.Solomonoff R. A preliminary report on a general theory of inductive inference. 1960. Technical Report V-131, Zator Co., Cambridge, Ma.
  • 12.López-Ruiza R, Mancinib H, Calbet X. A statistical measure of complexity. Physics Letters A. 1995;209:321–326. [Google Scholar]
  • 13.Badii R, Politi A. Complexity: Hierarchical Structures and Scaling in Physics. 1997. Cambridge University Press, Cambridge.
  • 14.Bennett C. Logical depth and physical complexity. In: Herken R, editor. The Universal Turing Machine– a Half-Century Survey. Oxford University Press; 1988. pp. 227–257. [Google Scholar]
  • 15.Crutchfield JP, Young K. Inferring statistical complexity. Phys Rev Lett. 1989;63:105–108. doi: 10.1103/PhysRevLett.63.105. [DOI] [PubMed] [Google Scholar]
  • 16.Gell-Mann M, Lloyd S. Information measures, effective complexity, and total information. Complexity. 1998;2:44–52. [Google Scholar]
  • 17.Grassberger P. Toward a quantitative theory of self-generated complexity. Int J Theor Phys. 1986;25:907–938. [Google Scholar]
  • 18.Lloyd S, Pagels H. Complexity as thermodynamic depth. Annals of Physics. 1988;188:186–213. [Google Scholar]
  • 19.Zurek W, editor. Complexity, Entropy and the Physics of Information. 1990. Addison-Wesley, Redwood City.
  • 20.Grassberger P. Problems in quantifying self-generated complexity. Helvetica Physica Acta. 1989;62:489–511. [Google Scholar]
  • 21.Chaitin G. On the length of programs for computing finite binary sequences. Journal of the ACM. 1966:547–569. [Google Scholar]
  • 22.Crutchfield J, Packard N. Symbolic dynamics of noisy chaos. Physica D. 1983;7:201–223. [Google Scholar]
  • 23.Bialek W, Nemenman I, Tishby N. Predictability, complexity, and learning. Neural Computation. 2001;13:2409–2463. doi: 10.1162/089976601753195969. [DOI] [PubMed] [Google Scholar]
  • 24.Cilibrasi R, Vitányi P. Clustering by compression. IEEE Transactions Information Theory. 2005;51:1523–1545. [Google Scholar]
  • 25.Casella G, Berger R. Statistical Inference. Duxbury Press; 2002. [Google Scholar]
  • 26.Mood A, Graybill F, Boes D. Introduction to the Theory of Statistics. McGrw-Hill; 1974. [Google Scholar]
  • 27.Li M, Chen X, Li X, Ma B, Vitányi P. The similarity metric. IEEE Transactions on Information Theory. 2004;50:3250–3264. [Google Scholar]
  • 28.Cebrian M, Alfonseca M, Ortega A. Common pitfalls using the normalized compression distance: What to watch out for in a compressor. Communications in Information and Systems. 2005;5:367–384. [Google Scholar]
  • 29.Emmert-Streib F. Exploratory analysis of spatiotemporal patterns of cellular automata by clustering compressibility. Physical Review E. 2010;81:026103. doi: 10.1103/PhysRevE.81.026103. [DOI] [PubMed] [Google Scholar]
  • 30.Conover W. Practical Nonparametric Statistics. 1999. John Wiley & Sons, New York.
  • 31.Smirnov N. Estimate of deviation between empirical distribution functions in two independent samples. Bulletin Moscow University. 1939;2:3–16. [Google Scholar]
  • 32.Milbrodt H, Strasser H. On the asymptotic power of the two-sided kolmogorov-smirnov test. Journal of Statistical Planning and Inference. 1990;26:1–23. [Google Scholar]
  • 33.Mattos T, Moreira J. Universality Classes of Chaotic Cellular Automata. Brazilian Journal of Physics. 2004;34:448–451. [Google Scholar]
  • 34.Crutchfield J, Feldman D. Statistical complexity of simple one-dimensional spin systems. Phys Rev E. 1997;55:R1239. [Google Scholar]
  • 35.Feldman D, Crutchfield J. Measures of statistical complexity: Why? Physics Letters A. 1998;238:244–252. [Google Scholar]
  • 36.Lehman E. Testing Statistical Hypotheses. Springer; 2005. [Google Scholar]
  • 37.Emmert-Streib F. The chronic fatigue syndrome: A comparative pathway analysis. Journal of Computational Biology. 2007;14:961–972. doi: 10.1089/cmb.2007.0041. [DOI] [PubMed] [Google Scholar]
  • 38.R Development Core Team. R: A Language and Environment for Statistical Computing. 2008. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0.

Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES