Mathew & Perreault [1] analyse cross-cultural data from the Western North American Indian (WNAI) dataset [2] in order to compare ‘the relative effect of environment and cultural history’ on behavioural variation across 172 societies. This endeavour is inspired by many other evolutionary studies of human cultural variation [3–7]. Mathew and Perreault conclude that ‘social learning operating over multiple generations [is] the main mode by which humans acquire their behaviour’ (p. 5). Our own investigation of cultural macroevolution in the WNAI [8] motivated us to attempt to reconstruct their analyses. we found their paper to be undermined by questionable analytical choices, and computational and data-handling problems. We draw this conclusion having used the information in the Methods and electronic supplementary material S1, S3, S4 and S6 in [1] to recreate those parts of their study that we were able to. In this commentary, we present the results of our examination and detail the serious methodological flaws that lead us to conclude that a complete re-analysis is required by Mathew and Perreault. We also comment briefly on their conceptual schema, which, in trying to find ‘the main mode of human adaptation’ [1], appears to set cultural transmission (i.e. social learning) in opposition to environmental adaptation.
Mathew and Perreault use logistic regression to model 457 present/absent behavioural traits as a function of three dimensions—E (local ecological conditions), P (phylogenetic or linguistic distance to other societies) and S (spatial or geographical distance to other societies). To judge the relative importance of the E, P and S classes of predictors, Mathew and Perreault compare sums of absolute values of regression coefficients across classes, for the best model of each behavioural trait. The ‘summed absolute values' metric is used for various purposes in model and feature selection [9,10]. The metric is problematic here, however, because it compounds statistical signal with different sizes of the E, P and S classes. To demonstrate, consider a null case in which none of the predictors in E, P or S are related to a trait and the regression coefficients resemble stochastic noise. The analyst must understand how a statistical metric would behave in such a case and choose an inference procedure that reliably distinguishes null from non-null cases. For concreteness, assume that the coefficients share a common Gaussian distribution with mean zero and variance σ2. The absolute value of a coefficient β then has expectation
and for a class containing M predictors the summed absolute values have expectation
. The null expectation therefore scales linearly with class size M, and larger classes of predictors will appear to have greater relative importance based on representation alone. Although model selection criteria such as AIC (discussed below) include a penalty for the number of predictors in a model, this does not mitigate the confounding effects of class size [11].
To see how summed absolute values depend on the number of predictors contributing to the sum, consider fig. 3 of [1], which appears to show a time-depth effect of language. The numbers of predictors in phylogenetic classes, from the Level 8 class down to the Level 3 class, follow a descending sequence: M = 8, 8, 6, 4, 4, 3 (see electronic supplementary material S7 in [1]). This sequence is mirrored by the stair-step feature of the overall effect of phylogeny and is wholly consistent with null expectations. Thus, what Mathew and Perreault interpret as greater relative importance for predictors incorporating deeper history may reflect only the fact that the deeper-level classes contain more predictors—interpretation of the figure is precluded by confounding effects of class size. The ratios of importance values shown in their fig. 2 are also consistent with null expectations. For example, with M = 6, 8 or 3 (for classes E, P or S, respectively), the null expectation of the relative importance of cultural history over ecology, (S + P)/E, is approximately 11/6, yielding 1.4 after tan−1 scaling (see their fig. 2 caption). The value 1.4 (not 1.0) is thus a more appropriate null reference for fig. 2a in [1]. Boxplot medians in fig. 2a stay relatively close to this value. As 11 of the 17 (65%) predictors are assigned to cultural history, it is not surprising that the overall effect for this combined class is larger than that for ecology in 70% of the behavioural traits. In short, the relative strengths of E, P and S cannot be discerned as Mathew and Perreault intend without a correction for the number of predictors in each class.
The overall effects reported by Mathew and Perreault are also troubled by numerical instability. For instance, the outcome ‘salt added to food’ (r_tech_162) has absolute sums of 4.29 × 1015, 3.20 × 1015 and 1.05 × 1016 for E, P and S, respectively (see electronic supplementary material S6 in [1]). Models for at least 29 outcomes have extreme effect sizes, even after dividing by class size M. Logistic regression coefficients predict changes in the log-odds of a binary outcome per unit changes in the independent variables. For reasons described by Gelman et al. [12], large-magnitude coefficient estimates call for special scrutiny and may indicate underlying problems such as non-convergence of model-fitting algorithms, variable separation or multicollinearity. In fact, in reproducing their results, we found that 24 of 457 models failed to converge; 12 of these were among the models with extreme effect sizes. Rather than addressing such problems directly, Mathew and Perreault take measures such as truncating axes (figure 1 above, and figs 1 and 3 in [1]) and rescaling coefficients by a tan−1 transformation (fig. 2 in [1]), the latter apparently to control the infinite relative importance values that result from division by zero.
Figure 1.
Boxplots of summed absolute standardized β coefficients for S, P and E: (a) reproduction of fig. 1b in [1], without outliers and truncated at a summed value of 80 (omits 35 S, 83 P and 54 E values > 80); (b) modified figure of the same data, including outliers and truncated at a summed value of 800 (omits 13 S, 31 P and 18 E values > 800).
During our reconstruction of the analyses, we also uncovered a critical flaw in Mathew and Perreault's use of principal component analyses (PCA) to compute P and S dimensions. To determine the predictors in P and S, Mathew and Perreault pass a matrix of pairwise distances (spatial or linguistic) to the prcomp function in the R language [13]. This function is designed to operate exclusively on datasets consisting of sampling units (here, cultures) in rows, and measured attributes (features) of these units in columns. In their columns, Mathew and Perreault use relational data—pairwise distances—where prcomp expects cultural attributes. In figure 2, we demonstrate the consequences of this for the spatial components. To visualize analogous consequences for the phylogenetic components is a more challenging problem we have not solved. Given the functionality of prcomp, however, we expect similar distortions because the components are calculated using relational distances along a language tree, not primary attributes of the individual cultures.
Figure 2.
Spatial warping produced by PCA on a pairwise distance matrix: (a) cultures plotted by Lat/Lon coordinates, with points shaded by a latitudinal gradient; (b) cultures plotted by spatial PC scores on dimensions one and two (data obtained from electronic supplementary material S1 in [1]), with shading scheme carried over from panel (a). PCA on the pairwise distance matrix introduces artificial nonlinear relationships between cultures and compresses spatially distinct cultures along a parabolic curve. The dashed horizontal line at PC2 = 10 demonstrates that cultures well separated by latitude may share the same value on PC2. Shading by a longitudinal gradient produces similar graphs. Spatial relationships adequately described by latitude and longitude together are obscured in the PC space.
One alternative to PCA is principal coordinates analysis (PCoA), a related method specifically for pairwise distance matrices [14]. The first two dimensions of a PCoA applied to spatial distances closely recreate spatial configurations at moderate scales (see example 5.2 in [15]). For discrete, low-variation distances like those produced by the language tree, PCoA may not be as suitable. A second alternative to PCA makes direct use of phylogenetic and spatial distances in a parametrized variance–covariance matrix [16]. Whatever the relationships are between cultural traits, phylogenetic and spatial distances, they may be disrupted by inappropriate use of PCA.
Mathew and Perreault use the glmulti package in R to screen all 131 071 possible models for each trait and then analyse the coefficients of just the single best model—the model with the lowest AIC value (AICmin). Model comparison is a powerful tool for evaluating alternative models given the data [12,17]. However, given so many predictors, the best model is often one of a large set of models with very similar AIC values (e.g. Δi < 4, where Δi is defined as AICi − AICmin), and our analysis of their data suggests this is true for most traits. As detailed by Burnham & Anderson [11], model averaging combines information from a set of top models, and thus incorporates more sample information and produces more robust inferences (e.g. about the relative effects of E, P and S). Moreover, the coefficients associated with each predictor can be interpreted in the context of other predictors, even if the single best model leaves out a particular component. Alternatively, rather than screening all possible models, Mathew and Perreault could compare a priori sets of models with different combinations of components (e.g. EPS, EP, ES, PS, E, P and S) [18].
The accumulation and transmission of cultural knowledge are hallmarks of the human species [19], and cross-cultural variation demonstrates strong regional and linguistic patterning [20]. Likewise, both abiotic and biotic environments clearly shape and constrain human cultural behaviour (see [21] for a review and [22] for a detailed study of resource-defence polygyny in the WNAI). We thus question the central framing of Mathew and Perreault's study—as a test for whether human behavioural variation ‘is due to variation in the ecological environment or to differences in cultural traditions' (see abstract of [1]). Their apparent posing of non-cultural mechanisms against cultural mechanisms is problematic in two main ways. First, it fails to acknowledge that social learning is itself an adaptation to the patterning of environmental change over time [23]. Second, we do not believe that anyone has seriously proposed that the decisions humans make regarding behavioural strategies emerge de novo with each generation (Mathew and Perreault's ‘single-generation adaptive response’ caricature) [24–26], nor have they ever done so [21]. Mathew and Perreault undoubtedly recognize the complexities here, and in places acknowledge the complementarity of perspectives. Nevertheless, we caution that framing a paper as a debate detracts from the more interesting challenge, which is to determine the relative influences of different transmission mechanisms, recognizing that they operate in addition to—not as alternatives to—adaptation [3,4,19,25]. Furthermore, careful modelling in this area indicates that given the complexity of interacting factors, we are still far from being able to use patterns in space and time to pinpoint the mechanisms responsible for cultural diversity [27].
For over a century, anthropologists have endeavoured to understand the origins and maintenance of cultural variation across human societies [28], and an evolutionary ecological perspective such as that of Mathew and Perreault has much to contribute [3–8]. Although such investigations have proved to be methodologically challenging, the field has prospered as many alternative quantitative approaches have been developed, employed and compared. Unfortunately, Mathew and Perreault's analyses are hampered by numerous problems that make it impossible to draw any reliable conclusions from their results. In conclusion, we believe that the statistical and computing mistakes in Mathew and Perreault are serious enough to merit a complete re-analysis of the data, particularly given the importance of understanding the multiple mechanisms responsible for the origins and maintenance of cultural variation across human societies.
Acknowledgements
We thank Bret Beheim, Barney Luttbeg, Richard McElreath and Pete Richerson for their insights, with the proviso that the views expressed here are our own.
Footnotes
The accompanying reply can be viewed at http://dx.doi.org/10.1098/rspb.2016.0177.
Competing interests
We declare we have no competing interests.
Funding
We received no funding for this study.
References
- 1.Mathew S, Perreault C. 2015. Behavioural variation in 172 small-scale societies indicates that social learning is the main mode of human adaptation. Proc. R. Soc. B 282, 20150061 ( 10.1098/rspb.2015.0061) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Jorgensen JG. 1980. Western Indians: comparative environments, languages, and cultures of 172 Western American Indian tribes. San Francisco, CA: WH Freeman. [Google Scholar]
- 3.Borgerhoff Mulder M, Nunn CL, Towner MC. 2006. Cultural macroevolution and the transmission of traits. Evol. Anthropol. Issues News Rev. 15, 52–64. ( 10.1002/evan.20088) [DOI] [Google Scholar]
- 4.Henrich J, McElreath R. 2003. The evolution of cultural evolution. Evol. Anthropol. Issues News Rev. 12, 123–135. ( 10.1002/evan.10110) [DOI] [Google Scholar]
- 5.Lipo CP, O'Brien MJ, Collard M, Shennan SJ (eds). 2006. Mapping our ancestors: phylogenetic approaches in anthropology and prehistory. New Brunswick, NJ: Transaction Publishers. [Google Scholar]
- 6.Mace R, Holden CJ, Shennan S (eds). 2005. The evolution of cultural diversity: a phylogenetic approach. London, UK: UCL Press. [Google Scholar]
- 7.Brown GR, Dickins TE, Sear R, Laland KN. 2011. Evolutionary accounts of human behavioural diversity: introduction. Phil. Trans. R. Soc. B 366, 313–324. ( 10.1098/rstb.2010.0267) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Towner MC, Grote MN, Venti J, Borgerhoff Mulder M. 2012. Cultural macroevolution on neighbor graphs vertical and horizontal transmission among Western North American Indian societies. Hum. Nat. 23, 283–305. ( 10.1007/s12110-012-9142-z) [DOI] [PubMed] [Google Scholar]
- 9.Gawalt B, Jia J, Miratrix L, El Ghaoui L, Yu B, Clavier S (eds). 2010. Discovering word associations in news media via feature selection and sparse classification. In MIR '10: Proc. of the Int. Conf. on Multimedia Information Retrieval, Philadelphia, PA, 29–31 March 2010. Alpha, NJ: Sheridan. [Google Scholar]
- 10.Tibshirani R. 1996. Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. B 58, 267–288. [Google Scholar]
- 11.Burnham KP, Anderson DR. 2002. Model selection and multimodel inference: a practical information-theoretic approach. New York, NY: Springer. [Google Scholar]
- 12.Gelman A, Jakulin A, Pittau MG, Su YS. 2008. A weakly informative default prior distribution for logistic and other regression models. Ann. Appl. Stat. 2, 1360–1383. ( 10.1214/08-AOAS191) [DOI] [Google Scholar]
- 13.R Core Team. 2014. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. See; http://www.R-project.org/. [Google Scholar]
- 14.Cox TF, Cox MAA. 2000. Multi-dimensional scaling, 2nd edn New York, NY: Chapman & Hall/CRC. [Google Scholar]
- 15.Krzanowski WJ, Marriott FHC. 1994. Multivariate analysis, part 1: distributions, ordination and inference. London, UK: Edward Arnold. [Google Scholar]
- 16.Freckleton RP, Jetz W. 2009. Space versus phylogeny: disentangling phylogenetic and spatial signal in comparative data. Proc. R. Soc. B 276, 21–30. ( 10.1098/rspb.2008.0905) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Towner MC, Luttbeg B. 2007. Alternative statistical approaches to the use of data as evidence for hypotheses in human behavioral ecology. Evol. Anthropol. Issues News Rev. 16, 107–118. ( 10.1002/evan.20134) [DOI] [Google Scholar]
- 18.Beheim BA, Bell AV. 2011. Inheritance, ecology and the evolution of the canoes of east Oceania. Proc. R. Soc. B 278, 3089–3095. ( 10.1098/rspb.2011.0060) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mesoudi A, Whiten A, Laland KN. 2004. Perspective: Is human cultural evolution Darwinian? Evidence reviewed from the perspective of the origin of species. Evolution 58, 1–11. ( 10.1111/j.0014-3820.2004.tb01568.x) [DOI] [PubMed] [Google Scholar]
- 20.Burton ML, Moore CC, Whiting JWM, Romney AK. 1996. Regions based on social structure. Curr. Anthropol. 37, 87–123. ( 10.1086/204474) [DOI] [Google Scholar]
- 21.Winterhalder B, Smith EA. 2000. Analyzing adaptive strategies: human behavioral ecology at twenty-five. Evol. Anthropol. Issues News Rev. 9, 51–72. () [DOI] [Google Scholar]
- 22.Sellen DW, Hruschka DJ. 2004. Extracted-food resource-defense polygyny in native western North American societies at contact. Curr. Anthropol. 45, 707–714. ( 10.1086/425637) [DOI] [Google Scholar]
- 23.Boyd R, Richerson PJ. 1985. Culture and the evolutionary process. Chicago, IL: University of Chicago Press. [Google Scholar]
- 24.Borgerhoff Mulder M. 2013. Human behavioral ecology: necessary but not sufficient for the evolutionary analysis of human behavior. Behav. Ecol. 24, 1042–1043. ( 10.1093/beheco/ars223) [DOI] [Google Scholar]
- 25.Nettle D. 2009. Ecological influences on human behavioural diversity: a review of recent findings. Trends Ecol. Evol. 24, 618–624. ( 10.1016/j.tree.2009.05.013) [DOI] [PubMed] [Google Scholar]
- 26.Nettle D, Gibson MA, Lawson DW, Sear R. 2013. Human behavioral ecology: current research and future prospects. Behav. Ecol. 24, 1031–1040. ( 10.1093/beheco/ars222) [DOI] [Google Scholar]
- 27.Crema ER, Kerig T, Shennan S. 2014. Culture, space, and metapopulation: a simulation-based study for evaluating signals of blending and branching. J. Archaeol. Sci. 43, 289–298. ( 10.1016/j.jas.2014.01.002) [DOI] [Google Scholar]
- 28.Boas F. 1896. The limitations of the comparative method in anthropology. Science. 6, 901–908. ( 10.1126/science.4.103.901) [DOI] [PubMed] [Google Scholar]


