Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Aug 4.
Published in final edited form as: Q J Exp Psychol (Hove). 2012 Oct 25;66(6):1113–1130. doi: 10.1080/17470218.2012.730533

Causal Essentialism in Kinds

Woo-kyoung Ahn 1, Eric G Taylor 2, Daniel Kato 3, Jessecae K Marsh 4, Paul Bloom 5
PMCID: PMC4119874  NIHMSID: NIHMS603321  PMID: 23098315

Abstract

The current study examines Causal Essentialism, derived from Psychological Essentialism of concepts (Medin & Ortony, 1989). We examine whether people believe that members of a category share some underlying essence that is both necessary and sufficient for category membership and that also causes surface features. The main claim is that Causal Essentialism is restricted to categories that correspond to our intuitive notions of existing kinds, and hence is more attenuated for categories that are based on arbitrary criteria. Experiments 1 and 3 found that people overtly endorse causal essences in non-arbitrary kinds but are less likely to do so for arbitrary categories. Experiments 2 and 4 found that people were more willing to generalize a member’s known causal relations (or lack thereof) when dealing with a kind than when dealing with an arbitrary category. These differences between kinds and arbitrary categories were found across various domains—not only categories of living things, but also for artifacts. These findings have certain real-world implications, including how people make sense of mental disorders that are treated as real kinds.


Some categories, like “robins,” and “chairs,” are intuitively more appropriate than other categories, like “things that weigh about 2 pounds, are white, and have a smooth surface” and “a group of mental disorder patients whose last names began with F” (see, e.g., Bloom, 2004; Gelman, 2003; Macnamara, 1986; Markman, 1989; Murphy & Medin, 1985). What are the assumptions that people have about these natural categories—or kinds? In this paper, we argue that it is the belief in causal essences that critically distinguishes between kinds and arbitrary categories. In what follows, we first define what we mean by a causal essence with reference to theories of Psychological Essentialism, and then we explain why kinds may elicit beliefs in a causal essence more so than arbitrary categories.

Psychological Essentialism

It has been argued that many concepts are not mere collections of correlated features, but rather are groupings based on shared causal mechanisms anchored in a category’s essence (e.g., Murphy & Medin, 1985; Medin & Ortony, 1989). For instance, people’s concept of birds may involve naïve theories about how having wings, flying, laying eggs, etc. are causally related and how some bird essence (e.g., “bird DNA”) must be responsible for these features. From this perspective, essences are believed to make categories what they are and cause their surface features (Bloom, 2004; Gelman, 2003; Locke, 1894/1975; Medin & Ortony, 1989). This belief is referred to as Psychological Essentialism (Medin & Ortony, 1989).

Results from many previous studies are consistent with Psychological Essentialism. For example, the fact that natural kinds are categorized together based on internal, deeper features, despite differences in surface features (Ahn, 1998; Gelman & Wellman, 1991; Keil, 1989) supports the importance of essences/essential features. Essentialism has been credited with explaining why people will endorse an object as a piece of art regardless of its appearance as long as the artist’s intention was to create art, calling upon a deeper underlying feature of the category of art (Gelman & Bloom, 2000). Furthermore, inferences that would follow from belief in an essence (e.g., biological bases, immutability, inductive potency, etc.) have been documented with domains as varied as social categories and mental disorder categories (e.g., Haslam, Rothschild, & Ernst, 2000; see Dar-Nimrod & Heine, 2011 for review). This variety of evidence has been used to support the claim that Psychological Essentialism is a pervasive cognitive belief. (But see also Kalish, 2002; Strevens, 2000 for evidence against Psychological Essentialism).

The current study focuses on one specific aspect of Psychological Essentialism, namely whether people explicitly endorse a causal essence (Gelman, 2003). We examine whether people believe that members of the same kind share something that causes surface features of the kind, and that this thing is necessary and sufficient for category membership. We call this specific claim Causal Essentialism in order to distinguish it from other claims made under Psychological Essentialism. While belief in a causal relationship between an essence and surface features, or Causal Essentialism, is considered one of the most crucial elements of an essence (Dar-Nimrod & Heine, 2011; Medin & Ortony, 1989), few previous studies have directly measured whether people explicitly believe across a variety of categories that what causes surface features is an essence.

Causal Essentialism in Kinds

The main claim of the current study is that Causal Essentialism is restricted to kinds—to collections of individuals that fit with people’s intuitive notions of natural and non-arbitrary categories (see also Prasada, Hennfield, & Otap, 2012). For instance, a type of viral infection in humans and birds and leads to flu symptoms would have been considered an arbitrary category before 1997. However, the discovery of a common viral cause resulted in the establishment of this grouping as a true kind called avian flu. We postulate that since avian flu came into existence, people now believe that it has a particular causal essence, which causes its surface features and is necessary and sufficient for an instance of the flu to be referred to as avian flu.

We also suggest that even when people do not know precisely what these essences are (Medin & Ortony, 1989), they may serve as “placeholder” explanations for why kinds are coherent and real. That is, a person could infer that a certain category must have a causal essence simply because it is accepted as a kind; this essence, whatever it is, is necessary for explaining its coherence. For instance, in the early 1900’s it might have appeared to be a pure coincidence that a group of people tended to display “unstable interpersonal relationships, affective distress, marked impulsivity, and unstable self-image.” Yet, this is now a category listed in the Diagnostic and Statistical Manual of Mental Disorders (4th Ed., American Psychiatric Association, 2000) as Borderline Personality Disorder. In the case of Borderline Personality Disorder, however, the underlying cause is still unknown. But the fact that it has been accepted in an official manual might be sufficient to encourage people to assume that there must be a shared causal essence. The current study tests this possibility by experimentally manipulating whether categories are true kinds or arbitrary categories, and observing how such manipulations influence Causal Essentialism. We predict that simply believing that a certain category is a kind would be sufficient to trigger Causal Essentialism.

The current study also examines whether the assumption that a category is a kind leads to belief in shared causal mechanisms among category members. Recent studies (e.g., Ahn, Kim, Lassaline, & Dennis, 2000; Hadjichristidis, Sloman, Stevenson, & Over, 2004; Lassaline, 1996) have demonstrated that causal relations between category features determine which features are considered more central and immutable and which features are more projectable. A crucial premise behind these studies is that members of the same category share causal relations, and it is these shared causal relations that determine feature centrality or property projection. Surprisingly, however, few studies have verified the psychological validity of this premise.

We propose, then, that people believe that an underlying essence governs how the features of a category are causally connected to each other, and because category members share a causal essence, they would also share similar causal relations. While previous theorists have argued that essentialized categories are perceived to be homogeneous with respect to individual features (Gelman, 2003; Medin & Ortony, 1989; see also Gelman & Markman, 1987; Haslam, Rothschild, & Ernst, 2000; Yzerbyt, Corneille, & Estrada, 2001; for empirical demonstrations), few have assumed that they would be also homogeneous in terms of causal structures. The basis of our proposal is the idea that surface features caused by the different essences may be perceived to have different causal implications. For example, people may believe that vegetables that are genetically modified lead to fewer health benefits and taste worse than vegetables that are naturally produced. Even if both types of vegetables are in fact chemically identical, their causal implications may be thought to differ if they have different causal essences (for supporting evidence, see Rozin et al., 2004). Conversely, surface features derived from the same causal essences may lead to beliefs in shared causal structures among surface features.

Manipulating Kinds versus Arbitrary Categories

Having explained the core distinction between kinds and arbitrary categories, and the implications of that distinction for Causal Essentialism and beliefs in shared causal mechanisms, we now discuss how kinds and arbitrary categories are experimentally manipulated in our study. This discussion will also clarify features that would or would not distinguish between kinds and arbitrary categories.

In our experiments, we developed a number of artificial categories consisting of three features (see Figure 1 for sample stimuli), which are used for both kinds and arbitrary categories. To manipulate a certain category as a kind, we simply indicated it using a known superordinate kind. For instance, participants were told, “There is a kind of animal called an egoogle,” “There is a mental disorder called BLV,” where italics, although not used in the experimental materials, indicate the known superordinate kinds. In contrast, arbitrary categories are not generally accepted among people and could have been constructed by a person on an idiosyncratic basis. In order to make this feature explicit in our manipulations for Experiments 1 and 2, we used the same category features as the ones used for kinds, but designated a non-expert of a domain (someone who is highly unlikely to possess generally accepted or valid knowledge of the domain) as an inventor of the category. For example, we stated for an arbitrary category, “A high school student was searching for animals on the web using the google search engine. He labeled a group of animals displayed on the even-numbered pages ‘egoogles’.” For Experiments 3 and 4, we indicated arbitrariness of categories by noting that the shared features are mere statistical co-occurrence (see item (4) of Figure 1).

Figure 1.

Figure 1

Sample Stimuli used in Experiments 1–4.

Note that some dimensions do not necessarily distinguish between kinds and arbitrary categories. First, as shown in Figure 1, both kinds and arbitrary categories can be constructed based on correlated features. Correlated features can lead people to infer the existence of a common cause (a causal essence) because such featural co-occurrence would otherwise be too much of a coincidence (e.g., Gelman, 2003; Markman, 1989). Still, in order to demonstrate that the differences in Causal Essentialism can be obtained merely due to differences in the kind vs. arbitrary category distinction, we used the same correlated features for both kinds and arbitrary categories.

Second, we used labels (e.g., “egoogles”) for both kinds and arbitrary categories in Experiments 1 and 2. Past research has shown effects of labels on essentialist beliefs (e.g., Yamauchi, 2005). As shown in Figure 1 (items (1) and (3)), however, the labels were used in two different ways so that the core distinctions between the two types of categories would be intact. For arbitrary categories the labels (e.g., egoogles) were shorthand for the criteria (e.g., even number of pages of google search) so that we can retain the arbitrariness of the categories. For non-arbitrary kinds the labels were purely nominal and have no meaning as in other existing kinds. We predict that labels per se would not be responsible for differences in Causal Essentialism between the two types of categories, so the predicted differences would be obtained even though labels are used for both versions. (But see the General Discussion for further discussion of effects of different types of labels.)

Third, in previous research natural kinds tend to be equated with real categories that need to be discovered in nature, whereas artifacts or conventionally established categories tend to be equated with nominal kinds (e.g., Schwartz, 1979). Yet, the distinction between kinds and arbitrary categories is not necessarily based on whether categories or category members are natural or human-made (except that by definition arbitrary categories must be invented based on arbitrary criteria). For instance, arbitrary categories can consist of animals (e.g., animals displayed on the even-numbered pages of google search results). Also, non-arbitrary kinds can consist of human-made objects (e.g., chairs) and can be socially constructed (e.g., bachelors). For these reasons, in Experiments 1 and 2 we used a variety of domains for both kinds and arbitrary categories.

Predictions and Overview of Experiments

To recapitulate, we argue that people’s notions of kinds include beliefs in shared causal essences and causal mechanisms. Thus, we predict that merely knowing that a category of certain things is a kind is sufficient to increase people’s endorsement of causal essences and generalization of causal mechanisms across members of the category.

Throughout four experiments, we asked participants to make judgments about kinds and arbitrary categories. Arbitrary categories were constructed by indicating that non-experts invented those categories based on arbitrary criteria (Experiments 1 and 2) or statistical coincidence (Experiments 3 and 4). Then, we measured the extent to which people explicitly endorsed a causal essence (Experiments 1 and 3) and the extent to which they generalized the presence or absence of causal relationships found in a category member to all other category members (Experiments 2 and 4).

We predicted that Causal Essentialism would be observed in kinds and would be greatly attenuated for arbitrary categories even though they shared the same labels (Experiments 1 and 2) and correlational structures (Experiments 1–4). We predicted this finding would be true across various domains.

Experiment 1

Novel categories were developed from each of four domains: mental disorders, medical disorders, living things and artifacts. For each category within a domain, we developed two versions: a non-arbitrary kind version and an arbitrary category version. The goal of Experiment 1 was to empirically validate our intuition by asking participants to judge the extent to which they ascribe a causal essence to each category. That is, we tested whether merely mentioning a category as a kind is sufficient to trigger Causal Essentialism, whereas describing a category as constructed using a non-expert’s criterion is not.

Method

Thirty-one Yale University undergraduates participated in this experiment in partial fulfillment of an introductory psychology course’s requirements or for monetary compensation. Of these, 16 participated only in this experiment, and the other 15 participated in this experiment after completing other categorization experiments. There was no significant interaction effect involving these two groups of participants, so all data are collapsed in the following analyses.

Two categories, each consisting of three characteristic features, were developed from each of four domains: mental disorders, medical disorders, living things and artifacts (see Appendix A for a complete list of features for eight categories). The eight categories were divided into two sets (i.e., Set 1 and Set 2), with each set containing one category from each of the four domains. From Appendix A, Set 1 contained FFL (mental disorder), SS7 (medical disorder), egoogle (living things), and notodd (artifact), and Set 2 contained the rest of the categories in Appendix A. Half of the participants received kind versions of Set 1 and arbitrary category versions of Set 2, whereas the other half received kind versions of Set 2 and arbitrary category versions of Set 1. Presentation of kind/arbitrary category versions was blocked and counterbalanced such that half of the participants rated kinds first and the other half rated arbitrary categories first. Within each block, the order of the four categories was randomized across participants.

For each category, participants were asked whether they believed that there was a causal essence (i.e., defining feature that also causes the other features of the category members), whether or not they knew what that essence might be. For instance, for FFL, they were asked, “Do you think there is something that is shared by all and only FFL patients that also causes the other features of FFL patients (whether or not we know what that thing is)?”1

Participants answered each question by typing a number on an 8-point scale ranging from 1 (Strongly Disagree) to 8 (Strongly Agree). The experiment was programmed using the RSVP computer program and run on Macintosh computers. Before starting the task, participants were instructed, “Please read the descriptions of each given category carefully before you make a judgment. There are no right or wrong answers in this task; we are simply interested in your own thoughts. When you arrive at a decision, please enter your response in the box at the top of the screen.” Participants proceeded through the experiment at their own pace.

Results and Discussion

There was no effect of presentation order so all results are reported with data collapsed across the two orders. As predicted and shown in Figure 2, participants more strongly endorsed causal essences for the kind versions (M= 5.77, SD=1.07) than for the arbitrary category versions (M=3.65, SD=1.88), although both versions contained identical sets of characteristic features. A 2 (Category Type: kinds versus arbitrary category) X 4 (domains) repeated-measures ANOVA found a significant main effect of Category Type, F(1,30)=41.32, p<.01, η2=0.58. There was no significant main effect of Domain, p>.50.

Figure 2.

Figure 2

Mean Essentialism Ratings of Kind and Arbitrary category Versions Across Domains. Error bars are standard errors of the mean.

The interaction between Domain and Category Type was significant, F(3,90)=3.86, p<.05, η2=0.11. Pairwise t-tests found that this interaction effect was because the effect of Category Type was larger in the medical domain than in the other three domains, all p’s<.05, presumably because of lay medical knowledge about essences (e.g., viruses). No other between-domain differences were found. In fact, the difference between the kind and arbitrary category conditions was in the same direction in all four domains, with all p’s<.001. Furthermore, the mean ratings for the kind versions of all four domains were significantly greater than the mid-point of the scale (4.5), all t’s>3.87, all p’s<.001, indicating that participants endorsed causal essences for all four domains.

One limitation of Experiment 1 is that it is not clear whether the significant differences between the two types of categories were obtained because kinds are believed to be homogeneous, as we have argued, or because the arbitrary categories’ criteria could have discounted the likelihood of an essence. To explain, note that for arbitrary categories the common possession of an arbitrarily-selected, nominal criterion determines membership, and thus, explains why all members are included in the category, regardless of what features members happen to have in common. The explanation provided by the arbitrary defining criterion for why the category includes the members it does could have ruled out the need to posit the existence of some common, underlying causal essence. Thus, the mechanism here is akin to the well-known discounting principle where a known cause (e.g., being invented based on a certain criterion) discounts the likelihood that an alternative cause (e.g., an essence) is also true (Kelley, 1972).

On one hand, the use of arbitrary criteria to explain the grouping for arbitrary categories was practically unavoidable because participants would be highly unlikely to believe that a group of, say, animals, sharing three features are not the same species of animals. That is, it was methodologically needed in order to make the arbitrary categories truly arbitrary. Yet, to further ensure the generality of the effects of kinds, Experiments 3 and 4 use different stimuli for arbitrary categories, which do not involve explicitly stated arbitrary criteria.

Experiment 2

Experiment 2 used the same stimuli from Experiment 1 and tested the generalizability of causal mechanisms observed in a category member (either a causal-chain or lack of any causal relations) across all other members in the same category. For instance, given the egoogle description shown in passages (1) and (3) in Figure 1, participants in the causal condition were further told that in a particular egoogle, it was found that the three features formed a causal chain (e.g., “the fact that it eats weeds causes it to smell bad, and because it smells bad, it has no natural predators”). Participants in the non-causal condition were told that these features are completely unrelated to each other. Then, all participants were asked to estimate the likelihood that other members of the same category would display the same pattern of causal relationships or lack of causal relationships. We predicted that participants would generalize causal mechanisms more with true kinds than with arbitrary categories.

If generalization of causal structure depends on the type of categories, it would provide novel insights into the causal learning literature. Traditionally, covariation has been considered one of the most important cues to causality (e.g., Cheng, 1997). Given that both kind and arbitrary categories in our experiments contain identical patterns of correlation among features, there is no reason why a known causal structure would be more or less likely to be generalized to other exemplars simply because of the category that the exemplars belong to. Nonetheless, as mentioned in the introduction, we predict that Causal Essentialism, more likely to be present in kinds, would determine people’s willingness to generalize causal structure within a category. The reason for such a prediction is that features have different causal implications depending on what caused them. As alluded to earlier, naturally produced foods are believed to be safer, healthier, and tastier than foods that are created or modified through human intervention, even if people are told that the natural food is chemically identical to the artificial one (Rozin et al., 2004). Intended crimes are treated differently from the identical acts of crime that were not intended (Weiner, 1986), most likely because of differences in the causal implications for the future. Thus, if surface features are believed to share the same underlying causal essences, they may be also believed to share other causal relations as well.

Method

Ninety-six people from Amazon’s Mechanical Turk website (https://www.mturk.com) participated for $1.50. The benefits and reliability of experimental data collected from Mechanical Turk have been previously documented (Paolacci, Chandler, & Ipeirotis, 2010).

The two versions (kind versus arbitrary category) of the eight categories used in Experiment 1 were used. As in Experiment 1, for each scenario, participants first learned whether or not a category is a kind or an arbitrary category (e.g., the first row of light grey boxes in Figure 3), followed by the three characteristics features of the category (e.g., the second light grey box in Figure 3). Then, one member from the group who had all three symptoms was singled out and its causal mechanism was described (e.g., the third light grey box in Figure 3). In the “causal” versions, this member’s features were described as forming a causal chain. In the “non-causal” versions, the features were described as causally unrelated to one another. Participants were then told to consider other members of the same category (e.g., the fourth row in Figure 3), and to judge the likelihood that they would also display the same causal pattern or the same lack of causal relations (e.g., the last row in Figure 3) on a 9-point scale (1: very unlikely to 9: very likely). Thus, the design of Experiment 2 was a 2 (Category Type: kind or arbitrary category) X 2 (causal versus non-causal conditions). Although all four domains of categories were used across participants, this factor was not the main interest in this study and was not fully crossed with the two main independent variables, as explained below.

Figure 3.

Figure 3

Each of the four versions of the egoogle category used in Experiment 2. The versions differed in (i) whether the category was a kind or an arbitrary category, and (ii) whether or not the features of the category member singled out were described as causally related. Versions of the other categories differed similarly.

As in Experiment 1, the 8 categories were divided into Set 1 and Set 2. Half of the participants received the kind versions of Set 1 and the arbitrary category versions of Set 2, whereas the other half received the kind versions of Set 2 and the arbitrary category versions of Set 1. As in Experiment 1, presentation of kind/arbitrary category versions was blocked and counterbalanced.

Within each block/set, two of the items were causal descriptions and two were non-causal. The presentation of items within a set was blocked by the causal vs. non-causal factor. Half of the participants received the causal description for the medical disorders and living things domains, and non-causal description for the mental disorders and artifact domains. The other half received the opposite description type for each domain. To clarify with an example, one participant received the causal versions of the medical disorder FFL and the animal egoogle, but the non-causal versions of the mental disorder SS7 and the artifact notodd. Another participant received the opposite description type for each of these categories.

Results and Discussion

The results differed as a function of block, so we restricted our analyses to the data from the first block only. As a result, the kind versus arbitrary category became a between-subject variable.

Figures 4a and 4b show the mean generalization ratings separated by domain, for the causal and non-causal items, respectively. As predicted, participants were more likely to generalize causal relations for kinds (M=8.07, SD=1.15) than for arbitrary categories (M=6.35, SD=2.16). They were also more likely to generalize the lack of causal relations for kinds (M=6.48, SD=2.00) than for arbitrary categories (M=5.21, SD=2.20).

Figure 4.

Figure 4

Figure 4

Figures 4a and 4b. Mean Generalization Ratings of Kind and Arbitrary category Versions for the four domains, separately for the causal and non-causal items. Error bars are standard errors of the mean.

A 2 (Category Type: kind versus arbitrary category) X 2 (Causality: causal versus non-causal conditions) mixed ANOVA was conducted with Category Type as a between-subjects variable and causality as a within-subjects variable. The main effect of Category Type was significant, F(1,94)=32.40, p<0.01, η2=0.26, supporting the hypothesis that non-arbitrary kinds would lead to more causal generalization than arbitrary categories. The main effect of Causality was also significant, F(1,94)=59.06, p<0.01, η2=0.38, suggesting that people were overall more likely to generalize the causal chain explanation (M=7.17, SD=1.94) than the lack of any causal relations (M=5.82, SD=2.20). The interaction between Category Type and Causality was not significant, F(1,94)=1.61, p=0.21, η2=0.01.

Note that our design allows for a within-subjects test of the main effect causal vs. non-causal, but not a within-subjects test of the interaction between causality and domain. An item analysis testing the interaction between causality and domain would be also limited since there are only a small number of items. We simply note that the differences between kinds and arbitrary categories are all in the same direction across all four domains as shown in Figures 4a and 4b.

Experiment 3

So far, we have found that people are more willing to endorse a causal essence and generalize a causal mechanism of a single member to other members in a kind than in an arbitrary category. As discussed earlier, however, one may argue as we do that these differences indicate an effect of non-arbitrary kinds (i.e., a kind being more homogeneous in terms of causal structure), or alternatively, an effect of arbitrary categories we used (i.e., the presence of arbitrary category criterion, such as last name beginning with an F, reducing the homogeneity of categories in terms of causal structure). In order to rule out the second possibility, we used a different strategy in Experiments 3 and 4. To explain this new strategy, we first explain the methodological difficulties in developing arbitrary categories below.

The reason why an arbitrary category criterion was specified in the arbitrary category conditions of Experiments 1 and 2 was that without such an arbitrary criterion, people may automatically assume that the category is a true kind, since it is indeed too much of a coincidence that a group of things from a common superordinate category shares three features. Such a tendency would be particularly strong for living things or artifacts, since surface similarities strongly imply underlying shared essences (Medin & Ortony, 1989). In order to undermine such automatic inferences, we spelled out the arbitrary category criterion for participants.

In Experiments 3 and 4, we instead restricted our stimuli to the mental disorder domain, because this domain appears to be a plausible case in which a group of people can share three characteristic features without necessarily having to share an actual known category membership. That is, while people may easily believe that two plants that both have sticky leaves, absorbed iron, and produced red flowers are the same kinds of plants, two people who both have difficulty remembering new information, require excessive attention, and always choose solitary activities may not necessarily suffer from the same disorder because each of these symptoms could be caused by different disorders or no disorder at all. Thus, the domain of mental disorders allows us to create arbitrary categories without resorting to describing arbitrary category criteria. Furthermore, restricting the stimuli to the domain of mental disorders does not appear to greatly compromise the generality of the results, as we did not find significant domain differences between the mental disorder categories and living things or artifacts in Experiments 1 and 2.

Figure 1 shows a sample stimulus used in this experiment. In order to make the arbitrary categories as arbitrary as possible, we did not use a category label. Also, to make it clear that sharing three symptoms does not necessarily mean having the same mental disorder, we created the passage emphasizing that some people have only one of the three symptoms, some have two of the three symptoms, and finally, there may be a group of people who happen to have all three symptoms. In addition, we specified the sample size for both kinds and arbitrary categories to be 500 people so that the potential inferred category size was not a confound.

The symptoms used in Experiment 3 (as well as Experiment 4) were developed by Ahn, Novick, and Kim (2003), and are shown in Appendix B. Within each category, the three symptoms are taken from three different existing mental disorders according to the DSM-IV (APA, 2000) so that they are highly unlikely to activate concepts of existing mental disorders. This is an important control for the arbitrary category condition. To further ensure the arbitrariness of the collection of the three symptoms for the manipulation of arbitrary categories, the symptoms we used were previously judged to be unlikely to have causal relations among them (Ahn et al., 2003). In particular, this second measure was crucial in avoiding ceiling effects for Experiment 4 where generalizability of causal relations among symptoms was judged.

Method

Twenty people from Amazon’s Mechanical Turk website (https://www.mturk.com) participated for $1. For stimuli, four sets of three characteristic symptoms were taken from the implausible causal condition of Experiment 1 in Ahn et al. (2003; see Appendix B for the symptoms). Each set of symptoms was used to create two versions, which differed in whether or not the characteristic symptoms were described as being diagnostic of a known mental disorder (see the bottom of Figure 1 for an example).

All items began with a description of the characteristic symptoms. As before, in the kind versions, individuals were described as sharing a known mental disorder (e.g., “There is a mental disorder called BLV that about 500 people have.”) In the arbitrary category versions, the symptoms were described but there was no mention of a mental disorder. Instead, participants were walked through how there can be a group of people, all of whom happen to share three symptoms (see Figure 1). After participants read the description of each category, they made a causal essence judgment. Specifically, they were asked, “What is the likelihood that there is a single cause underlying these three symptoms that all and only [these individuals] have (whether or not we know what that cause is)?” Ratings were made on a scale of 1 (highly unlikely) to 9 (highly likely).

Each participant viewed all four sets, but which version was given for each set was counterbalanced. Ten participants received the kind versions of BLV and YNA shown in Appendix B, and the arbitrary category versions of the other two sets. The remaining ten participants received the opposite versions of each disorder. Within these groups, half of the participants viewed the kind items first, and the other half viewed the arbitrary category items first. Within each block (i.e., the kind or the arbitrary category), the order of the items was determined randomly for each participant.

Results

The results did not differ as a function of block, so the results were collapsed across the two orders. As predicted, participants were more likely to attribute a causal essence to individuals if the category was a kind (M=4.98, SD=1.67) than individuals with the same symptoms but without any indication that these individuals share a proper mental disorder (M=3.83, SD=1.71), t(19)=2.94, p<0.01, d=0.66.

Experiment 4

Using the stimuli developed for Experiment 3, Experiment 4 tested whether participants were more likely to generalize the causal structure (either the causal chain or no causal relations) observed in a single category member across the entire category as in Experiment 2.

Method

Seventy-four people from Amazon’s Mechanical Turk website (https://www.mturk.com) participated for $1. The same mental disorder symptoms and categories from Experiment 3 were used. As in Experiment 2, the design was a 2 (Category Type: kinds versus arbitrary category) X 2 (causal versus non-causal conditions). The manipulation of the Category Type was the same as in Experiment 3. The manipulation of causal versus non-causal conditions was the same as in Experiment 2. Thus, after reading about a given category, participants read about an individual who belonged to the category. The individual’s symptoms were either causally related, forming a causal chain (causal condition), or were not related to each other at all (non-causal condition; See Figure 5 for sample stimuli). Then, participants rated the likelihood that others with same three symptoms displayed the same pattern of inter-feature causal relationships (either the causal chain or the lack of any causal relations). Ratings were made on a scale of 1 (highly unlikely) to 9 (highly likely).

Figure 5.

Figure 5

Each of the four versions of the BLV category. The versions differed in (i) whether or not the individuals were described has sharing a known mental disorder, and (ii) whether or not the symptoms of the individual singled out were described as causally related. Versions of the other categories differed similarly.

Each participant viewed all four categories, but each with a different version. Thirty-eight participants received the kind versions of BLV and YNA, and the arbitrary category versions of the other two sets. The remaining thirty-six participants received the opposite versions of each disorder. Within these groups, half of the participants received the causal version of BLV and FFL, and the non-causal version of YNA and SSJ. The other half received the opposite versions of each disorder.

The items were blocked by the Category Type factor. Thirty-eight participants received the kind items first and the arbitrary category items second. The remaining thirty-six participants received the items in the reverse order. Items within blocks were always presented in the following order: BLV before FFL, and YNA before SSJ, such that half of the participants received the causal items first and the other half received the non-causal items first.

Results and Discussion

The results differed as a function of block, so we restricted our analyses to the data from the first block only, making Category Type a between-subject factor. As predicted and shown in Figure 6, participants were more likely to generalize causal relations for individuals with a common mental disorder (M=6.45, SD=1.77) than for individuals with the same symptoms but no known mental disorder (M=5.53, SD=2.56). They were also more likely to generalize the lack of causal relations for individuals with a common mental disorder (M=6.00, SD=2.18) than individuals with no known disorder (M=5.39, SD=2.23).

Figure 6.

Figure 6

Mean generalization ratings for the descriptions in Experiment 4. Error bars are standard errors of the mean.

A 2 (Category Type) x 2 (causal vs. non causal) mixed ANOVA with Category Type as a between-subjects variable and causality as a within-subjects variable revealed a significant main effect of Category Type, F(1,72)=4.16, p=0.045, η2=0.05, supporting the hypothesis that generalization would be higher for individuals with a known mental disorder. There was no significant main effect of causality, F(1,72)=0.74, p=0.39, η2=0.01, and no significant interaction, F(1,72)=0.20, p=0.66 η2=0.003, meaning that the effect of Category Type held for both generalizations of causal structure and the lack of causal structure.

General Discussion

Throughout four experiments, we found that when categories are described as true kinds, people are more willing to endorse Causal Essentialism—that members share some underlying essence that is both necessary and sufficient for category membership and that causes surface features. In addition, they were more willing to generalize a member’s known causal relations or lack thereof to other members of the same kind. Such inferences were much weaker for arbitrary categories. These differences between kinds and arbitrary categories were found across various domains.

Implications for Psychological Essentialism

These findings lend support to Psychological Essentialism by providing the first empirical evidence that people do explicitly ascribe causal essences to kinds across a variety of domains. These findings challenge Strevens’ (2000) minimal hypothesis. Strevens (2000) has argued that previous findings that people categorize based on deeper features such as intentionality or heritage (e.g., Gelman & Bloom, 2000, Gelman & Wellman, 1991) do not necessarily require postulation of an essence. Instead, he claims that these results could be fully explained by a minimal hypothesis that there is something that causes surface features, which does not have to be an essence in that it does not have to be a defining feature of the category. (See also, Hampton, Estes, & Simmons, 2007; Pothos & Hahn, 2000.) Contrary to this claim, however, our participants did overtly endorse causal essences for kinds.

We also found such endorsement across various types of kinds—not only with living kinds, but also artifacts. Previous studies (e.g., Diesendruck & Gelman, 1999) showed categorization of artifacts is more graded than that of living kinds, suggesting that artifacts are believed to lack defining features. (See also, Hampton, Estes, & Simmons, 2007; Pothos & Hahn, 2000.) However, our results show that people seem to believe that many types of kinds, including artifacts, have a necessary and sufficient essence that determines surface features (see also Bloom, 1996, 2004). This apparent contradiction is also present in the findings by Brooks, Squire-Graydon, and Wood (2007), where people endorsed defining features for family-resemblance categories, which clearly lacked defining features.

One cautionary remark is that although our findings replicated across several domains, we are not claiming that all kinds are equally essentialized. There are at least two reasons for this precaution.

First, Psychological Essentialism is a multi-faceted claim. For instance, Haslam et al. (2000) measured five dimensions associated with essentialism (naturalness, stability, discreteness of category boundaries, immutability, necessity of category features) and found large variance among social categories (e.g., gender, age, occupation, politics) along these measures. We only measured belief in causal essences defined as a common feature that causes other surface features, and we found similar effects across four domains often discussed in the literature. However, there may be differences in our essentialism scale for other domains that we did not examine, or there may be differences along other dimensions of essentialism that Haslam et al. (2000) described.

Second, we used artificial categories, which allowed us to strictly equate the number of shared features within each category. We found that when controlling for surface similarities, information about domains did not seem to have an effect powerful enough to lead to differences in inferring causal essences (with an exception of the results from the medical disorders in Experiment 1). Nonetheless, real-life categories may vary in homogeneity in terms of shared features, and differences in surface similarities can lead to different degrees of essentialism across different categories.

Real-life Implications

In addition to making novel empirical contributions in terms of theories of Psychological Essentialism, the current findings have numerous real-life implications. In a nutshell, the current studies demonstrate the assumptions that people would make upon learning that a new, real category has come into understanding and has become lexicalized, which is a strong sign that a category is broadly accepted as appropriate. For instance, nearly 100 new words are added to Merriam-Webster Collegiate Dictionary each year, including such words as “mouse potato” and “avian influenza” for the year 2006, and “green-collar” and “webisode” for the year 2009. As a category transforms from an arbitrary category status to a true kind, users may spontaneously infer that the category has a causal essence and that members share similar causal structures. (See Dar-Nimrod and Heine, 2011 for a recent review of consequences of “genetic essentialism” on prejudice and stereotypes involving social categories and mental illnesses.)

To make this implication more concrete, consider the classification system of mental disorders, the DSM. Since the manual came out in 1952, the number of mental disorders grew from about 60 in the first version (APA, 1952) to over 400 today (APA, 2000; see also Houts, 2002). For instance, Post-traumatic Stress Disorder was not officially recognized as a mental disorder until 1980 (DSM-III, APA, 1980). The next version of the DSM is expected to come out in 2013, and there are proposals for more new mental disorders, such as Premenstrual Dysphoria, Attenuated Psychosis Syndrome, and Catatonic Disorder, to name a few (http://www.dsm5.org/Pages/Default.aspx).

When a new mental disorder is added, people may subsequently make assumptions about the disorder and members of that category. The current study suggests that people are more likely to believe that the symptoms result from a single cause, that all patients with the disorder have this cause, and that the causal relations among symptoms are similar among patients with these disorders. If so, whether these assumptions are actually empirically warranted is an important issue to be considered by the committee that determines what mental disorders to be included in the manual.

Issues for Future Research

There are a number of open questions remaining for future research. In the current Experiments 1 and 2, we used labels for both arbitrary categories and kinds, but as discussed earlier, the relationship between the labels and the categories differed depending on the condition. The label-category relationship was strictly nominal for kinds such that the labels were not linked to any specific features of the categories, whereas for arbitrary categories, the labels were a kind of abbreviation of the arbitrary criteria used for constructing the categories (e.g., egoogle for even number pages of google search). The reason why we chose this approach for arbitrary categories is that if we had used nominal labels, it may have signaled to the participants that the categories were kinds.

This issue leads to a more theoretically interesting question of what factors would turn a certain grouping into a kind. As discussed above, nominal labels may signal that a grouping is a kind. Another factor that was incorporated in the current experimental manipulation was the expertise of the inventor of the categories; had the creator of the categories been a true expert of the domain, even arbitrary categories may have been thought of more so as kinds. Future research can examine more systematically which factors that were manipulated in the current study to distinguish between kinds versus arbitrary categories are necessary or sufficient for the effect, and whether there are any other factors that would activate the belief that a grouping is a kind.

Relatedly, correlated features have been suggested as one of the most powerful cues to a common cause (an essence), and thus, highlighting correlated structures may readily induce the belief that a grouping is a kind. In the current study where all categories contained correlated features, we were concerned about the possibility of inference to kinds based on correlated features so that we ended up with longer descriptions for arbitrary categories than for kinds, as we chose to explicate at length the arbitrariness of a category in order to prevent the inference. While we do not believe that the differences in lengths would necessarily have been a confound responsible for the observed differences in Causal Essentialism, it would be interesting to examine the validity of our original concern; namely, whether the extent to which features are correlated may moderate not only beliefs in causal essences, but also beliefs in kinds.

Another open question is which categories, arbitrary categories or kinds, serve as a baseline for Causal Essentialism. It is possible that people as a default believe categories have essences until they are provided evidence to the contrary. In this way, the differences we found between kinds and arbitrary categories in Causal Essentialism ratings may be because the use of an arbitrary distinction lowers essentialism beliefs. Alternatively, people may believe by default that categories lack a Causal Essence. In this case, providing the kind labels may elevate essentialism-like responding over baseline. Finally, it is possible that our design moved participants in both directions from a more neutral baseline, with the description of kinds elevating ratings and the description of arbitrary categories lowering ratings. This is an issue for future research to determine how people’s default essentialism beliefs interact with the arbitrary status of categories.

In addition, future research can examine the relationship between two sets of results we found with kinds; people believe that members in a true kind tend to share a causal essence and also causal structures. The exact mechanism between these two inferences is unclear. For instance, does the generalization of causal structures depend on inferences to causal essences or does generalizing a causal structure create the inference of a causal essence? Alternatively, these two inferences may be independent of each other, and not as we have assumed in the current research two characteristics of the same construct, Causal Essentialism.

Finally, while the current study found that making a category a kind led to Causal Essentialism, it is possible that inferring a causal essence may make a category more likely to be believed to be a kind. That is, people may sometimes infer causal essences of a category first, which in turn causes them to treat the category as a kind. Similarly, people may notice that members of a certain category share similar causal relations, which may then cause people to believe that the category is a kind. For instance, upon observing that two different groups of patients with somewhat different symptoms respond to the same kind of treatment, one may postulate that these patients actually share the same underlying disorder. This way, causal knowledge would play a critical role in determining what counts as kinds.

Supplementary Material

table for appendix a

Appendix A. Stimuli Used in Experiments 1 and 2.

table for appendix b

Appendix B. Stimuli Used in Experiments 3 and 4.

Footnotes

1

One may be concerned that asking about judgments of defining features and causality may be too demanding for participants, and they may have paid attention only to the first part of the questions (i.e., judgments of defining features). If so, this would work against our hypothesis because both kinds and arbitrary categories contain defining features (labels in this experiment). In addition, Ahn et al. (2006) used separate questions to specify different aspects of essentialism (albeit with less controlled stimuli) and found results similar to the current study.

Contributor Information

Woo-kyoung Ahn, Email: woo-kyoung.ahn@yale.edu, Department of Psychology, Yale University, 2 Hillhouse Avenue, New Haven, CT06520, U.S.A. 203-432-9626.

Eric G. Taylor, Email: ericgtaylor@gmail.com, Department of Psychology, Yale University, 2 Hillhouse Avenue, New Haven, CT06520, U.S.A

Daniel Kato, Email: daniel.d.kato@gmail.com, Columbia University, 701 West 168th Street, Room 1002, New York, NY 10032.

Jessecae K. Marsh, Email: jessecae.marsh@lehigh.edu, Department of Psychology, Lehigh University, 17 Memorial Drive East, Bethlehem, PA 18015

Paul Bloom, Email: paul.bloom@yale.edu, Department of Psychology, Yale University, 2 Hillhouse Avenue, New Haven, CT06520, U.S.A.

References

  1. Ahn W. Why are different features central for natural kinds and artifacts? The role of causal status in determining feature centrality. Cognition. 1998;69:135–178. doi: 10.1016/s0010-0277(98)00063-8. [DOI] [PubMed] [Google Scholar]
  2. Ahn W, Kim NS, Lassaline ME, Dennis MJ. Causal status as a determinant of feature centrality. Cognitive Psychology. 2000;41:1–55. doi: 10.1006/cogp.2000.0741. [DOI] [PubMed] [Google Scholar]
  3. Ahn W, Novick L, Kim NS. Understanding it makes it more normal. Psychonomic Bulletin and Review. 2003;10:746–752. doi: 10.3758/bf03196541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 1. Washington, DC: Author; 1952. [Google Scholar]
  5. American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 3. Washington, DC: Author; 1980. [Google Scholar]
  6. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders. 4. Washington, DC: American Psychiatric Association; 2000. text revision. [Google Scholar]
  7. Atran S, Estin P, Coley JD, Medin DL. Generic Species and Basic Levels: Essence and Appearance in Folk Biology. Journal of Ethnobiology. 1997;17:22–45. [Google Scholar]
  8. Bloom P. Intention, history, and artifact concepts. Cognition. 1996;60:1–29. doi: 10.1016/0010-0277(95)00699-0. [DOI] [PubMed] [Google Scholar]
  9. Bloom P. Descartes’ Baby. New York: Norton; 2004. [Google Scholar]
  10. Brooks LR, Squire-Graydon R, Wood TJ. Diversion of attention in everyday concept learning: Identification in the service of use. Memory & Cognition. 2007;35:1–14. doi: 10.3758/bf03195937. [DOI] [PubMed] [Google Scholar]
  11. Cheng PW. From covariation to causation: A causal power theory. Psychological Review. 1997;104:367–405. [Google Scholar]
  12. Dar-Nimrod I, Heine SJ. Genetic essentialism: On the deceptive determinism of DNA. Psychological Bulletin. 2011;137:800–818. doi: 10.1037/a0021860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Diesendruck G, Gelman SA. Domain differences in absolute judgments of category membership: Evidence for an essentialist account of categorization. Psychonomic Bulletin and Review. 1999;6:338–346. doi: 10.3758/bf03212339. [DOI] [PubMed] [Google Scholar]
  14. Gelman SA. The essential child: origins of essentialism in everyday thought. New York: Oxford University Press; 2003. [Google Scholar]
  15. Gelman SA, Bloom P. Young children are sensitive to how an object was created when deciding what to name it. Cognition. 2000;76:91–103. doi: 10.1016/s0010-0277(00)00071-8. [DOI] [PubMed] [Google Scholar]
  16. Gelman SA, Markman EM. Categories and induction in young children. Cognition. 1986;23:183–209. doi: 10.1016/0010-0277(86)90034-x. [DOI] [PubMed] [Google Scholar]
  17. Gelman SA, Wellman HM. Insides and essences: Early understandings of the nonobvious. Cognition. 1991;38:213–244. doi: 10.1016/0010-0277(91)90007-q. [DOI] [PubMed] [Google Scholar]
  18. Hampton JA, Estes Z, Simmons S. Metamorphosis: Essence, appearance, and behavior in the categorization of natural kinds. Memory & Cognition. 2007;35:1785–1800. doi: 10.3758/bf03193510. [DOI] [PubMed] [Google Scholar]
  19. Hadjichristidis C, Sloman SA, Stevenson RJ, Over DE. Feature centrality and property induction. Cognitive Science. 2004;28:45–74. [Google Scholar]
  20. Haslam N, Rothschild L, Ernst D. Essentialist beliefs about social categories. British Journal of Social Psychology. 2000;39:113–127. doi: 10.1348/014466600164363. [DOI] [PubMed] [Google Scholar]
  21. Houts AC. Discovery, invention, and the expansion of the modern Diagnostic and Statistical Manuals of Mental Disorders. In: Beutler LE, Malik ML, editors. Rethinking the DSM. A psychological perspective. Washington, DC: American Psychological Association; 2002. pp. 17–68. [Google Scholar]
  22. Kalish CW. Essentialist to some degree: beliefs about the structure of categories. Memory & Cognition. 2002;30:340–352. doi: 10.3758/bf03194935. [DOI] [PubMed] [Google Scholar]
  23. Keil FC. Concepts, kinds, and cognitive development. Cambridge, MA: MIT Press; 1989. [Google Scholar]
  24. Kelley HH. Causal schemata and the attribution process. New York: General Learning Press; 1972. [Google Scholar]
  25. Lassaline ME. Structural alignment in induction and similarity. Journal of Experimental Psychology: Learning, Memory, and Cognition. 1996;22:754–770. doi: 10.1037//0278-7393.22.3.754. [DOI] [PubMed] [Google Scholar]
  26. Locke J. An essay concerning human understanding. Oxford University Press; Oxford: 1894/1975. [Google Scholar]
  27. Macnamara J. A border dispute: A place of logic in psychology. MIT Press; Cambridge, Mass: 1986. [Google Scholar]
  28. Malt BC, Sloman SA. Category essence or essentially pragmatic? Creator’s intention in naming and what’s really what. Cognition. 2007;105:615–648. doi: 10.1016/j.cognition.2006.10.001. [DOI] [PubMed] [Google Scholar]
  29. Markman EM. Categorization and naming in children: Problems of induction. MIT Press; Cambridge, MA: 1989. [Google Scholar]
  30. Medin DL, Ortony A. Psychological essentialism. In: Vosniadou S, Ortony A, editors. Similarity and analogical reasoning. Cambridge, MA: Cambridge University Press; 1989. pp. 179–196. [Google Scholar]
  31. Murphy GL, Medin DL. The role of theories in conceptual coherence. Psychological Review. 1985;92:289–316. [PubMed] [Google Scholar]
  32. Pothos EM, Hahn U. So concepts aren’t definitions, but do they have necessary or sufficient features? British Journal of Psychology. 2000;91:239–250. doi: 10.1348/000712600161925. [DOI] [PubMed] [Google Scholar]
  33. Prasada S, Hennfield L, Otap D. Conceptual and linguistic representations of kinds and classes. Cognitive Science. 2012 doi: 10.1111/j.1551-6709.2012.01254.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Putnam H. Is semantics possible? In: Schwartz SP, editor. Naming, necessity, and natural kinds. Cornell University Press; Ithaca, NY: 1977. [Google Scholar]
  35. Rozin P, Spranca M, Krieger Z, Neuhaus R, Surillo D, Swerdlin A, Wood K. Natural preference: Instrumental and ideational/moral motivations, and the contrast between foods and medicines. Appetite. 2004;43:147–154. doi: 10.1016/j.appet.2004.03.005. [DOI] [PubMed] [Google Scholar]
  36. Schwartz SP. Natural kind terms. Cognition. 1979;7:301–315. [Google Scholar]
  37. Strevens M. The essentialist aspect of naive theories. Cognition. 2000;74:149–175. doi: 10.1016/s0010-0277(99)00071-2. [DOI] [PubMed] [Google Scholar]
  38. Weiner B. An attributional theory of motivation and emotion. New York: Springer-Verlag; 1986. [Google Scholar]
  39. Yamauchi T. Labeling bias and categorical induction: Generative aspects of category information. Journal of Experimental Psychology: Learning, Memory and Cognition. 2005;31:538–553. doi: 10.1037/0278-7393.31.3.538. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

table for appendix a

Appendix A. Stimuli Used in Experiments 1 and 2.

table for appendix b

Appendix B. Stimuli Used in Experiments 3 and 4.

RESOURCES