Abstract
The Duchenne smile hypothesis is that smiles that include eye constriction (AU6) are the product of genuine positive emotion, whereas smiles that do not are either falsified or related to negative emotion. This hypothesis has become very influential and is often used in scientific and applied settings to justify the inference that a smile is either true or false. However, empirical support for this hypothesis has been equivocal and some researchers have proposed that, rather than being a reliable indicator of positive emotion, AU6 may just be an artifact produced by intense smiles. Initial support for this proposal has been found when comparing smiles related to genuine and feigned positive emotion; however, it has not yet been examined when comparing smiles related to genuine positive and negative emotion. The current study addressed this gap in the literature by examining spontaneous smiles from 136 participants during the elicitation of amusement, embarrassment, fear, and pain (from the BP4D+ dataset). Bayesian multilevel regression models were used to quantify the associations between AU6 and self-reported amusement while controlling for smile intensity. Models were estimated to infer amusement from AU6 and to explain the intensity of AU6 using amusement. In both cases, controlling for smile intensity substantially reduced the hypothesized association, whereas the effect of smile intensity itself was quite large and reliable. These results provide further evidence that the Duchenne smile is likely an artifact of smile intensity rather than a reliable and unique indicator of genuine positive emotion.
Keywords: Duchenne smile, facial expression, nonverbal behavior, emotion, FACS, Bayesian data analysis
I. Introduction
Ekman and colleagues [1]–[4] famously hypothesized that genuine positive emotion (e.g., amusement) is revealed by the Duchenne smile, a facial expression involving “smiling eyes” as well as a smiling mouth. In the language of the Facial Action Coding System (FACS) [5], which is often used to study facial expressions, the Duchenne smile is composed of the lip corner puller action unit (AU12) and the cheek raiser action unit (AU6). All smiles involve AU12, which pulls the lip corners toward the ears, but Duchenne smiles also involve AU6, which lifts the cheeks, narrows the eyes, and wrinkles the outer eye corners (hence the “smiling eyes”). Smiles without AU6 are called non-Duchenne smiles and are hypothesized to be disconnected from genuine positive emotion (e.g., false, miserable, or masking negative emotion [1]).
The Duchenne smile hypothesis has been very influential and is still commonly adopted in the social/medical sciences [6], [7] and in affective computing [8], [9]. It is often used to classify smiles as “real” or “fake” and to infer participants’ level of positive emotion from their face. It is also central to discrete emotion theory [10], which considers the Duchenne smile to be the universally recognized and produced facial expression (i.e., prototype) of positive emotion.
However, empirical support for the hypothesis has been mixed [11]. Some studies have supported it, finding that Duchenne smiles were more common in response to pleasant stimuli [2], [12], [13] and more associated with self-reported positive emotion [4], [14], [15] than non-Duchenne smiles. But others have cast doubt on it, finding that Duchenne smiles occur at high rates when participants feel negative emotion [2], [14], [16]–[22] or feign positive emotion [11], [23]–[27].
In attempting to explain these mixed results, Krumhuber and Manstead [11] proposed that AU6 may just be an artifact (i.e., side-effect) of high AU12 intensity. That is, intense smiles may be more likely to include AU6 regardless of what the smiling person is feeling. If this hypothesis is correct, then previously observed associations between AU6 and positive emotion may have actually been caused by an unmeasured association between AU12 intensity and positive emotion [28]. Later research supported the plausibility of this hypothesis. Gosselin et al. [29] instructed participants to activate AUs one-at-a-time and found that many of them added AU6 while trying to activate AU12. Moreover, the likelihood of adding AU6 increased with the intensity of AU12. This phenomenon also operated in reverse: many participants added AU12 while trying to activate AU6. These results suggest that there is a dependency between these AUs that encourages them to occur together, especially when the intensity of AU12 is high (and even in the absence of felt positive emotion).
The most direct test of the artifact hypothesis examined the facial behavior of 32 participants during two conditions: viewing amusing stimuli and trying to feign amusement [11]. Contrary to the Duchenne smile hypothesis, rates of Duchenne and non-Duchenne smiles were very similar in both conditions. So, AU6 did not discriminate between felt and false smiles. However, it did discriminate between weak and intense smiles; across both conditions, Duchenne smiles were much higher in AU12 intensity than non-Duchenne smiles. Thus, as predicted by the artifact hypothesis, the occurrence of AU6 was related to AU12 intensity but not to genuine positive emotion.
Several important questions still remain, however. First, this work only compared smiles during expressed amusement and feigned amusement; it did not compare smiles during positive and negative emotions. Thus, it is currently unknown whether Krumhuber and Manstead’s [11] finding will generalize to other contexts that the Duchenne smile hypothesis makes predictions about (e.g., negative emotions). Second, this work did not directly compare predictors in the same model (i.e., it did not statistically control for AU12 intensity). Thus, the relative importance of these variables and the exact extent to which AU12 intensity confounds these relationships is currently unknown. Finally, nearly all previous work dichotomized smiles as Duchenne or non-Duchenne without considering the intensity of AU6. However, Messinger et al. [30] argued that the intensity of AU6 contains important information about felt emotion. Thus, it is possible the relationship between AU6 and positive emotion would be strengthened by increasing the granularity with which AU6 is measured.
The current study addressed these gaps in the literature and made several novel contributions. First, we examined smiles during four different emotional contexts (i.e., amusement, embarrassment, fear, and pain), which allowed us to test whether the artifact hypothesis can generalize beyond feigned positive emotion to negative emotion. Including multiple emotions also allowed us to better assess the generalizability of our findings. Second, we directly compared predictors in two related models (i.e., predicting AU6 and amusement) by statistically controlling for AU12 intensity. We also did so using a much larger sample than previous research (N = 136) and a more sophisticated modeling approach (i.e., Bayesian multilevel regression with interactions). Finally, we represented AU6 as a continuum rather than a dichotomy, which allowed us to leverage the full extent of this variable’s predictive power [31].
We explored two related research questions. First, does AU6 predict how amused a smiling participant feels, even when controlling for AU12 intensity? If the Duchenne smile hypothesis is correct, then the answer should be yes. However, if the artifact hypothesis is correct, then the answer should be no (or at least the predictive power of AU6 should be largely reduced by controlling for AU12 intensity). Second, does self-reported amusement predict whether a smile contains AU6 (and to what degree), even when controlling for AU12 intensity? If the Duchenne smile hypothesis is correct, then amusement should be uniquely predictive of AU6. However, if the artifact hypothesis is correct, then AU12 intensity should be very related to AU6 and amusement should not be.
II. Methods
A. Data
1). BP4D+ Database:
To find examples of spontaneous facial behavior, we accessed the BP4D+ database [32], which includes video recordings and metadata from 140 participants during tasks meant to elicit different positive and negative emotions. In the amusement task, the participant was told a joke; in the embarrassment task, the participant was told to improvise a rhyming song and sing it loudly; in the fear task, darts were thrown at a dartboard located near the participant’s head; and in the pain task, the participant submerged their hand into ice water for as long as possible.
After each task, the participant rated how intensely they had felt 14 different emotions [32]; each emotion was rated on a six-point scale ranging from 0 (not at all) to 5 (extremely). In this study, we focused on participants’ ratings of amusement, which were their responses to the question, “How much did you feel happy, joyful, or amused?”
Facial actions in each task video were annotated by one of five expert coders who had passed the official FACS final test [5]. A period of around 15 s was selected from each task video to be annotated; this period corresponded to when the emotion elicitation was strongest (e.g., the period leading up to when the participant removed their hand from the ice water). The coders annotated each video frame during this period (at 25 fps) for the intensity of AU6 and AU12. Intensity was annotated using a six-point scale where 0 corresponded the absence of the AU and 1 through 5 corresponded to the official FACS intensity levels (i.e., “trace” through “maximum”).
A subset of task videos was coded by two or more coders to assess the reliability of the codes [32]. Inter-coder agreement was calculated using the ordinal-weighted S score (also called the κBP coefficient) [33], which adjusts for chance agreement and can accommodate scales with ordinal/ordered categories. Scores above 0.60 are often interpreted as “good” and scores above 0.80 are often interpreted as “very good” [33]. Inter-coder agreement was good for AU6 intensity (S = 0.70) and very good for AU12 intensity (S = 0.84).
2). Smile Events:
We used the BP4D+ FACS codes to identify all smile events, i.e., sequences of consecutive frames during which AU 12 was coded as having an intensity greater than zero. We excluded events from one participant who was an outlier in terms of age (i.e., 66 years old) but retained all other events, resulting in a sample of 751 smile events from 136 participants. The number of smile events per participant ranged from 3 to 11 (M = 5.5, SD = 2.0), and the duration of each event ranged from 0.1 s to 20.1 s (M = 6.0, SD = 5.3).
The sample was 60 % Female, 46 % White, 34 % Asian, 10 % Latino/Hispanic, and 7 % Black. Ages ranged from 18 to 30 years (M = 20.2, SD = 2.5). All participants were currently living in the USA and most had lived in the USA for many years (M = 15.3, SD = 7.1).
Consistent with previous research [11], we calculated each AU’s intensity level as its peak (i.e., maximum) value in each smile event. In order to facilitate interpretation and comparisons between predictors, the AU intensity variables and the amusement ratings were standardized to have zero mean and unit variance prior to analysis and visualization.
B. Statistical Modeling
1). Model Building:
To explore our research questions, we estimated two linear regression models. Model 1 regressed self-reported amusement on AU6 intensity, AU12 intensity, and the interaction of the two. This model allowed us to quantify how well amusement can be inferred from these variables and what the nature (i.e., magnitude and direction) of these relationships was. Model 2 regressed AU6 intensity on self-reported amusement and AU12 intensity; it allowed us to quantify how important each variable was in explaining the presence and intensity of AU6 in a smile. Both models also controlled for participant sex.
The regression models were estimated using a Bayesian multilevel modeling framework [34], [35]. Multilevel models were necessary to account for the hierarchical structure of the data (i.e., there were multiple observations/events from each participant). Using a single-level model could lead to incorrect conclusions because observations from the same participant are more similar than observations from different participants. As a result, the standard error estimates (which assume observations are independent) would be biased [36].
The models are depicted in Figures 1 and 2 using multilevel path diagrams [37]. Both models had the same two-level structure with observations (Level 1) clustered within participants (Level 2). The only difference between them was that AU6 was predicting amusement in Model 1 and amusement was predicting AU6 in Model 2. On Level 1, relationships between AU intensities and self-reported amusement were captured using regression pathways. In addition to direct pathways for each predictor, we also included the interaction of both predictors to allow for conditional effects (i.e., the effect of one predictor depending on the value of the other). On Level 2, intercepts were regressed on participant sex to control for sex differences in average levels of amusement and AU6. Additionally, random (i.e., participant-varying) effects were estimated to allow participants to have different average levels (i.e., intercepts) and different relationships between variables (i.e., slopes). Multilevel models that include random effects are more complex than models that do not, but are also more accurate and generalizable [38]. The regression coefficients that make up the primary results of these models capture the central tendency and variability in the random effects.
Fig. 1.

Path diagram of Model 1. Squares are observed variables, circles are random effects, triangles are intercepts, and arrows are regression paths. Level 1 includes the AU intensity variables as well as the amusement/positive emotion rating (PE) for smile t from participant i. Level 2 includes the sex (SEX) and random effects (β) for participant i.
Fig. 2.

Path diagram of Model 2. Squares are observed variables, circles are random effects, triangles are intercepts, and arrows are regression paths. Level 1 includes the AU intensity variables as well as the amusement/positive emotion rating (PE) for smile t from participant i. Level 2 includes the sex (SEX) and random effects (β) for participant i.
Bayesian methods were used to estimate the models because they offer many practical and interpretive advantages such as the ability to incorporate prior knowledge about parameters into the model and to estimate the probability of different parameter values [34], [35]. In brief, Bayesian methods combine existing knowledge about the probability of different model parameters (in the form of prior distributions) with observed data to generate updated knowledge about the parameters (in the form of posterior distributions). Statistical inferences can then be made using this updated knowledge. We estimated our models using the brms package [39] as an interface to the Stan platform [40]. Complete model details, including syntax, are provided in the supplemental materials.
2). Model Interpretation:
In interpreting our model results, we had two primary goals: to estimate the magnitude (i.e., size and sign) of each important effect, and to quantify the amount of precision (i.e., certainty) in these estimates. To accomplish these goals within a Bayesian framework, we followed recent guidelines from Kruschke [41], [42]. For each effect (i.e., regression coefficient), we represented its magnitude using a point estimate and its precision using an interval estimate.
Each point estimate is the single most credible value for the effect’s magnitude given the data and our model. For example, if our point estimate of the AU 6 intensity effect in Model 1 was 0.2, then this would mean that each one standard deviation increase in AU 6 intensity would be associated with a 0.2 standard deviation increase in self-reported amusement. The median of each effect’s posterior distribution was used as its point estimate (the median minimizes the posterior risk for the absolute-value loss function [43]).
Each interval estimate is the range of the most credible values for the effect’s magnitude given the data, our model, the amount of variability in the random effect, and the amount of sampling error. The more uncertainty there is in the magnitude of an effect, the wider its interval estimate will be. For example, if our interval estimate for the same effect described earlier was from 0.1 to 0.3, we could be more certain in its size and sign than if its interval estimate was instead from −0.2 to 0.6. The 95 % highest density interval (HDI) of each effect’s posterior distribution was used as its interval estimate (the 95 % HDI is the narrowest interval that contains 95 % of the posterior probability density [42]).
To provide information about the practical importance of each effect, we also used the region of practical equivalence (ROPE) technique [42]. We defined magnitudes within the range of −0.05 to 0.05 to be practically equivalent to zero1 (i.e., too small to be meaningful in practice). We then calculated the proportion of each HDI that fell within this range. The smaller this proportion is, the more confident we can be that the effect is large enough to be practically important. If desired, we can also use the HDI+ROPE decision rule to make discrete interpretations of the results [42]. The rule is simple: If the entire HDI falls within the ROPE (1.0), accept the null hypothesis and consider the effect to be practically equivalent to zero. If the entire HDI falls outside the ROPE (0.0), reject the null hypothesis and consider the effect large enough to be practically important. Otherwise, withhold a decision.
III. Results
The AU intensity variables were highly correlated with one another (r = 0.64), and self-reported amusement was weakly correlated with both AU6 intensity (r = 0.21) and AU12 intensity (r = 0.28). The results of Model 1 are summarized in Table I and depicted in Figure 3. The results of Model 2 are summarized in Table II and depicted in Figure 4. Complete results details are provided in the supplementary materials.
TABLE I.
Standardized Regression Coefficients from Model 1
| Parameter | Estimate | 95 % HDI | ROPE |
|---|---|---|---|
| Intercept | −0.07 | [ −0.20, 0.06 ] | .386 |
| Sex (Male) | 0.07 | [ −0.12, 0.26 ] | .320 |
| AU6 Intensity | 0.08 | [ −0.02, 0.17 ] | .285 |
| AU12 Intensity | 0.25 | [ 0.15, 0.33 ] | .000 |
| Interaction Effect | 0.07 | [ 0.01, 0.14 ] | .235 |
Note. The outcome variable in this model was amusement.
Fig. 3.

Marginal Effects Plot from Model 1 with 95% HDI. This model predicted the amount of self-reported amusement associated with each smile.
TABLE II.
Standardized Regression Coefficients from Model 2
| Parameter | Estimate | 95 % HDI | ROPE |
|---|---|---|---|
| Intercept | 0.04 | [ −0.06, 0.13 ] | .557 |
| Sex (Male) | −0.07 | [ −0.23, 0.07 ] | .342 |
| Amusement | 0.05 | [ −0.02, 0.11 ] | .536 |
| AU12 Intensity | 0.63 | [ 0.57, 0.70 ] | .000 |
| Interaction Effect | −0.01 | [ −0.08, 0.05 ] | .901 |
Note. The outcome variable in this model was AU6 intensity.
Fig. 4.

Marginal Effects Plot from Model 2 with 95% HDI. This model predicted the intensity level of AU6 (the cheek raiser) in each smile.
Model 1 found that, when holding either AU’s intensity constant, the other AU’s intensity was positively related to amusement. However, the effect of AU12 intensity was more than three times larger in magnitude than the effect of AU6 intensity. There was also some evidence of an interaction between the AU intensity variables. When AU12 intensity was low, the intensity of AU6 did not matter; but when AU12 intensity was high, higher AU6 intensity predicted more amusement. No reliable sex difference in amusement was apparent. When adopting the HDI+ROPE decision rule, only the AU12 intensity effect resulted in rejecting the null hypothesis. This model explained a little less than one-third of the variance in self-reported amusement, R2 = 0.30 [0.24, 0.35], which indicates that the intensities of AU6 and AU12 (as well as participant sex) constitute only a fraction of the information needed to successfully infer a participant’s level of amusement.
Model 2 found that, when holding AU12 intensity constant, there was almost no relationship between amusement and AU6 intensity. In contrast, when holding amusement constant, AU12 intensity was strongly related to AU6 intensity. Indeed, the latter effect was over twelve times larger in magnitude than the former. No reliable interaction between amusement and AU12 intensity was apparent, and no reliable sex difference in AU6 was apparent. When adopting the HDI+ROPE decision rule, only the AU12 intensity effect resulted in rejecting the null hypothesis. This model explained around half of the variance in AU6 intensity, R2 = 0.51 [0.46, 0.55]. Two pieces of information can be gleaned from this. First, it was easier to predict AU6 intensity using the study variables than it was to infer amusement. And second, because it was the only meaningful predictor in the model, this means that AU12 intensity explained around half the variance in AU6 intensity.
To see what the effects of AU6 and amusement would have been without controlling for AU12 intensity, we reran both models without it. (The syntax and results for these models are provided in the supplemental materials.) When ignoring AU12 intensity, the effect of AU6 intensity in predicting amusement was 0.22 [0.15, 0.29], ROPE = .000. This means that controlling for AU12 intensity reduced this effect by 64 % and changed the HDI+ROPE decision from reject to withhold. Similarly, when ignoring AU12 intensity, the effect of amusement in predicting AU6 intensity was 0.24 [0.16, 0.31], ROPE = .000. This means that controlling for AU12 intensity reduced this effect by 79 % and changed the HDI+ROPE decision from reject to withhold.
IV. Discussion
The Duchenne smile hypothesis [1]–[4] is that AU6 reveals the emotion underlying a smile and distinguishes felt smiles from false and negative emotion smiles. This hypothesis has become very popular in many research areas and is often used to justify the inference that a person is (or is not) feeling positive emotion based on the observation of a Duchenne (or non-Duchenne) smile. However, empirical support for this hypothesis has been called into question by researchers who have argued that AU6 may just be an artifact of high AU12 intensity. Thus, previously observed associations between AU6 and felt positive emotion may have been inflated by both variables’ correlations with AU12 intensity [11], [28].
The current study directly tested this possibility in two different ways and found strong support for it both times. When trying to infer positive emotion from the appearance of a smile, the predictive power of AU6 was reduced considerably by controlling for AU12 intensity. Similarly, when trying to predict whether a smile would include AU6 (and to what degree), the predictive power of self-reported amusement was almost completely erased by controlling for AU12 intensity.
There are three main implications of these findings. First, future research on the Duchenne smile hypothesis should always measure and consider AU12 intensity. As shown in our study, failing to do so can seriously distort estimates of the relationship between AU6 and emotion (e.g., inflating their magnitude by up to 79 %). Second, researchers should refrain from inferring the presence or absence of positive emotion on the basis of Duchenne smiles alone. We found that such an inference will often be incorrect, especially when the intensity of AU12 is low. Instead, future work should continue to explore other smile characteristics that have shown promise in revealing the meaning of smiles (e.g., symmetry and dynamics [27], [44], [45]). Given that the intensities of AU6 and AU12 only explained explained around 30 % of the variance in amusement, there is a lot of room for other smile characteristics to make contributions. Finally, researchers evaluating theories of emotion should consider the possibility that much of the evidence in favor of the Duchenne smile hypothesis may have been confounded by the unmeasured intensity of AU12 and therefore inflated [28]. Thus, theories that advance this hypothesis, such as discrete emotions theory [10], may have less evidentiary support than previously believed.
The reason for the strong association between AU6 and AU12 intensity is not completely understood, but it seems that an anatomical or psychomotor dependency between the facial muscles encourages AU6 and AU12 to co-occur [29]. Because the FACS manual [5] explicitly describes how to determine whether or not a strong AU12 includes AU6, we believe this association is real and not just a bias of the FACS coders.
In conclusion, the current study found that associations between felt positive emotion and Duchenne smiles are reduced when AU12 intensity is controlled for. These results replicate and extend previous work [11] by showing that this phenomenon generalizes beyond feigned smiles and applies to negative emotion smiles as well. We also quantified, for the first time, the extent of this reduction and found it to be substantial (i.e., 64 % when predicting amusement and 79 % when predicting AU6). These results were obtained even when This suggests that, rather than being a reliable indicator of positive emotion, AU6 may just be an artifact of intense smiles.
Acknowledgments
This material is based upon work partially supported by the National Science Foundation (1629716, 1629898, 1722822, 1734868) and National Institutes of Health (MH096951). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of National Science Foundation or National Institutes of Health, and no official endorsement should be inferred.
Footnotes
Supplemental Materials
Complete syntax and output for all analyses are provided through the Open Science Framework: https://osf.io/t7e2d
This range is conventionally used for standardized regression coefficients like ours because it corresponds to half of a “small” effect size [42, p. 277].
Contributor Information
Jeffrey M. Girard, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA
Gayatri Shandar, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA.
Zhun Liu, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA.
Jeffrey F. Cohn, Department of Psychology, University of Pittsburgh, Pittsburgh, PA
Lijun Yin, Department of Computer Science, Binghamton University, Binghamton, NY.
Louis-Philippe Morency, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA.
References
- [1].Ekman P and Friesen WV, “Felt, false, and miserable smiles,” Journal of Nonverbal Behavior, vol. 6, no. 4, pp. 238–252, 1982. [Google Scholar]
- [2].Ekman P, Davidson RJ, and Friesen WV, “The Duchenne smile: Emotional expression and brain physiology: II,” Journal of Personality and Social Psychology, vol. 58, no. 2, pp. 342–353, 1990. [PubMed] [Google Scholar]
- [3].Frank MG and Ekman P, “Not all smiles are created equal: The differences between enjoyment and nonenjoyment smiles,” Humor - International Journal of Humor Research, vol. 6, no. 1, 1993. [Google Scholar]
- [4].Frank MG, Ekman P, and Friesen WV, “Behavioral markers and recognizability of the smile of enjoyment,” Journal of Personality and Social Psychology, vol. 64, no. 1, pp. 83–93, 1993. [DOI] [PubMed] [Google Scholar]
- [5].Ekman P, Friesen WV, and Hager J, Facial Action Coding System: A Technique for the Measurement of Facial Movement. Salt Lake City, UT: Research Nexus, 2002. [Google Scholar]
- [6].Bernstein MJ, Young SG, Brown CM, Sacco DF, and Claypool HM, “Adaptive responses to social exclusion: Social rejection improves detection of real and fake smiles,” Psychological Science, vol. 19, no. 10, pp. 981–983, 2008. [DOI] [PubMed] [Google Scholar]
- [7].Gadassi R and Mor N, “Confusing acceptance and mere politeness: Depression and sensitivity to Duchenne smiles,” Journal of Behavior Therapy and Experimental Psychiatry, vol. 50, pp. 8–14, 2016. [DOI] [PubMed] [Google Scholar]
- [8].Hofmann J, Platt T, and Ruch W, “Laughter and smiling in 16 positive emotions,” IEEE Transactions on Affective Computing, vol. 8, no. 4, pp. 495–507, 2017. [Google Scholar]
- [9].Haddad KE, Dupont S, d’Alessandro N, and Dutoit T, “An HMM-based speech-smile synthesis system: An approach for amusement synthesis,” in IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, vol. 05, 2015, pp. 1–6. [Google Scholar]
- [10].Ekman P, “Are there basic emotions?” Psychological Review, vol. 99, no. 3, pp. 550–553, 1992. [DOI] [PubMed] [Google Scholar]
- [11].Krumhuber EG and Manstead ASR, “Can Duchenne smiles be feigned? New evidence on felt and false smiles,” Emotion, vol. 9, no. 6, pp. 807–820, 2009. [DOI] [PubMed] [Google Scholar]
- [12].Soussignan R and Schaal B, “Forms and social signal value of smiles associated with pleasant and unpleasant sensory experience,” Ethology, vol. 102, pp. 1020–1041, 1996. [Google Scholar]
- [13].Matsumoto D and Willingham B, “Spontaneous facial expressions of emotion of congenitally and noncongenitally blind individuals,” Journal of Personality and Social Psychology, vol. 96, no. 1, pp. 1–10, 2009. [DOI] [PubMed] [Google Scholar]
- [14].Ekman P, Friesen WV, and O’Sullivan M, “Smiles when lying,” Journal of Personality and Social Psychology, vol. 54, no. 3, pp. 414–414, 1988. [DOI] [PubMed] [Google Scholar]
- [15].Jakobs E, Manstead ASR, and Fischer AH, “Social motives, emotional feelings, and smiling,” Cognition & Emotion, vol. 13, no. 4, pp. 321–345, 1999. [Google Scholar]
- [16].Hess U and Bourgeois P, “You smile–I smile: Emotion expression in social interaction,” Biological Psychology, vol. 84, no. 3, pp. 514–520, 2010. [DOI] [PubMed] [Google Scholar]
- [17].Schneider K and Josephs I, “The expressive and communicative functions of preschool children’s smiles in an achievement-situation,” Journal of Nonverbal Behavior, vol. 15, no. 3, pp. 185–198, 1991. [Google Scholar]
- [18].Harris CR and Alvarado N, “Facial expressions, smile types, and self-report during humour, tickle, and pain,” Cognition and Emotion, vol. 19, no. 5, pp. 655–669, 2005. [Google Scholar]
- [19].Keltner D, “Signs of appeasement: Evidence for the distinct displays of embarrassment, amusement, and shame,” Journal of Personality and Social Psychology, vol. 68, no. 3, pp. 441–441, 1995. [Google Scholar]
- [20].Lee V and Beattie G, “The rhetorical organization of verbal and nonverbal behavior in emotion talk,” Semiotica, vol. 120, pp. 39–92, 1998. [Google Scholar]
- [21].Keltner D and Bonanno GA, “A study of laughter and dissociation: Distinct correlates of laughter and smiling during bereavement,” Journal of Personality and Social Psychology, vol. 73, no. 4, pp. 687–702, 1997. [DOI] [PubMed] [Google Scholar]
- [22].Papa A and Bonanno GA, “Smiling in the face of adversity: The interpersonal and intrapersonal functions of smiling,” Emotion, vol. 8, no. 1, pp. 1–12, 2008. [DOI] [PubMed] [Google Scholar]
- [23].Carroll JM and Russell JA, “Facial expressions in Hollywood’s portrayal of emotion.” Journal of Personality and Social Psychology, vol. 72, no. 1, pp. 164–176, 1997. [Google Scholar]
- [24].Gunnery SD, Hall JA, and Ruben MA, “The deliberate Duchenne smile: Individual differences in expressive control,” Journal of Nonverbal Behavior, vol. 37, no. 1, pp. 29–41, 2013. [Google Scholar]
- [25].Schmidt KL, Ambadar Z, Cohn JF, and Reed LI, “Movement differences between deliberate and spontaneous facial expressions: Zygomaticus major action in smiling,” Journal of Nonverbal Behavior, vol. 30, no. 1, pp. 37–52, 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Schmidt KL and Cohn JF, “Human facial expressions as adaptations: Evolutionary questions in facial expression research,” American Journal of Physical Anthropology, vol. 116, no. S33, pp. 3–24, 2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Schmidt KL, Bhattacharya S, and Denlinger R, “Comparison of deliberate and spontaneous facial movement in smiles and eyebrow raises,” Journal of Nonverbal Behavior, vol. 33, no. 1, pp. 35–45, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [28].Gunnery SD and Hall JA, “The expression and perception of the Duchenne smile,” in The Social Psychology of Nonverbal Communication, Kostic A and Chadee D, Eds. New York, NY: Palgrave Macmillan, 2015. [Google Scholar]
- [29].Gosselin P, Perron M, and Beaupré M, “The voluntary control of facial action units in adults,” Emotion, vol. 10, no. 2, pp. 266–271, 2010. [DOI] [PubMed] [Google Scholar]
- [30].Messinger DS, Cassel TD, Acosta SI, Ambadar Z, and Cohn JF, “Infant smiling dynamics and perceived positive emotion,” Journal of Nonverbal Behavior, vol. 32, no. 3, pp. 133–155, 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [31].DeCoster J, Iselin A-MR, and Gallucci M, “A conceptual and empirical examination of justifications for dichotomization,” Psychological Methods, vol. 14, no. 4, pp. 349–366, 2009. [DOI] [PubMed] [Google Scholar]
- [32].Zhang Z, Girard JM, Wu Y, Zhang X, Liu P, Ciftci U, Canavan S, Reale M, Horowitz A, Yang H, Cohn JF, Ji Q, and Yin L, “Multimodal spontaneous emotion corpus for human behavior analysis,” in IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 3438–3446. [Google Scholar]
- [33].Gwet KL, Handbook of Inter-Rater Reliability: The Definitive Guide to Measuring the Extent of Agreement among Raters, 4th ed Gaithersburg, MD: Advanced Analytics, 2014. [Google Scholar]
- [34].Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, and Rubin DB, Bayesian Data Analysis, 3rd ed Boca Raton, FL: CRC Press, 2014, vol. 1542. [Google Scholar]
- [35].McElreath R, Statistical Rethinking: A Bayesian Course with Examples in R and Stan. New York, NY: CRC Press, 2016. [Google Scholar]
- [36].Bliese PD and Hanges PJ, “Being both too liberal and too conservative: The perils of treating grouped data as though they were independent,” Organizational Research Methods, vol. 7, no. 4, pp. 400–417, 2004. [Google Scholar]
- [37].Curran PJ and Bauer DJ, “Building path diagrams for multilevel models,” Psychological Methods, vol. 12, no. 3, pp. 283–297, 2007. [DOI] [PubMed] [Google Scholar]
- [38].Barr DJ, Levy R, Scheepers C, and Tily HJ, “Random effects structure for confirmatory hypothesis testing: Keep it maximal,” Journal of Memory and Language, vol. 68, no. 3, pp. 255–278, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [39].Bürkner P-C, “Brms: An R Package for Bayesian Multilevel Models Using Stan,” Journal of Statistical Software, vol. 80, no. 1, pp. 1–28, August 2017. [Google Scholar]
- [40].Gelman A, Lee D, and Guo J, “Stan: A probabilistic programming language for Bayesian inference and optimization,” Journal of Educational and Behavioral Statistics, vol. 40, no. 5, pp. 530–543, 2015. [Google Scholar]
- [41].Kruschke JK and Liddell TM, “The Bayesian new statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective,” Psychonomic Bulletin & Review, vol. 25, no. 1, pp. 178–206, 2018. [DOI] [PubMed] [Google Scholar]
- [42].Kruschke JK, “Rejecting or accepting parameter values in Bayesian estimation,” Advances in Methods and Practices in Psychological Science, vol. 1, no. 2, pp. 270–280, 2018. [Google Scholar]
- [43].Jaynes ET, Probability Theory: The Logic of Science, 5th ed Cambridge University Press, 2007. [Google Scholar]
- [44].Hess U and Kleck RE, “Differentiating emotion elicited and deliberate emotional facial expressions,” European Journal of Social Psychology, vol. 20, no. 5, pp. 369–385, 1990. [Google Scholar]
- [45].Ambadar Z, Cohn JF, and Reed LI, “All smiles are not created equal: Morphology and timing of smiles perceived as amused, polite, and embarrassed/nervous,” Journal of Nonverbal Behavior, vol. 33, no. 1, pp. 17–34, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
