Abstract
The present study used a collaborative framework to integrate two long-term prospective studies: the Terman Life Cycle Study and the Hawaii Personality and Health Longitudinal Study. Using a five-factor personality-trait framework, teacher assessments of child personality were rationally and empirically aligned to establish similar factor structures across samples. Comparable items related to adult self-rated health, education, and alcohol use were harmonized, and data were pooled on harmonized items. A structural model was estimated, allowing paths to differ by sample. Harmonized child personality factors were then used to examine markers of physiological dysfunction in the Hawaii sample and mortality risk in the Terman sample. Harmonized conscientiousness predicted less physiological dysfunction in the Hawaii sample and lower mortality risk in the Terman sample. These results illustrate how collaborative, integrative work with multiple samples offers the exciting possibility that samples from different cohorts and ages can be linked together to directly test lifespan theories of personality and health.
Keywords: Integrative Data Analysis, Collaborative Studies, Lifespan Perspective, Child Personality, Five-Factor Model
A single research study necessarily has limitations, including its design, the sample characteristics, the measures used, the length of follow up (if any), and the historical period in which it takes place. However, when findings replicate across studies, we gain confidence in those findings, in the theories on which they are based, and indeed in psychological science as a whole. Although possible causal pathways often emerge from cross-sectional and short-term studies, longitudinal studies are especially valuable for understanding developmental processes, and considerable time, effort, and research funds have been spent during the past century developing longitudinal datasets that include information about individual differences in personality and later health outcomes (Greenhoot & Dowsett, 2012). Friedman, Kern, Hampson, and Duckworth (this issue) suggest that existing studies can be integrated to test theories of personality, health, and development across the lifespan. An important question is the extent to which studies can indeed be combined. In this article, we give an overview of integrative methods for longitudinal studies, and then examine the potential for--and challenges to--integrating and comparing findings from two lifespan studies: the Terman Life Cycle Study and the Hawaii Personality and Health Longitudinal Study.
Integrating Findings across Studies
There are a number of strategies for combining findings across studies, the most common being sequential independent replication, in which one study finds a relation between two constructs, and then other independent studies are performed (either by the same researchers or others) to replicate and extend those findings. For example, Friedman and colleagues (1993) found that children rated high on conscientiousness were at a lower risk of dying at any given age than children rated low on conscientiousness. This finding was then replicated in other studies with quite diverse participants (e.g., Christensen et al., 2002; Weiss & Costa, 2005; Wilson, Mendes de Leon, Bienias, Evans, & Bennett, 2004).
A second strategy involves meta-analytically combining standardized effect sizes from multiple studies to find an overall average effect and moderators of this effect (i.e, aggregate data meta-analysis, or AD). For example, by combining the findings from 20 studies that had examined conscientiousness and mortality, an overall significant protective effect of high conscientiousness was determined (Kern & Friedman, 2008). Conclusions from meta-analytic work are more robust than any single study, with greater precision and increased statistical power, although meta-analyses remain limited by the characteristics of the studies that are included (Cohn & Becker, 2003; Rosenthal & DiMatteo, 2001; Tak, Meijer, Manoharan, de Jonge, & Rosmalen, 2010).
A newly emerging third strategy involves integrative techniques, in which data are directly combined. Methods include (a) pooled data analysis, in which data from two or more independent samples are combined as a single dataset and analyzed as a single study (Curran & Hussong, 2009); (b) longitudinal item-response modeling, in which responses are aligned to an assumed underlying scale using item response theory (IRT) methods, and then change over age is examined (McArdle, Grimm, Hamagami, Bowles, & Meredith, 2009); and (c) meta-analysis of individual participant data (IDA, or mega-analysis), in which, like in aggregate level (AD) meta-analysis, a comprehensive review of existing studies is performed, but the raw participant data from each study, rather than standardized effect sizes, are aggregated (Cooper & Patall, 2009; Stewart & Clarke, 1995). Such techniques are challenging and often not feasible, but when possible, these integrative methods offer multiple advantages including replication across samples, increased statistical power, broader psychometric assessments of constructs, more extended periods of the lifespan, and greater emphasis on data sharing and collaboration (Hofer & Piccinin, 2010).
Finding Measure Similarity
Without some degree of measurement similarity, data cannot be integrated (Bauer & Hussong, 2009). Ideally, constructs are measured with the same measures and consistent procedures across multiple samples. Such an approach is increasingly used in biogenetic, economic, and medical research (e.g., Eldevik et al., 2010; Hallahan et al., 2011; Manichaikul et al., 2012; Olgiati et al., 2012; Salimi-Khorshidi et al., 2009; Serretti, Cusin, Rausch, Bondy, & Smeraldi, 2006). Similarly, the pharmaceutical community has established internationally accepted standards and procedures for harmonizing drug-related research, reducing duplication while promoting public health world-wide (International conference on Harmonisation, 2010). However, in archived social-behavioral studies, the same measures more often than not are unavailable. Longitudinal studies might have a handful of items that tap any particular construct. Even within a single study, measure equivalence can be a problem, as the same constructs may be measured differently across assessments, as newer measures are validated; as the underlying theories, the main goals of the study, and the investigators change over time; and as the cohort reaches different developmental milestones.
New methods have been developed to address intra- and inter-study measurement variance. Perhaps the most straightforward (though often not easy) method of creating similarity involves harmonization, in which variables are aligned or recoded to match across studies. For example, Cooper et al., (2011) harmonized physical capability measures across eight United Kingdom cohort studies, and tested age and gender interactions. Schaap et al. (2011) harmonized demographics, socioeconomic status (SES), chronic disease, anthropometry, physical performance, grip strength, pain, self-perceived health, and hospitalization items across five European cohort studies, with high quality overlap. Other variables were of questionable quality or could not be harmonized. Bath, Deeg, and Poppelaars (2010) harmonized data from two longitudinal cohort studies in the Netherlands and the United Kingdom. To align items, variables were transformed and recoded, mostly as dichotomous variables. Twenty-six variables could be harmonized. The authors noted numerous issues that necessarily must be considered in harmonization, including sample differences, the time interval from baseline to follow up, selective and non-selective attrition, and phrasing of questions and response categories.
Alternatively, statistical methods, such as latent factor analysis, item response theory, and non-linear factor analysis, now make it possible to establish equivalence and align items through latent approaches (Bauer & Hussong, 2009; Curran et al. 2008). For example, in latent factor analysis, potentially relevant items with broad distributional variance are chosen to represent this latent construct in each sample. In a multi-group analysis, factor loadings, mean values, and unique variances for the common items are then equated across the two samples to test for weak, strong, or strict invariance, respectively. In many cases, if factor loadings can be matched for at least half of the variables, partial invariance can be established (Reise, Widaman, & Pugh, 1993). As the number of invariant variables increases, the more confident one can be that the scales are measuring the same construct in the two samples. Stemming from testing in education, item response theory (IRT) was originally used with dichotomous items to establish similarities based on the difficulty and discriminating power of each item. More recent IRT formulations have incorporated generalized linear mixed effects modeling, allowing any outcome distribution within the exponential family for parametric models, as well as numerous non-parametric model variants (e.g., Rijmen, Tuerlinckx, De Boeck, & Kuppens, 2003). Non-linear factor analysis allows both categorical and continuous variables to be included and compared (Bauer & Hussong, 2009).
These techniques assume that a continuous construct exists that can be measured along a latent scale. A measure or item assessed in a particular sample is assumed to tap part of this underlying construct, and the goal is to empirically place the items along this ruler, providing a basis for comparing the samples directly. This concept of an underlying latent construct has been used extensively in educational testing services, such as the Graduate Record Exam (GRE). Computer adaptive testing (CAT) techniques create a large bank of items and align people across an underlying dimension, and then the exam exploits this supposed latent distribution so that different items across different samples are comparable.
A third sample can be used to bridge samples (Martin & Friedman, 2000, McArdle et al., 2009), essentially defining this underlying ruler. A new group of participants, matched as closely as possible on age, gender, and other key characteristics, completes the measures from each archival sample. Item invariance is established between each sample and this third sample. The third sample thus acts as a structural bridge between the two main samples. For example, Martin and Friedman (2000) had participants complete the NEO-PI-R and personality questions that were included in the original archival Terman study. Invariance across the archival and contemporary samples was established by confirming that items loaded on the same factors for both samples. In the contemporary sample, scales based on the archival items were highly correlated with four of the five NEO personality traits, thus aligning the archival items along the NEO factors and creating modern interpretable scales could be derived from these 50- to 70-year-old archival data. Such bridging of studies may be particularly important for aligning studies across the lifespan, using overlapping items and age or time periods as structural bridges.
To date, most studies in developmental psychology that consider links between personality and health have been conducted in independent samples. We suggest that more rapid progress can be made by creatively exploiting existing datasets, and such collaborative integration is needed to move the field from simple second generation personality-health models to complex third generation models (Friedman et al., this issue). Dynamic processes can be tested, providing a stronger theoretical foundation for effective interventions. Drawing on missing data designs, multi-level modeling, and Bayesian-based techniques, within-person trajectories and between-group variables can be combined, increasing power and precision (Riley, Simmons, & Look, 2007; Sutton, Kendrick, & Coupland, 2008). That is, by linking studies from different cohorts and developmental ages, using areas of overlap as structural bridges between studies, we can potentially piece together lifespan processes. Because we cannot randomly assign persons to personality traits or health conditions, some of the most informative research designs will involve longitudinal studies of widely varying samples.
In the current study, we applied this collaborative framework to two relatively large prospective studies, the Terman Life Cycle Study and the Hawaii Personality and Health Longitudinal Study, to examine the extent to which these samples could be directly integrated. We highlight both possibilities and potential pitfalls, and then suggest directions for future collaborative efforts.
The Current Study
One of the greatest benefits of attempting to integrate data across studies is the explicit testing of measure comparability. Whereas meta-analyses rely on standardized effects that are based on each investigator’s conceptualization of the constructs, integrative analyses require returning to the items themselves in order to theoretically and empirically establish equivalence. In the current study, we examined a model in which childhood personality traits predict midlife health (see Figure 1). The model we tested here was purposely simple and not meant to be a full model of personality and health. Our goal was to demonstrate the potential for and challenges to linking studies; by starting with a simple framework, we can then build more complex models, linking pieces of the personality-health puzzle together across the entire lifespan.
Figure 1.
Conceptual model. In this model, child personality traits predict adult health, partially mediated through educational attainment and health behaviors (for simplicity, we focus in this paper on alcohol abuse, but other behaviors could be included).
In health psychology, recent studies have increasingly used a five-factor model of personality traits as a framework for structuring and understanding personality-health relations (Smith & Williams, 1992). There are essentially two five-factor models, one stemming from early lexical studies (e.g., Goldberg, 1990, 1993) and the other stemming from later work by Costa and McCrae and captured by the NEO-PI-R personality questionnaire (Costa & McCrae, 1992). In the present study, we followed the lexical tradition, in which the five factors are typically labeled as (I) extraversion, (II) agreeableness, (III) conscientiousness, (IV) emotional stability, and (V) intellect/imagination.
For the purpose of this study, we defined health as a multi-dimensional construct that includes both physical and mental components. Multiple pathways link personality and health, including health behaviors, social relationships, situational selection, underlying genetic or biological differences, and physiological changes (Kern & Friedman, 2010). Characteristics of the sample may moderate personality, behavioral, and health relations. In this study, we examined health behaviors and educational attainment as mediators linking child personality and midlife health using two longitudinal samples.
The Terman Life Cycle Study was begun in 1921–22 by Lewis M. Terman. Participants were followed prospectively throughout their lives until death and were assessed on thousands of psychosocial variables, including child and adult personality traits, social relationships, health-related behaviors, and lifelong health status. The Hawaii Personality and Health Longitudinal Study was begun in 1959 by John M. Digman. Attempts to recontact participants began in 1998, and follow-up efforts continue today. Paralleling the Terman archive, there are data on child and adult personality traits, social relationships, health-related behaviors, and adult health status. Importantly, in both samples teachers assessed the child participants on personality traits, and forty years later participants completed various health and behavioral measures. The two studies have followed thousands of participants across many decades, have similar measures and structures relevant to personality and health theory, and together offer an example of the possibilities for integrating longitudinal studies. The Terman sample has complete lifespan data (childhood through mortality), whereas the Hawaii sample has more health-related measures, including medical/physiological measures at midlife.
Working within a single dataset involves a large commitment in terms of gathering the data and understanding the intricacies of the sample, which only becomes more complicated when more studies are added. Each dataset is unique and complicated, and we drew on each investigator’s unique expertise with the datasets to inform the models and analyses. Rather than a complete harmonization of variables assessed in the two samples, we aligned a minimal number of variables to specifically examine a theoretical model of personality and health across the two samples. We detail our harmonization process, testing the extent to which we can align the studies and then use unique aspects of each study (Hofer & Piccinin, 2010). We had three goals: (1) to determine whether equivalent measures of child personality traits could be established in the Terman and Hawaii samples; (2) to test a theoretical model of childhood personality, education, behavior, and adult health, using parallel analyses (in each sample individually) and integrative methods (directly pooling the data); and (3) to use similar items as a bridge and then extend analyses to unique aspects of each sample, linking child personality to mortality in the Terman sample and to physiological health measures in the Hawaii sample.
Method
Participants
Sample 1: The Terman Life Cycle Study
In the Terman sample, teachers across California were asked to identify the youngest and most intelligent students in their classes; those nominated were tested using the Stanford-Binet intelligence test and were included in the study if their IQ was 135 or greater. Other children were added through 1928, yielding a total sample of 1,528 participants (856 males [M], 672 females [F]). On average, participants were born in 1910 (SD = 3.7 years) and were 11 years old at the first assessment. Most came from a middle to upper-middle class background. Almost all were Caucasian, with about 51% English, Scottish, or Irish; 15.7% were German, 10.5% were Jewish (mostly Eastern European), and only a few Oriental, Mexican, African American, or other ancestries were represented (Terman et al., 1925). Follow-up efforts began in 1936, and by 1940 approximately 98 percent of the sample had been successfully contacted (Terman & Oden, 1947). Participants were followed prospectively throughout their lives, completing written assessments every five to ten years, with the last formal assessment in 1999. In addition, our research team has supplemented the extensive archival information by collecting death certificates, and we have refined and independently validated various psychosocial measures, including measures of child personality traits, social relationships, health-related behaviors, and health status (Friedman, Kern, & Reynolds, 2010). The Terman study is the longest longitudinal study with multiple repeated assessments that has ever been conducted.
The current study was limited to participants with data on child personality traits and midlife health information. Of the total sample, 252 participants (141 M, 111 F) were excluded because they were missing child personality ratings, and 191 participants (101 M, 90 F) were excluded because they were missing midlife health information, leaving a final sample of 1,085 individuals (614 M, 471 F). Those missing personality data were born later (t(1,526) = 10.23, p < .0001), were more likely to report illnesses at midlife (t(1,307) = 3.21, p = .001), and came from a slightly higher socioeconomic background (t(1,205) = 3.78, p = .0002). Participants missing health data completed fewer years of education than those with health data (t(1,499) = −9.88, p < .0001), had a slightly lower childhood IQ (t(1,526) = −2.33, p = .02), and were more likely to die or be lost to follow up at a younger age (t(1,526) = −23.87, p < .0001). These differences may narrow the relevant ranges somewhat and thus attenuate relations but are unlikely to introduce bias.
Sample 2: The Hawaii Personality and Health Longitudinal Study
In the Hawaii sample, elementary school teachers in grades 1, 2, 5, and 6 were recruited by John M. Digman and asked to describe the children in their classes on personality traits, which were later matched to the five-factor structure (Goldberg, 2001). Starting in 1998 (age 41–50), the now middle-aged members (age 50 to 60) are being located and follow-upped (Hampson et al., 2001). Participants complete assessments periodically, and are invited to attend a half-day session at a medical clinic where they complete a battery of physical, medical (including blood tests), personality, social, cognitive, and behavioral measures. Of the original Hawaii cohort, 2,320 were potentially available for recruitment, and 1,928 have been located (83%). Of those, only 45 refused continued participation. The sample includes a mix of socioeconomic backgrounds. The adult sample is composed of 37% Japanese Americans, 21% Native Hawaiian or part-Hawaiian Americans, 18% of European ancestry, and 24% of Filipino, Chinese, Okinawan, Korean, or other ancestry.
The initial cohort included six subsamples based on school grade and location in Hawaii (see Goldberg, 2001 for details); for the present study, the subsamples were combined into a single sample. The current study was limited to participants with child personality and midlife health data. Of the initial cohort, 2,221 participants (1,176 M, 1,045 F) had childhood personality data. Of those, 1,051 participants (559 M, 492 F) were excluded because they were missing midlife health data, leaving a final sample of 1,170 participants (617 M, 553 F). Those excluded at midlife were described by their teachers as less conscientious (t(2,219) = −4.21, p < .0001) and less intellectual/imaginative (t(2,219) = −2.95, p = .003) as children.
Measures
We selected items that fit within the model noted in Figure 1, with child personality traits as the main predictors, educational attainment and alcohol abuse as potential mediators, and indicators of adult health status, illness, and mental adjustment as the main outcomes. In addition, we included physiological measures (in the Hawaii sample) and mortality information (in the Terman sample) as outcomes unique to each sample.
Child personality
In the Terman sample, as part of the initial 1922 assessment (average age 11) parents and teachers were asked to rate their child, compared to average children of the same age, on 25 traits. Traits were assessed on a 13-point scale, ranging from very low levels of the trait to very high levels. In our original study (Friedman et al., 1993), six personality factors were created using the average of parent and teacher assessments: cheerfulness, conscientiousness, energy, motivation/self esteem, sociability, and permanency of moods. In the Hawaii sample, teachers (grades 1, 2, 5, or 6) were presented with a set of 39 to 63 traits (depending on the subsample), and they were given the names of all children in their class on pieces of cardboard. Similar to a Q-sort, teachers sorted the children into a nine-step quasi-normal distribution for each trait. A five-factor structure was recovered, creating measures of extraversion, agreeableness, conscientiousness, emotional stability, and intellect/imagination (Goldberg, 2001). In addition, 11 middle-level clusters of traits, such as activity level, sociability, and perseverance, were developed. To establish comparable measures across the two samples, we returned to the original 25 trait ratings by teachers in the Terman sample, and the 39 traits common across the subsamples in the Hawaii sample (see Appendix for traits and descriptions, and Results for harmonization procedure).
Educational attainment
In the Terman sample, at each assessment participants indicated the highest level of education achieved and any additional schooling completed during each interlude. In our prior studies, a continuous total educational attainment score was constructed in terms of years of schooling. In the Hawaii sample, at the first adult assessment (1999) participants indicated the highest grade or year of school completed on a nine-point scale. To harmonize an index of educational level, the more detailed information in the Terman sample was reduced to the Hawaii scale.1 Thus, in both samples education was coded as 1 = 8th grade or less, 2 = completed junior high school, 3 = some high school, 4 = high school graduate/GED, 5 = some technical school, 6 = technical/nursing school graduate, 7 = some college or community college, 8 = college graduate, and 9 = postgraduate or professional degree. Educational information was available for all 1,085 participants in the Terman sample and 1,136 participants (601 M, 535 F) in the Hawaii sample.
Alcohol abuse
As a marker of unhealthy behavior, we selected alcohol abuse.2 In the Terman sample, the 1950 assessment (average age 40) asked participants to indicate their typical alcohol use on a four-point scale (1 = none or rare, 2 = moderate, 3 = fairly heavy, 4 = problem drinking). In the Hawaii sample, considerably more information was available. To harmonize an index of alcohol abuse across the two samples, the more detailed information in the Hawaii sample was reduced to the Terman scale. At the first adult assessment (average age 45), participants indicated how often in the past month they drank any alcoholic beverages and how many drinks they had when they drank. Based on this information, participants were coded as 1 (none or very rarely consumed alcohol in the past month), 2 (moderate; some drinking in the past month, but no binge drinking, defined as more than two drinks at a time), or 3 (fairly heavy; periods of drinking three or more drinks at a time). In addition, several questions in the first and third assessments asked about problem drinking, with questions about whether the participant felt any concern about the amount they drank or had experienced problems stemming from alcohol use. If participants were rated as fairly heavy drinkers and reported problems or concerns, then they were classified as 4 (problem drinking).3 Alcohol data were available for all 1,085 participants in the Terman sample and 1,014 participants (540 M, 474 F) in the Hawaii sample.
Adult health
Several measures were selected from each sample to assess physical and mental health when the participants were about 50 years old.
Self-rated health
As is typically reported in most studies of health, both samples indicated on a five-point scale their general health in comparison with others of the same age and gender (1 = poor, 2 = fair, 3 = good, 4 = very good, 5 = excellent). Self-rated health data were available for all participants in both samples.
Illness
In the Terman sample, each assessment asked participants to report any major illnesses experienced since the last assessment. Reports of heart disease, stroke, cancer, breathing problems, major infections, injury, and other major illnesses through the 1960 assessment were summed (0 = none, 4 = four or more reported). In the Hawaii sample, the first adult assessment asked participants to report whether they had ever been treated for heart attacks, heart disease, stroke, cancer, diabetes, thyroid disease, high blood pressure, high cholesterol, migraines, chronic fatigue syndrome, or any other major illnesses. Conditions were summed (0 = none, 4 = four or more reported; the variable was reversed to create the composite health factor). Illness information was available for all 1,085 participants in the Terman sample and 1,166 participants (614 M, 552 F) in the Hawaii sample.
Mental adjustment
In the Terman sample, the adult questionnaires asked the Terman participants whether they had experienced any tendency toward nervousness, worry, special anxieties, or nervous breakdown in recent years, and if so, the “nature of such difficulties,” how the difficulties had been handled, and the status of their present condition. On the basis of this information and considered in the light of the total case history (i.e., all information available including personal correspondence with participants and their families), Terman and his associates classified each participant into one of three categories: (a) serious maladjustment, (b) some maladjustment, or (c) satisfactory adjustment (Terman & Oden, 1959). Individuals in the first category were those who showed marked signs of anxiety, depression, personality maladjustment, psychopathic personality problems, or suffered a nervous breakdown. The second category contained individuals who experienced feelings of inferiority or inadequacy, anxiety, or emotional conflict, but were still able to function. The third category contained those who were able to cope normally with everyday problems and who were judged to be essentially typical in terms of their emotional make-up.
To approximate a similar measure in the Hawaii sample, variables that related to mental health from the five adult questionnaires were compiled. Questions included emotional problems over the past four weeks that limited functioning, depression (based on the Center for Epidemiological Studies Depression scale), ratings on emotional health as compared to other individuals, reports of psychosomatic problems, and signs of post-traumatic stress disorder. Five trained research assistants rated each participant as 1 = seriously maladjusted, 2 = some maladjustment, or 3 = satisfactory adjustment. Inter-rater reliability (complete agreement across all five raters) was r = .96, and discrepant cases were determined through discussion. Mental adjustment data were available for 1,084 participants (613 M, 471 F) in the Terman sample and 1,169 participants (617 M, 552 F) in the Hawaii sample.
Measures unique to each sample
The Terman sample is unique in that we have now collected lifespan mortality data (death certificates) for 90% of the participants, but detailed physiological information is not available. The Hawaii sample does not include mortality information, but has collected physiological data on a growing subsample. If the two studies can be linked, we can potentially extend analyses to unique aspects of the samples, essentially allowing one study to fill in missing pieces from the other study. We examined this possibility by including health measures unique to each sample.
In our prior studies (Friedman et al., 1993; 1995), higher levels of childhood conscientiousness were related to a lower risk of dying at any given age. The conscientiousness-mortality relation was partially mediated by unhealthy behaviors. In the present study, age of death information was available for 984 participants (791 M, 572 F). The remaining 101 participants were censored at the last age that we heard from them. Some are known to still be alive.
In the Hawaii study, global health status at midlife (age 50) was evaluated by a composite variable of physiological dysregulation comprising multiple widely used biomarkers of the cardiovascular and metabolic systems indicative of allostatic load (Seeman, McEwen, Rowe, & Singer, 2001). Previously, Hampson, Goldberg, Vogt, Hillier, and Dubanoski (2009) demonstrated that this measure is related to lower self-rated health, depressive symptoms, and several health behaviors. Physiological dysregulation scores were derived from 10 clinically assessed biomarkers (systolic and diastolic blood pressure, total cholesterol, triglycerides, total-to-HDL cholesterol, urinary protein, fasting blood glucose, waist-to-hip ratio, body mass index, and use of blood pressure or cholesterol lowering medications). Using this information, we standardized within gender and summed the markers to create dysregulation scores. Higher scores indicate more physiological disregulation. Physiological data were available for 619 participants (305 M, 314 F).
Data Analyses
The primary challenge was to harmonize measures across the two studies. For child personality, we conceptually aligned items in each sample to the five-factor model, iterating between theoretical and empirical consideration. Traits were then standardized and averaged to create composite personality factors. Health, education, and alcohol variables were recoded to align response categories across the two samples, as noted under Measures. A composite health variable was created by standardizing and summing self-rated health, mental adjustment, and illness reports (reversed).
Next, we conducted parallel analyses in the two samples. Using hierarchical linear regression in SAS (version 9.2) software, the composite personality factors were regressed on the health composite variable separately in each sample, controlling for age and sex. Education and alcohol use were then added to the models as mediators. To allow for missing values on some items and to more appropriately address mediation (MacKinnon et al., 2002), the full model was estimated as a structural equation model (SEM) in R (version 2.15.0), using the lavaan package (Rosseel, 2012), with latent variables used for the personality and health variables. Using the composite harmonized variables, we then directly pooled the samples, and the SEM model was run as a multi-group analysis.4 By directly pooling data and allowing paths to differ by group, we can examine group heterogeneity directly.
Finally, using the aligned personality, education, and alcohol use variables, we extended analyses to the unique aspects of each sample, with personality predicting physiological dysregulation in the Hawaii sample using linear regression, and personality predicting mortality risk in the Terman sample, using Cox regression (survival analysis). Analyses were conducted in SAS, using the composite personality and health variables.
All analyses controlled for age and gender. To assess model fit, we relied primarily upon the RMSEA (root mean square error of approximation), a population-based index that is not affected by sample size when the sample size is at least 200 (Curran, Bollen, Chen, Paxton, & Kirby, 2003) and which includes confidence intervals (95% CI). A RMSEA of less than .05 is considered a close fit, .05 to .08 is considered acceptable, and .08 to .10 is considered mediocre (Loehlin, 2004). We also examined the Tucker Lewis Index (TLI), a population based index in which the best-fitting model is closest to 1.00. For model estimation, the lavaan package allows missing values based on a missing at random assumption using the missing = “ml” option in the fit statement. 5
Results
Establishing Comparable Child Personality Factors
The first part of the analyses aimed to establish comparable measures of child personality traits across the two cohorts. We considered the teacher assessments on 25 traits in the Terman sample and teacher assessments on 39 traits consistent across all subsamples in the Hawaii sample. Using the Big-Five model as a theoretical base, we identified traits relevant to each of the five constructs and then used factor analysis and intertrait correlations to create more pure measures of the factors. Then, in a final step we tested the measurement model in each sample to recover the five-factor structure and to confirm similar relations across the two samples.
First, the trait names and descriptions for each sample were matched across samples. Five trained raters indicated whether each of the 39 Hawaii traits were completely comparable, mostly comparable, somewhat comparable, or not at all comparable to each of the Terman traits, based on the descriptions provided (see Appendix for traits and descriptions). For example, energetic in the Hawaii sample was rated as completely comparable to amount of physical energy in the Terman sample. Twenty-one traits from the Terman sample and 26 traits from the Hawaii sample were identified as being at least somewhat comparable.
Next, working with one sample at a time, we factor analyzed the 21 and 26 traits (for the Terman and Hawaii samples respectively) using oblique rotations. Because the traits were all measured by teacher assessments, we expected a general evaluative “halo” effect, such that factors were expected to be interrelated. A five-factor solution was recovered in each sample, although in the Terman sample, only one item tapped emotional stability (permanency of moods).
Iterating between theoretical consideration of the traits and empirical item interrelations (intertrait correlations and coefficient alpha reliability estimates), we examined each empirically derived factor, first in terms of the initial placements by the trained raters, and later in terms of the scientific literature on the Big-Five factor representation (e.g., Goldberg, 1990, 2001). A few traits identified by the raters were not considered pure indicators of the factors. Specifically, in the Hawaii sample, restless and fidgets were identified by the initial raters as part of extraversion, but both personality theory and the correlations between these traits and the main factors suggest that these traits are a blend of low conscientiousness, high extraversion, and high neuroticism, rather than pure measures of extraversion. In the Terman sample, appreciation of beauty, sensitivity to approval, and freedom from vanity were identified as part of agreeableness, but because they do not fit within our usual conceptions of the factor, these traits were not included in the final factors. In addition, a few traits that past studies have suggested should be included in the factors were not initially identified by the raters. We examined the extent to which the initially excluded traits correlated with each factor. In the Hawaii sample fearful (reversed) was added to emotional stability, and seclusive (reversed) was added to extraversion.6 The final traits that were included in each factor and their reliabilities are summarized in Table 1.
Table 1.
Final child personality factors, with the traits that were included in each sample
| Factor | Terman Traits | α | Hawaii Traits | α |
|---|---|---|---|---|
| Extraversion | Cheerfulness | .78 | Assertive | .84 |
| Fondness for large groups | Energetic | |||
| Leadership | Gregarious | |||
| Physical energy | Socially confident | |||
| Popularity with other children | Lethargic (r) | |||
| Seclusive (r) | ||||
| Submissive (r) | ||||
|
| ||||
| Agreeableness | Generous | .74 | Considerate | .83 |
| Sympathetic | Self-minimizing | |||
| Rude (r) | ||||
| Spiteful (r) | ||||
|
| ||||
| Conscientiousness | Conscientiousness | .83 | Careful of personal belongings | .90 |
| Desire to excel | Careless of others’ property | |||
| Prudence/forethought | Conscientious | |||
| Truthfulness | Persevering | |||
| Will power and perseverance | Planful | |||
| Fickle (r) | ||||
| Irresponsible (r) | ||||
|
| ||||
| Emotional Stability | Permanency of moods | Single item | Concerned about acceptance (r) | .72 |
| Jealous (r) | ||||
| Touchy (r) | ||||
|
| ||||
| Intellect | Common sense | .78 | Curious | .81 |
| Intellect | Imaginative | |||
| Knowledgeable | Original | |||
| Originality | Verbal | |||
Note. See Appendix for trait descriptions. (r) indicates reverse-scored trait. α = coefficient alpha. Child personality was measured in 1922 (average age 11) in the Terman sample and between 1959 and 1967 (grades 1, 2, 5, or 6) in the Hawaii sample.
We then used confirmatory factor analysis in each sample to evaluate the final personality measurement models. A five-factor model fit best in each sample 7 (Terman sample: TLI = .843, RMSEA = .086 [95% CI = .081, .091]; Hawaii sample: TLI = .813, RMSEA = .092 [.089, .095]). Figures 2 and 3 present the final measurement models with standardized factor loadings for the Terman and Hawaii samples respectively.
Figure 2.
Final child personality (average age 10) measurement model in the Terman Sample. See Appendix for traits and descriptions. Standardized path estimates are presented. Emotional stability was evaluated as a single observed variable. Model was estimated in R (version 2.15.0, package lavaan).
Figure 3.
Final child personality (grades 1, 2, 5, or 6) measurement model in the Hawaii sample. See Appendix for traits and descriptions. * indicates reversed-scored items. Standardized path estimates are presented. Model was estimated in R (version 2.15.0, package lavaan).
Finally, we computed correlations between our new factors and the personality variables used in our prior studies (Table 2). In our prior work with each sample, the personality measures have been used elsewhere to predict numerous adult outcomes, supporting their validity. The measures used here were slightly altered: in the Terman sample, the original measures included parent reports, and here we only included teacher reports; in the Hawaii sample, slightly different traits for each factor were included to best align with the Terman items. Thus, the factors used here are similar to brief scales, and it is helpful to confirm that they correlate with our other measures to support their validity. In the Terman sample, conscientiousness and permanency of moods correlated .74 and .76 with the new conscientiousness and emotional stability factors.8 Sociability and motivation related respectively to the new extraversion and intellect factors. In the Hawaii sample, correlations with the prior five factors ranged from .70 for emotional stability to .86 for intellect.
Table 2.
Correlations between the new Big-Five personality factors and prior measures of personality traits in each sample.
| E | A | C | S | I | |
|---|---|---|---|---|---|
|
Terman Sample
| |||||
| Sociability | 0.69 | 0.26 | 0.17 | 0.14 | 0.18 |
| Cheerfulness | 0.53 | 0.31 | 0.18 | 0.27 | 0.36 |
| Conscientiousness | 0.16 | 0.47 | 0.74 | 0.33 | 0.39 |
| Permanency of mood | 0.26 | 0.25 | 0.29 | 0.76 | 0.23 |
| High motivation | 0.32 | 0.23 | 0.55 | 0.19 | 0.62 |
| Energetic | 0.42 | 0.09 | 0.06 | 0.11 | 0.10 |
|
| |||||
| Hawaii Sample | |||||
| Extraversion | 0.80 | −0.28 | −0.10 | −0.09 | 0.33 |
| Agreeableness | −0.32 | 0.74 | 0.21 | 0.53 | −0.11 |
| Conscientiousness | 0.04 | 0.43 | 0.83 | 0.16 | 0.07 |
| Emotional Stability | 0.02 | 0.26 | 0.22 | 0.70 | 0.04 |
| Intellect | 0.42 | −0.04 | 0.24 | 0.03 | 0.86 |
| Activity level | 0.81 | −0.24 | 0.10 | −0.06 | 0.52 |
| Sociability | 0.81 | −0.28 | −0.06 | −0.11 | 0.42 |
| Self assertion | 0.81 | −0.71 | −0.24 | −0.42 | 0.54 |
| Resilience | 0.54 | 0.21 | 0.49 | 0.39 | 0.61 |
| Antagonistic | 0.39 | −0.94 | −0.58 | −0.65 | 0.13 |
| Impulsivity | 0.37 | −0.73 | −0.70 | −0.59 | 0.10 |
| Mannerliness | −0.03 | 0.76 | 0.78 | 0.53 | 0.20 |
| Carefulness | −0.04 | 0.52 | 0.83 | 0.35 | 0.13 |
| Perseverance | 0.07 | 0.53 | 0.96 | 0.44 | 0.31 |
| Insecurity | 0.23 | −0.68 | −0.50 | −0.97 | 0.03 |
| Imagination | 0.58 | −0.10 | 0.29 | 0.04 | 0.97 |
Note. See Goldberg, 2001 and Friedman et al., 1993 for details on prior measures. Correlations > .50 are bolded. E = extraversion, A = agreeableness, C = conscientiousness, S = emotional stability, I = intellect. Child personality was measured in 1922 (average age 11) in the Terman sample and between 1959 and 1967 (grades 1, 2, 5, or 6) in the Hawaii sample.
In sum, using the original traits assessed by teachers in each sample, we rationally harmonized the traits and then empirically established similar factor structures. The pattern of correlations suggests that the new variables are tapping similar constructs to our prior studies, with the additional benefit of capturing a similar five-factor structure in the two samples. Through this intensive process, we can be more confident that we are comparing similar constructs across the two studies.
Integrating the Samples
The second part of the analyses aimed to: (a) test the personality-health model (see Figure 1) in each sample individually, and (b) test the personality-health model using the combined sample, pooled on the harmonized composite variables.
First, we tested the personality-health model individually in each sample, initially excluding and then including the education and alcohol abuse variables, controlling for age and sex. In the Terman sample, higher conscientiousness and lower agreeableness predicted higher educational attainment (conscientiousness: β = .22, p = .003; agreeableness: β = −.16, p = .02), and higher levels of extraversion predicted increased alcohol use (β = .22, p = .001), but personality, education, and alcohol use did not significantly predict midlife health.9 In the Hawaii sample, higher conscientiousness, lower extraversion, and high intellect predicted greater educational attainment (conscientiousness: β = .16, p =.04; extraversion: β = −.19, p = .04; intellect: β = .28, p <.001), higher extraversion and lower intellect predicted increased alcohol use (extraversion: β = .25, p = .009; intellect: β = −.15, p = .04), and education predicted better health (β = .28, p < .001).
To more directly compare model differences across samples, we directly pooled the data from the two samples on the harmonized composite items, creating a single larger dataset (N = 2255, 1231 males, 1024 females). Descriptive statistics and inter-item correlations for the Terman sample, Hawaii sample and combined Terman-Hawaii sample are summarized in Table 3. The SEM model, using the composite personality factors as observed predictors, was estimated. Allowing loadings and pathways to vary by group fit significantly better than constraining pathways and loadings across the groups (free model: RMSEA = .039 [.032, .047]; constrained model: RMSEA = .117 [.112, .123]; Δχ2(29) = 1200, p < .001). Figure 4 presents the final models by group, with the significant standardized pathways provided.
Table 3.
Descriptive statistics and inter-variable correlations in the Terman sample, the Hawaii sample, and the combined Terman-Hawaii sample for the full sample and separately by sex
| Variable | Mean (SD) | E | A | C | S | I |
|---|---|---|---|---|---|---|
| Terman Sample (N = 1,085) | ||||||
| Sex1 | 614 M, 471 F | .16*** | .09** | .15*** | −.01 | −.04 |
| Year of birth | 1910 (3.10) | .01 | −.07* | −.12*** | −.02 | −.02 |
| Males | 1910 (3.16) | −.08* | −.12*** | −.14*** | −.03 | −.05 |
| Females | 1910 (2.98) | .09* | −.02 | −.13*** | .000 | .03 |
| Health composite2 | 9.20 (2.09) | .02 | .02 | .02 | .06* | −.02 |
| Males | 9.42 (1.98) | .08* | .03 | .04 | .10* | −.02 |
| Females | 8.92 (2.19) | −.01 | .03 | .03 | .009 | −.03 |
| Self-rated health | 4.22 (0.72) | .03 | −.004 | .03 | .05 | .001 |
| Males | 4.29 (0.70) | .08* | .01 | .05 | .10* | .004 |
| Females | 4.13 (0.73) | .01 | −.004 | .03 | −.02 | −.01 |
| Illnesses | 0.89 (0.88) | .03 | .02 | .06 | .02 | .02 |
| Males | 0.80 (0.83) | −.02 | .03 | .04 | .001 | .01 |
| Females | 1.00 (0.94) | .05 | −.02 | .03 | .05 | .05 |
| Mental adjustment | 2.57 (0.64) | .04 | .06 | .07* | .10** | −.02 |
| Males | 2.59 (0.64) | .06 | .07 | .06 | .11** | −.04 |
| Females | 2.55 (0.64) | .02 | .05 | .09* | .09 | .000 |
| Education | 7.69 (1.44) | .01 | −.003 | .11*** | .06 | .07* |
| Males | 7.80 (1.47) | .05 | .05 | .12** | .05 | .06 |
| Females | 7.55 (1.40) | −.02 | −.05 | .12** | .07 | .08 |
| Alcohol abuse | 1.84 (0.61) | .08** | −.05 | −.11*** | −.02 | −.04 |
| Males | 1.95 (0.62) | .12** | .003 | −.07 | −.003 | −.03 |
| Females | 1.71 (0.58) | .10* | −.07 | −.11* | −.05 | −.06 |
|
| ||||||
| Hawaii Sample (N = 1,170) | ||||||
| Sex1 | 617 M, 553 F | −.05 | .21*** | .24*** | .01 | −.07* |
| Year of birth | 1955 (1.61) | .01 | −.01 | −.02 | .02 | −.001 |
| Males | 1955 (1.60) | .01 | −.001 | −.04 | .04 | .02 |
| Females | 1955 (1.62) | .02 | −.04 | −.01 | −.001 | −.02 |
| Health composite2 | 9.66 (2.16) | .04 | .05 | .10*** | .04 | .05 |
| Males | 9.72 (2.02) | .02 | −.003 | .07 | −.01 | .02 |
| Females | 9.59 (2.30) | .05 | .12** | .16*** | .07 | .09* |
| Self-rated health | 3.38 (0.97) | .07* | .04 | .10*** | .03 | .08** |
| Males | 3.34 (0.95) | .02 | .001 | .06 | .01 | .01 |
| Females | 3.43 (0.99) | .12** | .07 | .14** | .05 | .17*** |
| Illnesses | 0.83 (1.00) | −.02 | −.03 | −.07* | −.03 | −.01 |
| Males | 0.82 (0.98) | −.02 | .002 | −.04 | .02 | −.003 |
| Females | 0.84 (1.02) | −.02 | −.07 | −.12** | −.07 | −.02 |
| Mental adjustment | 2.73 (0.57) | −.002 | .03 | .05 | .02 | .03 |
| Males | 2.78 (0.51) | .01 | −.01 | .05 | .001 | .02 |
| Females | 2.67 (0.63) | −.02 | .11* | .10* | .05 | .02 |
| Education | 6.56 (1.89) | .04 | .13*** | .23*** | .10*** | .18*** |
| Males | 6.43 (1.93) | .05 | .13** | .23*** | .10* | .23*** |
| Females | 6.71 (1.83) | .05 | .10* | .20*** | .10* | .13** |
| Alcohol abuse | 2.18 (1.12) | .06* | −.06 | −.09** | .01 | .01 |
| Males | 2.44 (1.17) | .03 | .02 | .01 | .01 | −.03 |
| Females | 1.89 (0.98) | .08 | −.04 | −.10* | .04 | .03 |
|
| ||||||
| Combined Sample (N = 2,255) | ||||||
| Sex1 | 1231 M, 1024 F | .05* | .15*** | .20*** | .003 | −.06** |
| Year of birth | 1934 (22.81) | .001 | −.01 | −.01 | −.000 | −.002 |
| Males | 1933 (22.97) | .09** | −.07* | −.06 | −.01 | .01 |
| Females | 1935 (22.58) | .11*** | .06 | .03 | .01 | −.02 |
| Health composite2 | 9.44 (2.13) | .03 | .03 | .06** | .05* | .02 |
| Males | 9.57 (2.00) | .06 | .01 | .05 | .05 | .000 |
| Females | 9.28 (2.27) | .004 | .08** | .11*** | .05 | .03 |
| Self-rated health | 3.78 (0.95) | .05* | .02 | .06** | .03 | .04* |
| Males | 3.81 (0.96) | −.01 | .04 | .07* | .04 | −.001 |
| Females | 3.75 (0.94) | .11*** | .02 | .07* | .02 | .10** |
| Illnesses | 0.86 (0.95) | .004 | −.01 | −.01 | −.01 | .005 |
| Males | 0.81 (0.91) | −.02 | .02 | −.002 | .01 | .003 |
| Females | 0.91 (0.99) | .02 | −.05 | −.05 | −.02 | .01 |
| Mental adjustment | 2.65 (0.61) | .02 | .04* | .06** | .06** | .003 |
| Males | 2.69 (0.59) | .05 | .02 | .05 | .06* | −.01 |
| Females | 2.61 (0.63) | −.02 | .08** | .10** | .07* | .01 |
| Education | 7.12 (1.78) | .03 | .07** | .17*** | .08*** | .13*** |
| Males | 7.12 (1.85) | .01 | .11*** | .18*** | .07* | .14*** |
| Females | 7.11 (1.69) | .05 | .02 | .15*** | .08* | .11*** |
| Alcohol abuse | 2.01 (0.91) | .06** | −.05* | −.09*** | .001 | −.01 |
| Males | 2.18 (0.95) | .08** | −.01 | −.03 | −.001 | −.02 |
| Females | 1.80 (0.81) | .07* | −.04 | −.09** | .01 | −.01 |
Note. Child personality was measured in 1922 (average age 11) in the Terman sample and between 1959 and 1967 (grades 1, 2, 5, or 6) in the Hawaii sample. Adult variables were measured in the 1950 and 1960 assessments (age 40–50) in the Terman sample and age 41 to 50 (average 45) in the Hawaii sample. Pearson correlations are presented. E = extraversion, A = agreeableness, C = conscientiousness, S = emotional stability, I = intellect.
Ns given for males (M) and females (F). 0 = male, 1 = female.
The composite health variable is the sum of self-rated health, illness (reversed), and mental adjustment.
p < .05;
p < .01;
p < .001
Figure 4.
Final estimated SEM model for the combined sample (N = 2,255, nTerman = 1085, nHawaii = 1170), with the child (age 10–11) personality composite factor variables predicting midlife (~ age 50) health via education and alcohol abuse (~ age 40). Standardized estimates are given for the significant pathways, shown as solid lines (dotted lines were non-significant). The model was estimated as a multi-group model in R (version 2.15.0, package lavaan), with paths and loadings unconstrained across groups, allowing a direct comparison of sample heterogeneity.
Part 3: Extending the Model to Unique Elements
A promising possibility for integrative methods is that by carefully linking multiple datasets through common variables, we can use these variables to bridge the studies and then extend the analyses to unique aspects of each sample, effectively piecing together lifespan pathways. We used the now-parallel child personality factors to examine mortality risk in the Terman sample and markers of physiological dysfunction in the Hawaii sample. These analyses extended our prior studies with each independent sample (Friedman et al., 1993; Hampson et al., 2009) by using the newly linked personality factors.
The individual traits were standardized and summed to create composite personality factors. In the Terman sample, the five factors were used to predict mortality risk, using Cox proportional hazards regression analysis (survival analysis). Personality variables were included, and then education and alcohol abuse were added to the models. All analyses controlled for age and sex. Replicating our prior studies, the conscientiousness factor predicted lower mortality risk (hazard ratio [HR] = .90 [95% CI = .85, .95], p < .0001). When education and alcohol abuse were added to the model, the effects of conscientiousness were slightly decreased but still significant (HR = .91 [.87, .96]), education predicted lower risk (HR = .90 [.86, .94], p < .0001), and alcohol abuse predicted increased risk (HR = 1.32 [1.18, 1.49], p < .0001).
In the Hawaii sample, the five factors were used to predict physiological dysregulation, using standard multiple regression analyses. Personality variables were included, and then education and alcohol abuse were added to the models. The conscientiousness factor was related to less dysregulation (β = −.12, t(617) = −2.03, p = .04). When education and alcohol were added to the models, more education was related to less dysregulation (β = −.22, t(543) = −4.99, p < .001), alcohol use related to less dysregulation (β = −.13, t(543) = −2.87, p = .004), and the conscientiousness factor was no longer significantly related to dysregulation.
Discussion
In this study, we took advantage of integrative data analytic techniques to combine the data on personality and health from two longitudinal samples. The Terman Life Cycle Study is the longest study that has repeatedly followed individuals throughout their lives from childhood through death. The Hawaii Personality and Health Longitudinal Study includes an ethnically diverse cohort whose participants are being followed from childhood into late adulthood. Previously, individual studies with these samples found that child personality traits, especially conscientiousness, predicted measures of health later in life (self-rated health in the Hawaii sample; Hampson, Goldberg, Vogt, & Dubanoski, 2006, 2007 and longevity in the Terman sample; Friedman et al., 1993, 1995). In this new synthesis of these two studies, child conscientiousness predicted broadly construed adult health, and this effect was partially mediated by alcohol abuse and education.
The greatest difference between studies was that the education pathway was particularly salient for the Hawaii sample. This is just as it should be. The Terman study purposely included only the brightest children in California, so the sample is restricted in both intellectual level and concomitantly in educational attainment. In contrast, the Hawaii sample included every child in each classroom studied, and therefore it includes the full range of scholastic talent and concomitantly in educational attainment. Not surprisingly, intellect predicted educational attainment in the Hawaii sample but was not related to attainment in the Terman sample. Education is an enormously important factor in health given its relation to socioeconomic status and all the health benefits that flow from one’s relative position in the social hierarchy and its associated influences. Intellectually oriented individuals are more likely to pursue educational opportunities, but education itself matters, and these points jump out from an integrative, cross-study perspective.
This study suggests that combining data is both possible and will be fruitful in extending theories of development, personality, and health across the lifespan. But it also highlights some of the limitations and issues that must be considered, involving characteristics of the sample, measures, and analyses (Bauer & Hussong, 2009; Curran & Hussong, 2009; Hofer & Piccinin, 2009). Sample characteristics include (a) the representativeness of the samples; (b) birth cohort, major historical events, and the socioeconomic context of the time; (c) socio-economic status, racial, ethnic, and educational differences; and (d) attrition, mortality, and other possible selection effects. Measurement characteristics include (a) the constructs and measures used; (b) changes in constructs or measures at different periods in life; (c) how often the participants are assessed and the interval between assessments; (d) retest effects; and (e) how time is handled in the analyses. Perhaps the greatest benefit of attempting to pool data is that one must intentionally examine measure invariance. Only by establishing conceptual and metric equivalency can one really determine what differences may be due to actual processes as compared to characteristics of the sample or measures. Such differences are often glossed over and ignored, potentially leading to a science built on invalid assumptions and conclusions. By treating heterogeneity as interesting limits on generalizability rather than as problematic noise, both similarities and differences across samples become important.
Both of the samples are limited by the measures initially included. When the teachers assessed the personality traits of the school children, the present conception of the Big-Five factor model did not exist. Health status, mental adjustment, and health-related behaviors were assessed with different questions and at different time points. The present study starts to address generalizability across two samples, but the extent to which our findings generalize to other samples is unknown, especially when other social, cultural, and historical variables are relevant. The two samples are to some extent selective. Still, our process suggests that harmonization is indeed possible in longitudinal studies, and important conclusions emerged across the cohorts and the decades concerning conscientiousness and health.
By finding regions of overlap at both conceptual and measurement levels, relations that are unique to each sample can then be used to fill in missing pieces in lifespan models of personality and health. The Hawaii sample does not yet have much data concerning mortality and cause of death, but it has much more midlife physiological health information (including blood tests), whereas the Terman sample lacks detailed physiological measures, but we have gathered extensive information about length of life and cause of death. The two studies, taken alone and then together, confirm the importance of conscientiousness to later health and show the relevance of education and alcohol abuse. Thus, for the first time, we have evidence from a full lifespan analysis for the importance of child conscientiousness to adult health and longevity. Creatively using the resources that are available, we can address lifespan questions that are impossible in shorter-term studies, offering the means necessary to accumulate sound scientific knowledge about the interplay of personality and health.
Supplementary Material
Acknowledgments
This research was supported by grant AG020048 from the National Institute on Aging.
Appendix. Trait Labels and Descriptions
| Terman Sample | |
|---|---|
| Trait | Description |
| Amount of physical energy | High = Extraordinary amount of energy, pep, and animation. Dynamic & tireless. Low = Extreme physical inertia & lack of pep. Sluggish and easily fatigued. |
| Appreciation of beauty | High = Extraordinary appreciation of beautiful colors, landscapes, forms, sunsets, flowers, etc. Has natural “taste.” Low = practically no appreciation of beauty in things seen. No “taste.” |
| Cheerfulness & optimism | High = Extraordinarily cheerful and optimistic. Never sees dark side. Never worries. Low = Usually extremely depressed and pessimistic. Looks on dark side of everything. Worries constantly. |
| Common sense | High = Possesses common sense and judgment to an extraordinary degree. His advice always highly valuable. Low = extreme lack of common sense and judgment. Opinions not taken seriously by anyone. |
| Conscientiousness | High = Extraordinarily conscientious. Keen sense of duty. Does right for right’s sake. Always dependable. Low = Extreme lack of conscientiousness. No sense of duty. Does wrong for any advantage. Not dependable. |
| Desire to excel | High = Extraordinary pride in accomplishment and desire to excel. Does his upmost to stand first. Low = No pride in accomplishment. No ambition to excel. Almost never does his best. |
| Desire to know | High = Extraordinarily strong intellectual curiosity, and broad interests. Insistent on knowing. Low = Extreme lack of intellectual curiosity. Mentally inert. Few interests. Rarely asks questions. |
| Fondness for large groups | High = Extraordinary fondness for large groups. Unhappy alone. Devoted to parties, picnics, etc. Low = invariably avoids groups. Always prefers to be either alone or with one or two close chums. |
| Freedom from vanity and egotism | High = Extraordinarily free from egotism or vanity. Shrinks from praise and admiration. Low = Extremely egotistical and vain. “Fishes” for praise. Always showing off. |
| General intelligence | High = Extraordinary all-around intelligence. Low = general intelligence extremely inferior. Almost feeble-minded. |
| Generosity and unselfishness | High = Extraordinarily generous, unselfish, and fair-minded. Low = Extremely selfish. Cares only for own pleasures. Takes unfair advantage. |
| Health | High = Extraordinary good health, almost never sick, vigorous. Low = Extremely weakly and sickly. Extreme lack of vigor |
| Leadership | High = Extraordinary qualities of leadership. Gets others to do his will. Not easily influenced. Low = Always a follower. Never takes initiative. Suggestible and easily influenced. |
| Mechanical ingenuity | High = Extraordinary mechanical ingenuity. Likes and understands machinery, apparatus, etc. Clever at “fixing” things. Low = Extreme lack of mechanical ingenuity. Cares nothing for machinery. A blunderer with tools. |
| Musical appreciation | High = Extraordinary musical appreciation. Low = No musical appreciation whatever. |
| Originality | High = Extraordinary ability to think things through for self. Original, resourceful, and inventive. Excels in reasoning. Low = Extreme lack or originality and resourcefulness. Always depends on teacher or book. |
| Permanency of moods | High = Moods extraordinarily permanent. Almost never goes quickly from joy to sadness or sadness to joy. Low = Moods extremely changeable. Always alternating between extreme joy and extreme sadness. |
| Popularity with other children | High = Extraordinarily popular. Universal favorite. Is sought after and has many friends. Low = extremely unpopular. Disliked and shunned. A social outcast. |
| Prudence and forethought | High = Extraordinarily prudent. Always looks ahead. Never sacrifices future goods for present pleasure. Low = extreme lack of prudence. Never looks ahead. Lives wholly in the present. |
| Self-confidence | High = Extreme self-confidence and self-reliance. Always relies on own judgments. Courts responsibility. Low = Extreme lack of self-confidence. Distrusts own judgment. Afraid of responsibility. |
| Sense of humor | High = Extraordinarily keen sense of humor. Witty. Appreciates jokes. See the funny side in everything. Low = Extremely lacking in sense of humor. Serious and prosy. Never sees the funny side. |
| Sensitiveness to approval or disapproval | High = Extraordinary sensitiveness to approval or disapproval of other children. Can’t endure to be disliked. Low = utterly indifferent to opinion of other children. Does not care in the least to be liked. |
| Sympathy and tenderness | High = Extraordinarily tender and sympathetic. Kind on principle. Abhors cruelty. Low = Extreme lack of tenderness or sympathy. Rarely does a kind act. Tendency to cruelty. |
| Truthfulness | High = Extraordinarily truthful, honest, and frank. Never misleads or misrepresents, however great the temptation. Low = Extreme tendency to lying, deceitfulness, and evasiveness. Lies for the slightest advantage. |
| Will power and perseverance | High = Extraordinary will power. Persistent in overcoming difficulties. Extremely steadfast. Never gives up. Low = Extreme lack of will power. Easily discouraged and gives up at slightest difficulty. |
| Hawaii Sample | |
|---|---|
| Trait | Description |
| Adaptable | Copes easily and successfully with new and strange situations; bravely faces up to uncertainty. |
| Assertive | Bossy; usually attempts to direct the actions of others; is convinced his (her) way is the best way of doing things; shows others “how things should be done.” |
| Careful of personal belongings | Takes good care of things which belong to him (her); becomes concerned when possessions are missing and searches for them. Keeps his (her) own things neat, clean, and in order. |
| Careless of others’ property | Seldom or never concerned about the possibility of damage to others’ belongings; borrowed things are often returned broken or dirty; loses or forgets to return borrowed things. |
| Complains about others | Frequently complains about what others are saying about or doing to him (her); asks adult to intervene and change the behavior of others. |
| Concerned about acceptance | Expresses concern about real or imagined rebuffs and slights from others; unable to take his (her) relationship with others for granted; worries that he (she) may lose friends or that others will not like him (her). |
| Conscientious | Honest; knows what is right and generally does it even if no one is watching; tells the truth even when this is difficult; does not attempt to deceive others. |
| Considerate | Thoughtful of others; sensitive to others’ feelings; cannot do things which hurt others’ feelings; sympathetic when others are in trouble and tries to help. |
| Curious | Given to wondering and musing about the why and wherefore of things; examines things closely; asks questions. |
| Eccentric | Has interests and views which are different from those of others children; acts differently from others; not interested in wearing the same clothes or doing the same things as others are wearing and doing; goes his (her) own, rather off-beat way. |
| Energetic | Active; full of pep; vigorous; movements are quick, darting. |
| Esthetically sensitive | Notices and responds with pleasure to beauty in his (her) surroundings; enjoys art and/or music. |
| Fearful | Has many fears and worries, some of which are unreasonable; easily becomes alarmed or frightened. |
| Fickle | Changes frequently in interests, opinions, and pursuits; “flighty;” starts one thing and shifts to another. |
| Fidgets | Finds it difficult to sit still, frequently changing position; often “doodles” or manipulates objects for no reason other than to be doing something. |
| Gregarious | Likes to be with others and seeks their company; spends as much time with others as possible; dislikes being alone. |
| Happy | Joyful; has a sunny disposition; enjoys life; gives impression of contentment with the way things are going for him (her). |
| Imaginative | Has an active, vivid imagination; very fanciful; sees possibilities overlooked by others. |
| Impulsive | Behavior always seems very close “to the surface;” often acts before the appropriate moment; finds it difficult to hold back; often acts or speaks without thinking of possible consequences. |
| Irresponsible | Does not take his (her) assigned duties seriously; cannot be depended upon to carry out assigned tasks. |
| Jealous | Envies and begrudges the accomplishments of others; is disturbed when others are shown special attention or given special favors; shows disappointment or annoyance when others are praised. |
| Lethargic | Slow moving; seldom or never runs or hurries; unresponsive or slow to react; works slowly. |
| Mannerly | Has good manners; knows what to say and do when introduced to others; has a sense of “good form” and behaves accordingly; uses “Please” and “Thank you” properly. |
| Neat in appearance | Careful about clothes and appearance; dislikes being dirty or disheveled; is usually well groomed. |
| Nervous habits | Has a great variety of nervous habits (e.g., nail-biting, grimacing, ties, hair twisting, pencil chewing, etc.). |
| Original | Has remarkably novel and different ideas and/or solutions to problems; thinking and behavior are characterized by unusual approaches. |
| Outspoken | Speaks his (her) mind without reservation or hesitation; seldom or never hesitates to express views and opinions on any subject. |
| Persevering | Keeps at his work until it is completed; sees a job through despite difficulties; painstaking and thorough. |
| Planful | Behavior, including play, is purposeful; attacks work in systematic fashion. |
| Restless | Constantly or frequently moves about the room; unable to settle down after activity period or recess. |
| Rigid | Has difficulty adapting to change of to new situations; prefers to keep old ways and routines, even where these are obviously inappropriate. |
| Rude | Insolent and sassy to others; often gives impression he (she) goes out of his (her) way to be discourteous to others. |
| Seclusive | Dislikes group activities and games; prefers to be by self or in company of one or two others; dislikes being in a crowd. |
| Self-minimizing | Tends to minimize own importance; humble; never brags or shows off; seeks out or is content with less important tasks or positions. |
| Socially confident | Approaches others without hesitation; shows poise when performing before others. |
| Spiteful | Deliberately does or says things which annoy or hurt others; says hateful things about others; belittles others. |
| Submissive | Usually easily led or persuaded by others; seldom or never sticks up for own rights; gives in easily in arguments. |
| Touchy | Very sensitive to criticism; cries, pouts, or sulks when criticized; does not take well to jokes or pranks on him (her). |
| Verbally fluent | Speech seems to “pour out,” often in a torrent of words, sometimes making it difficult to understand him (her). |
Footnotes
Alternatively, the courser Hawaii data could be mapped to the more nuanced Terman variable, as there is essentially a “years of education” metric in educational attainment. That is, some high school could be coded 10, high school graduate 12, college graduate 16. The courser scaling used here most likely attenuates the education relations and correlations with intellect.
Based on the data available in each dataset, alcohol use information was the most straightforward health behavior to harmonize, as similar questions were available at the same age in both samples. Despite some evidence for modest cardiovascular protective effects of limited alcohol consumption, high alcohol use and abuse has repeatedly been documented as a significant health threat in Western societies.
Some questions indicated an allergic reaction to alcohol. As many of the participants were of Asian descent, such reactions were not coded as problem drinking.
Although the personality factors were aligned to the five factors in each sample, the factors comprised varying numbers of items, so factor loadings cannot be constrained across groups. For simplicity, we used the aligned composite factors, rather than more sophisticated multi-group comparisons with differing individual items.
SAS and R scripts are available in the online Supplemental Appendix.
One could question why items were identified by the raters and later rejected. The raters were psychology students who were trained to consider overlap in definitions and common word use, not placement within the Five-Factor Model. They were not personality experts. Our final theoretical considerations involved decisions by an expert in the five factor approach to personality (L. R. Goldberg).
For comparison, we also examined a two-factor model, with the conscientiousness, agreeableness, and emotional stability traits on factor 1 and the extraversion and intellect traits as factor 2, and a four-factor model, in which emotional stability was excluded, as only a single item was available for the Terman sample. For both samples, the five-factor model was superior.
In the original study (Friedman et al., 1993), although permanency of mood was reported as a single item, the study used the average of parent and teacher ratings, whereas the value reported here is based only on teacher assessments (for comparability with the Hawaii study).
This null relation remained when self-rated health, illness reports, and mental adjustment were considered separately as single outcomes.
Contributor Information
Margaret L. Kern, Department of Psychology, University of Pennsylvania
Sarah E. Hampson, Oregon Research Institute, Eugene, Oregon
Lewis R. Goldberg, Oregon Research Institute
Howard S. Friedman, Department of Psychology, University of California, Riverside
References
- Bath PA, Deeg D, Poppelaars J. The harmonisation of longitudinal data: a case study using data from cohort studies in The Netherlands and the United Kingdom. Ageing & Society. 2010;30:1419–1437. doi: 10.1017/S0144686X1000070X. [DOI] [Google Scholar]
- Bauer DJ, Hussong AM. Psychometric approaches for developing commensurate measure across independent studies: Traditional and new models. Psychological Methods. 2009;14:101–125. doi: 10.1037/a0015583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christensen AJ, Ehlers SL, Wiebe JS, Moran PJ, Raichle K, Ferneyhough K, Lawton WJ. Patient personality and mortality: A 4-year prospective examination of chronic renal insufficiency. Health Psychology. 2002;21:315–320. doi: 10.1037//0278-6133.21.4.315. [DOI] [PubMed] [Google Scholar]
- Cohn LD, Becker BJ. How meta-analysis increases statistical power. Psychological Methods. 2003;8:243–253. doi: 10.1037/1082-989X.8.3.243. [DOI] [PubMed] [Google Scholar]
- Cooper R, Hardy R, Sayer AA, Ben-Shlomo Y, Birnie K, Cooper C, Kuh D. Age and gender differences in physical capability levels from mid-life onwards: The harmonisation and meta-analysis of data from eight UK cohort studies. PLOS One. 2011;6(11) doi: 10.1371/journal.pone.0027899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cooper H, Patall EA. The relative benefits of meta-analysis conducted with individual participant data versus aggregated data. Psychological Methods. 2009;14:165–176. doi: 10.1037/a0015565. [DOI] [PubMed] [Google Scholar]
- Costa PT, McCrae RR. Revised NEO Personality Inventory (NEO-PI-R) and NEO Five-Factor Inventory (NEO-FFI) professional manual. Odessa, FL: Psychological Assessment Resources; 1992. [Google Scholar]
- Curran PJ, Bollen KA, Chen F, Paxton P, Kirby J. The finite sampling properties of the RMSEA: Point estimates and confidence intervals. Sociological Methods and Research. 2003;32:208–252. [Google Scholar]
- Curran PJ, Hussong AM. Integrative data analysis: The simultaneous analysis of multiple data sets. Psychological Methods. 2009;14:81–100. doi: 10.1037/a0015914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Curran PJ, Hussong AM, Cai L, Huang W, Chassin L, Sher KJ, Zucker RA. Pooling data from multiple longitudinal studies: The role of item response theory in integrative data analysis. Developmental Psychology. 2008;44:365–380. doi: 10.1037/0012-1649.44.2.365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eldevik S, Hastings RP, Hughes JC, Jahr E, Eikeseth S, Cross S. Using participant data to extend the evidence base for intensive behavioral intervention for children with autism. AJIDD: American Journal on Intellectual and Developmental Disabilities. 2010;115:381–405. doi: 10.1352/1944-7558-115.5.381. [DOI] [PubMed] [Google Scholar]
- Friedman HS, Kern ML, Hampson SE, Duckworth AL. A new lifespan approach to conscientiousness and health: Combining the pieces of the causal puzzle. Developmental Psychology. doi: 10.1037/a0030373. (this issue) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friedman HS, Kern ML, Reynolds CA. Personality and health, subjective well-being, and longevity. Journal of Personality. 2010;78:179–215. doi: 10.1111/j.1467-6494.2009.00613.x. [DOI] [PubMed] [Google Scholar]
- Friedman HS, Tucker JS, Schwartz JE, Martin LR, Tomlinson-Keasey C, Wingard D, Criqui M. Childhood conscientiousness and longevity: Health behaviors and cause of death. Journal of Personality and Social Psychology. 1995;68:696–703. doi: 10.1037//0022-3514.68.4.696. [DOI] [PubMed] [Google Scholar]
- Friedman HS, Tucker J, Tomlinson-Keasey C, Schwartz J, Wingard D, Criqui MH. Does childhood personality predict longevity? Journal of Personality and Social Psychology. 1993;65:176–185. doi: 10.1037//0022-3514.65.1.176. [DOI] [PubMed] [Google Scholar]
- Goldberg LR. An alternative “Description of personality”: The Big-Five factor structure. Journal of Personality and Social Psychology. 1990;59:1216–1229. doi: 10.1037//0022-3514.59.6.1216. [DOI] [PubMed] [Google Scholar]
- Goldberg LR. The structure of phenotypic personality traits. American Psychologist. 1993;48:26–34. doi: 10.1037//0003-066x.48.1.26. [DOI] [PubMed] [Google Scholar]
- Goldberg LR. Analysis of Digman’s child-personality data: Derivation of Big-Five factor scores from each of six samples. Journal of Personality. 2001;69:709–743. doi: 10.1111/1467-6494.695161. [DOI] [PubMed] [Google Scholar]
- Greenhoot AF, Dowsett CJ. Secondary data analysis: An important tool for addressing developmental questions. Journal of Cognition and Development. 2012;13:2–18. doi: 10.1080/15248372.2012.646613. [DOI] [Google Scholar]
- Hallahan B, Newell J, Soares JC, Brambilla P, Strakowski SM, Fleck DE, McDonald C. Structural magnetic resonance imaging in bipolar disorder: An international collaborative Mega-analysis of individual adult patient data. Biological Psychiatry. 2011;69:326–335. doi: 10.1016/j.biopsych.2010.08.029. [DOI] [PubMed] [Google Scholar]
- Hampson SE, Dubanoski JP, Hamada W, Marsella AJ, Matsukawa J, Suarez E, Goldberg LR. Where are they now? Locating former elementary-school students after nearly 40 years for a longitudinal study of personality and health. Journal of Research in Personality. 2001;35:375–387. [Google Scholar]
- Hampson SE, Goldberg LR, Vogt TM, Dubanoski JP. Forty years on: Teachers’ assessments of children’s personality traits predict self-reported health behaviors and outcomes at midlife. Health Psychology. 2006;25:57–64. doi: 10.1037/0278-6133.25.1.57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hampson SE, Goldberg LR, Vogt TM, Dubanoski JP. Mechanisms by which childhood personality traits influence adult health status: Educational attainment and healthy behaviors. Health Psychology. 2007;26:121–125. doi: 10.1037/0278-6133.26.1.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hampson SE, Goldberg LR, Vogt TM, Hillier TA, Dubanoski JP. Using physiological dysregulation to assess global health status: Associations with self-rated health and health behaviors. Journal of Health Psychology. 2009;14:232–241. doi: 10.1177/1359105308100207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hofer SM, Piccinin AM. Integrative data analysis through coordination of measurement and analysis protocol across independent longitudinal studies. Psychological Methods. 2009;14:150–164. doi: 10.1037/a0015566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hofer SM, Piccinin AM. Toward an integrative science of life-span development and aging. Journals of Gerontology Series B: Psychological Sciences and Social Sciences. 2010;65:269–278. doi: 10.1093/geronb/gbq017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- International Conference on Harmonisation. ICH for better health: Vision. 2010 Retrieved from http://www.ich.org/about/vision.html.
- Kern ML, Friedman HS. Do conscientious individuals live longer? A quantitative review. Health Psychology. 2008;27:505–512. doi: 10.1037/0278-6133.27.5.505. [DOI] [PubMed] [Google Scholar]
- Kern ML, Friedman HS. Why do some people thrive while others succumb to disease and stagnation? Personality, social relations, and resilience. In: Fry PS, Keyes CLM, editors. Frontiers of Resilient Aging: Life-Strengths and Well-Being in Late Life. Cambridge University Press; 2010. pp. 162–185. [Google Scholar]
- Loehlin JC. Latent variable models: An introduction to factor, path, and structural equation analysis. 4. Mahwah, NJ: Lawrence Erlbaum Associates; 2004. [Google Scholar]
- MacKinnon DP, Lockwood CM, Hoffman JM, West SG, Sheets V. A comparison of methods to test mediation and other intervening variable effects. Psychological Methods. 2002;7:83–104. doi: 10.1037//1082-989X.7.1.83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Manichaikul A, Chen WM, Williams K, Wong QN, Sale MM, Pankow JS, Tsai Mychaleckyj JC. Analysis of family- and population-based samples in cohort genome-wide association studies. Human Genetics. 2012;131:275–287. doi: 10.1007/s00439-011-1071-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin LR, Friedman HS. Comparing personality scales across time: An illustrative study of validity and consistency in life-span archival data. Journal of Personality. 2000;68:85–110. doi: 10.1111/1467-6494.00092. [DOI] [PubMed] [Google Scholar]
- McArdle JJ, Grimm KJ, Hamagami F, Bowles RP, Meredith W. Modeling life-span growth curves of cognition using longitudinal data with multiple samples and changing scales of measurement. Psychological Methods. 2009;14:126–149. doi: 10.1037/a0015857. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olgiati P, Politis A, Albani D, Rodilossi S, Polito L, Ateri E, Serretti A. Association of SORL1 alleles with late-onset Alzheimer’s Disease. Findings from the GIGAS_LOAD study and mega-analysis. Current Alzheimer Research. 2012;9:491–499. doi: 10.2174/156720512800492431. [DOI] [PubMed] [Google Scholar]
- Reise SP, Widaman KF, Pugh RH. Confirmatory factor analysis and Item Response Theory: Two approaches for exploring measurement invariance. Psychological Bulletin. 1993;114:552–566. doi: 10.1037/0033-2909.114.3.552. [DOI] [PubMed] [Google Scholar]
- Rijmen F, Tuerlinckx F, De Boeck P, Kuppens P. A nonlinear mixed model framework for item response theory. Psychological Methods. 2003;8:185–205. doi: 10.1037/1082-989x.8.2.185. [DOI] [PubMed] [Google Scholar]
- Riley RD, Simmonds MC, Look MP. Evidence synthesis combining individual patient data and aggregate data: A systematic review identified current practice and possible methods. Journal of Clinical Epidemiology. 2007;60:431–439. doi: 10.1016/j.jclinepi.2006.09.009. [DOI] [PubMed] [Google Scholar]
- Rosenthal R, DiMatteo MR. Meta-analysis: Recent developments in quantitative methods for literature reviews. Annual Review of Psychology. 2001;52:59–82. doi: 10.1146/annurev.psych.52.1.59. [DOI] [PubMed] [Google Scholar]
- Rosseel Y. lavaan: An R package for structural equation modeling. Journal of Statistical Software. 2012;48(2):1–36. [Google Scholar]
- Salimi-Khorshidi G, Smith SM, Keltner JR, Wager TD, Nichols TE. Meta-analysis of neuroimaging data: A comparison of image-based and coordinate-based pooling of studies. Neuroimage. 2009;45:810–823. doi: 10.1016/j.neuroimage.2008.12.039. [DOI] [PubMed] [Google Scholar]
- Schaap L, Peeters GMEE, Dennison EM, Zambon S, Nikolaus T, Sachez-Martinez M, Deeg DJH. European Project on Osteoarthritis (EPOSA): methodological challenges in harmonization of existing data from five European population-based cohorts on aging. BMC Musculoskeletal Disorders. 2011;12:272–284. doi: 10.1186/1471-2474-12-272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seeman TE, McEwen BS, Rowe JW, Singer BH. Allostatic load as a marker of cumulative biological risk: MacArthur studies of successful aging. Proceedings of the National Academy of Sciences of the United States of America. 2001;98:4770–4775. doi: 10.1073/pnas.081072698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Serretti A, Cusin C, Rausch JL, Bondy B, Smeraldi E. Pooling pharmacogenetic studies on the serotonin transporter: A mega-analysis. Psychiatry Research. 2006;145:61–65. doi: 10.1016/j.psychres.2005.09.020. [DOI] [PubMed] [Google Scholar]
- Smith TW, Williams PG. Personality and health: Advantages and limitations of the five-factor model. Journal of Personality. 1992;60:395–423. doi: 10.1111/j.1467-6494.1992.tb00978.x. [DOI] [PubMed] [Google Scholar]
- Stewart LA, Clarke MJ. Practical methodology of meta-analyses (overviews) using updated individual patient data. Statistics in Medicine. 1995;14:2057–2079. doi: 10.1002/sim.4780141902. [DOI] [PubMed] [Google Scholar]
- Sutton AJ, Kendrick D, Coupland CAC. Meta-analysis of individual- and aggregate-level data. Statistics in Medicine. 2008;27:651–669. doi: 10.1002/sim.2916. [DOI] [PubMed] [Google Scholar]
- Tak LM, Meijer A, Manoharan A, de Jonge P, Rosmalen JGM. More than the sum of its parts: Meta-analysis and its potential to discover sources of heterogeneity in psychosomatic medicine. Psychosomatic Medicine. 2010;72:253–265. doi: 10.1097/PSY.0b013e3181d714e1. [DOI] [PubMed] [Google Scholar]
- Terman LM, Baldwin BT, DeVoss JC, Fuller F, Goodenough FL, Kelley TL, et al. Mental and physical traits of a thousand gifted children. Vol. 1. Stanford, CA: Stanford University Press; 1925. Genetic studies of genius. [Google Scholar]
- Terman LM, Oden MH. Genetic studies of genius, IV: The gifted child grows up. Stanford, CA: Stanford University Press; 1947. [Google Scholar]
- Terman LM, Oden MH. Genetic studies of genius, V: The gifted group at mid-life. Stanford, CA: Stanford University Press; 1959. [Google Scholar]
- Weiss A, Costa PT. Domain and facet personality predictors of all-cause mortality among Medicare patients aged 65 to 100. Psychosomatic Medicine. 2005;67:724–733. doi: 10.1097/01.psy.0000181272.58103.18. [DOI] [PubMed] [Google Scholar]
- Wilson RS, Mendes de Leon CF, Bienias JL, Evans DA, Bennett DA. Personality and mortality in old age. Journal of Gerontology B: Psychological Sciences and Social Science. 2004;59:110–116. doi: 10.1093/geronb/59.3.p110. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




