Abstract
The Balloon Analogue Risk Task (BART) is a computerized decision-making task that provides a test of behavioral risk taking. The task is increasingly used in laboratory settings and in the field with young adults and adolescents. However, there are currently no published data about the test-retest characteristics of the task when it is administered on separate days. The current paper addresses this gap. Risky behavior on the BART (adjusted average pumps) showed acceptable test-retest reliability across days (r = +.77, p < .001). The data indicate that risk behavior on the BART has adequate test-retest stability and therefore performance on the task on a single occasion is likely to be representative of an individual’s performance on other occasions.
Keywords: test-retest validity, test-retest reliability, repeated measures, individual differences, risk behavior
Introduction
The Balloon Analogue Risk Task (BART) is a computerized measure of risk-taking behavior (Lejuez et al., 2002). Risk behavior on the BART correlates with real-world risk behavior such as alcohol use, cigarette and drug use, gambling, stealing, unsafe sex (Lejuez et al., 2002; 2003a,b), and trait measures of risk-taking propensity (e.g. sensation seeking and trait impulsivity, Lejuez et al., 2002; trait psychopathy, Hunt et al., 2005). These data suggest that the BART could be useful in studies that assess risk taking behavior over time, for instance, drug treatments, interventions, or events that could phasically alter risk taking on different days of study. Utility of the BART for such purposes, however, is predicated on its test-retest reliability.
Toward this end, there are thus far few reports about the test-retest reliability of risk behavior on the BART
Lejuez and colleagues assessed risk behavior on the BART in undergraduate smokers (n = 26) and nonsmokers (n = 34), by administering the test three times on a single test day (2003a). They found that risk behavior on the BART (adjusted average pumps) was correlated +.62 to +.82 across the three administrations within the single day, with small but significant increases in risk behavior between administrations (Lejuez et al., 2003a). However, these data obtained on a single day cannot speak to test-retest reliability across days. There is, in general, a lack of data on the reliability of computerized tasks of decision-making and behavioral control, such as go/no go, probability discounting, and delay discounting tasks (see Langenecker et al., 2007; Ohmura et al., 2006).
The goals of this analysis are threefold: (1) to document the stability of mean BART risk behavior on separate test days, (2) to provide preliminary data regarding the test-retest reliability of BART risk behavior, and (3) to document the stability of individual differences in BART risk behavior. The first goal is important because it provides information about the ways that mean risk behavior on the task may vary on separate study days in a given sample, which could occur through a number of mechanisms including novelty effects, learning, or habituation to the task. The second goal helps mitigate the current dearth of test-retest information for the BART. The third goal -- determining the relative stability of BART risk behavior -- is important because 1.) there are individual differences in BART risk behavior as measured on a single study day in healthy volunteers (Lejuez et al., 2002), 2.) these individual differences are correlated with personality traits which are themselves stable upon repeat assessment (see Lejeuz et al., 2002), and 3.) while it is therefore reasonable to expect individual differences in BART risk behavior to be reproducible, this has not yet been documented for this task. Relative stability (also termed “differential stability”) has been recently documented for delay discounting in alcoholics (Takahashi et al., 2007) and for delay and probability discounting in young adults (Ohmura et al., 2006), and is important to document because it speaks to the ability of a task to index individual differences in risky behavior on separate study days in the same participants, in the absence of other experimental manipulation. The present paper aims to fill these gaps, and provides preliminary data about mean behavior on the task over time, preliminary estimates of BART test-retest stability, and data about the relative stability of individual differences in BART risk behavior.
Method
Participants
Participants completed the BART risk task as a part of a larger, multi-session repeated measures study (White, Lejuez & de Wit, 2007), which involved an orientation session and three study sessions, one with a placebo capsule treatment and two with drug capsules. BART data used for the present analysis were obtained on the orientation session and up to approximately two weeks later on the placebo session. The current test-retest data have not been previously reported.
Participants were forty healthy volunteers (19 female), aged 18 to 35 who were free of major medical conditions, Axis I psychiatric disorder, and a history of drug or alcohol problems. They also had at least a high school education, were fluent in English and not working night shift (White, Lejuez and de Wit, 2007). One subject missing BART data on the orientation session was excluded from the analyses (final n = 39). Subjects were drug-free at the time of testing.
This sample is smaller than sample sizes used to investigate test-retest characteristics of self-report measures (e.g., personality trait measures of impulsivity; see Yao et al., 2007), is larger than samples used to establish external validity of the BART in nonsmokers (e.g. N=26, Lejuez et al., 2003a), and is comparable to samples used in test-retest studies of behavior and risk-taking (e.g., computerized behavioral tests, N=35, Sakong et al., 2007; go/no go, N=28, Langenecker et al., 2007; probability discounting, N=18, Ohmura et al., 2006; delay discounting, N=22, Ohmura et al., 2006; personal safety, N=29, Punakallio, 2004; gambling, N=34, Weinstock et al., 2004; empathy, N=24, Lawrence et al., 2004; self-harm, N=38, Fliege et al., 2006), several of which relate to performance on the BART (see Lejuez et al., 2002; 2003a; 2003b). The present sample size is adequate for preliminary data on test-retest characteristics of the BART.
Procedure
Before the first session subjects signed their informed consent. The studies were approved by the Institutional Review Board at The University of Chicago in accordance with the Code of Federal Regulations (Title 45, Part 46) “Protection of Human Subjects” adopted by the National Institutes of Health and the Office for Protection from Research Risks, and were conducted ethically in accordance with the Helsinki Declaration of 1964 (revised 1989).
Participants completed the task on the orientation session (Session A) and the placebo session (Session B), separated by approximately 2 weeks (mean: 11.5 days ± 9.1). This interval is similar to that used in other studies that assess test-retest stability of risk-taking or behavior, including test-retest assessment of measures of physical activity (14 days; Hardy et al., 2007; 22 days; Yore et al., 2007); compulsive exercise (14 days; Griffiths et al., 2005); sexual risk behavior (6 days; Troped et al., 2007; 17 days, Schrimshaw et al., 2006; 21 days, Hearn et al., 2003), smoking and drinking behavior (14 days; Chor et al., 2003; 2–30 days; Bell et al., 2003); HIV risk behavior (6–11 days; Fry & Lintzeris, 2003; 30 days, Sherman et al., 2003); risky drug behavior (7 days; Hubley et al., 2005); and substance use, abuse, and suicidality (8 days; Shaffer et al., 2004; 10–14 days, Flisher et al., 2004).
Behavioral Assessment: Balloon Analogue Risk Task (BART)
The BART is a computerized task on which participants have the opportunity to win or lose potential earnings, where persistent responding increases gains but also increases the risk of loss on each trial. The BART task consisted of 60 balloon trials, one third of which were Low, Medium or High payoff value (0.5 cents, 1.0 cents, and 5.0 cents per pump). Participants are presented with “balloons” on a screen, and subjects are given the opportunity to “pump” the balloon to earn monetary rewards. On each trial the computer screen displays a small balloon, a balloon pump, a reset button labeled “Collect $$$,” a box displaying the amount of money earned on the previous trial, a box displaying the total amount of money earned, and a box showing how much the balloon currently displayed would pay off (Lejuez et al., 2002). The number of points that can be earned per pump varies across trials, with 20 balloons having low value (0.5 cents per pump), 20 balloons having medium value (1.0 cents per pump) and 20 balloons having high value (5.0 cents per pump). On each trial, the number of cents earned increases with each pump until either a). the balloon “pops” and participants lose their earnings for that trial, or b). the subject collects the accumulated earnings for that trial. On each trial, each individual click on the pump inflates the balloon one degree (about .125” in all directions), and each balloon is programmed to pop between 1 and 128 pumps, with an average breakpoint of 64 pumps. Specific information regarding the balloon breakpoint determination is not provided to the participants, who are simply informed that the balloon can break anywhere from the 1st pump all the way through enough pumps to make the balloon fill the screen. At any point during each trial, the participants can stop pumping the balloon and click the “Collect $$$” button, which transfers money accumulated from that balloon to the permanent bank, updates the permanent bank amount on the display, and produces a slot machine payoff sound. In contrast, when a balloon explodes, a “pop” sound is heard, the balloon disappears, the money in the temporary bank is lost for that trial, and the next trial begins. The dependent variable is adjusted average pumps, i.e., pumps on the balloons which did not explode (Lejuez et al., 2002).
Data Analysis
Test-retest
Test-retest characteristics of the BART risk task were assessed using (1) Paired comparisons of mean risk behavior on the BART across days, to provide information about potential novelty, habituation, and learning effects on BART risk behavior across sessions; (2) Pearson correlations of BART risk behavior, to provide information about test-retest reliability across sessions; and (3) Spearman rank-order correlations, to provide information about the stability of individuals’ ranked riskiness of behavior on the task compared to the rest of the sample. Some change in mean risk behavior could be expected from novelty effects upon initial exposure to the task (Session A), or to learning, boredom or habituation upon subsequent exposure to the task (Session B). We expected test-retest reliability to be high on both sessions, and expected individual differences in participants’ relative riskiness of behavior on the task to be conserved across test days. Paired-samples t-tests, pearson and spearman correlations were also conducted separately for males and females, to assess potential gender differences in test-retest patterns for the BART in this sample.
Cumulative Exposure
Participants in the parent study completed the BART a total of 4 times (orientation, placebo session, two drug sessions). Therefore, it was important to determine whether performance on the task changed as a function of repeated administrations. Mean risk behavior and test-retest correlations were assessed for differences between two groups: (i) those who received 2 exposures of the BART by Session B (n = 14; OR-PL order; i.e., no intervening study days); and (ii) those who received more than 2 exposures of the BART by Session B (n = 25; OR-10 mg-PL order, OR-20 mg-PL order; OR-10 mg -20 mg -PL order, OR- 20 mg -10 mg -PL order; i.e., up to two intervening study days) in the larger parent study. Correlation coefficients were normalized using Fisher r to z transformation and z-scores were compared between groups. Mean risk behaviors were compared across groups using t-tests for independent samples.
Alpha was set at .05, 2-tailed for all tests.
Results
Mean Risk Behavior: Mean risk behavior on the BART (adjusted average pumps; see Lejuez et al., 2002) did not change over time (t (1, 38) = 0.85, n.s.; see Total Score, Table 1). Test-Retest Reliability: Pearson correlations for BART risk behavior between sessions were moderate to high (r = +.66 to +.78; p < .001). Correlations did not differ between the three BART reward conditions (low, medium, and high BART; Z < |1.6|, n.s.). Stability of Individual Differences: Spearman rank-order correlations for BART risk behavior between sessions were also moderate to high (rho = +.63 to +.73; p < .001). Correlations did not differ between the three BART reward conditions (low, medium, and high BART; Z < |1.2|, n.s.). Impact of Cumulative Exposure: Mean risk behavior on session B did not differ between groups receiving two or more than two exposures to the BART (means (s.e., n) = 43.6 (3.1, n=14), 41.1 (2.8, n=25), respectively; t (1, 37) = 0.59, n.s.). Test-retest correlations did not differ between groups (Z = 1.08; n.s.), though there was a nonsignificant trend toward increased test-retest reliability in subjects who had more than two exposures to the BART by the second session (r = 0.64 (n=14); versus r = 0.82 (n=25)). Gender Differences: Mean risk behavior on the BART did not change over time for either males or females (males: all t (1,20) < 0.9, n.s.; females: all t (1,17) < |1.8|, n.s.). Pearson correlations indicated that test-retest stability was moderate to high in both sexes (males / females: total score: r = + .74 / +.81; ½ cent BART: r = + .66 / +.70; 1 cent BART: r = + .73 / + .78; 5 cent BART: r = + .84 / + .72; all p ≤ .001), and the magnitude of test-retest correlations did not differ by gender (Z < |.9|, n.s.). Spearman rank-order correlations were moderate to high for both groups [males / females: total score: rho = +.67 (p ≤ .001) / +.72 (p ≤ .001); ½ cent BART: rho = +.65 (p ≤ .001) / +.53 (p < .05); 1 cent BART: rho = +.71 (p ≤ .001) / +.68 (p < .005); 5 cent BART: rho = +.78 (p ≤ .001) / +.76 (p ≤ .001)], and did not differ in magnitude (Z < |.54|, n.s.).
Table 1.
Test-Retest of BART risk behavior
| Session A | Session B | t | r | rho | |
|---|---|---|---|---|---|
| BART Risk Behavior | |||||
| Total Score | 40.80 (2.0) | 41.99 (2.0) | n.s. |
|
|
| Reward Levels: | |||||
| 1/2 cent/pump | 45.01 (2.9) | 44.80 (2.6) | n.s. | +.66*** | +.63*** |
| 1 cent/pump | 43.25 (2.3) | 43.96 (2.3) | n.s. | +.75*** | +.73*** |
| 5 cents/pump | 37.41 (2.3) | 38.71 (2.0) | n.s. | +.78*** | +.72*** |
Legend:
BART= Balloon Analogue Risk Task; Risk Behavior = Adjusted Average Pumps (per Lejuez et al., 2002). BART test-retest assessed in the same participants on separate test days, approximately 2 weeks apart (mean = 11.5 ± 9.1 days). Total Score = adjusted average pumps on 60 balloon trials per session; Reward levels = adjusted average pumps on 20 balloon trials in each reward category per session. Standard error in parentheses.
p < .001; N=39.
Discussion
There were three main findings: (i) mean risk behavior on the BART did not differ across test days; (ii) test-retest correlations across sessions were relatively high (r = +.77); and (iii) rank-order correlations across sessions were relatively high (rho = +.71). Cumulative experience with the task did not alter task performance, and test-retest characteristics did not differ by gender. The findings and their implications are discussed below.
First, risk behavior on the BART did not differ across sessions (Table 1); average behavior on the task assessed in the same participants on two separate study days did not differ despite intervening time, exposure to two low doses of amphetamine, and cumulative experience on the task. This finding is important because it indicates that the BART risk task could be relatively resilient to novelty effects, learning effects, and habituation. This finding assists in the evaluation of changes in BART risk behavior in response to interventions in within-subject designs, such as consumption of moderate doses of psychostimulants (e.g., White, Lejuez & de Wit, 2007), and acute episodes of sleep deprivation (Killgore, 2007; Acheson et al., 2007), and is similar to other measures relevant to risk-taking, such as probability and delay discounting, which also have stable mean performance when tested between 2 to 5 days (t (1,23) < |1.7|, n.s.; based on data from Appendix A, Richards et al., 1999) and 3 months apart (Ohmura et al., 2006).
Second, risk behavior on the BART showed adequate test-retest reliability (r = +.77) over a period of approximately 2 weeks. This finding extends the time window on which test-retest reliability of the BART is known (see Lejuez et al., 2003a). The magnitude of the test-retest estimate compares favorably with other tasks, such as delay discounting and probability discounting, which have test-retest correlations of +.52 to +.62 (p < .01; k-values), and +.76 to +.94 (p < .001; h-values), respectively over a shorter time period (i.e., 2–5 days; for data see Appendix A, Richards et al., 1999). The present BART estimates are also within the published range of 2-week test-retest reliability estimates for various self-report measures of risk-taking proclivity or behavior (e.g., 2-week re-test (+.75) for self-reports of risk-taking tendency, Meertens et al., 2008; 2-week re-test (+.73 to +.96) for self-reported gambling, Weinstock et al., 2004); 2-week re-test (> +.7) for self-reported sedentary activity, Hardy et al., 2007; 2-week re-test (+.61 to +.95) for self-reported smoking, Chor et al., 2003).
Third, spearman rank-order correlations were acceptably high, indicating that between-subject differences in rank-ordering of risky behavior on the task are likely to be stable over the time period studied. This finding is important because behavior on the BART correlates with scores on personality traits of impulsivity and psychopathy (e.g., Hunt et al., 2005; Lejuez et al., 2002), which are themselves stable. Other behavioral tasks relevant to risk-taking, such as probability and delay discounting, also show stability of individual differences over time periods both shorter (2–5 days; rho > +.89, p’s < .001; for data see Appendix A, Richards et al., 1999) and longer than studied here (2–3 months; Ohmura et al., 2006; Takahashi et al., 2007). The present results suggest that rank-ordered individual differences in risk behavior on the BART are likely to be both stable and reproducible, a result that has obtained for other behavioral tasks. The present, preliminary findings provide new information about the stability and test-retest reliability of the BART, which is necessary given the dearth of published test-retest information on the BART and laboratory tests of decision-making and behavioral control (for discussion, see Langenecker et al., 2007; Ohmura et al., 2006).
The present analysis has several limitations. The main limitation is the assessment of test-retest characteristics over a range of time (approximately 2 weeks; see Methods), rather than using a fixed, a-priori test-retest interval. Interval range approaches have however been used in a number of published test-retest studies of other measures of risk-taking and potentially injurious behavior (e.g., interval of 2 to 30 days for alcohol-related behavior, Bell et al., 2003; interval of 7 to 150 days for self-harm behavior, Fliege et al., 2006; interval of 6 to 11 days for HIV, HBV, and HCV risk behavior, Fry & Lintzeris, 2003; interval of 10 to 14 days for substance use, violent behavior, suicidality, and sexuality, Flisher et al., 2004). Future studies using fixed test-retest intervals, such as fixed 2-week, 3-week, 2-month, and 3-month retest intervals, would be valuable to assess the long-term stability of the behavior. The two-week interval would replicate and extend the present findings; three-week, two- and three-month intervals have been used for evaluation of test-retest reliability of other relevant tasks (e.g., 3 week interval for go/no go, Langenecker et al., 2007; 2 month interval for delay discounting, Takahashi et al., 2007; 3 month interval for delay and probability discounting, Ohmura et al., 2006), and would permit systematic comparison between findings in this small literature on test-retest characteristics of behavioral tasks. Other limitations include low statistical power to detect potential gender differences in test-retest patterns in the current sample. Despite these limitations, the present findings neverless provide necessary preliminary data about the test-retest characteristics of the BART in young adult samples.
Strengths of the current study include use of multiple, commonly-used levels of reward (1/2, 1, and 5 cents per pump), an explosion break-point of 64 pumps, and assessment of risk responses using 60 balloons total (20 balloons per sub-condition) on each study day. These features make the present BART findings highly generalizable to the other versions of the task. The BART is currently deployed in multiple versions, which vary in: (1) pump reward value; (2) number of balloons assessed; and (3) break-points for explosions. Regarding pump reward values, the present study used the most commonly-used reward value for pumps, i.e., 5 cents per pump (e.g., Killgore, 2007; Benjamin & Robbins, 2007; White, Lejeuz & de Wit, 2007; Hunt et al., 2005; Aklin et al., 2005; Lejuez et al., 2003a,b, 2005; Hopko et al., 2006; Bornovalova et al., 2005); and used several additional reward values (1/2 and 1 cent per pump) which are also in wide use (for instance, 1 cent per pump, Crowley et al., 2006; ½, 1, and 5 cents per pump, White, Lejeuz, & de Wit, 2007; Reynolds et al., 2004, 2006b; 1, 5, and 25 cents per pump, Acheson et al., 2007, 2008; Hamidovic et al., 2008), though other reward levels have also been used, which are not addressed here (i.e., 1 point per pump, redeemable for prizes, Lejuez et al., 2007; 10, 25, and 50 cents per pump, Reynolds et al., 2006a; and an escalating reward of 50 cents per pump with 2 cent increases; Fein & Chang, 2008). Regarding number of balloons, the study estimated test-retest reliability using a moderate to high number of balloons (20 balloons per condition; 60 balloons total per assessment); this exceeds the 10-balloon minimum required for the task (see Lejeuz et al., 2002; Aklin et al., 2005), and places the current study in the middle of the range of trials typically used for the task, which varies from 10 balloons total (Bornovalova et al., 2005; Hopko et al., 2006); to 10 balloons per condition (Acheson, Richards & de Wit, 2007; Acheson & de Wit, 2008; Hamidovic et al., 2008; Reynolds et al., 2004, 2006a, 2006b); 20 balloons total (Benjamin & Robbins, 2007); to 20 balloons per condition (Benjamin & Robbins, 2007; White, Lejuez & de Wit, 2007); 30 balloons total (Aklin et al., 2005; Crowley et al., 2006; Hunt et al., 2005; Killgore, 2007; Lejuez et al., 2003a,b, 2005a,b, 2007; Skeel et al., 2007); to 30 balloons per condition (Lejuez et al., 2002); and 40 balloons total (Benjamin & Robbins, 2007), 60 balloons total (Fein & Chang, 2008; White, Lejuez & de Wit, 2007); and 90 balloons total (Lejuez et al., 2002); with a recent novel variant (the 2BIT; Mitchell et al., 2008) topping out at 300 total balloons (150 balloons per condition; Mitchell et al., 2008). Thus the present data correspond favorably to the majority of studies using the BART in terms of numbers of balloon trials assessed. The present study used the most common break-point for explosions, i.e., 64 pumps out of a possible 128 (see Lejuez et al., 2002; Killgore, 2007; Lejuez et al., 2002, 2003a, 2007; White, Lejuez & de Wit, 2007; Hunt et al., 2005; Lejuez et al., 2005; Hopko et al., 2006; Bornovalova et al., 2005; Lejuez et al., 2005; though instances of lower break-points can also be found; for instance 4 & 16 pumps, Lejuez et al., 2002; 20 pumps, Fein & Chang, 2008; 63 pumps, Benjamin & Robbins, 2007). The results also provide test-retest information about the most widely used summary metric of risk behavior on the task (i.e., adjusted average pumps; Lejuez et al., 2002). Because of these features, the present study provides highly generalizable test-retest information and does so using an appropriately high number of balloons (> 20 per condition; 60 total; see Aklin et al., 2005; Lejuez et al., 2007). The present test-retest information is relevant to the large number of BART versions currently in use, due to the above overlaps between this version and the original version (Lejuez et al., 2002); EEG versions (Fein & Chang, 2008); youth versions (BART-Y; Lejuez et al., 2007); versions that assess responses to win gains or prevent losses (LBART, GBART; Benjamin & Robbins, 2007), and versions that involve multiple and alternate levels of reward (Acheson et al., 2007, 2008; Crowley et al., 2006; Hamidovic et al., 2008; Reynolds et al., 2004, 2006b; White, Lejeuz, & de Wit, 2007).
Acknowledgments
Thanks to Vandana Grover, Jen McDonald, and Justin Engasser of the Human Behavioral Pharmacology Laboratory for their assistance. This research was supported by USPHS grant DA09133 and training grant T32 DA07255.
References
- Acheson A, de Wit H. Bupropion improves attention but does not affect impulsive behavior in healthy young adults. Experimental and Clinical Psychopharmacology. 2008;16(2):113–123. doi: 10.1037/1064-1297.16.2.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Acheson A, Richards J, de Wit H. Effects of sleep deprivation on impulsive behaviors in men and women. Physiology & Behavior. 2007;91(5):579–587. doi: 10.1016/j.physbeh.2007.03.020. [DOI] [PubMed] [Google Scholar]
- Aklin W, Lejuez C, Zvolensky M, Kahler C, Gwadz M. Evaluation of behavioral measures of risk taking propensity with inner city adolescents. Behaviour Research and Therapy. 2005;43(2):215–228. doi: 10.1016/j.brat.2003.12.007. [DOI] [PubMed] [Google Scholar]
- Bell NS, Williams JO, Senier L, Strowman SR, Amoroso PJ. The reliability and validity of the self-reported drinking measures in the Army’s Health Risk Appraisal survey. Alcoholism: Clinical and Experimental Research. 2003;27:826–34. doi: 10.1097/01.ALC.0000067978.27660.73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benjamin A, Robbins S. The role of framing effects in performance on the Balloon Analogue Risk Task (BART) Personality and Individual Differences. 2007;43(2):221–230. [Google Scholar]
- Bornovalova M, Daughters S, Hernandez G, Richards J, Lejuez C. Differences in impulsivity and risk-taking propensity between primary users of crack cocaine and primary users of heroin in a residential substance-use program. Experimental and Clinical Psychopharmacology. 2005;13(4):311–318. doi: 10.1037/1064-1297.13.4.311. [DOI] [PubMed] [Google Scholar]
- Chor D, Faerstein E, Alves MG, de Souza Lopes C. How reproducible is self-reported information on exposure to smoking, drinking, and dietary patterns? Evidence among Brazilian adults in the Pró-Saúde Study. Sao Paulo Medical Journal. 2003;121:63–6. doi: 10.1590/S1516-31802003000200006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crowley T, Raymond K, Mikulich-Gilbertson S, Thompson L, Lejuez C. A risk-taking ‘set’ in a novel task among adolescents with serious conduct and substance problems. Journal of the American Academy of Child & Adolescent Psychiatry. 2006;45(2):175–183. doi: 10.1097/01.chi.0000188893.60551.31. [DOI] [PubMed] [Google Scholar]
- Fein G, Chang M. Smaller feedback ERN amplitudes during the BART are associated with a greater family history density of alcohol problems in treatment-naïve alcoholics. Drug and Alcohol Dependence. 2008;92(1):141–148. doi: 10.1016/j.drugalcdep.2007.07.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fliege H, Kocalevent RD, Walter OB, Beck S, Gratz KL, Gutierrez, et al. Three assessment tools for deliberate self-harm and suicide behavior: valuation and psychopathological correlates. Journal of Psychosomatic Research. 2006;61:113–21. doi: 10.1016/j.jpsychores.2005.10.006. [DOI] [PubMed] [Google Scholar]
- Flisher AJ, Evans J, Muller M, Lombard C. Brief report: Test-retest reliability of self-reported adolescent risk behaviour. Journal of Adolescence. 2004;27:207–12. doi: 10.1016/j.adolescence.2001.10.001. [DOI] [PubMed] [Google Scholar]
- Fry CL, Lintzeris N. Psychometric properties of the Blood-borne Virus Transmission Risk Assessment Questionnaire (BBV-TRAQ) Addiction. 2003;98:171–8. doi: 10.1046/j.1360-0443.2003.00207.x. [DOI] [PubMed] [Google Scholar]
- Griffiths MD, Szabo A, Terry A. The exercise addiction inventory: A quick and easy screening tool for health practitioners. British Journal of Sports Medicine. 2005;39:e30. doi: 10.1136/bjsm.2004.017020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hamidovic A, Kang U, de Wit H. Effects of low to moderate acute doses of pramipexole on impulsivity and cognition in healthy volunteers. Journal of Clinical Psychopharmacology. 2008;28(1):45–51. doi: 10.1097/jcp.0b013e3181602fab. [DOI] [PubMed] [Google Scholar]
- Hardy LL, Booth ML, Okely AD. The reliability of the Adolescent Sedentary Activity Questionnaire (ASAQ) Preventative Medicine. 2007;45:71–4. doi: 10.1016/j.ypmed.2007.03.014. [DOI] [PubMed] [Google Scholar]
- Hearn KD, O’Sullivan LF, Dudley CD. Assessing reliability of early adolescent girls’ reports of romantic and sexual behavior. Archives of Sexual Behavior. 2003;32:513–21. doi: 10.1023/a:1026033426547. [DOI] [PubMed] [Google Scholar]
- Hopko D, Lejuez C, Daughters S, Aklin W, Osborne A, Simmons B, et al. Construct Validity of the Balloon Analogue Risk Task (BART); Relationship with MDMA Use by Inner-City Drug Users in Residential Treatment. Journal of Psychopathology and Behavioral Assessment. 2006 Jun;28(2):95–101. [Google Scholar]
- Hubley AM, Russell LB, Palepu A. Injection Drug Use Quality of Life scale (IDUQOL): A validation study. Health Quality of Life Outcomes. 2005;3:43. doi: 10.1186/1477-7525-3-43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hunt M, Hopko D, Bare R, Lejuez C, Robinson E. Construct Validity of the Balloon Analog Risk Task (BART): Associations With Psychopathy and Impulsivity. Assessment. 2005;12(4):416–428. doi: 10.1177/1073191105278740. [DOI] [PubMed] [Google Scholar]
- Killgore W. Effects of sleep deprivation and morningness-eveningness traits on risk-taking. Psychological Reports. 2007;100(2):613–626. doi: 10.2466/pr0.100.2.613-626. [DOI] [PubMed] [Google Scholar]
- Langenecker SA, Zubieta JK, Young EA. A task to manipulate attentional load, set-shifting, and inhibitory control: Convergent validity and test-retest reliability of the Parametric Go/No-Go Test. Journal of Clinical and Experimental Neuropsychology. 2007;29:842–853. doi: 10.1080/13803390601147611. [DOI] [PubMed] [Google Scholar]
- Lawrence EJ, Shaw P, Baker D, Baron-Cohen S, David AS. Measuring empathy: reliability and validity of the Empathy Quotient. Psychological Medicine. 2004;34:911–9. doi: 10.1017/s0033291703001624. [DOI] [PubMed] [Google Scholar]
- Lejuez CW, Aklin WM, Jones HA, Richards JB, Strong DR, Kahler CW, Read JP. The Balloon Analogue Risk Task (BART) differentiates smokers and nonsmokers. Experimental and Clinical Psychopharmacology. 2003a;11:26–33. doi: 10.1037//1064-1297.11.1.26. [DOI] [PubMed] [Google Scholar]
- Lejuez CW, Aklin WM, Zvolensky MJ, Pedulla CM. Evaluation of the Balloon Analogue Risk Task (BART) as a predictor of adolescent real-world risk-taking behaviours. Journal of Adolescence. 2003b;26:475–9. doi: 10.1016/s0140-1971(03)00036-8. [DOI] [PubMed] [Google Scholar]
- Lejuez CW, Read JP, Kahler CW, Richards JB, Ramsey SE, Stuart GL, et al. Evaluation of a behavioral measure of risk taking: the Balloon Analogue Risk Task (BART) Journal of Experimental Psychology. 2002;8:75–84. doi: 10.1037//1076-898x.8.2.75. [DOI] [PubMed] [Google Scholar]
- Lejuez C, Aklin W, Bornovalova M, Moolchan E. Differences in risk-taking propensity across inner-city adolescent ever- and never-smokers. Nicotine & Tobacco Research. 2005;7(1):71–79. doi: 10.1080/14622200412331328484. [DOI] [PubMed] [Google Scholar]
- Lejuez C, Aklin W, Daughters S, Zvolensky M, Kahler C, Gwadz M. Reliability and Validity of the Youth Version of the Balloon Analogue Risk Task (BART-Y) in the Assessment of Risk-Taking Behavior Among Inner-City Adolescents. Journal of Clinical Child and Adolescent Psychology. 2007;36(1):106–111. doi: 10.1080/15374410709336573. [DOI] [PubMed] [Google Scholar]
- Meertens RM, Lion R. Measuring an individual’s tendency to take risks: The risk propensity scale. Journal of Applied Social Psychology. 2008;38:1506–1520. [Google Scholar]
- Mitchell S, Schoel C, Stevens A. Mechanisms underlying heightened risk taking in adolescents as compared with adults. Psychonomic Bulletin & Review. 2008;15(2):272–277. doi: 10.3758/pbr.15.2.272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ohmura Yu, Takahashi Taiki, Kitamura Nozomi. Three-Month Stability of Delay and Probability Discounting Measures. Experimental and Clinical Psychopharmacology. 2006;14:318–28. doi: 10.1037/1064-1297.14.3.318. [DOI] [PubMed] [Google Scholar]
- Punakallio A. Trial-to-trial reproducibility and test-retest stability of two dynamic balance tests among male firefighters. International Journal of Sports Medicine. 2004;25:163–9. doi: 10.1055/s-2003-45251. [DOI] [PubMed] [Google Scholar]
- Reynolds B, Ortengren A, Richards J, de Wit H. Dimensions of impulsive behavior: Personality and behavioral measures. Personality and Individual Differences. 2006b;40(2):305–15. [Google Scholar]
- Reynolds B, Richards J, de Wit H. Acute-alcohol effects on the Experiential Discounting Task (EDT) and a question-based measure of delay discounting. Pharmacology, Biochemistry and Behavior. 2006a;83(2):194–202. doi: 10.1016/j.pbb.2006.01.007. [DOI] [PubMed] [Google Scholar]
- Reynolds B, Richards J, Dassinger M, de Wit H. Therapeutic doses of diazepam do not alter impulsive behavior in humans. Pharmacology, Biochemistry and Behavior. 2004;79(1):17–24. doi: 10.1016/j.pbb.2004.06.011. [DOI] [PubMed] [Google Scholar]
- Richards JB, Zhang L, Mitchell SH, de Wit H. Delay or probability discounting in a model of impulsive behavior: Effect of alcohol. Journal of Experimental Analysis of Behavior. 1999;71:121–143. doi: 10.1901/jeab.1999.71-121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sakong J, Kang PS, Kim CY, Hwang TY, Jeon MJ, Park SY, et al. Evaluation of reliability of traditional and computerized neurobehavioral tests. Neurotoxicology. 2007;28:235–9. doi: 10.1016/j.neuro.2006.03.004. [DOI] [PubMed] [Google Scholar]
- Schrimshaw EW, Rosario M, Meyer-Bahlburg HF, Scharf-Matlick AA. Test-retest reliability of self-reported sexual behavior, sexual orientation, and psychosexual milestones among gay, lesbian, and bisexual youths. Archives of Sexual Behavior. 2006;35:225–34. doi: 10.1007/s10508-005-9006-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shaffer D, Scott M, Wilcox H, Maslow C, Hicks R, Lucas CP, et al. The Columbia Suicide Screen: Validity and reliability of a screen for youth suicide and depression. Journal of the American Academy of Child & Adolescent Psychiatry. 2004;43:71–9. doi: 10.1097/00004583-200401000-00016. [DOI] [PubMed] [Google Scholar]
- Sherman SG, Celentano DA, Mcgrath JW, Chard SE, Gangakhedkar RR, Joglekar N, et al. The reliability and validity of the Modified Condom Outcome Expectancy Scale (MCOES) among an international sample of HIV-negative partners of people living with HIV/AIDS. AIDS Care. 2003;15:359–66. doi: 10.1080/0954012031000105414. [DOI] [PubMed] [Google Scholar]
- Skeel R, Neudecker J, Pilarski C, Pytlak K. The utility of personality variables and behaviorally-based measures in the prediction of risk-taking behavior. Personality and Individual Differences. 2007;43(1):203–214. [Google Scholar]
- Takahashi T, Furukawa A, Miyakawa T, Maesato H, Higuchi S. Two-month stability of hyperbolic discount rates for delayed monetary gains in abstinent inpatient alcoholics. Neuro Endocrinol Letters. 2007;28:131–6. [PubMed] [Google Scholar]
- Troped PJ, Wiecha JL, Fragala MS, Matthews CE, Finkelstein DM, Kim J, Peterson KE. Reliability and validity of YRBS physical activity items among middle school students. Medicine Science Sports Exercise. 2007;39:416–25. doi: 10.1249/mss.0b013e31802d97af. [DOI] [PubMed] [Google Scholar]
- Weinstock J, Whelan JP, Meyers AW. Behavioral assessment of gambling: An application of the timeline followback method. Psychological Assessment. 2004;16:72–80. doi: 10.1037/1040-3590.16.1.72. [DOI] [PubMed] [Google Scholar]
- White T, Lejuez C, de Wit H. Personality and gender differences in effects of d-amphetamine on risk taking. Experimental and Clinical Psychopharmacology. 2007;15(6):599–609. doi: 10.1037/1064-1297.15.6.599. [DOI] [PubMed] [Google Scholar]
- Yao S, Yang H, Zhu X, Auerbach RP, Abela JR, Pulleyblank RW, Tong X. An examination of the psychometric properties of the Chinese version of the Barratt Impulsiveness Scale, 11th version in a sample of Chinese adolescents. Percept Mot Skills. 2007;104:1169–82. doi: 10.2466/pms.104.4.1169-1182. [DOI] [PubMed] [Google Scholar]
- Yore MM, Ham SA, Ainsworth BE, Kruger J, Reis JP, Kohl HW, 3rd, Macera CA. Reliability and validity of the instrument used in BRFSS to assess physical activity. Medicine Science Sports Exercise. 2007;39:1267–74. doi: 10.1249/mss.0b013e3180618bbe. [DOI] [PubMed] [Google Scholar]
