Abstract
Increasing evidence supports the hypothesis that impulsive decision-making is a heritable risk factor for an alcohol use disorder (AUD). Clearly identifying a link between impulsivity and AUD risk, however, is complicated by the fact that both AUDs and impulsivity are heterogeneous constructs. Understanding the link between the two requires identifying the underlying cognitive factors that lead to impulsive choices. Rodent models have established that a family history of excessive drinking can lead to the expression of a transgenerational impulsive phenotype, suggesting heritable alterations in the decision-making process. The current study explored the cognitive processes underlying impulsive choice in a validated selectively bred rodent model of excessive drinking – the alcohol preferring (‘P’) rat. Impulsivity was measured via delay discounting (DD) and P rats exhibited an impulsive phenotype compared to their outbred foundation strain - Wistar rats. Steeper discounting in P rats was associated with a lack of a prospective behavioral strategy, which was observed in Wistar rats and was directly related to DD. To further explore the underlying cognitive factors mediating these observations, a drift diffusion model of DD was constructed. These simulations supported the hypothesis that prospective memory of the delayed reward guided choice decisions, slowed discounting, and optimized the fit of the model to the experimental data. Collectively, these data suggest that a deficit in forming or maintaining a prospective behavioral plan is a critical intermediary to delay reward, and by extension, may underlie the inability to delay reward in those with increased AUD risk.
Keywords: Delay Discounting, Alcoholism, Alcohol Preferring Rat, Computational Model, Prospective Memory, Drift Diffusion Model
Introduction
Impulsivity is broadly defined as the predisposition to act prematurely without foresight (Dalley et al., 2011). The propensity to make impulsive choices is a risk factor for substance abuse disorders including alcoholism; however, the strength of this relationship is unknown (Verdejo-Garcia et al., 2008, de Wit, 2009, Dick et al., 2010, Lejuez et al., 2010, MacKillop, 2013, Dougherty et al., 2014, Jentsch et al., 2014, Jentsch and Pennington, 2014). One challenge in accurately quantifying this relationship is that both impulsivity and alcohol use disorders (AUDs) are heterogeneous constructs, influenced by a number of factors like sensation seeking, positive/negative urgency, or a lack of planning (de Wit, 2009, Lejuez et al., 2010, Rogers et al., 2010, Dalley et al., 2011, Coskunpinar et al., 2013, Jentsch et al., 2014). The broad goal of the present study was to explore the underlying factors contributing to the relationship between impulsivity and risk for excessive drinking.
Delay discounting (DD) is a behavioral measure of cognitive impulsivity and a relationship between DD and AUD has been reported (Kollins, 2003, MacKillop et al., 2011, Mitchell, 2011, MacKillop, 2013, Bickel et al., 2014a, Bickel et al., 2014b, Jentsch et al., 2014). Both alcoholism risk (Johnson et al., 1998, Enoch and Goldman, 2001, Dick and Bierut, 2006) and DD (Anokhin et al., 2014) are heritable, and there is increasing evidence that these traits are also genetically correlated (Wilhelm and Mitchell, 2008, Oberlin and Grahame, 2009, Acheson et al., 2011, Dougherty et al., 2014, Perkel et al., 2015). It is still unclear, however, to what extent excessive DD increases risk of developing an AUD, and which underlying cognitive processes might mediate this risk.
Alcohol Preferring “P” rats have been selectively bred for excessive alcohol consumption and have proven extremely useful for exploring the heritable factors and neural systems leading to this phenotype (McBride et al., 2014). Evidence is emerging that, like humans with a family history of alcoholism, rodents selectively bred for excessive drinking also exhibit impaired measures of cognition - including impulsivity (Oberlin and Grahame, 2009, Wenger and Hall, 2010, Walker et al., 2011, Beckwith and Czachowski, 2016). Moreover, alcohol naïve P rats discount delayed rewards more robustly than rodent populations that lack increased genetic risk (Beckwith and Czachowski, 2014, Perkel et al., 2015). These data suggest that the impulsive phenotype is not a consequence of excessive drinking, but rather that these two traits are regulated by similar genes and neural systems. The specific goal of the current study was to further characterize how altered cognitive processing in P rats predisposes them towards impulsive decision-making.
Drift diffusion models provide a powerful tool to explore the cognitive processes guiding fast, two-choice decisions (Ratcliff and McKoon, 2008). These models have been used extensively to examine cognitive processes required to withhold a prepotent response (Verbruggen and Logan, 2009), in addition to how environmental stimuli guide perceptual-based decisions (Shadlen et al., 1996, Romo and Salinas, 2001). Two-choice perceptual decision-making modeled as a diffusion process assumes that noisy sensory information is integrated to guide decision-making, and can account for the shape of response time distributions, the speed-accuracy tradeoff, and changes in neural firing during intertemporal choices (Ratcliff and McKoon, 2008, Kim and Lee, 2011). A recent study using a drift diffusion model supported the hypothesis that assigning value to a delayed reward requires that it be mapped to the context in which it will be received (Kurth-Nelson et al., 2012). These simulations concluded that the mapping process requires prospective memory and is critical for delaying reward (Kurth-Nelson et al., 2012). The drift diffusion model presented herein also explores how future, delayed rewards, are mapped and maintained during cognitive search, and was used to further test the hypothesis that steep delay discounting in P rats is due in part to alterations in the use of a prospective strategy. This was motivated by the fact that of the candidate cognitive processes that might mediate DD to increase AUD risk, the ability to, or use of, envisioning the future (i.e. prospection) has been shown recently to be negatively related to the severity of an individuals drinking problems (Griffiths 2012) or alcohol dependence (Heffernan, 2008). Using a modified DD task in rodent populations with varying risk of excessive alcohol consumption, and a drift diffusion model of these data, these studies tested the hypothesis that the ability to plan goal-directed behaviors involves a cognitive search process which is influenced by genetic risk and mediates impulsivity.
Materials and Methods
Animals
Sixteen male Indiana University Alcohol Preferring (P) rats and 16 non-genetically predisposed (heterogeneous) Wistar rats were provided by the Indiana University Alcohol Research Center animal production core or purchased from Harlan (Indianapolis, IN), respectively. Upon arrival animals were acclimated to single housing and a 12 hour reverse light/dark cycle with lights off at 7:00 AM for at least a week prior to testing. All animals were at least 70 days of age prior to testing and had ad lib access to food and water prior to food restriction/habituation. Over the course of approximately 2 weeks, animals were food restricted to 85% of their starting free-feeding weight and maintained under this condition throughout all experiments. All procedures were approved by the Purdue School of Science Animal Care and Use Committee and conformed to the Guidelines for the Care and Use of Mammals in Neuroscience and Behavioral Research (National Academic Press, 2003).
Operant Apparatus
Standard one compartment operant boxes (20.3 cm × 15.9 cm × 21.3 cm; Med Associates, St Albans, VT) were used for all DD procedures. Each box contained left and right retractable levers on one wall, left and right stimulus lights positioned immediately above each lever, and an easily accessible pellet hopper positioned between these left and right positioned devices. The opposite wall contained a house light and a tone generator (2,900 Hz) on the top most position.
Habituation/Shaping
Ten sucrose pellets were placed in the animals’ home cage on the final day of the 2 week induction/maintenance of food deprivation. Animals were then habituated to the operant apparatus and trained to press levers for the delivery of sucrose pellets. The first day served to habituate the animals to the operant box and familiarize them with the location of the pellet hopper where sucrose pellets were delivered. Animals were placed in the operant box for 30 minutes where 10 sucrose pellets were already available in the hopper. Animals were then progressively shaped to press the levers for delivery of sucrose pellets and habituated to the stimuli and stimuli contingencies over approximately 8 consecutive days. On day 1 of shaping, single sucrose pellets were delivered into the food hopper with increasing time intervals over a 20 minute period (30 total trials/pellets); no stimuli were used other than the sucrose pellet dispenser. On day 2 of shaping, illumination of the house light signaled the start of a trial. After 10 seconds, it was extinguished and a single sucrose pellet was dispensed. The inter trial interval (ITI) on these sessions were 15, 25, or 35 seconds, chosen at random (30 trials/pellets total). The levers and the stimulus lights above the levers were introduced on days 3 and 4, and animals were trained (i.e. ‘hand-shaped’) to press the left and the right bars for sucrose pellet delivery using successive approximation. Animals were randomly assigned to one of two levers on the first day and the opposite lever was used the following day. Some animals required one additional day of training/shaping before moving on to the next training/shaping phase. A minimum of 30 lever presses was required before moving on. On days 5 and 6, illumination of the house light signaled the start of a trial. After 10 seconds, it was extinguished and a single lever and corresponding stimulus light were extended and illuminated respectively. A response on the lever resulted in the simultaneous sounding of a 100 millisecond tone (marking the reinforced response) and delivery of a single sucrose pellet. One lever/light side was used per day, and side assignments matched those on days 3 and 4. Each session terminated following 30 minutes or 30 sucrose pellets earned, whichever came first. Days 7 and 8 included all of the stimuli. Illumination of the house light signaled the start of a trial. After 10 seconds, the house light was extinguished and both levers were extended into the chamber. The animals were required to initiate the start of the trial by pressing any one of the two levers. If a lever was not pressed after 10 seconds, the levers were retracted and the house light was presented again. If a trial was initiated, the levers retracted for 1 second, and then were extended back into the chamber as choice levers with their corresponding stimulus lights illuminated. A response on either lever resulted in the simultaneous sounding of a 100 millisecond tone (marking the reinforced response) and delivery of a single sucrose pellet. Additionally, the stimulus light above the non-chosen bar was turned off while the light above the chosen lever remained on until the end of the trial. Trials were always 35 seconds in total duration. These sessions lasted 35 minutes or were terminated following 30 sucrose pellets earned. The preferred choice lever was determined for each animal from these final 2 sessions.
Delay Discounting (DD)
Within-session adjusting amount DD procedures were adapted from Oberlin et al., (2009), and are illustrated in Table 1. The presentation of stimuli was identical to the last 2 days of shaping, except that lever pressing now resulted in the outcomes detailed here. A ‘delay lever’ was assigned for each animal to be the lever corresponding to the animal’s non-preferred side. Pressing the delay lever always resulted in the delivery of 6 sucrose pellets following a delay. The ‘immediate lever’ was the opposite lever. Pressing the immediate lever resulted in the delivery of 0–6 pellets immediately. The number of pellets delivered following a response on the immediate lever (i.e. the ‘value’ of the immediate lever) always started at 3 on a given day. On ‘choice trials’, the value of the immediate lever was increased by 1 pellet after each delay lever press, and decreased by 1 pellet after each immediate lever press. ‘Forced trials’ were incorporated to expose the animals to both the delayed and immediate reinforcers. Forced trials resulted from a response on the same choice lever for 2 consecutive choice trials, and were presented on the following trial. Failure to respond on a forced trial resulted in the repetition of an identical trial. Forced trial responses did not affect the immediate lever value. Each session terminated following 35 minutes or 30 choice trials completed. Four to 6 sessions were given for each delay in ascending, fixed delay intervals (0, 1, 2, 4, 8, and 16 seconds). Animals were given 6 daily sessions at the 0 second delay, and four daily sessions at every other delay. However, only animals who met our inclusion criteria of 70% of the maximum reward value of 6 (4.2 pellets) at the 0 second delay were moved on to the other delays. The average value of the immediate lever over the last 10 choice trials was determined for the last 3 days of each delay and used to determine the indifference point of each animal. Mean indifference points for each delay were determined for all animals and evaluated as detailed below.
Table 1.
Inter-lever Response Interval (IRI) Manipulation
To better characterize choice lever consistency (see results), a subset of the original Wistar rats were used for an additional follow-up experiment (n=8). These studies were identical to the DD procedures detailed above, except that the latency with which the levers were re-extended following trial initiation presses was manipulated (i.e. the Inter-lever Response Interval or IRI). Animals were run for 6 consecutive days at each of three delays (0, 4 and 16 seconds). However, the IRI was increased over daily sessions (within each reward delay condition) from 1 second to 16 seconds. Animals first had 2 days at the 1 second (normal) IRI delay for each of the 3 reward delay conditions; one day to re-acclimate them to the delay and the other for evaluating baseline behavior. For each subsequent day within a reward delay, IRI increased daily with delays of 2, 4, 8, and 16 seconds. In total, animals had 6 consecutive days within 3 reward delays (18 days). Our hypothesis was that by increasing IRI, Wistar rats would shift away from using a consistent lever press strategy.
Behavioral Statistics
Delay discounting data was evaluated using methods adapted from Oberlin and Grahame (2009). We opted to use this model vs. a generalized hyperbola model (Green and Myerson, 2004) since it is simpler and reduces the number of variables necessary for the model. Mean indifference point across the final 3 days of each delay were evaluated using mixed factor ANOVAs, with rat population as the between-groups factor and delay as the within-groups factor. The rate of discounting was determined using the hyperbolic fitting function: v=a/(1+kd) (Mazur, 1987). v Is the subjective value of the reward, a is the fixed value of the delay reward (6 pellets), d is the length of the delay (0–16 seconds), and k is the value fitted by the hyperbolic function. With this formula, impulsivity (k) is measured by the steepness of the rate of discounting. Thus, the larger the k value, the more impulsive the individual. Correlation and regression analyses were conducted to determine the strength and significance of particular variables of interest. Bonferroni corrected post-hoc tests were performed based on the number of comparisons when appropriate. Data were considered significant at p<0.05.
Model of Delay Discounting
To assess the possible cognitive processes that guide decision making in the within-session adjusting amount discounting task, a one dimensional drift diffusion model was used to simulate this task. The model was constructed such that a random walk accumulated towards two possible “decision-boundaries” (Figure 5A+B). The model was parameterized so one of the decision boundaries was found on each walk (i.e. similar to the behavioral data, there were very few model omissions (<1%); average experimental daily omissions P=0.5177±0.04; Wistar=0.42±0.08). A trial was defined as a series of walks with the location of the lower boundary corresponding to delay choices and the location of the upper boundary corresponding to immediate choices. The distance of the delay boundary from the origin was fixed at a distance that was proportional to the duration of the delay. The upper/immediate boundary was adjusted on each walk (moved closer to or farther from the origin), with distance from the origin inversely proportional to the subjective value (v) of the reward. The immediate boundary was adjusted depending on which boundary was found on the previous walk (i.e. the preceding “decision”), such that if a delay boundary was found on the previous walk, the immediate boundary moved closer to the origin, and vice versa. Thus, conceptually, the immediate boundary distance represented the value of the immediate lever from behavioral experiments. One trial in the model consisted of 500 walks, which was meant to simulate one day of discounting at a fixed delay as in the behavioral experiments. One-hundred trials (i.e. bootstraps) were run at each delay, which can be thought of as multiple observations (i.e. animals) over the delays. The step size and direction of the random walk y was determined by
where n was the duration of the random walk (5000 steps for all simulations), d was a diffusion coefficient which adjusted the size of each step on the walk and was 0.04 for all simulations. xi was randomly selected for each step i from N(0,1), unless noted otherwise. The trajectory of the walk accumulated from the origin (x1=0), (unless otherwise noted) towards a decision boundary located at either 1/v or −1/v, where v is the subjective value of the reward defined by the discounting function described in the behavioral experiments:
where a = actual value of the reward (in food pellets, [1…6]), and d = temporal delay (in seconds, [0, 1, 2, 4, 8, 16]). The DD parameter was defined as k〈e〉 (‘e’ denoting the mean from the experimental data; see results). In this way, the decision boundaries (±1/v) representing higher value “rewards” were closer to the origin. Our rationale for this was based on the observation that high value rewards have a higher probability of being chosen, and thus they should be easier to locate in “cognitive search-space” (Howard et al., 2005, Ratcliff and McKoon, 2008, Kurth-Nelson et al., 2012)
The lower decision boundary was constructed as a proxy for a delayed choice, a = 6 and was fixed so that −1/v was a fixed distance from the origin on each trial for a delay (d). The upper decision boundary was designed as a proxy for the immediate choice (Figure 5A+B), if the random walk crossed the upper decision boundary first (i.e. immediate choice, Figure 5B, blue line), the boundary was moved away from the origin on the next walk:
However, if the trajectory crossed the lower decision boundary first (i.e. delay choice, Figure 5B, red line) it was moved towards the origin:
To extract km from each simulation (‘m’ denoting modeled data) the mean of the immediate decision boundary was computed from each trial and for each bootstrap, which resulted in a bootstrap x delay matrix of values and km was then computed for each row resulting in 100 values per simulation (see Figure 5E, inset). The mean of these values is denoted as k〈m〉.
The origin of the random walk was adjusted in a subset of experiments to include a “memory” of which decision boundary was found by the walk on the previous trial (Figure 6A1). The rationale for this feature was arrived at following initial simulations and was based on the premise that the accumulation of information about past choices, and how they might relate to the current choice, is a necessary process in DD by which indifference points develop and differ as a function of delay length. If one boundary was found repeatedly, this would bias the walk towards this boundary. The origin bias function adjusted the origin of the walk along a sigmoidal function (Leibowitz et al., 2010) towards the previously found decision boundary via:
Where vw is the decision boundary found on walk w and o is a value that accumulates each consecutive time the boundary is found (o =[0…40]) and used to update the origin of the random walk via:
where originScale sets the maximum the origin could be moved towards the decision boundary. The origin of the walk (x1) on the next walk (w+1) was then set to the product of originBias(1/v) if the upper decision boundary was found or originBias(−1/v) for the lower. For example, if originScale = 0.25 then the random walk could be maximally initiated 25% closer to the decision boundary.
An additional term in the random walk was developed following initial simulations and also introduced in a subset of experiments in order to model how the prospective representation of the delayed choice, and its decay over time, might influence which decision boundary was found by the random walk (see Figure 6B1+2). The ‘drift bias’ decayed exponentially via modifying the distribution of the random walk N(di,1) where
and n, as previously, was the duration of the walk (5000 steps for all simulations). The assumption of exponential memory decay is common in a number of models (Murdock, 1997, Mensink and Raaijmakers, 1998).
To quantify the fit of the model to the experimental data prediction error was calculated as:
Where Y = the mean value of the immediate lever from the experimental data at delay d, and Ŷ is the mean value of the immediate reward boundary recovered from the model at delay d.
Results
Delay Discounting
One Wistar rat did not learn to lever press, and 1 Wistar and 1 P rat did not meet our inclusion criterion for magnitude discrimination (70% of the maximum reward value of 6 = 4.2 pellets) and were therefore excluded. No rat population differences were observed in the number of completed trials at any delay (main effect rat pop [F (1, 27) = 1.145, p = 0.2940]; delay*rat pop. interaction [F (5, 135) = 0.5886, p = 0.7087; Figure 1A). A clear preference for the larger-magnitude delay lever was observed at the zero second delay in both rat populations thus indicating that each rat population discriminated the magnitude of the reward (Figure 1B). Additionally, no rat population differences in indifference point were observed at the zero second delay indicating that the larger, delayed reward was valued similarly (Figure 1C). A clear shift in the preference for the delayed lever was observed across delays (choice type*delay interactions [P rats: F (5, 130) = 151.2, p<.0001]; [Wistar rats: F (5, 140) = 72.97, p<0.0001]) indicating that the animals understood the relationship between reinforcer size and the delay (Figure 1B). The pattern of forced choice trials inversely mirrored that of choice trials in the expected direction - there were a greater number of forced choices to the delay lever across delay in both groups (due to discounting/choosing more consecutive immediate rewards), with P rats displaying more forced choices to the delay lever than Wistar over delays (data not shown). Choice responding reached a stable asymptote by the final choice trial indicating that animals titred their choices based on the delay within a trial (Figure 5D1).
The results of analysis of DD across delays can be seen in Figure 1C. There were significant main effects of delay [F(5, 135)=97.60, p<0.0001], rat population [F(1, 27)=4.284, p<0.05], and a delay*rat population interaction [F(5, 135)=4.21 p<0.01]. Post-hoc testing indicated that P and Wistar rats differed significantly at the 4 (p<0.05), 8 (p<0.01), and 16 (p<0.01) second delays, with P rats displaying lower indifference points. Analysis of area under the curve (AUC) derived from 1) indifference points at each delay [t(27) = 3.46, p<0.01; see Figure 1C inset] and 2) the best-fit lines from nonlinear hyperbolic fitting [t(27) = 2.91, p<0.01] both also indicated significant group differences, with lower values in P rats. To evaluate differences in the rate of discounting (i.e. the steepness of the best-fit hyperbolic curve), k values were evaluated. Data was not normally distributed (Shapiro-Wilk; W=0.83, p<0.01), thus non-parametric Mann-Whitney tests were performed and indicated significant differences between P and Wistar rats [U(27) = 50, p<0.05]. K values were then natural-log transformed to allow for analysis with parametric statistics (and comparison with other normally distributed variables). The results of this analysis supported the results of non-parametric rat population differences in k [t(27) = 2.76, p=0.01], with steeper slopes in P rats. Thus, rat populations differed in DD with P rats displaying increased impulsivity compared to Wistar.
Lever Consistency
To further explore the cognitive processes that underlie DD, the extent to which the same lever was pressed to initiate a trial and subsequently make a choice response was quantified (i.e. initiation/choice ‘consistency’). This was based on the premise for which our study was designed; that consistent trials would reflect choice decision intent (i.e. a prospective decision strategy). The probability that trials were consistent depended on the frequency of delay vs. immediate lever initiations as well as the frequency of delay vs. immediate lever choices. The chance probability of a consistent trial for each animal/subject (s) at each delay (d) was calculated as:
Where D and I represent responding on the delayed or immediate lever, respectively and T is the total number of choices. The subscripts i and c represent choices during the initiation or choice phase, respectively. Thus, each animal had its own chance level probability of making consistent responses, which was derived from the observed pattern of initiations and choices. An analysis of chance probabilities (shaded lines in Figure 2A+B) indicated that there were no differences between rat populations. However, there was a significant main effect of delay [F(5, 135)=6.57, p<0.0001], with the chance probability of consistent choices decreasing with increasing delay length. Analysis of observed consistency revealed a significant main effect of delay [F(5, 135)=4.10, p<0.01] driven almost entirely by P rats, a main effect of rat population [F(1, 27)=15.26, p<0.001], and a significant delay*rat population interaction [F(5, 135)=2.64, p<0.05]. Bonferroni-corrected post-hoc testing detected differences between the observed consistency between rat populations at the 4 (p<0.01), 8 (p<0.001), and 16 second (p<0.001) delays, with Wistar rats displaying higher consistency than P rats at each of these delays. An analysis of the chance versus the observed consistency found a main effect in Wistar rats [F(1, 28)=21.04, p<0.0001] driven by higher consistency than chance at all delays (p’s<0.05; Figure 2A). In P rats there were no differences in chance probability versus the observed consistency (p’s>0.30; Figure 2B). Thus, Wistar rats displayed levels of consistency greater than expected by chance, whereas P’s did not. This indicates that a behavior plan guiding the choice at the initiation phase of the task was maintained through the choice phase in Wistars.
Response latencies
Separate analyses were conducted on latency to make choices on each of the two possible levers. In these analyses the median response time latency was taken for each animal over the last three days at each delay and the mean of this value was used for statistics. For immediate choice latency, there was a significant main effect of delay only [F(5, 135)=10.37, p<0.0001; Figure 3B], with generally longer latencies at the 1 and 2 second delays than other delays. However, it is important to note that latencies at the 1 and 2 second delays were not significantly greater than at other delays. For delay choice latency, there was a significant delay*rat population interaction [F(5, 135)=5.43, p=0.0001; Figure 3C], which post hoc testing indicated was driven by significantly slower responding in P rats at the 8 and 16 second delays (p’s<.05).
Relationships between behavioral measures
To evaluate the relationship between measures of impulsivity, lever consistency, and latency to make choices, correlation and a linear regression analyses were conducted. Initiation/choice consistency was significantly negatively associated with natural log transformed k values (i.e. impulsivity) in Wistar rats (r=−0.57, p<.05) but not in P rats (p=.24; Figure 4A). Initiation/choice consistency was also significantly negatively associated with choice latency in Wistar rats (r=−0.54, p<.05), but not in P rats (p=.70; Figure 4B). However, k values were significantly positively associated with choice latency in P rats (r=0.66, p<.05) but not Wistar rats (p=.17; Figure 4C). Thus, for Wistar rats, the more consistent the less impulsive, and the faster the choice decision, and for P rats, the slower the choice the more impulsive.
Drift Diffusion Model
The model was parameterized (see below) so that values could be interpreted similarly as the experimental data. For example, the decision boundaries were set as 1/v so that the immediate decision boundary could be roughly interpreted as number of pellets. In addition, the diffusion constant was fit so that 1 step corresponded to ~10 milliseconds in the experimental data (the sampling rate of behavioral hardware), resulting in similar response time distributions [two-sample KS test: D(100)=0.0875, p>0.05] between the experimental data (Figure 5C1) and the model (Figure 5C2).
The experimental data were acquired via within session adjusting amount DD procedure. This DD procedure is attractive due to the fact that an indifference point can be calculated which provides an accurate within-session measurement of the subjective value of the delayed reward. In both the experimental data and the model the value of the immediate lever was dynamic on early walks/trials and eventually reached asymptote based upon the distance of the delayed boundary from the origin (Figure 5D2) or the duration of the delay (Figure 5D1), thus the model (even prior to addition of bias features) recreated this important feature of the DD procedure.
The value of the actual reward v could be manipulated by changing the discounting function k. Manipulating v to reflect changes in the value of the reinforcer created a hyperbolic discounting function, where the probability of finding the delayed boundary decreased as a function of delay. To explore if simply modifying k was sufficient to recreate the observed discounting functions from the experimental data, the experimentally derived k values (ke) were initially used in the simulations. Upon providing the model with the values observed in the experimental data (k〈e,w〉 = 0.1963 to simulate Wistars and k〈e,p〉 = 0.4054 to simulate P rats) clear discounting was observed in each condition and discounting was slowed in simulated Wistars relative to simulated P’s (Figure 5E). However, the k values recovered from these simulations (km) were much higher than those observed in experimental data. In the case of the Wistars, the experimental k〈e,w〉 = 0.1963, which was −5.1 standard deviations lower than those observed in the simulations((km,w) = 0.3327 ± 0.027, one sample z-test, p<0.0001). In simulated P rats the experimental k〈e,p〉 = 0.4054, which was −3.08 standard deviations lower than those observed in the simulations ((km,w) = 0.5219 ± 0.038, one sample z-test, p=0.0021) (Figure 5E, inset). Thus, by only changing k, the discounting functions created by the model did not adequately recreate discounting observed in either population suggesting that additional parameters should be considered to better describe discounting in each rat population. Furthermore, the discounting parameter, k, in its pure form, provides only a mathematical description of how the perception of time interval influences perceived reward magnitude, which could each be influenced by a number of cognitive processes.
A number of observations in the behavioral data shed light on the differences in decision making that lead to altered DD between rat populations. First, Wistars discount slower (Figure 1C) and slower discounting is correlated with a propensity to respond on to-be-selected choice lever at initiation (Figure 4A). These data suggest that, in Wistars, a relationship exists in the lever selected to initiate a trial and the lever selected at the choice. There were no differences between P and Wistar in the number of initiation decisions on either the delay or immediate lever [main effect of rat population: F(1, 27)=0.13, p>0.7; rat population*delay interaction: F(5, 135)=0.92, p>0.4], but there was a slight bias in favor of initiating trials on the delay lever in both groups (≈66% delay initiations in P, and ≈64% delay initiations in Wistar). Since P and Wistar rats tend to initiate a trial on the delay lever with equal probability, this rules out an initiation bias as the cause for population differences in discounting. Rather, this suggests that, in Wistars, the initiation choice reflects a prospective strategy that may guide the decision about which lever is chosen during the choice phase. This is supported by the observation that the Wistars that were the most consistent also exhibited faster response latencies (Figure 4B). In addition, at the 8 and 16 second delay Wistars were faster to choose the delayed lever compared with P rats whereas there were no differences between populations for immediate choices (Figure 3C). Consistency between the initiation and the delay lever may reflect a process where a bias for the preferred lever accumulates over trials, reducing the need for deliberation as the preferred choice is clear. Alternatively, the consistent choices may reflect prospective process whereby the animal forms and maintains a trace of the to-be-selected lever, which then guides their decision (i.e. facilitates the mapping of the delayed reward to context). These possibilities were each modeled separately as additional parameters to explore potential differences in discounting in P and Wistar rats. For all subsequent simulations the k〈e〉 value used in the model was 0.3009 (the mean of Wistar and P rats, Figure 5E, black symbols).
To explore how an accumulating bias based on past choices might affect decision making, the origin of the random walk was updated based on which boundary was found on the previous trial. This could reflect a number of cognitive constructs such as value judgments about past choices (was it worth waiting for the delayed reward), integrating the temporal information about the delay (how long was the delay), building an association between the temporal delay and the number of pellets available immediately, or the decision process becoming increasingly easier. As such, the accumulation of information about past choices, and how they might relate to the current choice, was modeled as a sigmoidal function - the origin bias (see OriginBias equation). Modifying scaleOriginBias alone had a modest effect on the k values recovered from the simulations (Figure 6A). While it was possible to recover k values from the model similar to those observed in the P rats by modifying scaleOriginBias alone, modifying this variable did not yield k values similar to those observed in the Wistar rats. Modifying scaleOriginBias, resulted in discounting functions with an increased probability the random walk found the delay boundary at short delays (0, 1 second) and the immediate boundary at long delays (8,16 second; Figure 6A2).
Prospective memories are conceptualized as transient representations of future actions (Crystal and Wilson, 2015). To explore if a prospective process could account for alterations in discounting, random walks were transiently biased towards the delay boundary using the scaleDriftBias term. The walk was biased towards the delay boundary, and not the immediate, for the following reasons: First, there is evidence that the ability to withhold an impulsive response requires that a representation of the delayed reward must be formed and maintained (Buckner and Carroll, 2007, Kurth-Nelson et al., 2012). Second, each rodent population exhibited an initiation bias for the delay lever indicating the higher incentive motivational value of this lever. Additionally, an immediate decision does not likely require the level of planning and maintenance of information about the to-be-made-choice compared with a delay choice. This is supported by the observation that the Wistars that choose the delay lever most often (e.g. slowest discounters) are the ones that are the most consistent (Figure 4A). Further support for this idea is observed as an increase in latency to choose on immediate choices versus delay choices [main effect of choice type, F (1, 28) = 8.57, p = 0.0067] and a strong relationship between choice latency and consistency in Wistars (Figure 4B).
We hypothesized that if a memory of the incentive motivational value of this lever can be maintained up to the choice, then this can bias the choice process. Indeed the inclusion of the scaleDriftBias term dramatically slowed discounting and was capable of yielding k〈m〉 values in the range observed in the experimental data for both Wistar and P rats (Figure 6B1). An example where scaleDriftBias = 0.20, and scaleOriginBias = 0 is shown in Figure 6B3. After including this variable, the k〈m〉 values recovered from the simulations were 0.27335 ± 0.024 (μ ± σ), which slowed discounting relative to when scaleDriftBias = 0 (k〈m〉 = 0.4294 ± 0.035).
While k〈e,p〉 values of the P rat experimental data could be recreated by changing the scaleOriginBias or scaleDriftBias (grey circles, Figure 7A1), the fit of the model to the P rat experimental data was best (e.g. prediction error was minimized) when the scaleOriginBias=0.36 and scaleDriftBias=0 (Figure 7A3), indicating that changing the origin bias more adequately models the P rat experimental data.
A number of combinations of the origin and drift bias existed that yielded values close to those observed in the Wistar experimental data (i.e. several parameter combinations generated low prediction error; see black circles in Figure 7A1–2). However, the parameters that best minimized the prediction error (Figure 7) were scaleOriginBias=0.2 and scaleDriftBias=0.26. The improvement in the fit of the model to the Wistar data was small when the scaleOriginBias=0, indicating that while this parameter made improvements in the fit, the drift bias was far more important in creating an optimal fit to the experimental data in Wistar rats.
If Wistar rats formed a plan of their choice at the initiation of the task and maintained this in memory, one would expect a faster response time at the choice. In the experimental data, faster response latencies for delayed rewards were observed in Wistar rats at 8 and 16 second delays (Figure 3C). To explore this finding in the model we used the parameters for the origin and drift bias that optimally recreated the k〈m〉 values and minimized prediction error for each rat population (see above). Similar to the experimental data, a rat population*choice (immediate vs. delay) interaction [F(1,499987)=76614.43, p<0.0001] was observed for the natural log transformed response times recovered from the model. In addition, the distributions for the response times for an immediate versus a delay response were more similar in modeled Wistars (Kullback-Leibler Divergence=1.89) than the P’s (Kullback-Leibler divergence=5.53; Figure 7C1+2). Collectively, these data indicate that, similar to the experimental data, optimizing the model parameters based on the Wistar data decreased the latency of the random walk to find the delay boundary. Thus the inclusion of the origin bias optimized the fit of the model to the P rat experimental data and the Wistar experimental data was optimally fit by including the drift bias. Taken together, these data highlight potential differences in strategy employed by each rat population used to guide decision-making and further suggest that the ability to effectively plan a decision is key to mitigating an impulsive choice.
Inter-lever Response Interval (IRI)
We next designed a set of experiments to evaluate the influence of inserting a delay between the initiation lever and access to the choice lever in the subset of Wistar rats. The primary goal of these experiments was to determine if increasing the temporal delay between trial initiation and choice availability would interfere with lever consistency. We hypothesized that increasing IRI delay would interfere with consistency and increase choice latency.
At all 3 reward delays there were significant IRI*consistency interactions [0sec: [F(4, 64)=7.02, p<0.0001; 4sec: F(4, 64)=3.95, p<0.01; 16sec: F(4, 64)=3.95, p<0.01]. Post-hoc testing indicated above chance differences in consistency at the 1 (p’s<.001) and 2 (p’s<.001) second IRIs at all 3 reward delays, and above chance differences at the 4 (p’s<.01) and 8 (p’s<.05) second IRIs at the 4 and 16 second reward delays (Figure 8). At all 3 reward delays the 16 second IRI decreased consistency to chance levels. Thus, longer IRI intervals decreased consistent lever responding.
IRI length increased choice latency at all reward delays (main effects of IRI). Interestingly, significant increases in choice latency were only observed at IRI lengths where consistency was no greater than chance. These results support the observation that the consistent lever press strategy in Wistar rats leads to decreases in latency to choose a reward. IRI had no impact on indifference points as a function of IRI delay (no main effects or interactions of IRI).
Discussion
These results support recent work demonstrating that alcohol preferring “P” rats display increased impulsivity relative to other alcohol-preferring, non-alcohol-preferring, and heterogeneous rat strains (Beckwith and Czachowski, 2014, Perkel et al., 2015), as well as reports of greater impulsivity in other populations of selectively bred alcohol preferring rodents (Wilhelm and Mitchell, 2008, Oberlin and Grahame, 2009, Mitchell, 2011). These data are also consistent with human studies that find a relationship between impulsivity, measured via DD, and those FH+ for an AUD (Mitchell, 2011). While these data clearly indicate a relationship between impulsivity and FH of excessive drinking, the relationship between impulsivity and FH does not appear to extend to all FH+ subjects or rodents selectively bred for excessive drinking (Wilhelm et al., 2007, Wilhelm and Mitchell, 2012, Beckwith and Czachowski, 2014). These studies highlight the need to further refine our understanding of the various cognitive functions that influence impulsivity and the specific modalities of alcohol drinking each is related to.
In the experimental data used to fit the model a clear preference for the delay lever was observed at the zero second delay (Figure 1B), increasing delays also reduced the probability of choosing the delay lever (Figure 1B), and a stable choice responding was observed at the end of the trial (Figure 5D1). Collectively these data indicate that both P and Wistar rats demonstrated mastery of the behavioral task. A previous study, also employing a computational model, found no evidence that adjusting the delay influenced decision-making in a within-session adjusting delay task (Cardinal et al., 2002). However, a number of experimental and procedural differences prevent the comparison of these data to the model presented herein.
It is possible that the light cue remaining on following choice decisions exacerbated the difference in DD between the 2 different rat populations given that cues marking delay choices have been shown to increase DD in groups of highly impulsive animals and decrease DD in groups of animals with low impulsivity (Zeeb et al., 2010). That this cue was present regardless of which reward lever was chosen in our studies vs. only the delay choice in the study referenced above, and given the fact that it remained on up to many seconds after reward delivery until trial end (24 seconds – choice latency) makes this less likely in our view, but still a possibility.
In the current study a clear relationship was observed in the behavioral strategy used (e.g. consistency) and DD - but only in Wistar rats (Figures 1+2+4). No detectable strategy was observed in P rats (Figure 2B+4A), who discounted steeply (Figure 1). The effect of an animals ‘family history’ on consistency was robust and reproducible across all reward delay conditions tested (Figure 2A+B). This observation was key to better understand why P rats discounted more steeply than Wistar rats, and to further explore this a drift diffusion model of DD was employed.
Drift diffusion models are attractive because by modifying a few, well-defined parameters a number of phenomena related to decision-making can be modeled. As in previous studies, a random walk was used to model the cognitive search process (Kurth-Nelson et al., 2012), whereby information about the reward and delay are integrated to guide decision-making. The origin of the walk and the mean of the distribution from which the random walk was constructed, were each initially set to zero, which lead to a search process that was only influenced by reward magnitude and delay (i.e. the location of the decision boundaries). The decision boundaries, however, reflect an environmental variable that is external to the cognitive search process. In this regard, the cognitive search process was initiated de novo on each walk. However, as the animal progressed through the task, it was likely that information acquired from previous trials influenced the search process. In line with this view, it has been shown that information gleaned from previous trials, such as value judgments, can shape future decisions (Steiner and Redish, 2014).
Sequential sampling algorithms are attractive given their ability to accurately model response time distributions (Brown and Heathcote, 2008, Ratcliff and McKoon, 2008, Rodriguez et al., 2014). Response time distributions in two choice decision making procedures are typically positively skewed and in the current study, this phenomena was also observed in both the experimental and modeled data (Figure 3A+5C). As choices become more difficult or in situations where accuracy is stressed over speed, response times increase (Bogacz et al., 2010). Temporally discounted rewards have also been shown to result in longer response times for choice selection (Roesch et al., 2006, Oberlin and Grahame, 2009, Beckwith and Czachowski, 2014, Rodriguez et al., 2014). The geometry of the model presented herein lead to an increase in response time with increasing delays, which is consistent with previous experimental and modeling data.
Increases in reaction times with increasing delays may be related to the fact that as a reward becomes more temporally distant, it takes longer for the cognitive search process to map the reward to context (Kurth-Nelson et al., 2012). However, a linear increase in response times across delays was not observed in our experimental data, suggesting the search process was altered in our task compared with those previously reported (Figure 3). For delayed choices, response times tended to decrease in Wistar rats across delays, whereas they increased in P rats at the 8 and 16 second delays (Figure 3). The observed delay choice response times suggest the search process was facilitated somehow in Wistar rats and, at short delays, in P rats. For immediate choices, response times across delays were complex. In both rat populations, reaction times increased up to the 2 second delay and then dropped dramatically at the 4–16 second delays. This observation suggests that decision-making is a dynamic process that is influenced by genetic background, evolves across delays, and possibly influenced by cognitive strategy.
When the experimentally derived k〈e〉 values were used in the model, the resulting fit of the discounting functions was poor in terms of recovered k〈m〉 values and the mean immediate boundary values observed at each delay. The recovered k〈m〉 values from the model were considerably higher than those observed experimentally (Figure 5E) and the function was “compressed” compared to the experimental data (i.e. immediate lever values tended to be too small at short delays and too large at long delays). This suggested that parameters other than k value should be considered, and motivated the decision to explore how biasing the random walk changed the fit of the model to the experimental data.
The origin bias was introduced to model a decision-making strategy where the current choice was biased by information gleaned from previous choices. In other words, finding one boundary on previous walks increased the probability of finding it in the future, which could reflect the decision process becoming easier (i.e. little deliberation is required). The inclusion of this parameter improved the fit of the k〈m〉 values and minimized prediction error of the model to the experimental data from Wistar or P rats. In addition, this parameter increased the mean immediate boundary value at the 0 second delay indicating that the walk was increasingly biased towards the delayed boundary. Thus the model data suggest that, when no delay is present, making the decision-making process easier was necessary to optimally model the experimental data. However, while the inclusion of the origin bias optimized the fit of the model to the P rat data, this was not sufficient to model the Wistar data.
Wistar, but not P rats, consistently initiated a trial on the lever that was ultimately chosen in the choice epoch. This suggested that a behavioral plan was formed at, or close to, the initiation of the trial and was used to guide responding in the choice epoch. This was interpreted as Wistar rats forming a prospective memory of their choice thus reflecting a ‘proactive’ mechanism of behavioral control (Braver, 2012). This bias in the behavioral plan for the delayed lever was conceptualized in the model as a transient prospective memory of the delayed reward that decayed over time and modeled as the drift bias. The inclusion of this factor, slowed discounting, and provided an optimal fit to the Wistar experimental data. In addition, the inclusion of the drift bias reduced the number of steps required by the walk to find the delayed boundary, bringing it closer to the number of steps required to find the immediate boundary (Figure 6). This better modeled the experimental data as the choice latencies in Wistar rats were similar for immediate and delay choices at the 8 and 16 second delay, whereas delay choices were longer in P rats. The ability of the drift bias to slow discounting and optimize the fit of the model to Wistar data suggests that a prospective memory, or behavioral plan, will facilitate the decision-making process. These data were further supported by follow-up experiments in which increasing IRI length simultaneously decreased consistency and increased choice latency in Wistar rats (Figure 8). Collectively these data underscore the importance of the ability to create a behavioral plan and hold this in prospective memory to delay reward and mitigate impulsive choices.
Addiction is characterized by an inability to delay immediate reward in light of future outcomes (de Wit, 2009). In this way, the alcoholic is constantly faced with a choice; do I drink excessively to feel good now, or shall I abstain to feel good in the future? This choice requires that future consequences be conceptualized and integrated into a decision to drink or abstain. As discussed, conceptualizing future rewards requires their value be mapped to context in order to form a prospective memory, which may be a critical intermediary to delay reward. Lack of planning, which involves prospective memory, has been described as one of five types of impulsivity (Whiteside and Lynam, 2003) and a recent meta-analysis found that lack of planning was one of the strongest predictors of alcohol dependence (Coskunpinar et al., 2013). In line with this view, non-planning impulsiveness in recovering alcoholics has been shown to be associated with alterations in the activation and functional connectivity of brain regions thought to be critical for cognitive control such as the dorsal anterior cingulate cortex (Jung et al., 2014).
The coordinated activity of multiple brain regions is necessary for tasks involving inter-temporal decision-making such as those described here. Among these regions, alterations in neural processing in the hippocampus and frontal cortex are prime candidates for the differences observed between P and Wistar in the current studies (Peters and Buchel, 2010, 2011). The hippocampus participates in the regulation of retrospective and prospective forms of episodic memory (Ferbinteanu and Shapiro, 2003), with the former necessary for taking into account the consequences of previous reward choices, and the later necessary for forming plans based on the consequences of these previous choices. Given that lesions of the hippocampus in rats have been shown to increase impulsive choices (Rawlins et al., 1985, Cheung and Cardinal, 2005, Mariano et al., 2009), and alterations have been reported in mRNA, protein, monoamines, etc., in this brain region in P rats (McBride et al., 1990, Stewart and Li, 1997, McBride and Li, 1998, Witzmann et al., 2003, Edenberg et al., 2005), functional differences between P and Wistar in the hippocampus may account in part for differences behavioral strategy and DD we observed in these studies. Regions of the frontal cortex are also known to be involved in the discounting of rewards (Cardinal, 2006) as well as regulating other cognitive and executive functions (Dalley et al., 2004). Interestingly, P rats display lower basal dopamine (DA) concentrations in the Prefrontal Cortex (PFC) compared to Wistar rats (Engleman et al., 2006). PFC dopamine has been found to facilitate the modification of choice biases during cost/benefit decision-making (St Onge et al., 2012) suggesting that hypodompaminergia of the PFC in P rats may also be an important neurobiological mediator of DD in this rat population.
Collectively these data identify heritable alterations in the decision-making process that lead to an impulsive phenotype and a heightened risk for excessive drinking. Moreover, these data highlight the need for further study on the neural processes that mediate impulsivity, and on the cognitive control mechanisms by which these processes are altered in individuals FH+ for an AUD.
Acknowledgments
This work was supported by NIAAA grant #’s AA022821 (CCL), AA023786 (CCL), AA007611 (CCL), and AA022268 (DNL). This research was supported in part by Lilly Endowment, Inc., through its support for the Indiana University Pervasive Technology Institute, and in part by the Indiana METACyt Initiative. The Indiana METACyt Initiative at IU is also supported in part by Lilly Endowment, Inc.
References
- Acheson A, Richard DM, Mathias CW, Dougherty DM. Adults with a family history of alcohol related problems are more impulsive on measures of response initiation and response inhibition. Drug Alcohol Depend. 2011;117:198–203. doi: 10.1016/j.drugalcdep.2011.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anokhin AP, Grant JD, Mulligan RC, Heath AC. The Genetics of Impulsivity: Evidence for the Heritability of Delay Discounting. Biol Psychiatry. 2014 doi: 10.1016/j.biopsych.2014.10.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beckwith SW, Czachowski CL. Increased delay discounting tracks with a high ethanol-seeking phenotype and subsequent ethanol seeking but not consumption. Alcohol Clin Exp Res. 2014;38:2607–2614. doi: 10.1111/acer.12523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beckwith SW, Czachowski CL. Alcohol-Preferring P Rats Exhibit Elevated Motor Impulsivity Concomitant with Operant Responding and Self-Administration of Alcohol. Alcohol Clin Exp Res. 2016;40:1100–1110. doi: 10.1111/acer.13044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bickel WK, Johnson MW, Koffarnus MN, MacKillop J, Murphy JG. The behavioral economics of substance use disorders: reinforcement pathologies and their repair. Annual review of clinical psychology. 2014a;10:641–677. doi: 10.1146/annurev-clinpsy-032813-153724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bickel WK, Koffarnus MN, Moody L, Wilson AG. The behavioral- and neuro-economic process of temporal discounting: A candidate behavioral marker of addiction. Neuropharmacology. 2014b;76(Pt B):518–527. doi: 10.1016/j.neuropharm.2013.06.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bogacz R, Wagenmakers EJ, Forstmann BU, Nieuwenhuis S. The neural basis of the speed-accuracy tradeoff. Trends Neurosci. 2010;33:10–16. doi: 10.1016/j.tins.2009.09.002. [DOI] [PubMed] [Google Scholar]
- Braver TS. The variable nature of cognitive control: a dual mechanisms framework. Trends in cognitive sciences. 2012;16:106–113. doi: 10.1016/j.tics.2011.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown SD, Heathcote A. The simplest complete model of choice response time: linear ballistic accumulation. Cognitive psychology. 2008;57:153–178. doi: 10.1016/j.cogpsych.2007.12.002. [DOI] [PubMed] [Google Scholar]
- Buckner RL, Carroll DC. Self-projection and the brain. Trends in cognitive sciences. 2007;11:49–57. doi: 10.1016/j.tics.2006.11.004. [DOI] [PubMed] [Google Scholar]
- Cardinal RN, Daw N, Robbins TW, Everitt BJ. Local analysis of behaviour in the adjusting-delay task for assessing choice of delayed reinforcement. Neural Netw. 2002;15:617–34. doi: 10.1016/s0893-6080(02)00053-9. [DOI] [PubMed] [Google Scholar]
- Cardinal RN. Neural systems implicated in delayed and probabilistic reinforcement. Neural networks: the official journal of the International Neural Network Society. 2006;19:1277–1301. doi: 10.1016/j.neunet.2006.03.004. [DOI] [PubMed] [Google Scholar]
- Cheung TH, Cardinal RN. Hippocampal lesions facilitate instrumental learning with delayed reinforcement but induce impulsive choice in rats. BMC neuroscience. 2005;6:36. doi: 10.1186/1471-2202-6-36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coskunpinar A, Dir AL, Cyders MA. Multidimensionality in impulsivity and alcohol use: a meta-analysis using the UPPS model of impulsivity. Alcohol Clin Exp Res. 2013;37:1441–1450. doi: 10.1111/acer.12131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crystal JD, Wilson AG. Prospective memory: a comparative perspective. Behavioural processes. 2015;112:88–99. doi: 10.1016/j.beproc.2014.07.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dalley JW, Cardinal RN, Robbins TW. Prefrontal executive and cognitive functions in rodents: neural and neurochemical substrates. Neurosci Biobehav Rev. 2004;28:771–784. doi: 10.1016/j.neubiorev.2004.09.006. [DOI] [PubMed] [Google Scholar]
- Dalley JW, Everitt BJ, Robbins TW. Impulsivity, compulsivity, and top-down cognitive control. Neuron. 2011;69:680–694. doi: 10.1016/j.neuron.2011.01.020. [DOI] [PubMed] [Google Scholar]
- de Wit H. Impulsivity as a determinant and consequence of drug use: a review of underlying processes. Addict Biol. 2009;14:22–31. doi: 10.1111/j.1369-1600.2008.00129.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dick DM, Bierut LJ. The genetics of alcohol dependence. Curr Psychiatry Rep. 2006;8:151–157. doi: 10.1007/s11920-006-0015-1. [DOI] [PubMed] [Google Scholar]
- Dick DM, Smith G, Olausson P, Mitchell SH, Leeman RF, O’Malley SS, Sher K. Understanding the construct of impulsivity and its relationship to alcohol use disorders. Addict Biol. 2010;15:217–226. doi: 10.1111/j.1369-1600.2009.00190.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dougherty DM, Charles NE, Mathias CW, Ryan SR, Olvera RL, Liang Y, Acheson A. Delay discounting differentiates pre-adolescents at high and low risk for substance use disorders based on family history. Drug Alcohol Depend. 2014;143:105–111. doi: 10.1016/j.drugalcdep.2014.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edenberg HJ, Strother WN, McClintick JN, Tian H, Stephens M, Jerome RE, Lumeng L, Li TK, McBride WJ. Gene expression in the hippocampus of inbred alcohol-preferring and -nonpreferring rats. Genes Brain Behav. 2005;4:20–30. doi: 10.1111/j.1601-183X.2004.00091.x. [DOI] [PubMed] [Google Scholar]
- Engleman EA, Ingraham CM, McBride WJ, Lumeng L, Murphy JM. Extracellular dopamine levels are lower in the medial prefrontal cortex of alcohol-preferring rats compared to Wistar rats. Alcohol. 2006;38:5–12. doi: 10.1016/j.alcohol.2006.03.001. [DOI] [PubMed] [Google Scholar]
- Enoch MA, Goldman D. The genetics of alcoholism and alcohol abuse. Curr Psychiatry Rep. 2001;3:144–151. doi: 10.1007/s11920-001-0012-3. [DOI] [PubMed] [Google Scholar]
- Ferbinteanu J, Shapiro ML. Prospective and retrospective memory coding in the hippocampus. Neuron. 2003;40:1227–1239. doi: 10.1016/s0896-6273(03)00752-9. [DOI] [PubMed] [Google Scholar]
- Guidelines for the Care and Use of Mammals in Neuroscience and Behavioral Research. Washington DC: National Academy of Sciences; 2003. [PubMed] [Google Scholar]
- Green L, Myerson J. A discounting framework for choice with delayed and probabilistic rewards. Psychol Bull. 2004;130:769–792. doi: 10.1037/0033-2909.130.5.769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Howard MW, Fotedar MS, Datey AV, Hasselmo ME. The temporal context model in spatial navigation and relational learning: toward a common explanation of medial temporal lobe function across domains. Psychol Rev. 2005;112:75–116. doi: 10.1037/0033-295X.112.1.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jentsch JD, Ashenhurst JR, Cervantes MC, Groman SM, James AS, Pennington ZT. Dissecting impulsivity and its relationships to drug addictions. Annals of the New York Academy of Sciences. 2014;1327:1–26. doi: 10.1111/nyas.12388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jentsch JD, Pennington ZT. Reward, interrupted: Inhibitory control and its relevance to addictions. Neuropharmacology. 2014;76(Pt B):479–486. doi: 10.1016/j.neuropharm.2013.05.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson EO, van den Bree MB, Gupman AE, Pickens RW. Extension of a typology of alcohol dependence based on relative genetic and environmental loading. Alcohol Clin Exp Res. 1998;22:1421–1429. doi: 10.1111/j.1530-0277.1998.tb03930.x. [DOI] [PubMed] [Google Scholar]
- Jung YC, Schulte T, Muller-Oehring EM, Namkoong K, Pfefferbaum A, Sullivan EV. Compromised frontocerebellar circuitry contributes to nonplanning impulsivity in recovering alcoholics. Psychopharmacology (Berl) 2014;231:4443–4453. doi: 10.1007/s00213-014-3594-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim S, Lee D. Prefrontal cortex and impulsive decision making. Biol Psychiatry. 2011;69:1140–1146. doi: 10.1016/j.biopsych.2010.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kollins SH. Delay discounting is associated with substance use in college students. Addictive behaviors. 2003;28:1167–1173. doi: 10.1016/s0306-4603(02)00220-4. [DOI] [PubMed] [Google Scholar]
- Kurth-Nelson Z, Bickel W, Redish AD. A theoretical account of cognitive effects in delay discounting. Eur J Neurosci. 2012;35:1052–1064. doi: 10.1111/j.1460-9568.2012.08058.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lejuez CW, Magidson JF, Mitchell SH, Sinha R, Stevens MC, de Wit H. Behavioral and biological indicators of impulsivity in the development of alcohol use, problems, and disorders. Alcohol Clin Exp Res. 2010;34:1334–1345. doi: 10.1111/j.1530-0277.2010.01217.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKillop J. Integrating behavioral economics and behavioral genetics: delayed reward discounting as an endophenotype for addictive disorders. Journal of the experimental analysis of behavior. 2013;99:14–31. doi: 10.1002/jeab.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKillop J, Amlung MT, Few LR, Ray LA, Sweet LH, Munafo MR. Delayed reward discounting and addictive behavior: a meta-analysis. Psychopharmacology (Berl) 2011;216:305–321. doi: 10.1007/s00213-011-2229-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mariano TY, Bannerman DM, McHugh SB, Preston TJ, Rudebeck PH, Rudebeck SR, Rawlins JN, Walton ME, Rushworth MF, Baxter MG, Campbell TG. Impulsive choice in hippocampal but not orbitofrontal cortex-lesioned rats on a nonspatial decision-making maze task. Eur J Neurosci. 2009;30:472–484. doi: 10.1111/j.1460-9568.2009.06837.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mazur J. An adjusting procedure for studying delayed reinforcement. In: Commons ML, Mazur JE, Nevin JA, Rachlin H, editors. The Effect of Delay and of Intervening Events on Reinforcement Value. Lawrence Erlbaum; Hillsdale, N.J: 1987. pp. 5pp. 55–73. Quantitative analysis of behavior series. [Google Scholar]
- McBride WJ, Li TK. Animal models of alcoholism: neurobiology of high alcohol-drinking behavior in rodents. Crit Rev Neurobiol. 1998;12:339–369. doi: 10.1615/critrevneurobiol.v12.i4.40. [DOI] [PubMed] [Google Scholar]
- McBride WJ, Murphy JM, Lumeng L, Li TK. Serotonin, dopamine and GABA involvement in alcohol drinking of selectively bred rats. Alcohol. 1990;7:199–205. doi: 10.1016/0741-8329(90)90005-w. [DOI] [PubMed] [Google Scholar]
- McBride WJ, Rodd ZA, Bell RL, Lumeng L, Li TK. The alcohol-preferring (P) and high-alcohol-drinking (HAD) rats--animal models of alcoholism. Alcohol. 2014;48:209–215. doi: 10.1016/j.alcohol.2013.09.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mensink G-JM, Raaijmakers JGW. A model for interference and forgetting. Psychological Review. 1988;95:434–455. [Google Scholar]
- Mitchell SH. The genetic basis of delay discounting and its genetic relationship to alcohol dependence. Behavioural processes. 2011;87:10–17. doi: 10.1016/j.beproc.2011.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murdock BB. Context and mediators in a theory of distributed associative memory (TODAM2) Psychological Review. 1997;104:839–862. [Google Scholar]
- Oberlin BG, Grahame NJ. High-alcohol preferring mice are more impulsive than low-alcohol preferring mice as measured in the delay discounting task. Alcohol Clin Exp Res. 2009;33:1294–1303. doi: 10.1111/j.1530-0277.2009.00955.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perkel JK, Bentzley BS, Andrzejewski ME, Martinetti MP. Delay discounting for sucrose in alcohol-preferring and nonpreferring rats using a sipper tube within-sessions task. Alcohol Clin Exp Res. 2015;39:232–238. doi: 10.1111/acer.12632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peters J, Buchel C. Episodic future thinking reduces reward delay discounting through an enhancement of prefrontal-mediotemporal interactions. Neuron. 2010;66:138–148. doi: 10.1016/j.neuron.2010.03.026. [DOI] [PubMed] [Google Scholar]
- Peters J, Buchel C. The neural mechanisms of inter-temporal decision-making: understanding variability. Trends in cognitive sciences. 2011;15:227–239. doi: 10.1016/j.tics.2011.03.002. [DOI] [PubMed] [Google Scholar]
- Ratcliff R, McKoon G. The diffusion decision model: theory and data for two-choice decision tasks. Neural computation. 2008;20:873–922. doi: 10.1162/neco.2008.12-06-420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rawlins JN, Feldon J, Butt S. The effects of delaying reward on choice preference in rats with hippocampal or selective septal lesions. Behav Brain Res. 1985;15:191–203. doi: 10.1016/0166-4328(85)90174-3. [DOI] [PubMed] [Google Scholar]
- Rodriguez CA, Turner BM, McClure SM. Intertemporal choice as discounted value accumulation. PloS one. 2014;9:e90138. doi: 10.1371/journal.pone.0090138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roesch MR, Taylor AR, Schoenbaum G. Encoding of time-discounted rewards in orbitofrontal cortex is independent of value representation. Neuron. 2006;51:509–520. doi: 10.1016/j.neuron.2006.06.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rogers RD, Moeller FG, Swann AC, Clark L. Recent research on impulsivity in individuals with drug use and mental health disorders: implications for alcoholism. Alcohol Clin Exp Res. 2010;34:1319–1333. doi: 10.1111/j.1530-0277.2010.01216.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Romo R, Salinas E. Touch and go: decision-making mechanisms in somatosensation. Annual review of neuroscience. 2001;24:107–137. doi: 10.1146/annurev.neuro.24.1.107. [DOI] [PubMed] [Google Scholar]
- Shadlen MN, Britten KH, Newsome WT, Movshon JA. A computational analysis of the relationship between neuronal and behavioral responses to visual motion. J Neurosci. 1996;16:1486–1510. doi: 10.1523/JNEUROSCI.16-04-01486.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steiner AP, Redish AD. Behavioral and neurophysiological correlates of regret in rat decision-making on a neuroeconomic task. Nat Neurosci. 2014;17:995–1002. doi: 10.1038/nn.3740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- St Onge JR, Ahn S, Phillips AG, Floresco SB. Dynamic fluctuations in dopamine efflux in the prefrontal cortex and nucleus accumbens during risk-based decision making. J Neurosci. 2012;32:16880–16891. doi: 10.1523/JNEUROSCI.3807-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stewart RB, Li TK. The neurobiology of alcoholism in genetically selected rat models. Alcohol health and research world. 1997;21:169–176. [PMC free article] [PubMed] [Google Scholar]
- Verbruggen F, Logan GD. Models of response inhibition in the stop-signal and stop-change paradigms. Neurosci Biobehav Rev. 2009;33:647–661. doi: 10.1016/j.neubiorev.2008.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Verdejo-Garcia A, Lawrence AJ, Clark L. Impulsivity as a vulnerability marker for substance-use disorders: review of findings from high-risk research, problem gamblers and genetic association studies. Neurosci Biobehav Rev. 2008;32:777–810. doi: 10.1016/j.neubiorev.2007.11.003. [DOI] [PubMed] [Google Scholar]
- Walker SE, Pena-Oliver Y, Stephens DN. Learning not to be impulsive: disruption by experience of alcohol withdrawal. Psychopharmacology (Berl) 2011;217:433–442. doi: 10.1007/s00213-011-2298-0. [DOI] [PubMed] [Google Scholar]
- Wenger GR, Hall CJ. Rats selectively bred for ethanol preference or nonpreference have altered working memory. J Pharmacol Exp Ther. 2010;333:430–436. doi: 10.1124/jpet.109.159350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Whiteside SP, Lynam DR. Understanding the role of impulsivity and externalizing psychopathology in alcohol abuse: application of the UPPS impulsive behavior scale. Exp Clin Psychopharmacol. 2003;11:210–217. doi: 10.1037/1064-1297.11.3.210. [DOI] [PubMed] [Google Scholar]
- Wilhelm CJ, Mitchell SH. Rats bred for high alcohol drinking are more sensitive to delayed and probabilistic outcomes. Genes Brain Behav. 2008;7:705–713. doi: 10.1111/j.1601-183X.2008.00406.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilhelm CJ, Mitchell SH. Acute ethanol does not always affect delay discounting in rats selected to prefer or avoid ethanol. Alcohol Alcohol. 2012;47:518–524. doi: 10.1093/alcalc/ags059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilhelm CJ, Reeves JM, Phillips TJ, Mitchell SH. Mouse lines selected for alcohol consumption differ on certain measures of impulsivity. Alcohol Clin Exp Res. 2007;31:1839–1845. doi: 10.1111/j.1530-0277.2007.00508.x. [DOI] [PubMed] [Google Scholar]
- Witzmann FA, Li J, Strother WN, McBride WJ, Hunter L, Crabb DW, Lumeng L, Li TK. Innate differences in protein expression in the nucleus accumbens and hippocampus of inbred alcohol-preferring and -nonpreferring rats. Proteomics. 2003;3:1335–1344. doi: 10.1002/pmic.200300453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zeeb FD, Floresco SB, Winstanley CA. Contributions of the orbitofrontal cortex to impulsive choice: interactions with basal levels of impulsivity, dopamine signalling, and reward-related cues. Psychopharmacology (Berl) 2010;211:87–98. doi: 10.1007/s00213-010-1871-2. [DOI] [PubMed] [Google Scholar]