Abstract
The putamen (Put) is necessary for habitual actions, while the nucleus caudate (Cd) is critical for goal-directed actions. However, compared with the natural reward (such as sucrose)-seeking habit, how drug-related dysfunction or imbalance between the Put and Cd is involved in cocaine-seeking habit, which is not easy to bias behavior to goal-directed actions, is absent. Therefore, in our present study, in comparison with sucrose-habitual behavior, we evaluated the distinctive changes of the two subtypes of dopamine (DA) receptors (D1R and D2R) in cocaine-seeking habitual behavior animals. Moreover, the adaptive changes of Cav1.2 and Cav1.3, as prime downstream targets of D1R and D2R respectively, were also assessed. Our results showed that a similar percentage of the animals exhibited habitual seeking behavior after cocaine or sucrose variable-interval self-administration (SA) training in tree shrews. In addition, compared with animals with non-habitual behavior, animals with cocaine habitual behavior showed higher D1Rs and Cav1.2 expression in the Put accompanied with lower D2Rs and Cav1.3 expression in the Cd. However, after sucrose SA training, animals with habitual behavior only showed lower membrane expression of D2R in the Put than animals with non-habitual behavior. These results suggested that the upregulation of D1Rs-Cav1.2 signaling may lead to hyper-excitability of the Put, and the inactivation of D2Rs-Cav1.3 signaling may result in depressed activity in the Cd. This imbalance function between the Put and Cd, which causes an inability to shift between habits and goal-directed actions, may underlie the compulsive addiction habit.
Keywords: cocaine-seeking habit, putamen, DA receptors, LTCCs, tree shrew
1. Introduction
Although behaviors typically are dominated by the explicit representation of desired outcomes, some behaviors, called habits, seem to struggle against these conscious bonds. Intuitively, habits, an efficient mode of information processing, serve an obvious adaptive purpose. They are usually triggered by certain stimulus automatically, which are associated with the completion of the previous behavior and the presentation of the outcome (stimulus–response association, S-R). However, habits also tend to be inflexible in some circumstances even when the environment changes. This tendency is amplified in substance abuse, which has been taken as one of the most important underlying factors for drug craving and relapse. Many studies have found that, not like habits in life, drug-seeking habits are inclined to be compulsive [1]. Indeed, in humans, when alcohol-dependent people conducted a task that can distinguish between goal-directed (A-O) or S-R strategy, they tended to overuse the S–R association [2]. In rodents, after long-term noncontingent exposure to addictive drugs, animals also tended to overly depend on the habit system despite adverse consequence [3,4]. However, how drug-related dysfunction or imbalance between the A-O strategy and S-R strategy involved in cocaine-seeking habit is absent.
It has been deeply understood that there are functional alterations from the nucleus caudate (Cd) to the putamen (Put) during the development of substance abuse. The Put mainly participates in regulating habitual behaviors, while the Cd plays a role in mediating goal-directed actions [5]. Various evidence pointed to the functional unbalance between Cd and Put may underlie compulsive drug-seeking habits. Indeed, fMRI studies have found that the Put represented overactive states specific in alcohol-dependent participants [6], suggesting that the hyperactive Put might be the main factor causing compulsive elements of drug-seeking habits.
Within the striatum, the predominant cell types are GABAergic medium spine neurons (MSNs), which are typically segregated into dopamine 1 receptor (D1R) or dopamine 2 receptor (D2R) containing [7]. These cells have different projection targets and serve distinctive functions in the reward processes. This functional discrepancy is partly because D1Rs and D2Rs have different biological characteristics [8]. They have different downstream molecular targets, dopamine affinity, and represent functional antagonists, modulating neural activity differently [9]. For instance, D1Rs and D2Rs signaling participate in plasticity at glutamatergic synapses respectively: long-term potentiation (LTP) in the striatum depends on the action of D1Rs, whereas long-term depression (LTD) in the striatum relies upon that of D2Rs [8,10]. More importantly, the different changes of these subtype receptors may underlie the compulsive elements of cocaine-seeking habitual behavior via mediating neural activity or even plasticity. Indeed, studies have shown that during cocaine habitual behavior under a second-order schedule of reinforcement, dopamine overflow increased in the Put, and infusion of a non-selective dopamine receptor antagonist into the Put reduced cocaine-seeking habits [11,12]. Although many studies have focused on dopamine signaling in habitual behaviors, few know how D1Rs and D2Rs work differently [13]. Therefore, we hypothesized that plasticity changes in the dorsal striatum subregions, which are modulated by D1Rs and D2Rs, which probably lead to the hyperactivity of Put, could be the key mechanism of compulsive elements of cocaine-seeking habits.
As one of the downstream targets of DA receptors, L-type calcium channels (LTCCs) are essential for the neuronal in the striatum [14,15]. Notably, as downstream agents of DA receptors, Cav1.2 and Cav1.3, the two most prominent LTCCs subtypes distributed in the brain, are regulated by D1Rs and D2Rs, respectively [16]. In comparison with Cav1.2, Cav1.3 channels are activated more rapidly and at more negative membrane potentials [16]. Additionally, the activation of Cav1.2 via D1Rs is the key phrase during the formation of LTP, whereas the activation of Cav1.3 via D2Rs is the main form of LTD in the striatopallidal neurons [10]. Based on the above evidence, we hypothesized that the D1Rs-Cav1.2 signaling and the D2Rs-Cav1.3 signaling might participate distinctively in sucrose-seeking and cocaine-seeking habits.
Tree shrews are increasingly being used as a new and promising animal model in neurobiological studies [17,18]. Compared with rodents, they are genetically closer to the primate [19], and, notably, have a clearer anatomical structure in the striatum to distinguish between Cd and Put [20]. Moreover, in our lab, tree shrews have been proved also suitable to establish addiction models [21]. Therefore, in the present study, using tree shrews, we explored the differential expression of D1Rs and D2Rs in the Put and Cd in habitual behavior established on natural rewards and cocaine, respectively, which is necessary for understanding the compulsive characteristics of drug addiction.
2. Methods and Materials
2.1. Animals
Adult male tree shrews (Tupaia belangeri chinensis; 130–160 g) were used (the Animal House Center of the Kunming Institute of Zoology). All animals were individually housed in rearing cages (395 × 300 × 595 mm), each of which was attached to a nest box (246 × 158 × 147 mm) that can provide sleeping quarters and functioned as a transfer box when the animal was moved from its home cage to the training apparatus. The tree shrews were kept in an air-conditioned room in which the temperature (22–25 °C) and the humidity (40%–70%) were controlled on a 12 h/12 h dark/light cycle (lights on at 8:00) for at least two weeks prior to experiments, which ensured the tree shrews adapted to the new environment. Water and food (purchase from Keaoxieli Co., Beijing, China) were available ad libitum. All procedures were conducted according to the National Institutes of Health Guide for the Care and Use of Laboratory Animals and were approved by the Research Ethics Review Board of the Institute of Psychology, Chinese Academy of Sciences (A22039).
2.2. Drug Administration
Cocaine hydrochloride (Qinghai Pharmaceutical, Qinghai, China) was dissolved in 0.9% sterile physiological saline to the final concentrations.
2.3. SA Apparatus
The tree shrews were trained and tested for self-administration (SA) in standard operant chambers for rats (Med Associates, Inc., St. Albans, VT, USA), which were placed in a sound insulation cubicle. Each chamber was installed with two cue lights in two nose-poking holes (ENV-114M, Med Associates) situated 2 cm above the floor which were equipped with horizontal bars and a house light was located on the opposite wall. The drug solution was delivered through polyethylene tubing, protected by a leash assembly (PHM-120, Med Associates), and the polyethylene tubing connected with a fluid rotary joint (PHM-115, Med Associates), which was poised through the ceiling of the chamber. The drug solution was delivered by a 10 mL syringe in an infusion pump (PHM-100, Med Associates). Lenovo computer with MED PC Software IV (Med Associates) controlled infusions and light presentations and recorded the number of nose pokes.
2.4. Surgery
In the cocaine group, tree shrews were anesthetized with pentobarbital sodium (100 mg/kg, i.p.) to implant an intravenous jugular catheter (AniLab, Ningbo, China). A silicone tube was inserted 35 mm into the right jugular vein and firmly anchored to the vein with a silk suture. The other end of the catheter passed subcutaneously to connect with a 22-gauge connector (Plastics One, Roanoke, VA, USA) mounted on the back and was plugged with a solid pin when it was not being used for drug infusions.
2.5. Western Blot Analysis
For the western blot experiments, tree shrews were decapitated at once after the devaluation test and the brains were removed and frozen immediately in N-hexane (−70 °C) for approximately 30 s. Bilateral tissue punches of the Put and the Cd were obtained by using a 16-gauge needle in cryostat (Leica). The details of the extraction of protein were described previously [16]. Briefly, an equal amount of protein (30 ug for total protein and 20 ug for membrane protein) for each sample was resolved on 8% sodium dodecyl sulphate–polyacrylamide gel electrophoresis (SDS-PAGE) gels and transferred to polyvinylidene difluoride (PVDF) membranes (Millipore, Burlington, MA, USA). Then, membranes were incubated in Tris-buffered saline (TBS) (50 mM Tris-HCl, pH 7.4, 150 mM NaCl, and 0.05% Tween 20) with 5% nonfat dry milk for 2 h at room temperature, and incubated with the following primary antibodies overnight at 4 °C: anti-Cav1.2 (Alomone Lab, Jerusalem, Israel), anti-Cav1.3 (Alomone Lab, Jerusalem, Israel), anti-D1R (Santa Cruz Biotechnology, Santa Cruz, CA, USA), anti-D2R (Abcam, Cambridge, MA, USA), and anti-β-actin Sigma, St. Louis, MO, USA). Membranes were washed in TBS/0.1% Tween 20 three times for 5 min and were then incubated in an anti-rabbit or anti-mouse secondary antibody conjugated to horseradish peroxidase (Zhongshan Biotechnology, Zhongshan City, Guangdong, China). Finally, the membranes were developed using West Dura chemiluminescent substrate (Pierce Laboratories, Waltham, MA, USA). The bands were quantified with Quantity One Analysis Software (Bio-Rad, Hercules, CA, USA). The optical density of each band was normalized to the relative optical density of β-actin protein expression to control for inconsistencies between the loaded samples.
2.6. Behavioral Procedures
Cocaine SA training sessions began 7 days following surgery. Cocaine (0.175 mg/0.05 mL per infusion in 2.5 s, intravenous injection) or 10% sucrose solution (0.2 mL per drop in 0.2 s, oral administration) was available under a fixed-ratio (FR) schedule and variable-interval (VI) schedule. The concentrations of cocaine and sucrose referred to the previous studies in our lab (please insert the references list in the comment). Three stages of both cocaine SA training and sucrose SA training were included in our study as previously described (Furlong et al., 2014), including FR1 training, VI training, and devaluation test. The details are described as follows.
FR1 training: Poking in the active hole resulted in an infusion and initiated a 20 s time-out. At the same time, the active hole-specific cue light was lighted as a conditioned stimulus (CS). The house light was turned off during the time-out. Poking in the inactive hole was recorded but had no consequence. The maximum infusions in cocaine SA training or sucrose SA training were 60 per session. Each of the FR1 sessions lasted 120 min or until the tree shrew reached maximum infusions. In the cocaine group, the animals received 8 sessions of FR1 training, while the sucrose group received 5 sessions of FR1 training. If animals could not meet the criterion that the variation of the number of active nose pokes within the last 3 sessions fell below 20%, they were excluded. Inactive and active nose poking assignments were counter balanced.
VI training: The VI training was introduced following FR1 training, in which the reward (cocaine or sucrose) would be delivered in variable intervals after poking the active hole. In the cocaine group, animals received VI training for 6 days, in which the average of the interval was from 5 s to 40 s. In the sucrose SA training, tree shrews received VI30s training for 5 continuous days. Each session lasted for two hours.
Devaluation test: All animals that finished the above two stages were able to receive a devaluation test. The devaluation test occurred in the same conditions as for FR1 sessions but without rewards. The test lasted for 40 min in the cocaine group and 60 min in the sucrose group. The number of nose pokes was recorded every 10 min. Through this test, tree shrews could be divided into habitual and non-habitual groups. The habitual group maintained the number of nose pokes during the last three ten-minute intervals, while the non-habitual group rapidly decreased the number of nose pokes.
2.7. Statistical Analysis
For behavioral experiment, data were analyzed using two-way ANOVA, followed by the Bonferroni post hoc tests. For the western blot analysis, normalized optical density values were used to calculate the percentage fold change for each treatment group compared with the naïve tree shrews (set to 1), and these data were analyzed with t-test. All data were shown as the mean ± SEM and were processed in Graph Pad Prism 7.0.
3. Results
The establishment of habitual cocaine-seeking behaviors in tree shrews.
In past research, paradigms to examine the isolated goal-directed and habitual actions have been developed and outcome devaluation procedures are commonly used to detect whether a behavior is controlled by a goal or a habit [22,23]. The devaluation test was conducted after the VI training phase in our research (Figure 1A). After the entire training in our study, we found that tree shrews demonstrated two opposite trends in seeking behaviors (Two-way ANOVA, group × time effect: F(3, 30) = 4.657; p = 0.009) (Figure 1B). The number of valid nose pokes in some tree shrews remained at a stable level in the devaluation test (One-way ANOVA, F(3, 16) = 0.3263, p = 0.8063; n = 5), which means that the number of the valid nose pokes during the last three ten-minute intervals did not exceed 20% of the decreases from the first 10 min [24], indicating that this group of tree shrews performed habitual behaviors that were insensitive to devaluation; however, the number of valid nose pokes in other tree shrews decreased significantly (One-way ANOVA, F(3, 24) = 5.625, p = 0.0046, n = 7), indicating that they performed goal-directed behaviors. However, no difference was observed in the received dose of cocaine between the two groups (t-test, t(10) = 0.475, p = 0.645) (Figure 1C), and no differences were observed in the number of valid nose pokes during the entire FR1 (Two-way ANOVA, group effect: F(1, 10) = 0.0229, p = 0.883; session effect: F(3.089, 30.89) = 2.494, p = 0.0768; interaction effect: F(7, 70) = 0.837, p = 0.561) and VI training process (Two-way ANOVA, group effect: F(1, 10) = 0.321, p = 0.583; session effect: F(2.457, 24.57) = 0.849, p = 0.461; interaction effect: F(11, 110) = 0.837, p = 0.603) (Figure 1D,E).
These results showed that some tree shrews established cocaine-seeking habits after FR1 and VI trainings. Moreover, these results are independent of the consumption of cocaine and the magnitude of trainings.
The protein levels of D1R and Cav1.2 increased, whereas the protein levels of D2R and Cav1.3 decreased in the Put of cocaine-habitual tree shrews.
After the devaluation test, all tree shrews were sacrificed and the protein levels of D1Rs, D2Rs, Cav1.2, and Cav1.3 in the Put were determined in both the habit and non-habit behavior groups of animals (Figure 2).
The statistical data indicated that both the total and membrane protein levels of D1Rs and Cav1.2 in the Put were significantly higher in the habit tree shrews than in the non-habit animals (t-test, D1Rs total: t = 4.731, p = 0.0026; D1Rs membrane: t = 2.780, p = 0.0195; Cav1.2 total: t = 7.626, p = 0.0003; Cav1.2 membrane: t = 4.142, p = 0.0072) (Figure 2A–D), while both the total and membrane protein levels of D2Rs and Cav1.3 in the Put were lower in the habit tree shrews than in the non-habit animals (t-test, D2Rs total: t = 2.938, p = 0.0212; D2Rs membrane: t = 3.556, p = 0.0118; Cav1.3 total: t = 2.214, p = 0.0456; Cav1.3 membrane: t = 2.025, p = 0.0494;) (Figure 2E–H).
Our results showed that the protein levels of D1Rs and Cav1.2 increased, whereas the protein levels of D2Rs and Cav1.3 decreased in the Put of the well-established cocaine habit tree shrews compared with the non-habit animals.
The protein levels of D1R, D2R, Cav1.2, and Cav1.3 had no difference in the Cd of cocaine-habitual tree shrews.
Meanwhile, we also detected the total and membrane protein levels of Cav1.2, Cav1.3, D1Rs, and D2Rs in the Cd in the habit group and non-habit group with the method of western blot. We observed no difference between the habit tree shrews and the non-habit animals in both total and membrane protein levels of these molecules in the Cd (Figure 3). Our results showed that the protein levels of Cav1.2, Cav1.3, D1Rs, and D2Rs in the Cd were no different between these two groups.
The establishment of habitual sucrose-seeking behavior in tree shrews .
The same as the cocaine SA training, after the entire sucrose SA training, tree shrews demonstrated two opposite directions in seeking behaviors (Figure 4A). The number of nose pokes in some tree shrews remained at a stable level (One-way ANOVA, n = 4; F (3) = 0.758, p = 0.592), indicating that this group of tree shrews performed habitual behaviors that were insensitive to devaluation, while the number of nose pokes in other tree shrews decreased (One-way ANOVA followed by LSD post hoc test, n = 4, p < 0.001, 0.001, 0.001, 0.001, 0.001; 10–20, 20–30, 30–40, 40–50, 50–60 min vs. 0–10 min), indicating that they performed goal-directed behaviors. Moreover, two-way ANOVA revealed a significant difference within the group effect (F(1, 36) = 14.175, p = 0.009) but no time effect (F(5, 36) = 0.468, p = 0.797), nor interaction effect (F(5, 36) = 1.480, p = 0.226). t-test showed that the number of nose pokes in the non-habit group at 10–20, 30–40, 40–50, and 50–60 min was significantly fewer than the habit group (t =3.797, p = 0.009; t = 3.151, p = 0.02; t = 2.997, p = 0.024; t = 3.205, p = 0.049).
In addition, no difference was observed in the received rewards of sucrose between the two groups (t-test, n = 4, t = 2.282, p = 0.063) (Figure 4C), and no differences were observed in the number of nose pokes during the entire FR1 training (Two-way ANOVA, group effect: F(1, 27) = 0.212, p = 0.669; session effect: F(4, 27) = 0.327, p = 0.856; interaction effect: F(4, 27) = 0.393, p = 0.811) (Figure 4D). Interestingly, during the VI training, two-way ANOVA revealed a significant difference within the group effect (F(4, 28) = 17.1963, p = 0.008) and the number of valid pokes in the non-habit group was fewer than that in the habit group on the 1st, 3rd, and 4th sessions of training (t-test; t = 2.726, p = 0.034; t =3.753, p = 0.009; t = 6.310). These results showed that some tree shrews developed sucrose-seeking habits after VI trainings.
The membrane protein levels of D2R protein levels decreased in the Put of sucrose-habitual tree shrews.
To further evaluate the changes of these molecules between the well-established sucrose habitual behavior tree shrews and the non-habitual animals, we observed no differences between the habitual group and the non-habitual group in the total protein levels of Cav1.2, Cav1.3, D1Rs or D2Rs in the Put (Figure 5A,C,E,G). Then, the statistical data showed no difference between the habit tree shrews and the non-habit tree shrews in the membrane protein levels of Cav1.2, Cav1.3, or D1Rs in the Put (Figure 5B,D,H), but the membrane protein levels of D2Rs in the Put were significantly lower in the well-established habitual sucrose-seeking tree shrews than in the non-habit tree shrews (t-test, t = 2.512, p = 0.046) (Figure 5F). Our results showed that the membrane protein levels of D2R of the Put decreased in the habit tree shrews compared with the non-habit animals.
The protein levels of D1R, D2R, Cav1.2 and Cav1.3 were no different in the Cd of sucrose-habitual tree shrews.
We detected the total and membrane protein levels of Cav1.2, Cav1.3, D1Rs, and D2Rs in the Cd in the habit group and non-habit group with the method of western blot. We observed no differences between the habit tree shrews and the non-habit animals in both total and membrane protein levels of these molecules in the Cd (Figure 6).
The data were expressed as the means ± SEM and analyzed with the t-test, habit group n = 4, non-habit group n = 4.
4. Discussion
Our current study showed that both food and cocaine-seeking habits displayed insensitivity to the devaluation tests. Only forty percent of tree shrews exhibited habitual cocaine-seeking behavior, whereas the rate increased to fifty percent in the sucrose group. However, as the sample size was relatively small, this observation needs to be further validated. Moreover, we found differential alterations of dopamine receptors and L-type calcium channel subtypes between cocaine-seeking habit and sucrose-seeking habit tree shrews compared with non-habit tree shrews. Burgeoning evidence points to a maladaptive habit system underlying the behavioral manifestation of addiction. For instance, non-contingent exposure to cocaine or amphetamine expedites the formation of habitual behavior reinforced by sucrose [25,26]. In humans, addicts also represent over-reliance upon the habit system [2]. The above findings highlight the importance of comparative studies between “normal” habits and maladaptive habits in addiction.
The Put plays a key role in mediating habitual behavior. Indeed, pharmacological blockade of the Put impaired the expression of habitual behavior, but did not influence the acquisition. One of the main regulators of the Put activity is DA receptor system. There are two subtypes of DA receptors: D1-like subtype couples to the G protein Gs, which activates adenylyl cyclase (AC) and recruits Cav1.2-dependent signaling, further enhancing the neural activity; while the D2-like subfamily instead inhibits AC, recruits Cav1.3 signaling, and subsides neuronal excitation. Based on that, we tested the protein levels of DA receptor subtypes and LTCCs subtypes between the habit and non-habit groups after cocaine and sucrose SA training, respectively. We found that, compared with the non-habitual cocaine-seeking group, there was higher expression of both D1Rs and Cav1.2, and lower expression of both D2Rs and Cav1.3 in the Put in the habitual cocaine-seeking group. However, conversely, we only found decreased D2Rs in the Put in the habitual sucrose-seeking group. These results suggest that the up-regulation of D1Rs-Cav1.2 signaling and the down-regulation of D2Rs-Cav1.3 signaling may cause maladaptive hyperactivity of the Put, which likely underlies maladaptive elements of cocaine-seeking habit.
Furthermore, in the striatum, two fundamental neural circuits are constituted by specified medium-sized spiny neurons (MSNs), each expressing a distinct type of DA receptor. One circuit is the direct pathway, predominantly expressing dopamine D1 receptors (D1Rs). The other is the indirect pathway, primarily expressing dopamine D2 receptors (D2Rs). Therefore, our results also suggest that the imbalanced activity between direct MSNs (dMSNs) and indirect MSNs (iMSNs), which was evaluated by D1Rs and D2Rs, causes it to be difficult to shift from goal-directed to habitual behavior when the environment or reward value changes.
The method that we used to establish cocaine-seeking habit was two days of FR1 followed by six days of VI schedules. Similar to the rats [27,28], forty percent of tree shrews showed habitual cocaine-seeking. No significant difference in the number of valid nose pokes was observed in the last three sessions of FR1 between the habit and non-habit groups. These results indicated that both groups performed the well-established SA training and showed no difference in learning ability. Moreover, from the beginning of the VI training, the habit animals were more insensitive to the changeable delayed reward time than those in the non-habit group. In addition, the total doses of cocaine were not significantly different between these two groups. Therefore, the causes for the establishment of habitual drug-seeking behavior included not only the cocaine itself but also the vulnerability of these tree shrews. For instance, studies have shown that animals with lower D2R expression itself are more impulsive [29,30], which might act as an intrinsic character of vulnerability to habitual drug-seeking.
Evidence has demonstrated that the expression of habitual drug-seeking behavior depends on the activation of the dorsal striatum (DS) as regulated by dopaminergic input from the substantia nigra (SN) [31]. Indeed, dopaminergic nuclei are becoming a target for potential treatment [32], and recent studies have further found that blocking DA receptors in the striatum inhibits well-established drug-seeking behavior [21]. Our results found that the protein level of D1R was higher, whereas the protein level of D2Rs was lower in the Put in habitual cocaine-seeking tree shrews compared with non-habitual animals, indicating that the Put might be highly active via upregulating D1Rs signaling and downregulating D2Rs signaling. Furthermore, the increase of D1Rs and the decrease of D2Rs in our present results also imply the activation of dMSNs and inactivation of iMSNs, and it has been shown that these two types of neurons usually compete for action control. Based on these results, the enhanced S-R action may be induced by D1Rs increase and the activation of dMSNs, resulting in the habitual drug-seeking.
More specifically, activation of iMSNs mainly supports the A-O action strategies [11,33,34], while dMSNs activation supports the S-R action strategies [35,36,37]. In addition, studies have shown that the activation of D1Rs in the dMSNs enhances neural excitability via protein kinase A (PKA) signaling and is essential for the expression of long-term plasticity in the dorsal striatum. In contrast, the activation of D2Rs worked in the opposite way, and is necessary for expressing the long-term depression [8]. Therefore, it is possible that the upregulated D1Rs and downregulated D2Rs make dMSNs and iMSNs more readily activated by dopamine (DA), leading to habitual drug-seeking behavior. It was consistent with other results in that both pathways participated in reward-seeking behavior in the contingency degradation (CD) session, while iMSNs were activated earlier than dMSNs [5,38]. Furthermore, in contrast to cocaine, habitual sucrose-seeking behavior was only accompanied with the decrease of D2Rs level, and there was no change in D1R level related to the habitual sucrose seeking. This result was similar to other studies in that the downregulation of D2Rs was found in natural reward (such as delicious food) habitual behavior [39,40]. In addition, both the food and cocaine groups showed lower D2R expression in habitual animals than non-habitual animals, but only the habitual cocaine-seeking animals showed higher D1R expression in the Put. These results suggested the upregulation of D1R was specific to the habitual drug-seeking behavior. It raised the possibility that an abnormal increase of D1Rs in the Put might be necessary for habitual drug-seeking and difficulty to switch actions and return to goal-directed strategies even when faced with serious negative consequences.
In our present study, we also evaluated the changes of DA receptors in the Cd of tree shrews, which is the same as the dorsomedial striatum (DMS) in the rodents. Different from the Put, there was no adaptive change of D1Rs and D2Rs in the Cd after the expression of habitual drug- or sucrose-seeking. It was consistent with studies showing that blockade of DA receptors in the DMS only impaired habitual drug-seeking at the early stage but had no effect on the established habitual behavior [1,41]. Lesions of the posterior DMS abolished the sensitivity of rats’ instrumental performance to outcome devaluation, implying that DMS played an essential role in the goal-directed action [40]. Moreover, intracranial self-stimulation of dMSNs in the DMS leads to reinforcement of actions, while the same manipulation in the iMSNs leads to avoidance of actions [42]. Therefore, the activation of DMS might be necessary for the formation or development of instrumental lever-pressing associated learning through guiding the action according to goal-directed strategies. Nevertheless, it also raised another possibility that the DMS might decrease activity during the expression of habitual drug-seeking, which was reported in our previous study in rats [13]. However, in the Cd of tree shrews, we did not detect changes in DA receptor signaling after cue exposure in the well-established habitual cocaine or sucrose-seeking animals. Some factors, such as species of animals, location of the focus brain regions, or the training procedures, can explain these differences. For example, for tree shrews, the division of function related to the execution of behavior strategies might be more specialized, equal to saying that the Put activation might be enough to express the well-established habitual behavior.
Our results showed that the protein level of Cav1.2 in the Put exhibited the same dynamic tendency as D1Rs, and the protein level of Cav1.3 in the Put showed the same tendency as D2Rs. D1Rs influence Cav1.2 by activating the PKA pathway [8,43], while D2Rs can regulate Cav1.3 through a calcineurin-dependent mechanism [8], indicating that DA receptor system works closely with Cav1.2 and Cav1.3. Moreover, D1Rs and D2Rs can exert their effects on the same process of drug addiction by modulating Cav1.2 and Cav1.3, respectively [44]. In the expression of cocaine sensitization, Cav1.2 works as one of the critical modulators, regulated by the activation of D1Rs, phosphorylates GluA1 of a-amino-3-hydroxy-5-methyl- 4-isoxazole-propionic acid receptor (AMPAR) at the Ser831 site, leading to the LTP [14,45]. Conversely, the decreased D2Rs stimulated Cav1.3 in the DS to suppress its downstream activation, and reduced phosphorylation at GluA1 at the Ser845 site, resulting in the LTD [14,45]. This evidence implies that the upregulation of D1Rs signaling by moderating the activation of Cav1.2 might cause enhanced synaptic efficacy in striatonigral neurons, which supports S-R action. Meanwhile, the downregulation of D2Rs pathway by mediating the activation of Cav1.3 might induce depressed synaptic efficacy in striatopallidal neurons, which supports goal-directed action, finally leading to the establishment and expression of habitual cocaine-seeking behavior. Furthermore, we also evaluated the protein level of LTCCs subtypes in the sucrose group, and there were no changes of Cav1.2 and Cav1.3 in either the Put or the Cd between habit and non-habit groups. These results indicated that the variation of LTCCs, monitored by the DA system, might be the specific molecular mechanism involved in drug-related habitual behavior.
5. Conclusions
Using the VI training schedule, we successfully established a tree shrews model of habitual cocaine-seeking behavior, and also established habitual sucrose-seeking behavior to investigate the distinct molecular mechanisms between addictive drugs and the nature reward. Furthermore, we found that the protein expression of both D1Rs and Cav1.2 were higher and that the protein expression of D2Rs and Cav1.3 were lower in the Put in habitual cocaine-seeking tree shrews than in the non-habitual group. In contrast, habitual sucrose-seeking animals were only related to the decrease of D2Rs in the Put. It implied that an abnormal increase of D1Rs in the Put might be necessary for habitual drug-seeking behavior.
Author Contributions
The conception or design of the study: F.S. and Y.D. The acquisition, analysis, and interpretation of data for work: Y.D., S.J., W.D. and L.J. Drafting the work: F.S., Y.D. and L.J. Y.D. and L.J. contributed equally to this paper. Revising the work critically for important intellectual content: W.D., Y.M., S.J., Y.L., J.Z., J.L. and N.S. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
All procedures were conducted according to the National Institutes of Health Guide for the Care and Use of Laboratory Animals and were approved by the Research Ethics Review Board of the Institute of Psychology, Chinese Academy of Sciences (A22039).
Informed Consent Statement
Not applicable.
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This work was supported by the Ministry of Science and Technology of the People’s Republic of China (2021ZD0203800), National Natural Science Foundation of China (Grant No. 3197070674) and the National Natural Science Foundation of Beijing (Grant No. 7202128), CAS-VPST Silk Road Science Fund 2021 (GJHZ202129) and CAS Key Laboratory of Mental Health, Institute of Psychology.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Zapata A., Minney V.L., Shippenberg T.S. Shift from Goal-Directed to Habitual Cocaine Seeking after Prolonged Experience in Rats. J. Neurosci. 2010;30:15457–15463. doi: 10.1523/JNEUROSCI.4072-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Sjoerds Z., De Wit S., Brink W.V.D., Robbins T., Beekman A.T.F., Penninx B.W.J.H., Veltman D.J. Behavioral and neuroimaging evidence for overreliance on habit learning in alcohol-dependent patients. Transl. Psychiatry. 2013;3:e337. doi: 10.1038/tp.2013.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Everitt B.J., Robbins T.W. Drug Addiction: Updating Actions to Habits to Compulsions Ten Years On. Annu. Rev. Psychol. 2016;67:23–50. doi: 10.1146/annurev-psych-122414-033457. [DOI] [PubMed] [Google Scholar]
- 4.Everitt B.J., Robbins T.W. From the ventral to the dorsal striatum: Devolving views of their roles in drug addiction. Neurosci. Biobehav. Rev. 2013;37:1946–1954. doi: 10.1016/j.neubiorev.2013.02.010. [DOI] [PubMed] [Google Scholar]
- 5.Vicente A.M., Galvão-Ferreira P., Tecuapetla F., Costa R.M. Direct and indirect dorsolateral striatum pathways reinforce different action strategies. Curr. Biol. 2016;26:R267–R269. doi: 10.1016/j.cub.2016.02.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Sjoerds Z., van den Brink W., Beekman A.T., Penninx B.W., Veltman D.J. Cue Reactivity Is Associated with Duration and Severity of Alcohol Dependence: An fMRI Study. PLoS ONE. 2014;9:e84560. doi: 10.1371/journal.pone.0084560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gerfen C.R., Engber T.M., Mahan L.C., Susel Z.V.I., Chase T.N., Monsma F.J., Jr., Sibley D.R. D1 and D2 dopamine receptor-regulated gene expression of striatonigral and striatopallidal neurons. Science. 1990;250:1429–1432. doi: 10.1126/science.2147780. [DOI] [PubMed] [Google Scholar]
- 8.Surmeier D.J., Ding J., Day M., Wang Z., Shen W. D1 and D2 dopamine-receptor modulation of striatal glutamatergic signaling in striatal medium spiny neurons. Trends Neurosci. 2007;30:228–235. doi: 10.1016/j.tins.2007.03.008. [DOI] [PubMed] [Google Scholar]
- 9.Park K., Volkow N.D., Pan Y., Du C. Chronic cocaine dampens dopamine signaling during cocaine intoxication and unbalances D1 over D2 receptor signaling. J. Neurosci. 2013;33:15827–15836. doi: 10.1523/JNEUROSCI.1935-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lovinger D.M. Neurotransmitter roles in synaptic modulation, plasticity and learning in the dorsal striatum. Neuropharmacology. 2010;58:951–961. doi: 10.1016/j.neuropharm.2010.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Murray J.E., Belin D., Everitt B. Double Dissociation of the Dorsomedial and Dorsolateral Striatal Control Over the Acquisition and Performance of Cocaine Seeking. Neuropsychopharmacology. 2012;37:2456–2466. doi: 10.1038/npp.2012.104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Vanderschuren L.J., Di Ciano P., Everitt B.J. Involvement of the Dorsal Striatum in Cue-Controlled Cocaine Seeking. J. Neurosci. 2005;25:8665–8670. doi: 10.1523/JNEUROSCI.0925-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Shen F., Jin S., Duan Y., Liang J., Zhang M., Jiang F., Sui N. Distinctive Changes of L-Type Calcium Channels and Dopamine Receptors in the Dorsomedial and Dorsolateral Striatum after the Expression of Habitual Cocaine-Seeking Behavior in Rats. Neuroscience. 2018;370:139–147. doi: 10.1016/j.neuroscience.2017.07.049. [DOI] [PubMed] [Google Scholar]
- 14.Schierberl K., Hao J., Tropea T.F., Ra S., Giordano T.P., Xu Q., Garraway S.M., Hofmann F., Moosmang S., Striessnig J., et al. Cav1.2 L-Type Ca2+ Channels Mediate Cocaine-Induced GluA1 Trafficking in the Nucleus Accumbens, a Long-Term Adaptation Dependent on Ventral Tegmental Area Cav1.3 Channels. J. Neurosci. 2011;31:13562–13575. doi: 10.1523/JNEUROSCI.2315-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Surmeier D.J., Bargas J., Hemmings H.C., Jr., Nairn A.C., Greengard P. Modulation of calcium currents by a D1 dopaminergic protein kinase/phosphatase cascade in rat neostriatal neurons. Neuron. 1995;14:385–397. doi: 10.1016/0896-6273(95)90294-5. [DOI] [PubMed] [Google Scholar]
- 16.Striessnig J., Ortner N.J., Pinggera A. Pharmacology of L-type calcium channels: Novel drugs for old targets? Curr. Mol. Pharmacol. 2015;8:110–122. doi: 10.2174/1874467208666150507105845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wang J., Chai A., Zhou Q., Lv L., Wang L., Yang Y., Xu L. Chronic Clomipramine Treatment Reverses Core Symptom of Depression in Subordinate Tree Shrews. PLoS ONE. 2013;8:e80980. doi: 10.1371/journal.pone.0080980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Parésys L., Hoffmann K., Froger N., Bianchi M., Villey I., Baulieu E.E., Fuchs E. Effects of the Synthetic Neurosteroid 3 beta-Methoxypregnenolone (MAP4343) on Behavioral and Physiological Alterations Provoked by Chronic Psychosocial Stress in Tree Shrews. Int. J. Neuropsychopharmacol. 2016;19:pyv119. doi: 10.1093/ijnp/pyv119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Song S., Liu L., Edwards S.V., Wu S. Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model. Proc. Natl. Acad. Sci. USA. 2012;109:14942–14947. doi: 10.1073/pnas.1211733109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Rice M.W., Roberts R.C., Melendez-Ferro M., Perez-Costas E. Neurochemical Characterization of the Tree Shrew Dorsal Striatum. Front. Neuroanat. 2011;5:53. doi: 10.3389/fnana.2011.00053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Duan Y., Meng Y., Du W., Li M., Zhang J., Liang J., Li Y., Sui N., Shen F. Increased cocaine motivation in tree shrews is modulated by striatal dopamine D1 receptor-mediated upregulation of Cav1. 2. Addict. Biol. 2021;26:e13053. doi: 10.1111/adb.13053. [DOI] [PubMed] [Google Scholar]
- 22.Balleine B.W., O’Doherty J.P. Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action. Neuropsychopharmacol. Off. Publ. Am. Coll. Neuropsychopharmacol. 2009;35:48–69. doi: 10.1038/npp.2009.131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Liljeholm M., O’Doherty J.P. Contributions of the striatum to learning, motivation, and performance: An associative account. Trends Cogn. Sci. 2012;16:467–475. doi: 10.1016/j.tics.2012.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wang W.-S., Chen Z.-G., Liu W.-T., Chi Z.-Q., He L., Liu J.-G. Dorsal hippocampal NMDA receptor blockade impairs extinction of naloxone-precipitated conditioned place aversion in acute morphine-treated rats by suppressing ERK and CREB phosphorylation in the basolateral amygdala. Br. J. Pharmacol. 2014;172:482–491. doi: 10.1111/bph.12671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Corbit L.H., Nie H., Janak P. Habitual Alcohol Seeking: Time Course and the Contribution of Subregions of the Dorsal Striatum. Biol. Psychiatry. 2012;72:389–395. doi: 10.1016/j.biopsych.2012.02.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Nelson A., Killcross S. Amphetamine exposure enhances habit formation. J. Neurosci. 2006;26:3805–3812. doi: 10.1523/JNEUROSCI.4305-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hou Y.-Y., Liu Y., Kang S., Yu C., Chi Z.-Q., Liu J.-G. Glutamate receptors in the dorsal hippocampus mediate the acquisition, but not the expression, of conditioned place aversion induced by acute morphine withdrawal in rats. Acta Pharmacol. Sin. 2009;30:1385–1391. doi: 10.1038/aps.2009.130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Everitt B.J., Robbins T.W. Neural systems of reinforcement for drug addiction: From actions to habits to compulsion. Nat. Neurosci. 2005;8:1481–1489. doi: 10.1038/nn1579. [DOI] [PubMed] [Google Scholar]
- 29.Hammerslag L.R., Belagodu A.P., Arogundade O.A.A., Karountzos A.G., Guo Q., Galvez R., Roberts B.W., Gulley J.M. Adolescent impulsivity as a sex-dependent and subtype-dependent predictor of impulsivity, alcohol drinking and dopamine D2 receptor expression in adult rats. Addict. Biol. 2019;24:193–205. doi: 10.1111/adb.12586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Besson M., Pelloux Y., Dilleen R., Theobald D.E., Lyon A., Belin-Rauscent A., Robbins T.W., Dalley J.W., Everitt B.J., Belin D. Cocaine Modulation of Frontostriatal Expression of Zif268, D2, and 5-HT2c Receptors in High and Low Impulsive Rats. Neuropsychopharmacology. 2013;38:1963–1973. doi: 10.1038/npp.2013.95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ito R., Dalley J.W., Robbins T.W., Everitt B.J. Dopamine release in the dorsal striatum during cocaine-seeking behavior under the control of a drug-associated cue. J. Neurosci. 2002;22:6247–6253. doi: 10.1523/JNEUROSCI.22-14-06247.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Sobstyl M., Kupryjaniuk A., Mierzejewski P. Nucleus accumbens as a stereotactic target for the treatment of addictions in humans: A literature review. Neurol. Neurochir. Pol. 2021;55:440–449. doi: 10.5603/PJNNS.a2021.0065. [DOI] [PubMed] [Google Scholar]
- 33.Hellard E.R., Binette A., Zhuang X., Lu J., Ma T., Jones B., Williams E., Jayavelu S., Wang J. Optogenetic control of alcohol-seeking behavior via the dorsomedial striatal circuit. Neuropharmacology. 2019;155:89–97. doi: 10.1016/j.neuropharm.2019.05.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Yin H.H., Knowlton B.J., Balleine B.W. Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning. Eur. J. Neurosci. 2005;22:505–512. doi: 10.1111/j.1460-9568.2005.04219.x. [DOI] [PubMed] [Google Scholar]
- 35.Balleine B., Dickinson A. Goal-directed instrumental action: Contingency and incentive learning and their cortical substrates. Neuropharmacology. 1998;37:407–419. doi: 10.1016/S0028-3908(98)00033-1. [DOI] [PubMed] [Google Scholar]
- 36.Graybiel A.M. Habits, rituals, and the evaluative brain. Annu. Rev. Neurosci. 2008;31:359–387. doi: 10.1146/annurev.neuro.29.051605.112851. [DOI] [PubMed] [Google Scholar]
- 37.Yin H.H., Knowlton B.J. Contributions of Striatal Subregions to Place and Response Learning. Learn. Mem. 2004;11:459–463. doi: 10.1101/lm.81004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cui G., Jun S.B., Jin X., Pham M.D., Vogel S., Lovinger D.M., Costa R. Concurrent activation of striatal direct and indirect pathways during action initiation. Nature. 2013;494:238–242. doi: 10.1038/nature11846. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Patrono E., Di Segni M., Patella L., Andolina D., Valzania A., Latagliata E.C., Felsani A., Pompili A., Gasbarri A., Puglisi-Allegra S., et al. When Chocolate Seeking Becomes Compulsion: Gene-Environment Interplay. PLoS ONE. 2015;10:e0120191. doi: 10.1371/journal.pone.0120191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Yin H.H., Ostlund S.B., Knowlton B.J., Balleine B.W. The role of the dorsomedial striatum in instrumental conditioning. Eur. J. Neurosci. 2005;22:513–523. doi: 10.1111/j.1460-9568.2005.04218.x. [DOI] [PubMed] [Google Scholar]
- 41.Thorn C.A., Atallah H., Howe M., Graybiel A.M. Differential Dynamics of Activity Changes in Dorsolateral and Dorsomedial Striatal Loops during Learning. Neuron. 2010;66:781–795. doi: 10.1016/j.neuron.2010.04.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kravitz A.V., Tye L.D., Kreitzer A.C. Distinct roles for direct and indirect pathway striatal neurons in reinforcement. Nat. Neurosci. 2012;15:816–818. doi: 10.1038/nn.3100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hall D.D., Davare M.A., Shi M., Allen M.L., Weisenhaus M., McKnight G.S., Hell J.W. Critical role of cAMP-dependent protein kinase anchoring to the L-type calcium channel Cav1. 2 via A-kinase anchor protein 150 in neurons. Biochemistry. 2007;46:1635–1646. doi: 10.1021/bi062217x. [DOI] [PubMed] [Google Scholar]
- 44.Jin S., Shen F., Duan Y., Li M., Sui N. The mechanism of intracerebral L-type voltage dependent calcium channels in drug addiction. Chin. Sci. Bull. 2016;61:1173–1180. [Google Scholar]
- 45.Giordano T.P., Tropea T.F., Satpute S.S., Sinnegger-Brauns M.J., Striessnig J., Kosofsky B.E., Rajadhyaksha A.M. Molecular switch from L-type Cav1. 3 to Cav1. 2 Ca2+ channel signaling underlies long-term psychostimulant-induced behavioral and molecular plasticity. J. Neurosci. 2010;30:17051–17062. doi: 10.1523/JNEUROSCI.2255-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.