Elsevier

Behavioural Brain Research

Volume 154, Issue 1, 23 September 2004, Pages 19-30
Behavioural Brain Research

Research report
Dissociation of the role of nucleus accumbens dopamine in responding to reward-predictive cues and waiting for reward

https://doi.org/10.1016/j.bbr.2004.01.013Get rights and content

Abstract

The choice among behavioral options is influenced by the anticipated cost of working for the reward relative to the anticipated reward magnitude. Dopamine release in the nucleus accumbens (NAc) has been suggested to play an important role in cost/benefit computation. From a behavioral perspective, work involves two elements: the caloric expenditure of energy and the time required to complete the task. In many studies of the contribution of NAc dopamine to cost/benefit decisions, measures of work have conflated these separate elements. Here we describe a novel cued progressive delay task, an analog of the progressive ratio task that minimizes energy expenditure. In this task, rats obtain sucrose reward by entering a reward receptacle in response to a cue and remaining in the receptacle for a required wait that is increased after each successful trial. In an experiment in which the magnitude of reward was varied across sessions, the animals’ fail point (maximum wait achieved) was correlated with the amount of reward delivered. Microinjection of D1 or D2 dopamine receptor antagonists into the NAc did not affect the length of time animals were willing to wait for reward (fail point), but did reduce the proportion of cues to which the animal responded. These results suggest that waiting for reward without increased caloric energy expenditure does not require NAc dopamine.

Introduction

Many behavioral studies implicate NAc dopamine release in the appetitive phase of goal-directed behavior: NAc dopamine is often required for animals to exert effort to obtain reward, but not to consume it [4], [5], [22], [23]. An elegant series of experiments by Salamone and coworkers suggests that the release of dopamine in the NAc facilitates sustained goal-directed motor behavior (reviewed in [34], [35], [37]). For instance, depletion of NAc dopamine with 6-hydroxy-dopamine (6-OHDA) or injection of dopamine antagonists into the NAc causes animals to choose a smaller or less preferred reward that requires less effort to obtain than a larger (and, in control animals, normally chosen) reward that requires more effort [11], [12], [24], [29], [36], [38], [40]. Similarly, NAc 6-OHDA lesions reduce operant responding on high fixed-ratio (FR) schedules (e.g., FR64) but not schedules with lower ratios, such as FR1 [1]. Furthermore, NAc dopamine depletion strongly impairs performance of a progressive ratio task, in which the number of lever presses required to obtain reward increases with each reward earned, by causing a reduction in the rate of lever pressing and hence the number of rewards earned by the animal in a fixed period of time [2], [18].

Several different hypotheses could explain these results. One idea is that animals that have sustained NAc dopamine depletion or dopamine receptor antagonism suffer from deficits in motivation or motor ability. This can be ruled out by consistent findings that consumption of freely available food is not affected, nor is the ability to earn reward when every operant response is followed by a reward, as in FR1 [34]. Another possibility is that release of dopamine in the NAc increases the rate of work emitted per unit of reward. Delivering larger reward (e.g., 6 pellets after 300 presses instead of 1 pellet after 300 presses) increases normal animals’ ability to complete very high ratio requirements. However, increasing the reward in high fixed-ratio tasks, so that the work:reward ratio is constant (e.g., delivering 6 pellets after 300 lever presses instead of 1 pellet after 50 presses), does not rescue performance deficits caused by NAc 6-OHDA lesions [39]. Indeed, the effects of NAc 6-OHDA lesions were larger for the 300 presses:6 pellets task than for 50 presses:1 pellet. Thus, the contribution of NAc dopamine is apparently greater for larger work requirements than smaller requirements, even if the work:reward ratio is constant. This is consistent with the idea that the computation that is affected by the lesion is that of the amount of work required, not the amount of reward to be obtained or the ratio between the two.

An alternate interpretation is that NAc dopamine facilitates sustained work through time, even if the caloric energy expenditure is not large. This possibility arises from the fact that work in operant tasks has two components: caloric expenditure and time between work onset and delivery of reward. These two components covary in ratio tasks; for example, performing 300 lever presses requires more energy than performing 50 lever presses and also takes more time. The longer the interval between work onset and reward, the more the value of the predicted reward is discounted [19], [25], and the more likely the animal will interrupt its effort to obtain the reward. NAc dopamine could influence the discounting function by attenuating the fall-off in value as the delay increases, thereby facilitating both sustained effort and willingness to wait for larger reward. This idea is supported by studies showing that systemic dopamine antagonists reduce the choice of larger delayed rewards in favor of smaller immediate rewards [8], [43], and the indirect dopamine agonist amphetamine increases willingness to wait for larger reward [8], [14], [32]. Furthermore, lesions of the NAc core cause rats to choose smaller immediate reward over larger delayed reward [7]. Taken together, these studies provide indirect evidence that the release of NAc dopamine flattens the delay discounting function.

One interpretation of the studies suggesting a role for dopamine and the NAc in delay discounting [7], [8], [32], [43] is that NAc dopamine may be required to facilitate the response to cues that predict a large reward available after a long delay (or a higher energy cost). These studies employed discrete-trials tasks in which animals had to choose between levers that resulted in different outcomes. For instance, in the studies by Cardinal and coworkers [7], [8], the choice was between a lever (the “immediate lever”) that, if selected, resulted in small immediate reward, and a “delay lever” that resulted in larger reward after a delay. (Both levers were retracted after a response was made.) If indeed NAc dopamine facilitates the choice of larger delayed reward, it may do so either (1) directly, by promoting the response on the delay lever, perhaps because the dopamine is released in response to the reward-predictive stimulus of the delay lever but not the immediate lever; or (2) indirectly, by allowing the animal to maintain the association between the delay lever and the large reward throughout the delay to reward, thereby increasing the likelihood that the delay lever will be chosen on subsequent trials. The difference between these two hypotheses is critical for understanding the role of NAc dopamine in goal-directed behavior that involves a delay to reward: either the dopamine has its effects throughout the delay to enhance the ability of the animal to wait (or work) for reward, or its effects are much more transient, involving only the choice of behavior in response to discrete cues. A third hypothesis, suggested by the work of Salamone and coworkers [10], [33], is that NAc dopamine release is required during the expenditure of large (but not small) amounts of caloric energy in order to facilitate the expenditure over a given period of time.

To test among these possibilities, we adapted the progressive ratio task to eliminate the requirement for emitting increasing caloric energy, while maintaining the requirement that animals wait through an increasing delay for each successive reward. This novel progressive delay task measures the amount of time an animal is willing to wait for reward. In the task, a cue is presented to the animal on a VI 30-s schedule. The animal must respond to the cue by entering its head into a reward receptacle and remaining until reward is delivered. The delay between receptacle entry and reward delivery is increased with each reward earned; reward is not delivered if the animal withdraws early from the receptacle (“fail point”). Fail point is, therefore, analogous to break point on the progressive ratio schedule. We tested how NAc dopamine contributes to the performance of this task by injecting D1 or D2 antagonists into the NAc prior to daily behavioral sessions. The inclusion of an explicit cue to signal the animal to initiate a trial allowed us to dissociate the role of NAc dopamine in the initiation of behavioral responses to reward-predictive cues [13] from its potential subsequent role in waiting for reward.

Section snippets

Subjects

Sixteen male Long–Evans rats (Harlan, IN), weighing approximately 350 g were used in this study. Animals were individually housed on a 12 h/12 h light/dark cycle, with all experiments conducted during the light phase. Upon receipt, rats were allowed at least 1 week of ad libitum food and water, followed by food and water restriction to maintain 85–90% of their original body weight (295–320 g during restriction). Throughout the experiment, food intake was limited to 13–15 g food (Bio-Serv, NJ) and 25 

Baseline performance during the progressive delay task

The progressive delay task is diagrammed in Fig. 1A. A cue was presented on a VI 30-s schedule. Earning a successful reward by entering the reward receptacle during cue presentation resulted, on the next trial, in an increase of 1.75 s in the required wait to reward delivery. Early withdrawal from the receptacle before reward delivery resulted in a 90-s penalty period during which no cues were presented. Animals typically earned four to five rewards before failing to earn reward due to early

Discussion

The major finding of this work is that the ability of animals to wait for reward in a novel progressive delay task was unaffected by injection of dopamine D1 or D2 receptor antagonists into the NAc. In contrast, in the same task, the antagonists substantially reduced the probability of a receptacle entry in response to the cue signaling that a trial could be initiated. The reduction in cue response ratio is similar to the effects of NAc dopamine antagonism on the autoshaped response on a lever

Acknowledgements

This work was supported by funds provided by the State of California for medical research on alcohol and substance abuse through the University of California, San Francisco; by the Wheeler Center for the Neurobiology of Addiction; by the Ernest Gallo Clinic and Research Center; and by NIH Grant DA01949 to H.L.F. We thank Dr. V. Kharazia for help with the histology, and Drs. S. Taha, G. Hjelmstad, J. Mitchell, T. Robinson, and A. Wakabayashi for helpful discussions.

References (46)

  • K.L. Nowend et al.

    D1 or D2 antagonism in nucleus accumbens core or dorsomedial shell suppresses lever pressing for food but leads to compensatory increases in chow consumption

    Pharmacol. Biochem. Behav.

    (2001)
  • J.D. Salamone et al.

    Motivational views of reinforcement: implications for understanding the behavioral functions of nucleus accumbens dopamine

    Behav. Brain Res.

    (2002)
  • J.D. Salamone et al.

    Anhedonia or anergia? Effects of haloperidol and nucleus accumbens dopamine depletion on instrumental response selection in a T-maze cost/benefit procedure

    Behav. Brain Res.

    (1994)
  • J.D. Salamone et al.

    Behavioral functions of nucleus accumbens dopamine: empirical and conceptual problems with the anhedonia hypothesis

    Neurosci. Biobehav. Rev.

    (1997)
  • J.D. Salamone et al.

    Nucleus accumbens dopamine depletions make animals highly sensitive to high fixed ratio requirements but do not impair primary food reinforcement

    Neuroscience

    (2001)
  • J.D. Sokolowski et al.

    The role of accumbens dopamine in lever pressing and response allocation: effects of 6-OHDA injected into core and dorsomedial shell

    Pharmacol. Biochem. Behav.

    (1998)
  • M. Arroyo et al.

    Acquisition, maintenance and reinstatement of intravenous cocaine self-administration under a second-order schedule of reinforcement in rats: effects of conditioned cues and continuous access to cocaine

    Psychopharmacology (Berl)

    (1998)
  • R.N. Cardinal et al.

    Impulsive choice induced in rats by lesions of the nucleus accumbens core

    Science

    (2001)
  • R.N. Cardinal et al.

    The effects of d-amphetamine, chlordiazepoxide, alpha-flupenthixol and behavioural manipulations on choice of signalled and unsignalled delayed reinforcement in rats

    Psychopharmacology (Berl)

    (2000)
  • Cardinal RN, Robbins TW, Everitt BJ. Choosing delayed rewards: perspectives from learning theory, neurochemistry, and...
  • P. Di Ciano et al.

    Differential involvement of NMDA, AMPA/kainate, and dopamine receptors in the nucleus accumbens core in the acquisition and performance of pavlovian approach behavior

    J. Neurosci.

    (2001)
  • J.L. Evenden et al.

    The pharmacology of impulsive behaviour in rats: the effects of drugs on response choice with varying delays of reinforcement

    Psychopharmacology (Berl)

    (1996)
  • U.E. Ghitza et al.

    Persistent cue-evoked activity of accumbens neurons after prolonged abstinence from self-administered cocaine

    J. Neurosci.

    (2003)
  • Cited by (83)

    • The Arousal-motor Hypothesis of Dopamine Function: Evidence that Dopamine Facilitates Reward Seeking in Part by Maintaining Arousal

      2022, Neuroscience
      Citation Excerpt :

      For this reason, the arousal-motor hypothesis has important implications for the interpretation of studies in which dopamine function is disrupted. For example, we and others have observed that dopamine antagonists reduce animals’ responding to reward-predictive cues in various behavioral paradigms (Ogren and Fuxe, 1988; Di Ciano et al., 2001; Wakabayashi et al., 2004; Yun et al., 2004; Lex and Hauber, 2008; Nicola, 2010; Saunders and Robinson, 2012; Wenzel et al., 2018). These effects may have been amplified by reduced arousal, especially when operant sessions and ITIs were long.

    • Neurobiology of reward-related learning

      2021, Neuroscience and Biobehavioral Reviews
      Citation Excerpt :

      Excitotoxic lesions of NAc impair conditioned approach responding elicited by a food-associated CS (Parkinson et al., 1999) or self-administration maintained by presentation of drug-associated cues (Ito et al., 2004). Intra-NAc infusions of DA D1 or D2 receptor antagonists reduce responding maintained by food or drug-related CSs (Di Ciano et al., 2001; Nicola et al., 2005; Wakabayashi et al., 2004; Yun et al., 2004). Thus, CSs can cause the conditioned activation of VTA DA neurons and this neural activity appears necessary for CSs to function as such.

    View all citing articles on Scopus
    View full text