The dopamine signal in decision-making tasks with stimulus and timing uncertainty
BMC Neuroscience volume 15, Article number: P71 (2014)
It was long suggested that the phasic activity of midbrain dopamine (DA) neurons codes the subject’s error in the prediction of reward . Most of the experimental work in this field was done in the context of classical and instrumental conditioning. Recently, there has been an increasing interest to study the activity of DA neurons while the subject performs a decision-making task with the goal of obtaining reward at the end of the trial [2, 3]. In an experiment in which monkeys have to decide about the presence or absence of a somatosensory stimulus, recordings of DA neurons have shown that the activity of these cells is modulated according to the trial type. The averaged activity in hit, miss, false alarm or correct rejection trials each presents a distinct temporal profile . In particular, the neurons’ response to the go cue is correlated with the subject’s uncertainty about his choice.
The signal of dopamine neurons in classical and instrumental conditioning has been explained in terms of the temporal-difference (TD) algorithm. However it is not clear whether and, if so, how reinforcement learning can account for the dopamine signals in complex decision-making tasks with noisy sensory information and temporal uncertainty of the relevant task events, as is the case in the detection task mentioned above . We have developed an actor-critic model which deals with both these aspects of the problem. While an internal temporal representation keeps track of past relevant events, partial observability is accounted for by means of a Bayesian approach.
The dopamine phasic activity predicted by the model matches the experimental data and the prediction of the psychometric curve is consistent with the animal performance. Furthermore, the model provides an interpretation of the condition-dependent dopamine response to the go cue instruction in terms of reward prediction error. Using Bayesian inference the model constructs an internal belief about the presence of the somatosensory stimulus. This belief reflects the confidence about the sensory perception and thus the value assigned to this perceptual judgment. The large belief in stimulus-present decisions represents a high degree of confidence in the sensory perception and a great expectation for future rewards. On the contrary, stimulus-absent choices reflect a small belief and consequently a larger uncertainty about the decision and the future reward. This computational description of belief agrees with the previous interpretation of the data  and describes well other experimental observations such as the dependence of DA neurons' signals on the stimulus amplitude. The model also predicts a decrease in dopamine activity before the go cue instruction, which is also observed in the data. We explain this decreasing tonic activity as an effect of the timing uncertainty. This is partly due to the task structure and partly generated from the limited temporal resolution of the stimulus representation, which creates subjective variability in the timing of events.
Schultz W, Dayan P, Montague P: A neural substrate of prediction and reward. Science. 1997, 275: 1593-1599. 10.1126/science.275.5306.1593.
Nomoto K, Schultz W, Watanabe T, Sakagami M: Temporally extended dopamine responses to perceptually demanding reward-predictive stimuli. J. Neurosci. 2010, 30: 10692-10702. 10.1523/JNEUROSCI.4828-09.2010.
de Lafuente V, Romo R: Dopamine neurons code subjective sensory experience and uncertainty of perceptual decisions. Proc. Natl. Acad. Sci. U.S.A. 2011, 108: 19767-19771. 10.1073/pnas.1117636108.
Carnevale F, de Lafuente V, Romo R, Parga N: Internal signal correlates neural populations and biases perceptual decision reports. Proc. Natl. Acad. Sci. U.S.A. 2012, 109: 18938-18943. 10.1073/pnas.1216799109.
About this article
Cite this article
Sarno, S., de Lafuente, V., Romo, R. et al. The dopamine signal in decision-making tasks with stimulus and timing uncertainty. BMC Neurosci 15 (Suppl 1), P71 (2014). https://doi.org/10.1186/1471-2202-15-S1-P71
- Instrumental Conditioning
- Future Reward
- Dopamine Signal
- Timing Uncertainty
- Somatosensory Stimulus