Cortico-striatal plasticity for action-outcome learning using spike timing dependent eligibility

Gurney, Kevin N; Humphries, Mark D; Redgrave, Peter

doi:10.1186/1471-2202-10-S1-P135

Volume 10 Supplement 1

Eighteenth Annual Computational Neuroscience Meeting: CNS*2009

Poster presentation
Open access
Published: 13 July 2009

Cortico-striatal plasticity for action-outcome learning using spike timing dependent eligibility

Kevin N Gurney¹,
Mark D Humphries¹ &
Peter Redgrave¹

BMC Neuroscience volume 10, Article number: P135 (2009) Cite this article

1466 Accesses
2 Citations
Metrics details

Introduction

We recently proposed that short-latency, sensory-evoked dopamine release is critical for learning action-outcome causality [1]. If an action causes an unexpected outcome associated with a phasic visual event, there will be a phasic burst of dopamine in the striatum. Subsequent reinforcement of the striatal response to the cortical representation of the action then makes the selection of the action (and its outcome) more likely; i.e. there is "repetition biasing" of action selection. This, in turn, facilitates associative learning of the action-outcome pairing elsewhere in the brain. Here, we present a model of cortico-striatal plasticity in medium spiny neurons (MSNs) that could form the basis for a quantitative account of action-outcome learning in basal ganglia.

Methods

We used an Izhikevich-style spiking model MSN with 200 synapses. We constructed new cortico-striatal learning rules based on a recent in vitro study by Shen et al [2]. This study provided, for the first time, comprehensive data on MSN plasticity in the D1 and D2 receptor-dominated MSN subpopulations. For each population (D1/D2) we ascribed STDP-like kernel "templates" to very high and low levels of dopamine in a manner consistent with the data. At intermediate levels of dopamine, the kernels were formed from a linear superposition of these templates. These kernels then gave rise to an eligibility trace for learning that induced plasticity in the presence of subsequent delivery of dopamine [3]. We refer to this mechanism as spike-timing dependent eligibility: STDE. We then mimicked the cortical and dopaminergic signals that an MSN might see during action-outcome learning. Each cortical input comprised 50 highly active afferents with others at a background rate. The selection of the active afferents was fixed for the causal action, and chosen randomly for other actions at each trial (see Figure 1a).

Results

When phasic dopamine is elicited by the causal action, it induced a rapid increase in MSN response that would be the foundation for inducing repetition bias of action selection. Further, the MSN has become receptive to the action request through synaptic pattern matching (Fig 1b top panel). Subsequent trials induce more selective synaptic patterning (reduced response at trial 1200). Dopamine dips (caused by assigning a noxious value to the outcome) induce a reduction in response. We conclude that the recently discovered complex dopamine-receptor dependent forms of STDP [2] can lead to cortico-striatal plasticity that can support action-outcome learning.

References

Redgrave P, Gurney KN: The short-latency dopamine signal: a role in discovering novel actions?. Nat Rev Neurosci. 2006, 7: 967-975. 10.1038/nrn2022.
Article CAS PubMed Google Scholar
Shen W, Flajolet M, Greengard P, Surmeier DJ: Dichotomous dopaminergic control of striatal synaptic plasticity. Science. 2008, 321: 848-851. 10.1126/science.1160575.
Article CAS PubMed Central PubMed Google Scholar
Izhikevich EM: Solving the distal reward problem through linkage of STDP and dopamine signaling. Cereb Cortex. 2007, 17: 2443-2452. 10.1093/cercor/bhl152.
Article PubMed Google Scholar

Download references

Acknowledgements

This work was part funded by EPSRC grant EP/C516303/1

Author information

Authors and Affiliations

Adaptive Behaviour Research Group, Department of Psychology, University of Sheffield, Sheffield, S10 2TP, UK
Kevin N Gurney, Mark D Humphries & Peter Redgrave

Authors

Kevin N Gurney
View author publications
You can also search for this author in PubMed Google Scholar
Mark D Humphries
View author publications
You can also search for this author in PubMed Google Scholar
Peter Redgrave
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kevin N Gurney.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Gurney, K.N., Humphries, M.D. & Redgrave, P. Cortico-striatal plasticity for action-outcome learning using spike timing dependent eligibility. BMC Neurosci 10 (Suppl 1), P135 (2009). https://doi.org/10.1186/1471-2202-10-S1-P135

Download citation

Published: 13 July 2009
DOI: https://doi.org/10.1186/1471-2202-10-S1-P135

Eighteenth Annual Computational Neuroscience Meeting: CNS*2009

Cortico-striatal plasticity for action-outcome learning using spike timing dependent eligibility

Introduction

Methods

Results

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Neuroscience

Contact us

Eighteenth Annual Computational Neuroscience Meeting: CNS*2009

Cortico-striatal plasticity for action-outcome learning using spike timing dependent eligibility

Introduction

Methods

Results

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Neuroscience

Contact us