Properties of synaptic plasticity rules implementing actor-critic temporal-difference learning

Potjans, Wiebke; Morrison, Abigail; Diesmann, Markus

doi:10.1186/1471-2202-9-S1-P69

Volume 9 Supplement 1

Seventeenth Annual Computational Neuroscience Meeting: CNS*2008

Poster presentation
Open access
Published: 11 July 2008

Properties of synaptic plasticity rules implementing actor-critic temporal-difference learning

Wiebke Potjans¹,
Abigail Morrison¹ &
Markus Diesmann^1,2

BMC Neuroscience volume 9, Article number: P69 (2008) Cite this article

1594 Accesses
Metrics details

There is considerable interest in establishing a link between system-level learning and synaptic plasticity [1–3]. In a previous study [4] we presented a specific set of biologically plausible synaptic plasticity rules implementing temporal-difference (TD) learning in a spiking neuronal network inspired by the actor-critic architecture [5]. We showed the equivalence between the plasticity rules and the traditional discrete-time TD(0) algorithm and demonstrated that the network learns a complex task with a similar speed to its discrete time counterpart and attains the same equilibrium performance. However, the set of learning rules represents only one possible way in which actor-critic TD learning could be implemented in the brain, and so the model has only limited predictive power for experimental work.

Here, we extract properties of synaptic plasticity rules that suffice to implement actor-critic TD(0) learning, under the assumption that states are represented by elevated rates in disjunct sets of neurons. On this basis we define generalized classes of continuous time synaptic plasticity rules that implement value function and policy updates. The main property is that the amount and sign of the weight update depends on a characteristic change in the activity of the critic module combined with a global reward signal. We present concrete examples belonging to the defined class and demonstrate that they are able to solve a non-trivial task. We further analyze to what extent the defined class of plasticity rules are compatible with experimental findings of synaptic plasticity [6, 7].

References

Izhikevich EM: Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral Cortex. 2007, 17 (10): 2443-2452. 10.1093/cercor/bhl152.
Article PubMed Google Scholar
Baras D, Meir R: Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule. Neural Computation. 2007, 19: 2245-2279. 10.1162/neco.2007.19.8.2245.
Article PubMed Google Scholar
Florian RV: Reinforcement learning Through Modulation of Spike-Timing – Dependent Synaptic Plasticity. Neural Computation. 2007, 19: 1468-1502. 10.1162/neco.2007.19.6.1468.
Article PubMed Google Scholar
Potjans W, Morrison A, Diesmann M: A spiking neural network model for the actor-critic temporal-difference learning algorithm. 342.6. 37th SFN meeting, San Diego, USA.
Sutton RS, Barto AG: Reinforcement learning, An Introduction. 1998, The MIT press
Google Scholar
Kirkwood A, Rioult MG, Bear MF: Experience-dependent modification of synaptic plasticity in visual cortex. Nature. 1996, 381: 526-528. 10.1038/381526a0.
Article CAS PubMed Google Scholar
Reynolds JNJ, Wickens JR: Dopamine-dependent plasticity of corticostriatal synapses. Neural Networks. 2002, 15: 507-521. 10.1016/S0893-6080(02)00045-X.
Article PubMed Google Scholar

Download references

Acknowledgements

Partially funded by DIP F1.2, BMBF Grant 01GQ0420 to the Bernstein Center for Computational Neuroscience Freiburg, and EU Grant 15879 (FACETS).

Author information

Authors and Affiliations

Computational Neuroscience Group, RIKEN Brain Science Institute, Wako-shi, Saitama, 351-0198, Japan
Wiebke Potjans, Abigail Morrison & Markus Diesmann
Bernstein Center for Computational Neuroscience, Albert-Ludwigs-University, 79104, Freiburg, Germany
Markus Diesmann

Authors

Wiebke Potjans
View author publications
You can also search for this author in PubMed Google Scholar
Abigail Morrison
View author publications
You can also search for this author in PubMed Google Scholar
Markus Diesmann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wiebke Potjans.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Potjans, W., Morrison, A. & Diesmann, M. Properties of synaptic plasticity rules implementing actor-critic temporal-difference learning. BMC Neurosci 9 (Suppl 1), P69 (2008). https://doi.org/10.1186/1471-2202-9-S1-P69

Download citation

Published: 11 July 2008
DOI: https://doi.org/10.1186/1471-2202-9-S1-P69

Seventeenth Annual Computational Neuroscience Meeting: CNS*2008

Properties of synaptic plasticity rules implementing actor-critic temporal-difference learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Neuroscience

Contact us

Seventeenth Annual Computational Neuroscience Meeting: CNS*2008

Properties of synaptic plasticity rules implementing actor-critic temporal-difference learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Neuroscience

Contact us