A model of cell specialization using a Hebbian policy-gradient approach with

Daucé, Emmanuel

doi:10.1186/1471-2202-10-S1-P136

Volume 10 Supplement 1

Eighteenth Annual Computational Neuroscience Meeting: CNS*2009

Poster presentation
Open access
Published: 13 July 2009

A model of cell specialization using a Hebbian policy-gradient approach with "slow" noise

Emmanuel Daucé^1,2

BMC Neuroscience volume 10, Article number: P136 (2009) Cite this article

995 Accesses
Metrics details

We study a model of neuronal specialization using a policy gradient reinforcement approach. (1) The neurons stochastically fire according to their synaptic input plus a noise term; (2) The environment is a closed-loop system composed of a rotating eye and a visual punctual target; (3) The network is composed of a foveated retina directly connected to a motoneuron layer; (4) The reward depends on the distance between the subjective target position and the fovea and (5) the weight update depends on the Hebb-like product r(t)Z_ij(t) where r(t) is the reward and Z_ij(t) is a Hebbian trace updated according to the product [S_i(t)-F_i(t)] e_j(t), where S_i(t) is the post-synaptic spike, F_i(t) is the firing probability and e_j(t) is the pre-synaptic activity [1, 2].

Several temporal scales are to be considered when modeling such neuromimetic controller systems. First, the typical integration time of the neurons is of the order of few milliseconds. Second, the motor commands have a duration on the order of 100 ms. In the design of an adaptive controller, this temporal mismatch must be taken into account.

For that, we consider that the firing probability is monitored by a "pink noise" term whose autocorrelation is of the order of 100 ms, so that the firing probability is overestimated (or underestimated) for about100 ms periods. The rewards occurring meanwhile assess the "quality" of those elementary shifts, and modify the firing probability accordingly.

Every motoneuron being associated to a particular angular direction, we test at the end of the learning process the preferred output of the visual cells. We find that accordingly with the observed final behavior, the visual cells preferentially excite the motoneurons heading in the opposite angular direction (see Figures 1 and 2).

References

Bartlett P, Baxter J: Synaptic modifications in spiking neurons that learn. 1999, Technical report, Australian National University
Google Scholar
Florian R: A reinforcement learning algorithm for spiking neural networks. Proc of Seventh International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC'05). 2005, 299-306.
Google Scholar

Download references

Acknowledgements

The author thanks the INRIA Lille-Nord europe for 1-year delegation in the SEQUEL team.

This work is supported by the french ANR MAPS (ANR-07-BLAN-0335-02).

Author information

Authors and Affiliations

Institute of Movement Sciences University of the Mediterranean, Marseille, France
Emmanuel Daucé
INRIA Lille Nord-Europe, Villeneuve d'Ascq, France
Emmanuel Daucé

Authors

Emmanuel Daucé
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Emmanuel Daucé.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Daucé, E. A model of cell specialization using a Hebbian policy-gradient approach with "slow" noise. BMC Neurosci 10 (Suppl 1), P136 (2009). https://doi.org/10.1186/1471-2202-10-S1-P136

Download citation

Published: 13 July 2009
DOI: https://doi.org/10.1186/1471-2202-10-S1-P136

Eighteenth Annual Computational Neuroscience Meeting: CNS*2009

A model of cell specialization using a Hebbian policy-gradient approach with "slow" noise

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Neuroscience

Contact us

Eighteenth Annual Computational Neuroscience Meeting: CNS*2009

A model of cell specialization using a Hebbian policy-gradient approach with "slow" noise

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Neuroscience

Contact us