Cooperation/supervision of a habit by a cognitive strategy in a goal-directed navigational paradigm

Hanoune, Souheïl; Banquet, Jean-Paul; Gaussier, Philippe; Quoy, Mathias

doi:10.1186/1471-2202-16-S1-P200

Volume 16 Supplement 1

24th Annual Computational Neuroscience Meeting: CNS*2015

Poster presentation
Open access
Published: 04 December 2015

Cooperation/supervision of a habit by a cognitive strategy in a goal-directed navigational paradigm

Souheïl Hanoune¹,
Jean-Paul Banquet¹,
Philippe Gaussier¹ &
…
Mathias Quoy¹

BMC Neuroscience volume 16, Article number: P200 (2015) Cite this article

717 Accesses
Metrics details

The Stimulus-Response (S-R) theory and Tolman's Cognitive Theory of behavior control both issued from behaviorism in the early 20th century still provide a relevant general framework to account for animal reward-based adaptive behavior. In this paper, we propose a new paradigm for representing and implementing both the cognitive strategy and the S-R habit strategy within a unitary coding frame. Based on a parallel learning of both strategies, the model explains how the fast learning cognitive strategy can supervise and accelerate the slow learning S-R habit strategy; and also how. In late learning stages, the habit strategy can overcome the cognitive. This parallel representation is inspired by the cortico-basal functional loops [1] and the cooperation between the cognitive associative loop, including the dorso-medial striatum and the mPF; and the sensory-motor loop, associated to the sensory motor cortex in relation with the dorso-lateral striatum.

The implementation of S-R habit strategy is based on a neural modified version of the classical Q-learning and is based on the model of [2], emulating the functioning of the sensory-motor loop. The states of the model are represented by hippocampal transitions, representing associations between two consecutive place-cells during the exploration of the environment, learned in the CA1-CA3 regions of the hippocampus. The cognitive strategy is based on a map representation of the environment namely the cognitive map [3]. Based on the association between learned transitions, the cognitive map allows the back-propagation of a reward within a tree, allowing the selection of the shortest path to the goal. While the cognitive map is quickly learned, the Q-values associated with the Q-learning are slower to acquire. On the other hand, the Q-learning tends to be more accurate than the cognitive map when fully learned.

The model exploits this speed difference in its parallel learning. The fast acquisition of the cognitive map allows the robot to quickly choose correct paths to the goal, and thus the time convergence of the Q-learning algorithm is optimized. The cooperation is based on the biasing of the selected transition by the cognitive map and the Q-learning in parallel (see Figure 1). In its early learning stage, the Q-learning biasing is too weak, and the cognitive map is dominant (Figure 2. VS Figure 2b), inducing the supervision of the S-R habit by the cognitive strategy. In the later learning stages, the Q-learning is stronger and more precise. Cooperation of the cognitive strategy and S-R habit enables a faster S-R learning; as shown in Figure 2. The lesion studies (Figure 2c, Figure 2d) show that the system maintains a coherent behavior event after the lesion of either of the structures supporting the two strategies. Also, the time responses highlight the superiority of the habit strategy after over-training (Figure 2.c VS Figure 2.d).

References

DeLong MR, Strick PL: Parallel organization of functionally segregated circuits linking basal ganglia and cortex. Annual Rev of Neurosci. 1986, 9 (1): 357-381.
Article Google Scholar
Hirel J, Gaussier P, Quoy M, Banquet J-P, Save E, Poucet B: The Hippocampo-cortical Loop: Spatio-Temporal Learning and Goal-oriented Planning in Navigation. Neural Networks. 2013, 43: 8-21.
Article PubMed CAS Google Scholar
O'Keefe J, Nadel L: The hippocampus as a cognitive map. Oxford University. 1978
Google Scholar

Download references

Acknowledgements

This work was supported by the ANR-NEUROBOT project (ANR-BLAN-SIMI2-LS-100617-13-01).

Author information

Authors and Affiliations

EIS Lab, University of Cergy-Pontoise, ENSEA - CNRS, Paris, France
Souheïl Hanoune, Jean-Paul Banquet, Philippe Gaussier & Mathias Quoy

Authors

Souheïl Hanoune
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Paul Banquet
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Gaussier
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Quoy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Souheïl Hanoune.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Hanoune, S., Banquet, JP., Gaussier, P. et al. Cooperation/supervision of a habit by a cognitive strategy in a goal-directed navigational paradigm. BMC Neurosci 16 (Suppl 1), P200 (2015). https://doi.org/10.1186/1471-2202-16-S1-P200

Download citation

Published: 04 December 2015
DOI: https://doi.org/10.1186/1471-2202-16-S1-P200

24th Annual Computational Neuroscience Meeting: CNS*2015

Cooperation/supervision of a habit by a cognitive strategy in a goal-directed navigational paradigm

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Neuroscience

Contact us

24th Annual Computational Neuroscience Meeting: CNS*2015

Cooperation/supervision of a habit by a cognitive strategy in a goal-directed navigational paradigm

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Neuroscience

Contact us