Volume 16 Supplement 1

24th Annual Computational Neuroscience Meeting: CNS*2015

Open Access

The formation of habits in the neocortex under the implicit supervision of the basal ganglia

BMC Neuroscience201516(Suppl 1):P212

https://doi.org/10.1186/1471-2202-16-S1-P212

Published: 18 December 2015

If basal ganglia are widely accepted to participate in the high-level cognitive function of decision-making, their role is less clear regarding the formation of habits [1, 2]. One of the biggest problem is to understand how goal-directed actions are transformed into habitual responses, or, said differently, how an animal can shift from an action-outcome (A-O) system to a stimulus-response (S-R) one, while maintaining a consistent behavior. We introduce a computational model (basal ganglia, thalamus and cortex) that can solve a simple two arm-bandit task using reinforcement learning and explicit valuation of the outcome [3]. Hebbian learning has been implemented at the cortical level such that the model learns each time a move is issued, rewarded or not. Then, by inhibiting the output nuclei of the model (GPi), we show how learning has been transferred from the basal ganglia to the cortex, simply as a consequence of the statistics of the different choices (see Figure 1). Because best (in the sense of most rewarded) actions are chosen more often, this directly impacts the amount of Hebbian learning and lead to the formation of habits within the cortex. These results have been confirmed in monkeys doing the exact same task where the BG has been inactivated using muscimol. This tends to show that the basal ganglia implicitly teach the cortex in order for it to learn the values of new options. In the end, the cortex is able to solve the task perfectly, even if it exhibits slower reaction times.
Figure 1

Left. In habitual condition (HC), performances are optimal, with or without GPi. In novel condition (NC), only the intact model (with GPi) is able to learn the new stimuli while lesioned model performances stay at the level f chance. Right. Analysis of the data shows that reaction time is higher in normal condition as compared to habitual condition with active and inactive GPi. The later increases significantly the reaction time in both conditions. All data have been averaged over 250 simulations.

Authors’ Affiliations

(1)
INRIA Bordeaux Sud-Ouest
(2)
Université de Bordeaux
(3)
LaBRI, Université de Bordeaux, IPB, CNRS
(4)
Laboratoire Franco-Israélien de Neurosciences, CNRS Bordeaux

References

  1. Yin HH, Knowlton BJ: The role of the basal ganglia in habit formation. Nature Reviews Neuroscience. 2006, 7 (6): 464-476.PubMedView ArticleGoogle Scholar
  2. Seger CA, Spiering BJ: A critical review of habit learning and the basal ganglia. Frontiers in Systems Neuroscience. 2011, 5: 66-PubMedPubMed CentralView ArticleGoogle Scholar
  3. Guthrie M, Leblois A, Garenne A, Boraud T: Interaction between cognitive and motor cortico-basal ganglia loops during decision making: a computational study. Journal of Neurophysiology. 2013, 109 (12): 3025-3040.PubMedView ArticleGoogle Scholar

Copyright

© Topalidou et al. 2015

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Advertisement