Skip to main content
  • Poster presentation
  • Open access
  • Published:

A reservoir network model for sensory-guided probabilistic decision making

Animals are often required to adequately respond to novel stimuli on the basis of their previous sensory experiences. To investigate the neural mechanism underlying this sensory-guided decision making, we conducted multiunit recordings from the rats performing a two-alternative choice task. The rats were trained to make a LEFT or a RIGHT choice in response to two auditory cues, and then their behavior was tested for these familiar cues and other novel cues. Their choice probabilities generally varied in a graded manner such that the probability of choosing an option changed gradually according to frequency differences between familiar and novel cues. However, we also observed large variability in choice behavior across rats: choice probabilities for novel cues were near the chance level in some rats, while it systematically changed with tone frequency in other rats. We elucidated the mechanism underlying such decision behavior and the possible origin of individual behavioral differences in a neural network model (Fig. 1A). Our model is viewed as a reservoir network [1] and learns to associate two familiar cues with two alternative choices through reinforcement learning with eligibility trace [2].

figure 1

Figure 1

Our model successfully replicated the gradual choice behavior observed in our experiment. We further analyzed how neural dynamics in the reservoir network determines the choice probability for familiar and novel cues. We found that a familiar stimulus sequentially activates a relatively small portion of reservoir neurons, and reinforcement learning trained output connections from these neurons such that only an adequate output neuron is activated by the neural trajectory evoked in the reservoir. We further revealed that choice responses to novel cues become graded due to trial-by-trial overlaps between the familiar trajectories and novel-cue-evoked trajectories.

Interestingly, our model also exhibited similar variability in choice responses to that observed in individual rats. We found that if input neurons are highly sensitive to external stimuli, that is, if the width of their frequency tuning curves is broad, the model network likely shows gradual choice behavior for novel stimuli. In contrast, the model with more sensitive input neurons (with broader tuning curves) tends to generate near-random choice behavior, displaying a flat dependence of choice probability on novel stimuli. We compared choice behavior between the models and the rats by introducing quantitative measures and found that the behavioral tendency of our model is consistent with that of the rats (Fig. 1B).

These results may suggest that some individual differences in decision making behavior emerge from neural population dynamics rather than differences in higher-level behavioral strategies.


  1. Maass W, Natschläger T, Markram H: Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations. Neural Comput. 2002, 14: 2531-2560.

    Article  PubMed  Google Scholar 

  2. Izhikevich EM: Solving the distal reward problem through linkage of STDP and dopamine signaling. Cereb Cortex. 2007, 17: 2443-52.

    Article  PubMed  Google Scholar 

Download references


This work was partially supported by KAKEN-HI No. 255413, Grants-in-Aid for Scientific Research (no. 22115013) from MEXT

Author information

Authors and Affiliations


Corresponding author

Correspondence to Tomoki Kurikawa.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kurikawa, T., Handa, T. & Fukai, T. A reservoir network model for sensory-guided probabilistic decision making. BMC Neurosci 16 (Suppl 1), P74 (2015).

Download citation

  • Published:

  • DOI: