Surprise minimization as a learning strategy in neural networks

Faraji, Mohammad Javad; Preuschoff, Kerstin; Gerstner, Wulfram

doi:10.1186/1471-2202-16-S1-P77

Volume 16 Supplement 1

24th Annual Computational Neuroscience Meeting: CNS*2015

Poster presentation
Open access
Published: 04 December 2015

Surprise minimization as a learning strategy in neural networks

Mohammad Javad Faraji¹,
Kerstin Preuschoff² &
Wulfram Gerstner¹

BMC Neuroscience volume 16, Article number: P77 (2015) Cite this article

1209 Accesses
Metrics details

Surprise is informative because it drives attention and modifies learning. Not only has it been described at different stages of neural processing [1], but it is a central concept in higher levels of abstraction such as learning and memory formation [2]. Several methods, including Bayesian and information theoretical approaches, have been used to quantify surprise. In Bayesian surprise, only data observations which substantially affect the observer's beliefs yield surprise [3, 4]. In Shannon surprise, however, observations that are rare or less likely to happen are considered surprising [5]. Although each of the existing measures partly incorporates conceptual aspects of surprise, they still suffer from some drawbacks including implausibility from the view point of neural implementation.

We first review the two probability-based surprise measures above, and discuss their pros. We then propose a novel measure for calculating surprise which benefits from the advantages of both measures. Importantly, the proposed measure benefits from calculating surprise during learning phase (e.g., inference about parameters in Bayesian framework). This is in contrast to Bayesian surprise where the surprise calculation is not prior to the inference step. Our proposed method can also be neurally implemented in a feed-forward neural network.

Furthermore, we propose a principle of (future) surprise minimization as a learning strategy; that is if something unexpected (surprising) happens, the subjective internal model of the external world should be modified such that the same observation becomes less surprising if it happens again in the not so distant future. We mathematically describe a class of learning rules which obey that principle. We show that standard Bayesian updating and the likelihood maximization technique both belong to such class. It accredits usage of well-known inference techniques in frequentist and Bayesian frameworks from a novel perspective. As a consequence, we propose a modified Bayesian method for updating beliefs about the world. This learning rule also obeys the principle of surprise minimization. In this method, the influence of the likelihood term on the posterior belief can be controlled by a subjective parameter. We apply this technique to learning within changing environments. Modified Bayesian updating helps the learning agent to actively control the influence of new information on learning environments. As a result, the agent quickly adapts to the changing environments.

References

Fairhall AL, Lewen GD, Bialek W, van Steveninck RRR: Efficiency and ambiguity in an adaptive neural code. Nature. 2001, 412 (6849): 787-792.
Article PubMed CAS Google Scholar
Ranganath C, Rainer G: Neural mechanisms for detecting and remembering novel events. Nature Reviews Neuroscience. 2003, 4 (3): 193-202.
Article PubMed CAS Google Scholar
Baldi P, Itti L: Of bits and wows: a Bayesian theory of surprise with applications to attention. Neural Networks. 2010, 23 (5): 649-666.
Article PubMed PubMed Central Google Scholar
Itti L, Baldi P: Bayesian surprise attracts human attention. Advances in neural information processing systems. 2005, 547-554.
Google Scholar
Shannon CE: A mathematical theory of communication. ACM SIGMOBILE Mobile Computing and Communications Review. 2001, 5 (1): 3-55.
Article Google Scholar

Download references

Acknowledgements

This research was supported by the European Research Council (grant agreement no. 268 689).

Author information

Authors and Affiliations

School of Life Sciences, Brain Mind Institute and School of Computer and Communication Sciences, Ecole Polytechnique Federal de Lausanne (EPFL), CH-1015, Lausanne, Switzerland
Mohammad Javad Faraji & Wulfram Gerstner
Geneva Finance Research Institute, University of Geneva, CH-1211, Geneva, Switzerland
Kerstin Preuschoff

Authors

Mohammad Javad Faraji
View author publications
You can also search for this author in PubMed Google Scholar
Kerstin Preuschoff
View author publications
You can also search for this author in PubMed Google Scholar
Wulfram Gerstner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Javad Faraji.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Faraji, M.J., Preuschoff, K. & Gerstner, W. Surprise minimization as a learning strategy in neural networks. BMC Neurosci 16 (Suppl 1), P77 (2015). https://doi.org/10.1186/1471-2202-16-S1-P77

Download citation

Published: 04 December 2015
DOI: https://doi.org/10.1186/1471-2202-16-S1-P77

24th Annual Computational Neuroscience Meeting: CNS*2015

Surprise minimization as a learning strategy in neural networks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Neuroscience

Contact us

24th Annual Computational Neuroscience Meeting: CNS*2015

Surprise minimization as a learning strategy in neural networks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Neuroscience

Contact us