Task and timing in visual processing

Rothenstein, Albert L; Tsotsos, John K

doi:10.1186/1471-2202-8-S2-P148

Volume 8 Supplement 2

Sixteenth Annual Computational Neuroscience Meeting: CNS*2007

Poster presentation
Open access
Published: 06 July 2007

Task and timing in visual processing

Albert L Rothenstein¹ &
John K Tsotsos¹

BMC Neuroscience volume 8, Article number: P148 (2007) Cite this article

1019 Accesses
Metrics details

The study of visual perception abounds with examples of surprising results, and perhaps none of these has generated more controversy than the speed of object recognition. Some complex objects can be recognized with amazing speed even while attention is engaged on a different task. Some simple objects need lengthy attentional scrutiny, and performance breaks down in dual-task experiments [1]. These results are fundamental to our understanding of the visual cortex, as they clearly show the interplay of the representation of information in the brain, attentional mechanisms, binding and consciousness.

We argue that the lack of a common terminology is a significant contributor to this controversy, and define several different levels of tasks as: Detection – is a particular item present in the stimulus, yes or no?; Localization – detection plus accurate location; Recognition – localization plus detailed description of stimulus; Understanding – recognition plus role of stimulus in the context of the scene.

It is clear from performance results that detection is not possible for all stimuli, and the difference must be in the internal representation of the different stimuli. For detection to be possible, the fast, feed-forward activation of a neuron (or pool of neurons) must represent the detected stimulus, which is consistent with the experimental finding that only highly over-learned and biologically relevant stimuli or broad stimulus categories can be detected. In detection tasks localization is poor or absent [2], so location needs to be recovered based on this initial representation. Given that detailed location and extent information is only available in the early processing areas, this must be accomplished by the ubiquitous feedback connections in the visual cortex. Once the location of a stimulus has been recovered and distracters inhibited, one or more subsequent feed-forward passes through the system can create a detailed representation of the selected stimulus.

Here we present a computational demonstration of how attention forms the glue between the sparse, fast, and parallel initial representation that supports object detection and the slow, serial, and detailed representations needed for full recognition. The Selective Tuning (ST) model of (object based) visual attention [3] can be used to recover the spatial location and extent of the visual information that has contributed to a categorical decision. This allows for the selective detailed processing of this information at the expense of other stimuli present in the image. The feedback and selective processing create the detailed population code corresponding to the attended stimulus. We suggest and demonstrate a possible binding mechanism by which this is accomplished in the context of ST, and show how this solution can account for existing experimental results.

References

Koch C, Tsuchiya N: Attention and consciousness: Two distinct brain processes. Trends in Cognitive Sciences. 2007, 11: 16-22. 10.1016/j.tics.2006.10.012.
Article PubMed Google Scholar
Evans KK, Treisman A: Perception of objects in natural scenes: Is it really attention free?. Journal of Experimental Psychology: Human Perception and Performance. 2005, 31 (6): 1476-1492. 10.1037/0096-1523.31.6.1476.
PubMed Google Scholar
Tsotsos JK, Culhane SM, Wai WYK, Lai YH, Davis N, Nuflo F: Modeling visual attention via selective tuning. Artif Intell. 1995, 78 (1–2): 507-545. 10.1016/0004-3702(95)00025-9.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science & Engineering and Centre for Vision Research, York University, Toronto, Canada
Albert L Rothenstein & John K Tsotsos

Authors

Albert L Rothenstein
View author publications
You can also search for this author in PubMed Google Scholar
John K Tsotsos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Albert L Rothenstein.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Rothenstein, A.L., Tsotsos, J.K. Task and timing in visual processing. BMC Neurosci 8 (Suppl 2), P148 (2007). https://doi.org/10.1186/1471-2202-8-S2-P148

Download citation

Published: 06 July 2007
DOI: https://doi.org/10.1186/1471-2202-8-S2-P148

Sixteenth Annual Computational Neuroscience Meeting: CNS*2007

Task and timing in visual processing

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Neuroscience

Contact us

Sixteenth Annual Computational Neuroscience Meeting: CNS*2007

Task and timing in visual processing

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Neuroscience

Contact us