Interocular induction of illusory size perception
© Song et al; licensee BioMed Central Ltd. 2011
Received: 2 August 2010
Accepted: 11 March 2011
Published: 11 March 2011
The perceived size of objects not only depends on their physical size but also on the surroundings in which they appear. For example, an object surrounded by small items looks larger than a physically identical object surrounded by big items (Ebbinghaus illusion), and a physically identical but distant object looks larger than an object that appears closer in space (Ponzo illusion). Activity in human primary visual cortex (V1) reflects the perceived rather than the physical size of objects, indicating an involvement of V1 in illusory size perception. Here we investigate the role of eye-specific signals in two common size illusions in order to provide further information about the mechanisms underlying illusory size perception.
We devised stimuli so that an object and its spatial context associated with illusory size perception could be presented together to one eye or separately to two eyes. We found that the Ponzo illusion had an equivalent magnitude whether the objects and contexts were presented to the same or different eyes, indicating that it may be largely mediated by binocular neurons. In contrast, the Ebbinghaus illusion became much weaker when objects and their contexts were presented to different eyes, indicating important contributions to the illusion from monocular neurons early in the visual pathway.
Our findings show that two well-known size illusions - the Ponzo illusion and the Ebbinghaus illusion - are mediated by different neuronal populations, and suggest that the underlying neural mechanisms associated with illusory size perception differ and can be dependent on monocular channels in the early visual pathway.
Although the visual system has been intensively studied and a lot is known about the processing of simple object features such as orientation, luminance, colour, motion, and shape, surprisingly little is known about the neural processes underlying size perception. The spatial extent of neural activity in human primary visual cortex (V1) reflects the perceived size rather than the physical size of an object in a Ponzo-like illusion [4, 5], suggesting a possible role for V1 in illusory size perception. The geniculostriate visual pathway is segregated into monocular pathways from the left and right eyes. The first stage at which information from the two eyes converges is V1, but it nonetheless contains populations of monocular neurons, which respond, with varying degree of exclusivity, to direct stimulation from only one of the eyes [6–8]. Therefore, it is unclear whether binocular or monocular neuronal populations of V1 are involved in the perception of the Ponzo illusion. Moreover, it remains unknown whether other forms of illusory size perception such as the Ebbinghaus illusion are mediated by the same or different neuronal mechanisms. The Ponzo and the Ebbinghaus illusions are induced by spatial contexts suggesting a role for different types of contextual information. While the contexts in the Ponzo illusion (Figure 1A and 1B) contain monocular depth clues that globally affect both objects, the contexts in the Ebbinghaus illusion (Figure 1C) are simple geometric patterns that locally affect the adjacent object but not the other object. Such discrepancy between the two illusions may be associated with the involvement of different neuronal populations.
Here we studied the extent to which monocular and binocular neurons in the human visual system were involved in two different size illusions: the Ponzo illusion and the Ebbinghaus illusion. We took advantage of the well-described functional organization of the visual system to infer the cortical stage at which the illusory size perception occurred. In visual cortices beyond V1 almost all neurons are binocular, whereas in subcortical visual areas (such as the lateral geniculate nucleus in the thalamus) and V1 a large proportion of neurons are monocular. Interocular transfer paradigms can reveal the degree of binocularity in illusory size perception and allow us to make inferences concerning the neuronal populations or neural stages involved [9, 10]. If a spatial context in one eye exerts an influence on the perceived size of an object presented to the other eye, this suggests that the illusory size perception is mediated by binocular neurons at V1 or higher visual areas. Conversely, if the process is substantially reduced under dichoptic presentation (i.e., presenting the objects and their spatial contexts to different eyes), this implicates the involvement of monocular neurons early in the visual system such as at lateral geniculate nucleus (LGN) or V1.
We therefore devised stimuli in which an object and its spatial context associated with a size illusion could be presented together to one eye (monocular presentation) or separately to the two different eyes (dichoptic presentation). In separate experiments, we quantified the magnitude of illusory size perception induced by the two different (Ponzo vs. Ebbinghaus) illusions, and examined how the magnitude of the illusion was affected (if at all) by these interocular manipulations. We found that the Ponzo illusion showed equivalent magnitudes in dichoptic and monocular presentations, but the Ebbinghaus illusion in contrast was much weaker in dichoptic compared to monocular presentation.
We studied the two illusions under monocular, dichoptic, and binocular conditions: the spatial contexts were presented to only one eye, and the objects were presented to either the same eye (monocular), or the opposite eye (dichoptic), or to both eyes simultaneously (binocular). Contrasting the monocular and dichoptic conditions allowed us to evaluate the degree of interocular transfer in illusory size perception, whereas contrasting the binocular condition with the monocular and dichoptic conditions allowed us to infer the linearity vs. nonlinearity of the interocular transfer effect. For a better comparison of the Ebbinghaus and the Ponzo illusions, we matched the spatial configuration of the two illusion stimuli by using a horizontal configuration for the Ponzo illusion (Figure 1A, ). We also studied the vertical configuration of the Ponzo illusion (Figure 1B, ) to generalize the findings.
Similar to the Ponzo illusion, the Ebbinghaus illusion persisted regardless of the type of interocular manipulation (Figure 2B; monocular, t(5) = 7.4, p < 0.001; dichoptic, t(5) = 4.3, p < 0.01; binocular, t(5) = 5.6, p < 0.01; right tailed t-test), but changed in magnitude across different manipulations (F(2,10) = 16.0, p < 0.001, one-way repeated measures ANOVA). Interestingly, the change in the magnitude of the Ebbinghaus illusion showed a very different pattern from that of the Ponzo illusion. Presenting the objects and their spatial contexts to different eyes greatly weakened the Ebbinghaus illusion (dichoptic vs. monocular, t(5) = 4.3, p < 0.01; paired t-test). Further, when the objects were presented to both eyes simultaneously and the contexts to only one eye, the Ebbinghaus illusion had an intermediate magnitude (binocular vs. monocular, t(5) = 4.1, p < 0.01; binocular vs. dichoptic, t(5) = 2.9, p < 0.05; paired t-test), and the regression model suggested a linear interaction between the monocular and dichoptic conditions (β1 = 0.44 ± 0.15, β2 = 0.04 ± 0.12, 95% confidence interval; R2 = 0.9333).
Our study suggests that two well-known size illusions - the Ebbinghaus illusion and the Ponzo illusion - arise from different neuronal mechanisms. The Ponzo illusion showed complete interocular transfer; that is, the strength of the illusion was equivalent regardless of whether the inducing spatial context was presented to the same or different eye as the objects whose perceived size participants were asked to judge. Conversely, in a similar configuration there was significantly reduced interocular transfer for the Ebbinghaus illusion. The complete interocular transfer observed in the Ponzo illusion indicates that it is mediated by binocular neuronal populations where visual inputs from the two eyes are combined and the information about the eye of origin is lost. The induction of the Ebbinghaus illusion magnitude from monocular to dichoptic presentation indicates that it is in large part mediated by monocular neuronal populations. Thus, the neural mechanism underlying illusory size perception is likely to depend on the type of spatial contexts in which the objects appear.
In the early visual system, inputs from the left and right eye are clearly segregated, up until V1 where there exist neuronal populations with varying degrees of binocularity [6, 7]. A dichoptically presented stimulus will drive responses in neurons exclusively responsive to the same eye (i.e., monocular neurons), as well as binocular neurons but with a decreasing degree depending on their ocular dominance. The weak interocular transfer that we observed for the Ebbinghaus illusion is therefore an indication that this illusion could be mediated by V1 or even earlier in the geniculostriate pathway (e.g., LGN) where the majority of the neurons are monocular. In contrast, the full interocular transfer that we observed with the Ponzo illusion provides strong evidence that this form of illusory size perception reflects activity in visual areas at least as high as V1.
The conclusion that the two illusions have different underlying neural mechanisms is further supported by the intra-individual comparisons that showed a lack of correlation between the magnitudes of each illusion across individuals. Moreover, when a regression model was used to fit individual illusion magnitudes under different interocular manipulations, this suggested a linear interocular interaction in the Ebbinghaus illusion but a nonlinear interaction in the Ponzo illusion. The Ebbinghaus illusion magnitude under binocular conditions reflected a linear combination of that under monocular and dichoptic conditions, whereas in the Ponzo illusion a nonlinear combination was present. Such differences are also indicative of a monocular component in the Ebbinghaus illusion and a binocular component in the Ponzo illusion. Since monocular signals from the two eyes are subjective to nonlinear neural processing in binocular summation [12, 13], the nonlinearity in the Ponzo illusion suggests that the illusory size perception here takes place at or after the stage of binocular summation. At this stage, the illusion inducers, as inputs from a single eye, are weaker than the binocular inputs of objects, leading to the induction of illusion magnitude from monocular/dichoptic to binocular condition. In contrast, the linearity in the Ebbinghaus illusion indicates that the illusion takes effect separately at different monocular channels before binocular summation.
The discrepancy between the Ponzo and the Ebbinghaus illusion may be largely due to the difference in the spatial contexts that induce the illusory perception. While the contexts in the Ponzo illusion suggest three-dimensional depth/distance information, the contexts in the Ebbinghaus illusion are simple geometric forms/contours. Although the slant contexts in the Ponzo illusion are monocular rather than binocular depth clues, it is plausible that the processing of monocular depth clues also requires the engagement of binocular neurons  as does the processing of binocular disparity clues . The simple geometric contexts in the Ebbinghaus illusion, on the other hand, are likely to be processed already in V1 where a large proportion of neurons are monocular . Notably, the contexts in the Ponzo illusion globally affect both objects, whereas the contexts in the Ebbinghaus illusion locally affect the adjacent object but not the other object. As the receptive fields of neurons increase significantly in size along the visual pathway , the processing of global contexts are likely to be mediated by higher visual areas where the neuronal receptive fields are large, and that of local contexts may be mediated by neurons with small receptive fields in lower visual areas. Thus, the different cognitive natures of the two illusions may be the underlying cause of the different neuronal involvements.
Interestingly, similarly to our findings in the Ebbinghaus illusion, other contextual effects such as collinear facilitation  and the tilt illusion  also exhibit partial interocular transfer. In collinear facilitation, the threshold for detecting a low contrast Gabor patch is enhanced when it is flanked by two higher contrast patches sharing the same orientation [20, 21]. In the tilt illusion, the perceived orientation of a central grating is biased by the orientation of the surrounding grating . Both of these illusions are dependent on the local contexts of stimulus orientation, a feature known to be processed by V1 neurons [6, 7]. The lateral intrinsic connections in V1 [23–25] which link neurons with similar receptive field properties  are hypothesized to modulate such contextual influences on visual perception . It is conceivable that the Ebbinghaus illusion also depends on these lateral connections.
Our study shows that monocular and binocular processing is involved in different ways in two phenomenally similar size illusions. While the Ponzo illusion was completely binocular and may largely involve higher visual areas, the Ebbinghaus illusion was primarily mediated by monocular pathways and therefore probably arises in primary visual cortex or subcortical visual areas. These findings suggest that the extent to which different neuronal populations are involved in illusory size perception is dependent on the spatial contexts in which the objects appear. This raises the possibility that different types of size illusions will affect our visually guided actions and interact with other visual pathways (e.g., color processing) in different ways. Such topics would be of interest for future study.
Six healthy, right-handed participants (4 females, 2 males, aged 22 to 32) with normal or corrected-to-normal vision participated in this study. All gave written informed consent, and the study was approved by the local ethics committee. Apart from two of the authors (CS and DSS), the other participants were naive to the aims of the experiments and received payment for participation. Five of the six participants took part in all three experiments, and one participant took part in experiment 1 and experiment 2.
Visual stimuli were programmed in Matlab Psychtoolbox  and were presented on a calibrated CRT monitor (size 22", spatial resolution 1024 × 768 pixels, refresh rate 100 Hz). The experiments were conducted in a darkened room with the monitor providing the only significant source of light. The left-eye and the right-eye stimuli were presented on the left and the right half of the monitor, respectively, and the participants viewed the stimuli at a distance of 67 cm through a mirror stereoscope with a chin and forehead rest. The participants indicated their responses by pressing the assigned keys on a keyboard.
The objects were two circles separated by a horizontal distance of 4 deg. The illusion inducers were seven big circles (1 deg in diameter) surrounding one object at a distance of 1.25 deg and nineteen small circles (0.1 deg in diameter) surrounding the other object at a distance of 0.4 deg (Figure 1C). The object surrounded by nineteen small circles was chosen as the reference object, and the object surrounded by seven big circles was chosen as the test object. Whether the small or big circles (and corresponding reference and test objects) appeared in left or right visual fields was randomized across trials. The diameter of the reference object was kept at 0.5 deg, and that of the test object was varied around 0.5 deg on a per-participant basis (see Procedures for details). These stimulus parameters were selected through preliminary experiments to maximize the magnitude of the illusion, while ensuring at the same time that there was adequate binocular fusion and no binocular suppression under dichoptic presentation.
To aid binocular convergence, a fixation cross (0.25 deg × 0.25 deg) was centered in the illusion stimuli, and a textured rectangle frame (inner edges: 8 deg × 4 deg; outer edges: 10 deg × 5 deg) was placed around the illusion stimuli. The circles and the fixation cross were at maximum luminance (white, 105 cd/m2), the background was at minimum luminance (black, 0.69 cd/m2), and the rectangle frame was inlaid with a stone texture for better binocular convergence.
Before the start of the experiments, nonius lines/illusory stimuli were dichoptically presented on the monitor, and each participant adjusted the stereoscope as well as the stimulus location, till the left-eye and the right-eye stimuli were well fused. During the experiments, the participants were instructed to press a special key if the left-eye and the right-eye stimuli were not well fused (e.g., the objects and the illusion inducers were misaligned), and the same trial was then repeated. For all six participants, less than 5% of the trials were reported not well fused.
The experiment consisted of two parts. In the first part, the participants adjusted the size of the test object to perceptually match the size of the reference object. Four adjustments were made for each presentation condition (monocular, dichoptic, binocular). In two adjustments, the test object surrounded by seven big circles was presented to the left of the fixation cross, and the reference object surrounded by nineteen small circles was presented to the right of the fixation cross. In the other two adjustments, the spatial configurations of the two objects (i.e., left or right of the fixation cross) were reversed.
After the first part, nine size values, which spread around the average result of the twelve adjustments (three presentation conditions times four adjustments), were chosen. The nine values were -0.1 deg, -0.075 deg, -0.05 deg, -0.025 deg, 0 deg, 0.025 deg, 0.05 deg, 0.075 deg, 0.1 deg away from the average adjustment value, and they were used as the size of the test object for the second part. In the second part, after 500ms presentation of the illusion stimuli, participants judged which of the two objects (the one to the left or the one to the right of the fixation cross) was the larger one. Twenty trials were tested for each combination of presentation condition and object size (three presentation conditions times nine object sizes resulting in a total of twenty-seven combinations). The test object was presented to the left of the fixation cross for ten trials and to the right for the other ten. The sequence of the trials was randomized and counterbalanced for the presentation condition (monocular, dichoptic, binocular), the test object size, and the test object location (i.e., to the left or right of fixation). After each trial, a high-contrast, dynamical, coloured noise stimulus was presented for 1000ms to prevent any potential interference between trials.
In experiment 2, we studied the Ponzo illusion under monocular, dichoptic, and binocular presentation of objects. For a better comparison of the results from experiment 1 and experiment 2, we matched the spatial configuration of the Ponzo illusion stimulus to that of the Ebbinghaus illusion stimulus. The two objects in the Ponzo illusion were aligned horizontally  instead of vertically , and similarly, the illusion inducers were two lines converged horizontally instead of vertically (Figure 1A). The two converging lines (length: 4.8 deg; width: 0.06 deg; converged at 25 deg) provided the depth impression that the two objects were located at different distances from the observer. Again, whether the lines converged from left to right or right to left (and the corresponding position of test and reference object) was randomized from trial to trial. The circle that should be perceived as farther away from the observer was chosen as the reference object, and the circle at the front side was chosen as the test object. We also matched the spatial distance between the objects and their surrounds in the two illusions; the converging lines in the Ponzo illusion surrounded the test object at a spatial distance of 1.25 deg and the reference object at 0.4 deg. Except for the above-mentioned changes in the stimulus parameters, experiment 2 was identical to experiment 1.
In experiment 3, we studied the Ponzo illusion with conventional vertical inducers (Figure 1B)  under monocular, dichoptic, and binocular presentation of objects. The parameters and procedures of experiment 3 were identical to those of experiment 2, except that the objects and the illusion inducers were in the vertical instead of the horizontal configuration.
R2 of data fitting with logistic function
Exp2. Horizontal Ponzo
Exp3. Vertical Ponzo
We would like to thank Randolph Blake for helpful discussions. This work was supported by the Wellcome Trust.
- Fisher GH: Detection of visual stimuli located within angles. Nature. 1967, 215: 553-554. 10.1038/215553a0.View ArticlePubMedGoogle Scholar
- Massaro DW, Anderson NH: Judgmental model of the Ebbinghaus illusion. Journal of Experimental Psychology. 1971, 89: 147-151. 10.1037/h0031158.View ArticlePubMedGoogle Scholar
- Franz VH, Scharnowski F, Gegenfurtner KR: Illusion Effects on Grasping Are Temporally Constant Not Dynamic. Journal of Experimental Psychology: Human Perception and Performance. 2005, 31: 1359-1378. 10.1037/0096-15188.8.131.529.PubMedGoogle Scholar
- Murray SO, Boyaci H, Kersten D: The representation of perceived angular size in human primary visual cortex. Nature Neuroscience. 2006, 9: 429-434. 10.1038/nn1641.View ArticlePubMedGoogle Scholar
- Fang F, Boyaci H, Kersten D, Murray SO: Attention-Dependent Representation of a Size Illusion in Human V1. Current Biology. 2008, 18: 1707-1712. 10.1016/j.cub.2008.09.025.PubMed CentralView ArticlePubMedGoogle Scholar
- Hubel DH, Wiesel TN: Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. Journal of Physiology. 1962, 160: 106-154.PubMed CentralView ArticlePubMedGoogle Scholar
- Hubel DH, Wiesel TN: Receptive fields and functional architecture of monkey striate cortex. Journal of Physiology. 1968, 195: 215-243.PubMed CentralView ArticlePubMedGoogle Scholar
- Adams DL, Sincich LC, Horton JC: Complete pattern of ocular dominance columns in human primary visual cortex. Journal of Neuroscience. 2007, 27: 10391-10403. 10.1523/JNEUROSCI.2923-07.2007.View ArticlePubMedGoogle Scholar
- Steiner V, Blake R, Rose D: Interocular transfer of expansion, rotation, and translation motion after effects. Perception. 1994, 23: 1197-1202. 10.1068/p231197.View ArticlePubMedGoogle Scholar
- Maffei L, Berardi N, Bisti S: Interocular transfer of adaptation after effect in neurons of area 17 and 18 of split chiasm cats. Journal of Neurophysiology. 1986, 55: 966-976.PubMedGoogle Scholar
- Leibowitz HW, Judisch JM: The relation between age and the magnitude of the Ponzo illusion. American Journal of Psychology. 1967, 80: 105-109. 10.2307/1420548.View ArticlePubMedGoogle Scholar
- Campbell F, Green DG: Monocular versus binocular visual acuity. Nature. 1965, 208: 191-192. 10.1038/208191a0.View ArticlePubMedGoogle Scholar
- Blake R, Fox R: The psychophysical inquiry into binocular summation. Perception and Psychophysics. 1973, 14: 161-185. 10.3758/BF03198631.View ArticleGoogle Scholar
- Tsutsui KI, Sakata H Mand Taira: Neural mechanisms of three-dimensional vision. Neuroscience Research. 2005, 51: 221-229. 10.1016/j.neures.2004.11.006.View ArticlePubMedGoogle Scholar
- Roe AW, Parker AJ, Born RT, DeAngelis GC: Disparity channels in early vision. Journal of Neuroscience. 2007, 27: 11820-11831. 10.1523/JNEUROSCI.4164-07.2007.PubMed CentralView ArticlePubMedGoogle Scholar
- Li Z: A neural model of contour integration in the primary visual cortex. Neural Computation. 1998, 10: 903-940. 10.1162/089976698300017557.View ArticlePubMedGoogle Scholar
- Dumoulin SO, Wandell BA: Population receptive field estimates in human visual cortex. NeuroImage. 2008, 39: 647-660. 10.1016/j.neuroimage.2007.09.034.PubMed CentralView ArticlePubMedGoogle Scholar
- Huang PC, Hess RF, Dakin SC: Flank facilitation and contour integration: different sites. Vision Research. 2006, 46: 3699-3706. 10.1016/j.visres.2006.04.025.View ArticlePubMedGoogle Scholar
- Wade NJ: The influence of colour and contour rivalry on the magnitude of the tit illusion. Vision Research. 1980, 20: 229-233. 10.1016/0042-6989(80)90107-8.View ArticlePubMedGoogle Scholar
- Polat U, Sagi D: Lateral interactions between spatial channels: suppression and facilitation revealed by lateral masking experiments. Vision Research. 1993, 33: 993-997. 10.1016/0042-6989(93)90081-7.View ArticlePubMedGoogle Scholar
- Polat U, Sagi D: The architecture of perceptual spatial interactions. Vision Research. 1994, 34: 73-78. 10.1016/0042-6989(94)90258-5.View ArticlePubMedGoogle Scholar
- Schwartz O, Sejnowski TJ, Dayan P: Perceptual organization in the tilt illusion. Journal of Vision. 2009, 9: 1-20. 10.1167/9.4.19.View ArticlePubMedGoogle Scholar
- Hirsch JA, Gilbert CD: Synaptic physiology of horizontal connections in the cat's visual cortex. Journal of Neuroscience. 1991, 11: 1800-1809.PubMedGoogle Scholar
- Ts'o DY, Gilbert CD, Wiesel TN: Relationships between horizontal interactions and functional architecture in cat striate cortex as revealed by cross-correlation analysis. Journal of Neuroscience. 1986, 6: 1160-1170.PubMedGoogle Scholar
- Weliky M, Kandler K, Fitzpatrick D, Katz LC: Patterns of excitation and inhibition evoked by horizontal connections in visual cortex share a common relationship to orientation columns. Neuron. 1995, 15: 541-552. 10.1016/0896-6273(95)90143-4.View ArticlePubMedGoogle Scholar
- Bosking WH, Zhang Y, Schofield B, Fitzpatrick D: Orientation selectivity and the arrangement of horizontal connections in tree shrew striate cortex. Journal of Neuroscience. 1997, 17: 2112-2127.PubMedGoogle Scholar
- Chisum HJ, Mooser F, Fitzpatrick D: Emergent properties of layer 2/3 neurons reflect the collinear arrangement of horizontal connections in tree shrew visual cortex. Journal of Neuroscience. 2003, 23: 2947-2960.PubMedGoogle Scholar
- Brainard DH: The Psychophysics Toolbox. Spatial Vision. 1997, 10: 433-436. 10.1163/156856897X00357.View ArticlePubMedGoogle Scholar