Skip to content

Advertisement

  • Poster presentation
  • Open Access

A unifying theory of ITD-based sound azimuth localization at the behavioral and neural levels

  • 1, 2Email author,
  • 1, 2,
  • 3 and
  • 1, 2
BMC Neuroscience201314 (Suppl 1) :P39

https://doi.org/10.1186/1471-2202-14-S1-P39

  • Published:

Keywords

  • Sound Source
  • Interaural Time Difference
  • Neural Data
  • Sound Source Localization
  • Head Related Transfer Function

In many species, azimuthal sound source localization relies on the processing of fine temporal differences between the incoming signals at both ears (interaural time differences, ITDs). There exists no consensual theory of ITD-based localization that explains the behavioral and neural data alike. The classical view of a place code for localization [1] is questioned by electrophysiological data [2], while its alternative is functionally inefficient [3]. We propose as a functional principle that the system performs a maximum-likelihood estimation of the position of the source given the cues in the stimulus. This Bayesian approach implies that the behavioral and neural data are constrained by natural distributions of binaural cues, as observed in acoustical recordings of head related transfer functions (HRTFs). We first record and analyze HRTFs in humans and cats. Then we discuss the implications of our hypothesis on psychoacoustical data in humans and electrophysiological data in the cat.

In a maximum-likelihood approach, the current observed cue is compared to the a priori distribution of cues (marginal prior normalization). It is thus fundamental to uncover what the cues are and how they are distributed across the spectrum. We recorded HRTFs in different species, and performed simulations of natural environments to quantify the robustness of ITD cues. We find that ITD is a frequency-dependent quantity that decreases by about 30% across the spectrum, and that such variations occur within the bandwidth of a cochlear filter. We also show how the distributions of cues vary across frequencies, in relation with various features of the environment such as reflections.

Because the ITD as a constant delay is an insufficient cue, azimuth should be extracted by the system based on a frequency-dependent representation of ITD. We test this prediction in a psychoacoustical setup. Using a matching paradigm, subjects are asked to adjust the lateralization of two noises with different frequency contents, by varying the ITD of one of the stimuli. The HRTF data allows us to predict that the higher frequency sounds should be matched with a lower ITD than the lower frequency sound. We show how this prediction is met both qualitatively and quantitatively in our experiment.

We give a model of the function of binaural cells in the cat brainstem. We predict the responses of those neurons to binaural beats at different frequencies from the cat HRTFs. We show how this simple model can explain already observed features of the electrophysiological literature [4], namely the presence of cells sensitive to frequency-dependent interaural delays.

Finally, we propose a spiking neuron implementation of this maximum-likelihood principle. Cells are tuned to the frequency-dependent cues of their best position by means of both cochlear mismatches and axonal delays [5]. The Bayesian marginal prior normalization is implemented through the use of inhibition. Probing the model with various input sources, in a simulated virtual environment, we show that the network accurately localizes sound sources, comparably with an optimal Bayesian observer. Moreover, this model predicts qualitative differences in those observations for mammals of different sizes such as the cat and gerbil.

Authors’ Affiliations

(1)
Equipe Audition, Département d'Etudes Cognitives, Ecole Normale Supérieure, Paris, 75005, France
(2)
Laboratoire Psychologie de la Perception, CNRS and Université Paris Descartes, Paris, 75006, France
(3)
Dominick P. Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, New York, USA

References

  1. Jeffress LA: A place theory of sound localization. J Comp Physiol Psychol. 1948, 41: 35-9.View ArticlePubMedGoogle Scholar
  2. Grothe B, Pecka M, McAlpine D: Mechanisms of Sound Localization in Mammals. Physiol Rev. 2010, 90: 983-1012. 10.1152/physrev.00026.2009.View ArticlePubMedGoogle Scholar
  3. Brette R: On the interpretation of sensitivity analyses of neural responses. J Acoust Soc Am. 2010, 128: 2965-2972. 10.1121/1.3488311.View ArticlePubMedGoogle Scholar
  4. Kuwada S, Yin TC: Binaural interaction in low-frequency neurons in inferior colliculus of the cat. I. Effects of long interaural delays, intensity, and repetition rate on interaural delay function. J Neurophysiol. 1983, 50: 981-999.PubMedGoogle Scholar
  5. Day ML, Semple MN: Frequency-dependent interaural delays in the medial superior olive: implications for interaural cochlear delays. J Neurophysiol. 2011Google Scholar

Copyright

© Benichoux et al; licensee BioMed Central Ltd. 2013

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Advertisement