Evaluation and comparison of different machine learning approaches to auditory spectro-temporal receptive field estimation
BMC Neuroscience volume 12, Article number: P4 (2011)
The linear spectro-temporal receptive field (STRF) is a well-known approach to describe which features are encoded by auditory cortical neurons . It is defined as the linear filter that, when convolved with the envelope of a stimulus, gives a linear estimate of the spike rate evoked by that stimulus. A common STRF estimation method is reverse correlation, also known as spike-triggered average (STA), where the stimulus parts preceding the spikes are averaged in a specific time window. Linear regression approaches estimate an STRF based on the ensemble-averaged response spike rate. In  we have shown that the linear STRF model can be reformulated in terms of linear classification and a novel method using a support vector machine (SVM) classifier has been presented. Hence, given a set of stimuli with evoked response ensembles measured for a neuron the STRF of that specific neuron can be estimated using any of these methods.
Reverse correlation, linear regression and the SVM-based approach are evaluated using neural recordings from the primary auditory cortex of mongoelan gerbils  and synthetic data created using an inhomogeneous Linear-nonlinear Poisson (LNP) model with refractory period . In the LNP model, STRFs with different characteristics are used in the linear stage of the model and the average spike rate and the number of trials are varied. Complexes of frequency modulated (FM) sweeps as described in  are used as stimuli for both types of data. All methods are tested and evaluated using 5-fold cross-validation (CV). Model complexity in terms of Principal Components (PC) for linear regression and the SVM classifier is determined by using the least complex model within one standard deviation of the maximum mean coherence between estimated and predicted spike rate. The resulting STRFs are evaluated using mean coherence, the area under the receiver operating curve (AUC) for single spike classification and STRF variability for the different CV folds.
As shown in Figure 1, linear regression and SVM classification produce STRFs that are better predictors for responses of cortical neurons than traditional reverse correlation. The classification-based approach yields the best response field characterization for single-trial data as well as for data with multiple trials. In general, linear regression and SVM classification produce STRFs with very similar structure and high predictive power for rather linear neurons validating the reformulation of the STRF model.
Theunissen FE, Sen K, Doupe AJ: Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds. J Neurosci. 2000, 20: 2315-2331.
Happel MFK, Müller SG, Anemüller J, Ohl FW: Predictability of STRFs in auditory cortex neurons depends on stimulus class. Interspeech. 2008, 670.
Meyer AF, Happel MFK, Ohl FW, Anemüller J: Estimation of spectro-temporal receptive fields based on linear support vector machine classification. BMC Neuroscience. 2009, 10: P147-10.1186/1471-2202-10-S1-P147.
Schwartz O, Pillow JW, Rust NC, Simoncelli EP: Spike-triggered neural characterization. J Vis. 2006, 6: 484-507. 10.1167/6.4.13.
About this article
Cite this article
Meyer, A.F., Diepenbrock, J.P., Ohl, F.W. et al. Evaluation and comparison of different machine learning approaches to auditory spectro-temporal receptive field estimation. BMC Neurosci 12 (Suppl 1), P4 (2011). https://doi.org/10.1186/1471-2202-12-S1-P4