On the matching of top-down knowledge with sensory input in the perception of ambiguous speech
© Eulitz and Hannemann; licensee BioMed Central Ltd. 2010
Received: 7 November 2008
Accepted: 2 June 2010
Published: 2 June 2010
How does the brain repair obliterated speech and cope with acoustically ambivalent situations? A widely discussed possibility is to use top-down information for solving the ambiguity problem. In the case of speech, this may lead to a match of bottom-up sensory input with lexical expectations resulting in resonant states which are reflected in the induced gamma-band activity (GBA).
In the present EEG study, we compared the subject's pre-attentive GBA responses to obliterated speech segments presented after a series of correct words. The words were a minimal pair in German and differed with respect to the degree of specificity of segmental phonological information.
The induced GBA was larger when the expected lexical information was phonologically fully specified compared to the underspecified condition. Thus, the degree of specificity of phonological information in the mental lexicon correlates with the intensity of the matching process of bottom-up sensory input with lexical information.
These results together with those of a behavioural control experiment support the notion of multi-level mechanisms involved in the repair of deficient speech. The delineated alignment of pre-existing knowledge with sensory input is in accordance with recent ideas about the role of internal forward models in speech perception.
At the level of speech, most conversations are considerably unclear. How does the brain cope with partly obliterated speech information and how does pre-existing knowledge support these coping-processes? It has been suggested that lexical information can be restored by using top-down lexical knowledge. Here we use the phonemic restoration illusion, where listeners hear spoken words as intact even though parts of them have been replaced by an extraneous sound , to study this repair processes in detail.
Given the top-down lexical influences on phonemic processing [2, 3] the phonemic restoration illusion can be described as a match of bottom-up sensory input with lexical expectations resulting in resonant neural dynamics [4, 5]. Similar, resonant states were first described in studies of feature binding in animals . In humans, such correlates can be measured as an enhancement in the gamma band (GBA) which is discussed among others as a signature of object recognition [7, 8] and in relation with several mnemonic processes [9–11]. In language processing, a modulation in GBA was observed for the differentiation between words and pseudowords [12–14] as well as a correlate of merging expected lexical information with degraded speech input . The predominance of the effects over left anterior regions of the brain illustrates the involvement of language competent brain areas.
The present experiment was designed to examine by means of GBA the "filling-in" of phonemic information in the course of phonemic restoration at the pre-attentive level. Whereas the role of top-down repair process for phonemic restoration has been shown in previous ERP experiments for attentive listening to sentences [16, 17] a generalization to the preattentive level of processing is still missing. The top-down influence in phonemic restoration at the preattentive level of processing would, however, support the important role of top-down processing for speech perception in general.
We used a roving standard passive oddball paradigm and the point of interest was the detection of a minimal change in the auditory object , and the dependency of repair processes of ill-formed speech on the information structure of the preceding auditory object. The preceding object was one of two nouns being a minimal pair in German (Falte and Falke). The process of merging expected lexical information with the sensory input is expected to result in resonant states  which are depicted in the induced GBA. The intensity of repair and thus of the GBA was expected to correlate with the amount of information which has to be aligned.
The assumption about a different amount of information in the lexical representations of the two words used here was based on recent mental lexicon models, which assume abstract and sparse representations of language in the mental lexicon [19, 20]. To handle the huge variability of the speech signal during speech perception, these models propose that all nondistinctive and predictable phonological information is not stored in the declarative memory [19, 21]. Instead of merely storing all variance in phoneme realizations [22, 23], the abstract models assume the underspecification of certain phonemic features in the mental lexicon. Of relevance for the present study was certain place of articulation information which is more sparse for the "t" in Falte (underspecified for the [coronal] place of articulation [19, 20]) compared to the "k" in Falke which is assumed to be phonologically fully specified.
To test whether the specification of phonological details modulate the restoration of phonemes, the induced GBA to the noise-replaced stimuli was investigated. We expected the induced GBA to differ between lexically specified and underspecified information in the precursor. Particularly the induced GBA is expected to be larger in case of a fully specified anticipated phoneme, because more information from the predecessor, has to be aligned and merged with ambiguous auditory input. If this can be found in the present study, the GBA could be interpreted as a correlate of "filling in" the expected and lexically specified information to form a perceivable auditory object out of an ill-formed speech signal.
Alternatively, if the process of phonemic restoration does not differ according to the specification of the phoneme to be restored or if the claim of underspecification in the mental lexicon [19, 20] does not hold, no differential modulation of induced GBA should be observable. Moreover, if there is no immediate top-down influence on the phonemic restoration [24, 25], no differential modulation of induced GBA should be observable. To substantiate the induced brain activity as a correlate of merging lexical top-down expectancies with obliterated speech input it should be also dissociable from evoked brain activity.
Nineteen healthy right-handed monolingual German-speaking volunteers without otolaryngological or neurological diseases participated in this study. Due to bad signal-to-noise ratio three subjects had to be excluded and all further analyses were performed for 16 subjects (eight female; mean age = 23.8 years, standard deviation [SD] = 3.1 years). All participants gave their written consent and received class credits or a small financial bonus. The study was conducted in compliance with the declaration of Helsinki and approved by the ethics committee of the University of Konstanz.
Participants were seated in an electrically shielded and sound attenuated room. During the experiment the subjects were instructed to ignore all stimuli and watched a silent movie. Before the three blocks of passive listening the subjects had to identify the three stimulus classes by pressing a key corresponding to the subjectively heard phoneme at the beginning of the second syllable.
To analyze the induced spectral changes in gamma-band activity (GBA; the principle approach was the same as in .) in the artefact free epochs from -400 ms to 1000 ms of the disconfirming and confirming items, a wavelet analysis using Morlet wavelets with an m-factor = 7 was performed. By forming a good compromise between frequency and time resolution, this method provides a time-varying magnitude of the signal in each frequency band, leading to time frequency representations of the signal . Then, time by frequency energy is averaged across single trials, allowing one to analyze non-phase-locked frequency components. This method is described in detail elsewhere . In order to achieve a good time and frequency resolution wavelets from 10 to 100 Hz in 2 Hz steps were computed. Next the raw wavelet-data were normalized by computing the relative power change for every time by frequency bin compared to the median of the according baseline which was defined as the latency range from -200 to -100 ms before stimulus onset.
To capture a wide range of cortical sources as well as maintaining a good signal to noise ratio, the mean spectral power of all disconfirming and confirming events was averaged over 6 electrode arrays with 6 electrodes each (Figure 3). Concerning the lack of exact a priori knowledge of latencies and frequencies which might map process of the phonemic restoration in the gamma band, a similar approach as in Hannemann et al.  using permutation tests  was pursued to compare the differences in spectral power of disconfirming #-items with a K-item as predecessor minus the associated confirming item with the comparable difference having a T-item as predecessor. In the present study these tests were applied to each time-frequency bin from 280 to 1000 ms post stimulus onset for frequencies between 30 and 60 Hz. To make relatively sure that no time-frequency bins passed our criteria by chance, only contiguous bins for at least 30 ms per frequency band which showed a p-value p < 0.01 (uncorrected) were taken into account for further consideration.
Finally a four-way repeated-measures ANOVA Predecessor (K-item vs. T-item) x Expectation (disconfirming vs. confirming) x Hemisphere (left vs. right) x Position (anterior, medial, posterior) was performed on the time-frequency clusters surviving the initial permutation tests to substantiate our findings. For all analyses involving the factor Position, we checked for the violations of the sphericity assumption using Mauchly's criterion, and in case of violations report multivariate testing (using Wilks Lamba) instead. Post-hoc test were only applied to time frequency spots that passed the initial permutation tests. These statistical analyses principally comprised a two way repeated-measures ANOVA Predecessor x Expectation and the belonging t-tests to identify the direction of the predicted modulation in induced GBA.
To dissociate the "filling in" of expected lexical information from a pure phonological conflict depending on the specificity of phoneme representations between the particular predecessor and the pivotal disconfirming noise replaced item, the assessed induced GBA was compared with the mismatch negativity response (MMN) [33, 34] which is sensitive to map phonological conflicts in passive oddball paradigms [35, 36]. Thus, to ensure that the hypothesized induced GBA is not a mere by-product of a MMN elicited by deviant items (= disconfirming items) interrupting a sequence of repeated standard items, we analyzed the evoked potentials (re-referenced to linked mastoids) with a prestimulus baseline of 100 ms recorded at Fz. Again, we examined the mean amplitude in the latency range identified by the permutations test for the induced GBA using the factorial design as described above.
Further, to differentiate the induced brain activity from evoked brain activity in the gamma band range, we also calculated mean amplitudes of the evoked GBA in the same time by frequency windows as those for the induced GBA and analyzed them using the same factorial design. Finally we also analyzed the induced GBA in higher frequency ranges (76-86 Hz) which are known to reflect electromyographic (EMG) activity for facial and head muscles (in which the peak of the spectral density function of muscular contamination could be expected; ) to rule out possible confounds of EMG artefacts .
In addition to the EEG study, two behavioural identification experiments were conducted to gain knowledge about the attentive processing of the noise-replaced items. Twelve subjects (seven female; mean age = 24.5 years, standard deviation [SD] = 3.5 years) participated in each of the experiments. They fulfilled the same criteria as the subjects of the EEG study. In the first experiment, which was carried out for exploratory purposes, the subjects had to identify the stimulus-class for all noise-overlaid and noise-replaced items. However, because of the possibility to infer the [coronal] place information from redundant information, larger projection rates in favour of a /t/ percept could be expected for the #-items compared to the /k/ percept, which has a fully specified representation in the mental lexicon instead. Each item was presented six times using the same equipment as for the EEG study. The subjects had to subjectively judge as exact as possible which phoneme has been perceived and respond by pressing the corresponding key on a standard PC keyboard with their right hand.
The experimental design of the second behavioural experiment was made to mimic possible context effects which played a role in the EEG study. Therefore, one experimental trial contained 4 stimuli with the first three items belonging to one item class followed by a fourth item (= target) which could belong to the same or one of the other two item classes. Each of the nine stimulus combinations was presented 24 times which resulted in 216 trials overall. The within trial ISI was 500 ms as in the EEG experiment. After the presentation of the fourth item the subjects had to indicate by button press whether they perceive a K or a T-item at the fourth position as fast and accurately as possible. We hypothesized the reaction times to differ between /k/ and /t/ depending on the specificity of the place of articulation information. Further, the reaction times depicting a successful integration of anticipated and actual sensory input should be longer compared to an unsuccessful unification. To analyze the processing of the noise-replaced items the reaction times (RT) were analyzed by means of a mixed-model ANOVA after cropping the lower and upper 10% percentile.
Induced brain responses
Mean spectral power for the 430 - 490 ms/38 - 44 Hz range, averaged across six electrode sites and standard error of mean (SEM) in % change for the noise-replaced #-items.
Expectation created by predecessor
As Figure 5a indicates, the modulation in 38 - 44 Hz spectral power might last longer than the initial permutation test suggested. For the latency range from 350 to 490 ms the Predecessor x Expectation interaction was also significant (F(1,15) = 12.25, p < 0.001) with post-hoc t-tests showing significant larger values in spectral power for #1(k) compared to #1(t) items (t(15) = 2.96, p < 0.01) and significant differences between #1(k) and #2(k) items (t(15) = 2.86, p < 0.05). All other post-hoc analyses revealed no significant differences (p > 0.2) for this latency range.
Although our predictions concerning the induced spectral changes were only specific to the "filling in" of expected lexical information as processing step to build up a percept of a phoneme (which are expected to appear first after onset of the noise replacement begins) the time course of the induced 38 - 44 Hz changes in Figure 5a points to another modulation around the onset of the #-items (-50 - 100 ms). However, the corresponding ANOVA showed neither a significant Predecessor x Position interaction (F(1,15) = 1.87, p > 0.1) nor any main effect (all F < 1.5, p > 0.2) for the 38 - 44 Hz range and reinforces therefore the non-result of the permutation tests for this latency range.
Analyses of evoked gamma band responses and control for possible EMG confounds
To ascertain that our results indeed reflect modulations of induced brain activity, we post-hoc analyzed the evoked brain activity for the same time and frequency range. Figure 5 contrasts the time course of the induced and evoked spectral changes in the 38 - 44 Hz range for left anterior medial temporal electrode sites. As exemplified only the induced spectral changes showed a modulation on the disconfirming #-items with larger values for the #1(k) items compared to #1(t) items whereas the evoked spectral changes exposed no comparable modulation pattern. Statistical analyses analogous to the analyses for the induced brain activity revealed neither a four-way interaction (F(1,15) < 1, p > 0.4) nor any main effect or interaction for the left anterior temporal electrode sites (all F < 1).
Finally, to test for possible EMG artefacts which might be correlated with the induced result in the 38 - 44 Hz range, a four-way ANOVA testing the 76 - 86 Hz range  yielded no comparable results for the latency range identified by the permutation tests, especially no Predecessor x Expectation x Hemisphere x Position interaction (F(1,15) = 1.84, p > 0.1).
Differentiation of induced brain responses from ERP results at Fz
Mean identification rates for the K, T and #-items across 12 subjects of the first behavioural experiment.
Rate of Keystroke [%]
K - key
T - key
Reaction times (RT) and the projection rate (PR) of #-items onto /k/ and /t/ percepts in different contexts are summarized for the second behavioural experiment.
K - Percept
T - Percept
RT in [ms]
PR in [%]
RT in [ms]
PR in [%]
To gain a better understanding of how the brain copes with acoustically ambivalent situations the present study was set out to shed light on the brain mechanisms underlying the repair of fragmentary speech information. Particularly the study investigated the role of lexical specification of phonological details in the mental lexicon and its impact on the phonemic restoration illusion. In order to prevent influences of attention or decision making processes on the phonemic restoration the illusion was investigated by means of a passive oddball paradigm in which the subjects were instructed to ignore the auditory stimuli. In doing so, the study goes beyond previous EEG studies with active tasks [16, 17]. To monitor the processing of ambivalent sensory input under the influence of differential top-down mediated expectations of phonemic features we examined the induced GBA. If the fine structure of phonological information in the mental lexicon may play a significant role in the phonemic restoration, we hypothesized a differential modulation in the induced GBA depending on the specificity of the place of articulation of the phoneme to be restored. Our results for the left anterior electrode sites clearly support this assumption. In the latency range of the to-be-expected phoneme for the disconfirming #-items we observed larger values of induced GBA if the expected phoneme was specified for the feature place of articulation (K-item) compared to the underspecified expectation (T-item). These larger values of induced GBA were most pronounced between 430 and 490 ms in the 38 - 44 Hz range. Importantly, there was no differential modulation in induced GBA for the confirming #-items. As Figure 5 illustrates the evoked GBA showed no comparable effects, neither for the disconfirming nor the confirming #-items.
The topography of the effect is similar as in Hannemann et al. . As there, the modulation of induced GBA over left anterior temporal electrode sites can be interpreted as a correlate for a match of bottom-up sensory input with lexical expectations resulting in resonant neural dynamics [4, 5].
The present results showing the differential modulation of induced GBA in the restoration illusion is also interesting from another point of view. The difference was predicted based on a speech perception model which assumes underspecified mental representations of certain features of the sound structure. According to the featurally underspecified lexicon theory [19, 20] the critical phoneme in the K-items possesses a full featural specification for the [dorsal] place of articulation while the [coronal] place of articulation for T-items is underspecified. Thus, while the repeatedly presented K-items establish an expectation of a specified place of articulation in the critical phoneme, the T-item cannot build up such specific expectations based on specified featural information in the mental lexicon. This difference was reflected in our GBA results and would not have been predicted by other speech perception models [22, 23]. Moreover, the pre-attentive modulation of induced GBA is further evidence for an immediate lexical top-down support in the phonemic restoration  and generally in the perception of speech in difficult auditory environments. Thus, the present results suggest a more extensive and immediate top-down influence on repair processes in speech perception as claimed by more autonomous views on speech perception [24, 25].
It is well established that signatures in GBA can differentiate between words and pseudowords [12, 13]. As all #-items were acoustically identical (and in a strict sense all pseudowords) this known difference should maximally lead to a main effect of Expectation and does therefore not explain the present results. Thus models favoring strictly bottom-up processes in speech perception  cannot account for the observed differential modulation in induced GBA, especially because all #-items were physically equal and there is no post-perceptual decision making process which might have influenced the GBA.
According to Pulvermuller et al. , activity in higher frequency bands contains information about semantic features of words, i.e. it shows differential topographies between verbs and nouns in a lexical decision task. Recently an intracerebral EEG study observed modulations in evoked GBA in a visual semantic decision task . Following this argumentation it might be possible, that the observed modulation in induced GBA in the present study is caused by different semantic instead of phonological expectations. As both words which create the expectation for the disconfirming #-items are nouns, were matched for frequency and the observed effect cover different frequency bands this interpretation seems rather unlikely. Nevertheless it can't be absolutely ruled out that the larger value of induced GBA for #1(k)-items compared to #1(t)-items is at least partly due to a differential semantic expectation.
With respect to the findings of Eulitz et al.  the suspending of repeated presentation of fully specified with underspecified items lead to larger phonological conflicts mapped in differential MMNs than vice versa. Thus, if the present modulation in induced GBA in favor to the #1(k)-items is due to that kind of phonological conflict the MMN should also show a differentiating pattern between the disconfirming expectations of specified and underspecified items. As we found only a general difference between disconfirming and confirming #-items which was independent from the predecessor context the present results cannot be explained by variable strength of phonological conflicts, at least in the present latency range of 430 - 490 ms.
The results of the behavioural experiments support and extend our interpretation of the observed gamma-band modulation during the processing of the #-items. When attending the stimuli, the pattern of results is different compared to the pre-attentive processing of #-items. Without context, as in the first behavioural experiment, the subjects showed a preference towards perceiving a /t/ over a /k/ and all other possible phonemes. The same pattern of results was obtained in the pre-experimental exploration. This identification bias toward /t/ was replicated for the projection rates in the second behavioural experiment. Due to the lack of alternatives in this choice task, this bias was even more pronounced. This bias can be interpreted in two ways: (i) The [coronal] place of articulation is regarded as the default place of articulation by phonologists . In absence of any information indicating a specification of the place of articulation in the mental lexicon, the subjects therefore showed a preference towards perceiving a /t/ for the ambiguous acoustics in the #-items. (ii) It might be also due to the spectral characteristics of the noise replacing the critical consonant, which is spectrally slightly more similar to a /t/ compared to a /k/ . Interestingly, reaction time data of the second behavioural experiment indicated context effects. When subjects decided that the actual #-item was the same as the predecessors, the reaction times to these #-items was significantly longer compared to the inexpedient response. The longer RT seems to indicate a more complex decision and evaluation process, which is required to align the anticipated phonemic information and the sensory input. Under attentive processing conditions, this RT effect is independent of the specification of featural information in the mental lexicon.
According to that, the modulation in induced GBA in favour of the #1(k)-items and the prolonged RT enlighten differential aspects of the phonemic restoration illusion. Both describe the matching processes of deficient sensory input and anticipated phonemic information. But, as the behavioural data is generally influenced by external factors, i.e. task formulations and attention etc., the pre-attentive EEG data is free of such influences and thus yield additional insights on the influence of the fine structure of the mental lexicon on this matching process. However, the present results are only a first step towards a comprehensive understanding of the influence of the specificity of phonemes in the mental lexicon on repair processes in speech perception. Further studies investigating other features in German and other languages are crucial to allow for a general comprehension of speech perception under natural and noisy conditions.
In sum, the current study evinces for the first time a direct correlate for a top-down modulated "filling in" in the phonemic restoration illusion without relying on redundant sentential information. The present induced brain responses again reveal clear evidence for a left lateralized functional network in matching expected lexical information with sketchy sensory input  to form a coherent auditory object . Further, they demonstrate the influence of the fine structure in the mental lexicon on top-down modulated speech perception processes and are in line with current cortical models of auditory word recognition . Moreover, the delineated alignment of lexical expectancies with sensory input is in accordance with recent ideas that speech perception is facilitated by internal forward models . Thus it serves as prerequisite for speech and more generally for conscious object perception . Finally the current results experimentally show that the human ability to comprehend speech even pre-attentively and under much compromised conditions (i.e. restoring missing phonemes) relies on the immediate interaction of lexical expectancies (i.e. top-down) and the acoustical input. These interactions can be examined by means of induced GBA.
The current research was founded by grants of the German Science Foundation to C.E. (sub-project D7 of the SFB 471). Further we wish to thank K. Preller, C. Massau and O. Bobrov for helping us with the EEG data acquisition.
- Warren RM: Perceptual restoration of missing speech sounds. Science. 1970, 167: 392-393. 10.1126/science.167.3917.392.View ArticlePubMedGoogle Scholar
- Samuel AG: Lexical activation produces potent phonemic percepts. Cognit Psychol. 1997, 32: 97-127. 10.1006/cogp.1997.0646.View ArticlePubMedGoogle Scholar
- Samuel AG: Knowing a word affects the fundamental perception of the sounds within it. Psychol Sci. 2001, 12: 348-351. 10.1111/1467-9280.00364.View ArticlePubMedGoogle Scholar
- Grossberg S: The link between brain learning, attention, and consciousness. Conscious Cogn. 1999, 8: 1-44. 10.1006/ccog.1998.0372.View ArticlePubMedGoogle Scholar
- Grossberg S: Resonant neural dynamics of speech perception. J Phonetics. 2003, 31: 423-445. 10.1016/S0095-4470(03)00051-2.View ArticleGoogle Scholar
- Gray CM, Konig P, Engel AK, Singer W: Oscillatory responses in cat visual cortex exhibit inter-columnar synchronization which reflects global stimulus properties. Nature. 1989, 338: 334-337. 10.1038/338334a0.View ArticlePubMedGoogle Scholar
- Tallon-Baudry C, Bertrand O: Oscillatory gamma activity in humans and its role in object representation. Trends Cogn Sci. 1999, 3: 151-162. 10.1016/S1364-6613(99)01299-1.View ArticlePubMedGoogle Scholar
- Gruber T, Trujillo-Barreto NJ, Giabbiconi CM, Valdes-Sosa PA, Muller MM: Brain electrical tomography (BET) analysis of induced gamma band responses during a simple object recognition task. Neuroimage. 2006, 29: 888-900. 10.1016/j.neuroimage.2005.09.004.View ArticlePubMedGoogle Scholar
- Leiberg S, Kaiser J, Lutzenberger W: Gamma-band activity dissociates between matching and nonmatching stimulus pairs in an auditory delayed matching-to-sample task. Neuroimage. 2006, 30: 1357-1364. 10.1016/j.neuroimage.2005.11.010.View ArticlePubMedGoogle Scholar
- Lenz D, Jeschke M, Schadow J, Naue N, Ohl FW, Herrmann CS: Human EEG very high frequency oscillations reflect the number of matches with a template in auditory short-term memory. Brain Res. 2008, 1220: 81-92. 10.1016/j.brainres.2007.10.053.View ArticlePubMedGoogle Scholar
- Gruber T, Tsivilis D, Giabbiconi CM, Muller MM: Induced electroencephalogram oscillations during source memory: familiarity is reflected in the gamma band, recollection in the theta band. J Cogn Neurosci. 2008, 20: 1043-1053. 10.1162/jocn.2008.20068.View ArticlePubMedGoogle Scholar
- Lutzenberger W, Pulvermuller F, Birbaumer N: Words and pseudowords elicit distinct patterns of 30-Hz EEG responses in humans. Neurosci Lett. 1994, 176: 115-118. 10.1016/0304-3940(94)90884-2.View ArticlePubMedGoogle Scholar
- Eulitz C, Maess B, Pantev C, Friederici AD, Feige B, Elbert T: Oscillatory neuromagnetic activity induced by language and non-language stimuli. Brain Res Cogn Brain Res. 1996, 4: 121-132. 10.1016/0926-6410(96)00026-2.View ArticlePubMedGoogle Scholar
- Pulvermuller F, Birbaumer N, Lutzenberger W, Mohr B: High-frequency brain activity: its possible role in attention, perception and language processing. Prog Neurobiol. 1997, 52: 427-445. 10.1016/S0301-0082(97)00023-3.View ArticlePubMedGoogle Scholar
- Hannemann R, Obleser J, Eulitz C: Top-down knowledge supports the retrieval of lexical information from degraded speech. Brain Res. 2007, 1153: 134-143. 10.1016/j.brainres.2007.03.069.View ArticlePubMedGoogle Scholar
- Sivonen P, Maess B, Friederici AD: Semantic retrieval of spoken words with an obliterated initial phoneme in a sentence context. Neurosci Lett. 2006, 408: 220-225. 10.1016/j.neulet.2006.09.001.View ArticlePubMedGoogle Scholar
- Sivonen P, Maess B, Lattner S, Friederici AD: Phonemic restoration in a sentence context: evidence from early and late ERP effects. Brain Res. 2006, 1121: 177-189. 10.1016/j.brainres.2006.08.123.View ArticlePubMedGoogle Scholar
- Griffiths TD, Warren JD: What is an auditory object?. Nat Rev Neurosci. 2004, 5: 887-892. 10.1038/nrn1538.View ArticlePubMedGoogle Scholar
- Lahiri A, Reetz H: Underspecified Recognition. 2002, Berlin: Mouton.: C. Gussenhoven & N. Warner, 637-676. 7Google Scholar
- Wheeldon L, Waksler R: Phonological underspecification and mapping mechanisms in the speech recognition lexicon. Brain & Language. 2004, 90: 401-412.View ArticleGoogle Scholar
- Lahiri A, Marslen-Wilson W: The mental representation of lexical form: a phonological approach to the recognition lexicon. Cognition. 1991, 38: 245-294. 10.1016/0010-0277(91)90008-R.View ArticlePubMedGoogle Scholar
- McClelland JL, Elman JL: The TRACE model of speech perception. Cognit Psychol. 1986, 18: 1-86. 10.1016/0010-0285(86)90015-0.View ArticlePubMedGoogle Scholar
- Bybee J: Phonology and language use. 2001, Cambridge: Cambridge University PressView ArticleGoogle Scholar
- Norris D, McQueen JM, Cutler A: Perceptual learning in speech. Cognit Psychol. 2003, 47: 204-238. 10.1016/S0010-0285(03)00006-9.View ArticlePubMedGoogle Scholar
- Norris D, McQueen JM: Shortlist B: a Bayesian model of continuous speech recognition. Psychol Rev. 2008, 115: 357-395. 10.1037/0033-295X.115.2.357.View ArticlePubMedGoogle Scholar
- Schroeder MR: Reference signal for signal quality studies. J Acoust Soc Am. 1968, 44: 1735-1736. 10.1121/1.1911323.View ArticleGoogle Scholar
- Cowan N, Winkler I, Teder W, Naatanen R: Memory prerequisites of mismatch negativity in the auditory event-related potential (ERP). J Exp Psychol Learn Mem Cogn. 1993, 19: 909-921. 10.1037/0278-7318.104.22.1689.View ArticlePubMedGoogle Scholar
- Baldeweg T, Klugman A, Gruzelier J, Hirsch SR: Mismatch negativity potentials and cognitive impairment in schizophrenia. Schizophr Res. 2004, 69: 203-217. 10.1016/j.schres.2003.09.009.View ArticlePubMedGoogle Scholar
- Berg P, Scherg M: A multiple source approach to the correction of eye artifacts. Electroencephalogr Clin Neurophysiol. 1994, 90: 229-241. 10.1016/0013-4694(94)90094-9.View ArticlePubMedGoogle Scholar
- Sinkkonen J, Tiitinen H, Naatanen R: Gabor filters: an informative way for analysing event-related brain activity. J Neurosci Methods. 1995, 56: 99-104. 10.1016/0165-0270(94)00111-S.View ArticlePubMedGoogle Scholar
- Tallon-Baudry C, Bertrand O, Delpuech C, Permier J: Oscillatory gamma-band (30-70 Hz) activity induced by a visual search task in humans. J Neurosci. 1997, 17: 722-734.PubMedGoogle Scholar
- Blair RC, Karniski W: An alternative method for significance testing of waveform difference potentials. Psychophysiology. 1993, 30: 518-524. 10.1111/j.1469-8986.1993.tb02075.x.View ArticlePubMedGoogle Scholar
- Naatanen R: The perception of speech sounds by the human brain as reflected by the mismatch negativity (MMN) and its magnetic equivalent (MMNm). Psychophysiology. 2001, 38: 1-21. 10.1111/1469-8986.3810001.View ArticlePubMedGoogle Scholar
- Pulvermuller F, Shtyrov Y: Language outside the focus of attention: the mismatch negativity as a tool for studying higher cognitive processes. Prog Neurobiol. 2006, 79: 49-71. 10.1016/j.pneurobio.2006.04.004.View ArticlePubMedGoogle Scholar
- Naatanen R, Lehtokoski A, Lennes M, Cheour M, Huotilainen M, Iivonen A, Vainio M, Alku P, Ilmoniemi RJ, Luuk A, Allik J, Sinkkonen J, Alho K: Language-specific phoneme representations revealed by electric and magnetic brain responses. Nature. 1997, 385: 432-434. 10.1038/385432a0.View ArticlePubMedGoogle Scholar
- Eulitz C, Lahiri A: Neurobiological evidence for abstract phonological representations in the mental lexicon during speech recognition. J Cogn Neurosci. 2004, 16: 577-583. 10.1162/089892904323057308.View ArticlePubMedGoogle Scholar
- Cacioppo JT, Tassinary LG, Fridlund AJ: The skeletomotor system. Principles of psychophysiology: physical, social, and inferential elements. Edited by: Cacioppo JT, Tassinary LG. 1990, Cambridge, MA: Cambridge UP, 324-385.Google Scholar
- Norris D, McQueen JM, Cutler A: Merging information in speech recognition: feedback is never necessary. Behav Brain Sci. 2000, 23: 299-325. 10.1017/S0140525X00003241.View ArticlePubMedGoogle Scholar
- Pulvermuller F, Preissl H, Lutzenberger W, Birbaumer N: Brain rhythms of language: nouns versus verbs. Eur J Neurosci. 1996, 8: 937-941. 10.1111/j.1460-9568.1996.tb01580.x.View ArticlePubMedGoogle Scholar
- Mainy N, Jung J, Baciu M, Kahane P, Schoendorff B, Minotti L, Hoffmann D, Bertrand O, Lachaux JP: Cortical dynamics of word recognition. Hum Brain Mapp. 2008, 29 (11): 1215-30. 10.1002/hbm.20457.View ArticlePubMedGoogle Scholar
- Paradis C, Prunet JF: Phonetics and phonology: Vol.2 The special status of coronals. 1991, San Diego, CA: Academic PressGoogle Scholar
- Liberman AM, Delattre P, Cooper FS: The role of selected stimulus-variables in the perception of the unvoiced stop consonants. Am J Psychol. 1952, 65: 497-516. 10.2307/1418032.View ArticlePubMedGoogle Scholar
- Scott SK, Johnsrude IS: The neuroanatomical and functional organization of speech perception. Trends Neurosci. 2003, 26: 100-107. 10.1016/S0166-2236(02)00037-1.View ArticlePubMedGoogle Scholar
- Poeppel D, Idsardi WJ, van Wassenhove V: Speech perception at the interface of neurobiology and linguistics. Philos Trans R Soc Lond B Biol Sci. 2008, 363: 1071-1086. 10.1098/rstb.2007.2160.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.