13.4 Evidence for ‘top-down’ effects of word knowledge on perception

Catherine Anderson; Bronwyn Bjorkman; Derek Denis; Julianne Doner; Margaret Grant; Nathan Sanders; Ai Taniguchi

[In progress] Chapter 13: Psycholinguistics and Neurolinguistics

13.4 Evidence for ‘top-down’ effects of word knowledge on perception

The preceding sections provided evidence for how a speaker’s knowledge of the phonology of their language can influence the way they perceive speech. In this section, we will see that a speaker’s knowledge of the words of their language also has an influence on the way language is perceived. In particular, we will focus here on ways in which a speaker’s word knowledge has an influence on the speech sounds that they hear.

The first example comes from an influential experiment by Ganong in 1980. Remember from Section 13.2 that the difference between a voiced stop like [d] and a voiceless stop like [t] is the stop’s Voice Onset Time, or in other words the time between the release of the stop closure and the beginning of the following vowel. Many years of research have shown that although Voice Onset Time is a continuous variable – it can take on any millisecond value – listeners perceive a sharp boundary between speech sounds like [d] and [t]. So in an experiment that presents participants with different values of Voice Onset Time and asks them to indicate whether they heard [d] or [t], you would get nearly all [d] responses below a threshold and nearly all [t] responses above that threshold, with only a small area of uncertainty in the middle. This figure shows a typical pattern found by experiments that vary Voice Onset Time in increments. Below the boundary between [d] and [t], nearly all participants would identify the sound as [da]. Above the boundary, the responses would be all or nearly all [ta]. There is only a short window around the boundary where responses are not all one way or the other.

Graph of theoretical results of a phoneme identification task where Voice Onset Time is varied on a continuum. At short Voice Onset Times, nearly all responses identify a sound as [d]. At high Voice Onset Times, nearly all responses identify the sound as [t]. There is a short window of mixed responses around the [d]-[t] boundary, creating a sinusoidal (s-shaped) curve. — Graph showing a typical pattern found in phoneme identification tasks. This particular graph does not show real data, but rather an idealized pattern.

This pattern is called categorical perception and holds for a variety of phonetic continua other than Voice Onset Time as well. Categorical perception was initially thought to reflect phonological knowledge, but subsequent studies provided evidence against that interpretation. For example, categorical perception is found for contrasts that are not a part of the phonology of a listener’s language, for babies who have not had sufficient time to tune their mental grammar to the phonology of their language, and even for non-human animals, who we presume do not have human-like phonology (see Kuhl and Miller, 1975, for a study on chinchillas). This is why, in this section, we use the square bracket notation for [d] and [t].

Building on the robust findings of categorical perception, Ganong examined whether presenting sounds on a continuum from [d] to [t] as a part of words would influence participants’ perception of the sounds. In particular, words were chosen such that at one end of the continuum, the word was a real word of English (e.g., dash), while at the other end, the word was pronounceable but not a real word of English (e.g., tash). The critical hypothesis was that listeners might be more likely to categorize a sound that was intermediate between [d] and [t] as the sound that would create a real word. This is what Ganong found. When a sound near the threshold was presented as part of Xash, where X represents the critical stop consonant, it was more likely to be perceived as dash; (the real word of English) rather than tash (which is pronounceable but not an English word). When the same sound is presented with Xask, it would be more likely to be perceived as task (a real word of English) rather than dask. So people’s knowledge of words of English influenced which speech sound they heard, even when the acoustic signal is the same. This finding, which now gets called a Ganong effect, is an example of a top-down influence on perceptual processing. We call information that flows to the brain from the outside world through sensory surfaces, like the vibration of the inner ear or an image on the retina, bottom-up information. It is no surprise that bottom-up information has an influence on processing, as it couldn’t really be any other way. What we call top-down influences are cases where our knowledge, for example our knowledge of words, has an effect on perceptual processing. In this chapter we have seen several ways in which our knowledge of language has an influence on the way we process linguistic input from quite early stages of perception.

References

Ganong, W. F. (1980). Phonetic categorization in auditory word perception. Journal of Experimental Psychology: Human Perception and Performance, 6(1), 110–125. https://doi.org/10.1037/0096-1523.6.1.110

Kuhl, P. K., & Miller, J. D. (1975). Speech Perception by the Chinchilla: Voiced-Voiceless Distinction in Alveolar Plosive Consonants. Science, 190(4209), 69–72. http://www.jstor.org/stable/1740887

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Essentials of Linguistics, 2nd edition Copyright © 2022 by Catherine Anderson; Bronwyn Bjorkman; Derek Denis; Julianne Doner; Margaret Grant; Nathan Sanders; and Ai Taniguchi is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.

References

License

Share This Book