Nav: Home

A new corpus of 'slips of the ear' in English

February 17, 2017

Listening in quiet conditions is actually quite rare. Most of the time there is some kind of noise present, whether it is traffic, machinery, or simply other conversations. As native speakers with a rich experience of the language and the context in which speech occurs, we have a great capacity to reconstruct the part of the message obscured by noise. However, errors still occur at times. A group involving Dr García Lecumberri, Ikerbasque Research Professor Martin Cooke, along with researchers Dr Jon Barker and Dr Ricard Marxer of the University of Sheffield (UK) have identified 3207 "consistent" confusions. The confusions are said to be consistent because, in every case, a significant number of listeners agree. This type of confusion is extremely valuable in the construction of models of speech perception, since any model capable of making the same error is very likely to be undergoing the same processes as those in human listeners.

The research study involved more than 300000 individual stimulus presentations to 212 listeners in a range of different noise conditions. The resulting corpus is the only one of its kind for the English language and is available at For each confusion, the corpus contains the waveforms of both the speech and the noise, a record of what a cohort of listeners heard, along with phonemic transcriptions. Distinct types of confusion appear with some frequency in the corpus. In the simplest cases what is clear is that the noise masks some parts of the word, forcing listeners to suggest a word that best fits the audible fragments (e.g., "wooden" -> "wood"; "pánico" -> "pan") or to substitute one sound for another ("ten" -> "pen"; "valla ->falla"). In other cases listeners appear to incorporate elements from the noise itself ("purse" -> "permitted"; "ciervo" -> "invierno"). Finally, the researchers find odd cases where there is little or no relation between the word produced and the confusion ("modern" -> "suggest"; "guardan -> pozo"). In these cases the way that the speech and noise signals interact is complex, and therefore interesting.

Dr García Lecumberri argues that "these studies help to reveal the mechanisms underlying speech perception, and the better we understand these processes, the more we can help at a technical and clinical level those listeners who suffer hearing and speech comprehension problems". The group has also elicited a similar corpus for the Spanish language that can be accessed from the same web page. "There are similarities and differences between Spanish and English confusions: Spanish is a highly-inflected language, leading to more confusions in word-final position; English has a larger number of monosyllabic words and a richer set of word-final consonants, leading to more substitution-type errors in this position" she adds. However, both languages show a similar pattern of confusion types in noise, with some sounds surviving better than others.
Additional information

Dr María Luisa García Lecumberri is Senior Lecturer in English Phonetics in the Faculty of Letters at the University of the Basque Country (Vitoria) and member of the Language and Speech research group, to which Ikerbasque Research Professor Dr Martin Cooke also belongs. Dr Jon Barker is Reader in Computer Science in the Speech and Hearing research group at the University of Sheffield, where Dr Ricard Marxer works as a research fellow. Corpus collection was funded by the EU Framework 7 Marie Curie project PEOPLE-2011-290000 "Inspire: Investigating Speech Processing in Realistic Environments".


Ricard Marxer, Jon Barker, Martin Cooke, and Maria Luisa García Lecumberri (December 2016). A corpus of noise-induced word misperceptions for English. The Journal of the Acoustical Society of America, Volume 140, Issue 5. DOI: 10.1121/1.4967185.

University of the Basque Country

Related Language Articles:

The world's most spoken language is...'Terpene'
If you're small, smells are a good way to stand out.
Study analyzes what 'a' and 'the' tell us about language acquisition
A study co-authored by an MIT professor suggests that experience is an important component of early-childhood language usage although it doesn't necessarily account for all of a child's language facility.
Why do people switch their language?
Due to increasing globalization, the linguistic landscape of our world is changing; many people give up use of one language in favor of another.
Discovering what shapes language diversity
A research team led by Colorado State University is the first to use a form of simulation modeling to study the processes that shape language diversity patterns.
'Speaking my language': Method helps prepare teachers of dual language learners
Researchers at Lehigh University, led by L. Brook Sawyer and Patricia H.
The brain watched during language learning
Researchers from Nijmegen, the Netherlands, have for the first time captured images of the brain during the initial hours and days of learning a new language.
'Now-or-never bottleneck' explains language acquisition
We are constantly bombarded with linguistic input, but our brains are unable to remember long strings of linguistic information.
The secret language of microbes
Social microbes often interact with each other preferentially, favoring those that share certain genes in common.
A programming language for living cells
New language lets MIT researchers design novel biological circuits.
Syntax is not unique to human language
Human communication is powered by rules for combining words to generate novel meanings.

Related Language Reading:

Best Science Podcasts 2019

We have hand picked the best science podcasts for 2019. Sit back and enjoy new science podcasts updated daily from your favorite science news services and scientists.
Now Playing: TED Radio Hour

Jumpstarting Creativity
Our greatest breakthroughs and triumphs have one thing in common: creativity. But how do you ignite it? And how do you rekindle it? This hour, TED speakers explore ideas on jumpstarting creativity. Guests include economist Tim Harford, producer Helen Marriage, artificial intelligence researcher Steve Engels, and behavioral scientist Marily Oppezzo.
Now Playing: Science for the People

#524 The Human Network
What does a network of humans look like and how does it work? How does information spread? How do decisions and opinions spread? What gets distorted as it moves through the network and why? This week we dig into the ins and outs of human networks with Matthew Jackson, Professor of Economics at Stanford University and author of the book "The Human Network: How Your Social Position Determines Your Power, Beliefs, and Behaviours".