Nav: Home

Speech recognition technology is not a solution for poor readers

May 13, 2019

Even today about one in five humans is considered to be low literate or illiterate; they cannot read or write simple statements about everyday life. Low literacy can be due to no or little reading practice or reading impairments such as dyslexia. For developing countries with low literacy rates, voice recognition has been hailed as a solution by companies such as Google, calling it 'the next big leap in technology'. But is speech technology really the solution for low literacy?

Falk Huettig and Martin Pickering argue that it is not. In an opinion article in Trends in Cognitive Sciences, the psycholinguists suggest that relying on speech technology might be counterproductive, as literacy has crucial benefits even beyond reading. "It is very relevant and timely to look at the advantages of reading on speech, especially as people tend to read less and in different ways than they used to", says Falk Huettig. "Contemporary social media writing and reading habits for example are quite different from traditional print media. Information that people used to get from written sources such as novels, newspapers, public notices, or even recipe books they get more and more from YouTube videos, podcasts, or audiobooks".

This is not necessarily a bad thing, as some of the general benefits of reading can also be obtained from listening to audiobooks. As audio books also provide 'book language', listening to them will confer some similar advantages--such as a larger vocabulary, increased knowledge of the world and a larger short-term ('working') memory, which is important to keep track of information and multiple entities over several sentences, paragraphs, or often even pages.

But according to Huettig and Pickering, reading itself--the actual physical act of reading--is crucially important for developing the skill of predicting upcoming words, which transfers from reading to understanding spoken language. Reading trains the language prediction system although even very young children--who cannot read yet--can predict where a sentence is going. When 2-year-olds hear "the boy eats a big cake", they already look at something edible (i.e. a cake) after hearing "eats", but before hearing "cake". Predicting upcoming information is useful, as it reduces processing load and frees up limited brain resources. And crucially, skilled readers get much better at predicting.

Children who are among the most avid readers encounter over 4 million words a year, while children who rarely read encounter only about 50,000 words. As a result, good readers get a deeper understanding of the meaning of words and build large networks of words with strong associations between them--which helps them to predict upcoming words. As poor readers have smaller vocabularies and weaker representations of words in their mind (i.e. the recollection of the sound and meaning of a word), the predictive relationships between words are also weaker (e.g. the prediction that 'read the ... ' is often followed by 'book').

The literate mind

As reading is self-paced, there is a strong incentive to predict upcoming words, as this speeds up reading, which is typically much faster than listening. Skilled readers tend to take in whole words at one glance (gazing with their eyes at multiple letters at the same time) and time their eye movements to optimise the reading process. Printed texts (even given the occasional changes in fonts and word capitalizations) are much more regular than conversational speech, which is full of disfluencies, incomplete word pronunciations, and speech errors. This regularity of written texts helps readers to form the predictive relationships between words that then, by extension, can also be used to better predict words when listening to speech.

It is hard to imagine for someone who learned to read a long time ago but even the concept of a 'word' is an invention of the literate mind; it is very hard to grasp if you are an illiterate who only ever hears a stream of speech sounds. For example, when illiterates or children who haven't learned to read yet are asked to repeat the last word of a spoken sentence, they tend to repeat the whole sentence. In contrast, words clearly stand out in written language, typically being separated by white space. Written forms make words more salient and precise: readers become more aware that words are stable units in language. Storing the written form of words in memory also helps to make spoken word forms more salient, to be accessed faster when predicting upcoming speech. And, again, it is prediction of upcoming language that makes language understanding become really fast and proficient.

"Our arguments provide one more reason why more efforts should be undertaken to teach the hundreds of millions of illiterates in developing countries and functional illiterates across the world how to read (or to read better) and why a focus on artificial intelligence voice recognition and voice assistants to overcome literacy-related problems has its dangers", the authors argue.

"Writing is an ancient human technology that we shouldn't give up easily. Teaching how to read and how to read better remains very important even in a modern technological world", concludes Huettig.
-end-
Falk Huettig & Martin J. Pickering (2019). Literacy advantages beyond reading: Prediction of spoken language. Trends in Cognitive Sciences.

Questions? Contact:

Falk Huettig
Phone: +31 24 3521374
Email: falk.huettig@mpi.nl

Marjolein Scherphuis (press officer)
Phone: +31 24 3521947
Email: marjolein.scherphuis@mpi.nl

Max Planck Institute for Psycholinguistics

Related Language Articles:

Why the language-ready brain is so complex
In a review article published in Science, Peter Hagoort, professor of Cognitive Neuroscience at Radboud University and director of the Max Planck Institute for Psycholinguistics, argues for a new model of language, involving the interaction of multiple brain networks.
Do as i say: Translating language into movement
Researchers at Carnegie Mellon University have developed a computer model that can translate text describing physical movements directly into simple computer-generated animations, a first step toward someday generating movies directly from scripts.
Learning language
When it comes to learning a language, the left side of the brain has traditionally been considered the hub of language processing.
Learning a second alphabet for a first language
A part of the brain that maps letters to sounds can acquire a second, visually distinct alphabet for the same language, according to a study of English speakers published in eNeuro.
Sign language reveals the hidden logical structure, and limitations, of spoken language
Sign languages can help reveal hidden aspects of the logical structure of spoken language, but they also highlight its limitations because speech lacks the rich iconic resources that sign language uses on top of its sophisticated grammar.
More Language News and Language Current Events

Best Science Podcasts 2019

We have hand picked the best science podcasts for 2019. Sit back and enjoy new science podcasts updated daily from your favorite science news services and scientists.
Now Playing: TED Radio Hour

Erasing The Stigma
Many of us either cope with mental illness or know someone who does. But we still have a hard time talking about it. This hour, TED speakers explore ways to push past — and even erase — the stigma. Guests include musician and comedian Jordan Raskopoulos, neuroscientist and psychiatrist Thomas Insel, psychiatrist Dixon Chibanda, anxiety and depression researcher Olivia Remes, and entrepreneur Sangu Delle.
Now Playing: Science for the People

#537 Science Journalism, Hold the Hype
Everyone's seen a piece of science getting over-exaggerated in the media. Most people would be quick to blame journalists and big media for getting in wrong. In many cases, you'd be right. But there's other sources of hype in science journalism. and one of them can be found in the humble, and little-known press release. We're talking with Chris Chambers about doing science about science journalism, and where the hype creeps in. Related links: The association between exaggeration in health related science news and academic press releases: retrospective observational study Claims of causality in health news: a randomised trial This...