Nav: Home

Researchers use Twitter to track the flu in real time

May 09, 2017

"This flu is horrendous. Can't breathe, can't sleep or eat. Muscles ache, fever 102. Should have gotten the shot. Time for a movie marathon."

The above tweet looks like 140 characters of misery. But in the hands of Northeastern's Alessandro Vespignani and his colleagues, it is so much more.

An international team led by Vespignani has developed a unique computational model to project the spread of the seasonal flu in real time. It uses posts on Twitter in combination with key parameters of each season's epidemic, including the incubation period of the disease, the immunization rate, how many people an individual with the virus can infect, and the viral strains present.

Tested against official influenza surveillance systems, the model has been shown to accurately forecast the disease's evolution up to six weeks in advance--significantly earlier than other models. It will enable public health agencies to plan ahead in allocating medical resources and launching campaigns that encourage individuals to take preventative measures such as vaccination and increased hand washing.

"In the past, we had no knowledge of initial conditions for the flu," says Vespignani, who is also director of the Network Science Institute at Northeastern. The initial conditions--which show where and when an epidemic began as well as the extent of infection--function as a launching pad for forecasting the spread of any disease.

To ascertain those conditions, the researchers incorporated Twitter into their parameter-driven model. "This kind of integration has never been done before," says Vespignani. "We were not looking for the number of people who were sick because Twitter will not tell you that. What we wanted to know was: Do we have more flu at this point in time in Texas or in New Jersey, in Seattle or in San Francisco? Twitter, which includes GPS locations, is a proxy for that. By looking at how many people were tweeting about their symptoms or how miserable they were because of the flu, we were able to get a relative weight in each of those areas of the U.S."

The paper on the novel model received a coveted Best Paper Honorable Mention award at the 2017 International World Wide Web Conference last month following its presentation. It was one of only four papers out of more than 400 presented to be selected for an award.

A work in progress

The researchers' work began when the Centers for Disease Control and Prevention announced the "Predict the Influenza Season Challenge" in November 2013, an invitation to external researchers to advance the science of forecasting infectious diseases. Vespignani and his team have been participating ever since, with the new paper covering their projections for the 2014-15 and 2015-16 flu seasons in the U.S., Italy, and Spain.

Over those time periods, they applied forecasting and other algorithms week by week to the key parameters informed by the Twitter data. "This gave us a large number of possible ways the disease might evolve," says Vespignani. They then matched the resulting simulations with the surveillance data generated by the CDC and clinical and personal reports of influenza-like illnesses from the three countries. "The surveillance data tells us the ground truth for the past four weeks, but it is always delayed by about one week because you need to get the report from the doctor," he says. By analyzing the evolving dynamics revealed in the past data, they were able to select the model that would most likely forecast the future.

The explicit modeling of the disease's parameters--information about the dynamics of the disease itself--set Vespignani's model apart from others in the challenge. For example, they could identify the week when the epidemic would reach its peak and the magnitude of that peak with an accuracy of 70 to 90 percent six weeks in advance of the event.

"By capturing the key parameters, we could track how serious the flu was each year compared with every other year and see what was driving the spread," says first author Qian Zhang, PhD'14, associate research scientist at Northeastern. "That is what the public health agencies and the epidemiologists really care about. We are not just playing a game of numbers, which is what straightforward statistical models do."

While the paper reports results using Twitter data, the researchers note that the model can work with data from many other digital sources, too, as well as online surveys of individuals such as influenzanet, which is very popular in Europe.

"Our model is a work in progress," emphasizes Vespignani. "We plan to add new parameters, for example, school and workplace structure. This is not a challenge in the sense that you want to win. This is a science challenge in which you want to learn--to see that there is not a single model but a portfolio of models that will tell us new things."

Northeastern University

Related Twitter Articles:

Researchers use Twitter to track the flu in real time
An international team led by Northeastern University professor Alessandro Vespignani has developed a unique computational model to project the spread of the seasonal flu in real time.
As scientists take to Twitter, study shows power of 'visual abstract' graphics
When it comes to sharing new research findings with the world, Twitter has emerged as a key tool for scientists -- and for the journals where they publish their findings.
Twitter could have predicted the outcome of the Brexit vote
Leave campaigners were not only victorious in the June 2016 Brexit vote but also in the battle of the twittersphere, a new study in the British Journal of Politics and International Relations reports.
Spreading rumors on Twitter and mistaking retweets for truth
A new study of the believability of information received via Twitter and the intention to pass on a tweet -- whether news or rumor -- is influenced by the number of times the information has already been retweeted.
Twitter researchers offer clues as to why Trump won
University of Rochester computer scientists were able to shed light on how Trump won the presidential election by using data science to track the millions of Twitter followers who liked or 'unliked' the candidates throughout the campaign.
Finding credibility clues on Twitter
By scanning 66 million tweets linked to nearly 1,400 real-world events, Georgia Institute of Technology researchers have built a language model that identifies words and phrases that lead to strong or weak perceived levels of credibility on Twitter.
New public tool uses Twitter posts to gain insights about marijuana use
Journalists, researchers, policymakers, and the public looking to gain new insights about the use of marijuana can now turn to CannabisConvo.
More are positive about HPV vaccine on Twitter than not, Drexel study finds
A Drexel University study into sentiments toward the HPV vaccine on Twitter found that significantly more tweets post positive sentiments toward vaccines, such as the value of prevention and protection, than not.
Tweeting #plasticsurgery -- plastic surgeons urged to engage and educate on twitter
Twitter has become an important resource for people seeking information about plastic surgery.
What Twitter behavior accompanies mental health crises?
A new study published today in the Journal of the American Medical Informatics Association indicates that there were two specific types of heightened Twitter discussions in 2014 related to mental health: expected increases in response to planned behavioral health events and unexpected increases in response to unanticipated events.

Related Twitter Reading:

Best Science Podcasts 2019

We have hand picked the best science podcasts for 2019. Sit back and enjoy new science podcasts updated daily from your favorite science news services and scientists.
Now Playing: TED Radio Hour

Do animals grieve? Do they have language or consciousness? For a long time, scientists resisted the urge to look for human qualities in animals. This hour, TED speakers explore how that is changing. Guests include biological anthropologist Barbara King, dolphin researcher Denise Herzing, primatologist Frans de Waal, and ecologist Carl Safina.
Now Playing: Science for the People

#SB2 2019 Science Birthday Minisode: Mary Golda Ross
Our second annual Science Birthday is here, and this year we celebrate the wonderful Mary Golda Ross, born 9 August 1908. She died in 2008 at age 99, but left a lasting mark on the science of rocketry and space exploration as an early woman in engineering, and one of the first Native Americans in engineering. Join Rachelle and Bethany for this very special birthday minisode celebrating Mary and her achievements. Thanks to our Patreons who make this show possible! Read more about Mary G. Ross: Interview with Mary Ross on Lash Publications International, by Laurel Sheppard Meet Mary Golda...