Nav: Home

Regenstrief, IU study finds machine learning as good as humans' in cancer surveillance

April 21, 2016

INDIANAPOLIS -- Machine learning has come of age in public health reporting according to researchers from the Regenstrief Institute and Indiana University School of Informatics and Computing at Indiana University-Purdue University Indianapolis. They have found that existing algorithms and open source machine learning tools were as good as, or better than, human reviewers in detecting cancer cases using data from free-text pathology reports. The computerized approach was also faster and less resource intensive in comparison to human counterparts.

Every state in the United States requires cancer cases to be reported to statewide cancer registries for disease tracking, identification of at-risk populations, and recognition of unusual trends or clusters. Typically, however, busy health care providers submit cancer reports to equally busy public health departments months into the course of a patient's treatment rather than at the time of initial diagnosis.

This information can be difficult for health officials to interpret, which can further delay health department action, when action is needed. The Regenstrief Institute and IU researchers have demonstrated that machine learning can greatly facilitate the process, by automatically and quickly extracting crucial meaning from plaintext, also known as free-text, pathology reports, and using them for decision-making.

"Towards Better Public Health Reporting Using Existing Off the Shelf Approaches: A Comparison of Alternative Cancer Detection Approaches Using Plaintext Medical Data and Non-dictionary Based Feature Selection" is published in the April 2016 issue of the Journal of Biomedical Informatics.

"We think that its no longer necessary for humans to spend time reviewing text reports to determine if cancer is present or not," said study senior author Shaun Grannis, M.D., M.S., interim director of the Regenstrief Center of Biomedical Informatics. "We have come to the point in time that technology can handle this. A human's time is better spent helping other humans by providing them with better clinical care."

"A lot of the work that we will be doing in informatics in the next few years will be focused on how we can benefit from machine learning and artificial intelligence. Everything -- physician practices, health care systems, health information exchanges, insurers, as well as public health departments -- are awash in oceans of data. How can we hope to make sense of this deluge of data? Humans can't do it -- but computers can."

Dr. Grannis, a Regenstrief Institute investigator and an associate professor of family medicine at the IU School of Medicine, is the architect of the Regenstrief syndromic surveillance detector for communicable diseases and led the technical implementation of Indiana's Public Health Emergency Surveillance System - one of the nation's largest. Studies over the past decade have shown that this system detects outbreaks of communicable diseases seven to nine days earlier and finds four times as many cases as human reporting while providing more complete data.

"What's also interesting is that our efforts show significant potential for use in underserved nations, where a majority of clinical data is collected in the form of unstructured free text," said study first author Suranga N. Kasthurirathne, a doctoral student at School of Informatics and Computing at IUPUI. "Also, in addition to cancer detection, our approach can be adopted for a wide range of other conditions as well."

The researchers sampled 7,000 free-text pathology reports from over 30 hospitals that participate in the Indiana Health Information Exchange and used open source tools, classification algorithms, and varying feature selection approaches to predict if a report was positive or negative for cancer. The results indicated that a fully automated review yielded results similar or better than those of trained human reviewers, saving both time and money.

"Machine learning can now support ideas and concepts that we have been aware of for decades, such as a basic understanding of medical terms," said Dr. Grannis. "We found that artificial intelligence was as least as accurate as humans in identifying cancer cases from free-text clinical data. For example the computer 'learned' that the word 'sheet' or 'sheets' signified cancer as 'sheet' or 'sheets of cells' are used in pathology reports to indicate malignancy.

"This is not an advance in ideas, it's a major infrastructure advance -- we have the technology, we have the data, we have the software from which we saw accurate, rapid review of vast amounts of data without human oversight or supervision."
-end-
The study was conducted with support from the Centers for Disease Control and Prevention.

In addition to Dr. Grannis and Mr. Kasthurirathne, co-authors of the study are Regenstrief Institute investigator Brian E. Dixon, MPA, Ph.D. and Huiping Xu, Ph.D., of the IU Fairbanks School of Public Health; former Regenstrief fellow Judy Gichoya, M.D. and Regenstrief investigator Burke Mamlin, M.D. of the IU School of Medicine and Yuni Xia, Ph.D. of the School of Science at IUPUI.

Indiana University

Related Cancer Articles:

Radiotherapy for invasive breast cancer increases the risk of second primary lung cancer
East Asian female breast cancer patients receiving radiotherapy have a higher risk of developing second primary lung cancer.
Cancer genomics continued: Triple negative breast cancer and cancer immunotherapy
Continuing PLOS Medicine's special issue on cancer genomics, Christos Hatzis of Yale University, New Haven, Conn., USA and colleagues describe a new subtype of triple negative breast cancer that may be more amenable to treatment than other cases of this difficult-to-treat disease.
Metabolite that promotes cancer cell transformation and colorectal cancer spread identified
Osaka University researchers revealed that the metabolite D-2-hydroxyglurate (D-2HG) promotes epithelial-mesenchymal transition of colorectal cancer cells, leading them to develop features of lower adherence to neighboring cells, increased invasiveness, and greater likelihood of metastatic spread.
UH Cancer Center researcher finds new driver of an aggressive form of brain cancer
University of Hawai'i Cancer Center researchers have identified an essential driver of tumor cell invasion in glioblastoma, the most aggressive form of brain cancer that can occur at any age.
UH Cancer Center researchers develop algorithm to find precise cancer treatments
University of Hawai'i Cancer Center researchers developed a computational algorithm to analyze 'Big Data' obtained from tumor samples to better understand and treat cancer.
New analytical technology to quantify anti-cancer drugs inside cancer cells
University of Oklahoma researchers will apply a new analytical technology that could ultimately provide a powerful tool for improved treatment of cancer patients in Oklahoma and beyond.
Radiotherapy for lung cancer patients is linked to increased risk of non-cancer deaths
Researchers have found that treating patients who have early stage non-small cell lung cancer with a type of radiotherapy called stereotactic body radiation therapy is associated with a small but increased risk of death from causes other than cancer.
Cancer expert says public health and prevention measures are key to defeating cancer
Is investment in research to develop new treatments the best approach to controlling cancer?
UI Cancer Center, Governors State to address cancer disparities in south suburbs
The University of Illinois Cancer Center and Governors State University have received a joint four-year, $1.5 million grant from the National Cancer Institute to help both institutions conduct community-based research to reduce cancer-related health disparities in Chicago's south suburbs.
Leading cancer research organizations to host international cancer immunotherapy conference
The Cancer Research Institute, the Association for Cancer Immunotherapy, the European Academy of Tumor Immunology, and the American Association for Cancer Research will join forces to sponsor the first International Cancer Immunotherapy Conference at the Sheraton New York Times Square Hotel in New York, Sept.

Related Cancer Reading:

Best Science Podcasts 2019

We have hand picked the best science podcasts for 2019. Sit back and enjoy new science podcasts updated daily from your favorite science news services and scientists.
Now Playing: TED Radio Hour

Changing The World
What does it take to change the world for the better? This hour, TED speakers explore ideas on activism—what motivates it, why it matters, and how each of us can make a difference. Guests include civil rights activist Ruby Sales, labor leader and civil rights activist Dolores Huerta, author Jeremy Heimans, "craftivist" Sarah Corbett, and designer and futurist Angela Oguntala.
Now Playing: Science for the People

#521 The Curious Life of Krill
Krill may be one of the most abundant forms of life on our planet... but it turns out we don't know that much about them. For a create that underpins a massive ocean ecosystem and lives in our oceans in massive numbers, they're surprisingly difficult to study. We sit down and shine some light on these underappreciated crustaceans with Stephen Nicol, Adjunct Professor at the University of Tasmania, Scientific Advisor to the Association of Responsible Krill Harvesting Companies, and author of the book "The Curious Life of Krill: A Conservation Story from the Bottom of the World".