Tool reveals molecular causes of disease, including infant cancer

March 01, 2019

Princeton University researchers are gaining new insights into the causes and characteristics of diseases by harnessing machine learning to analyze molecular patterns across hundreds of diseases simultaneously. Demonstrating a new tool now available to researchers worldwide, the team of computer scientists and biologists has already uncovered and experimentally confirmed previously unknown contributions of four genes to a rare form of cancer that primarily affects babies and young children.

The team, which includes collaborators at Michigan State University and the University of Oslo, introduced the system and demonstrated its abilities in a paper published in the Feb. 23 issue of the journal Cell Systems.

While previous approaches focused on genes associated with specific diseases or types of cancer, the new technique uses machine learning to find unique patterns of gene activity by looking at more than 300 different diseases simultaneously, including cancers, heart disease, metabolic disorders and many others. In doing so, it reveals distinctions between diseases and tissue types, including fine-tuned differences between related diseases that were not possible to discern with other techniques.

The researchers believe that, with further development, the tool will be useful to clinicians in diagnosing disease, tailoring and tracking the effectiveness of therapies, and finding new treatment approaches.

The system, called Unveiling RNA Sample Annotation for Human Diseases, or URSA(HD), incorporates information about the activity of genes from publicly available records of about 8,000 biopsies taken from healthy and diseased tissues of thousands of patients. Going forward, researchers may submit new samples to the tool, via a
web interface, and receive an analysis of possible associations with diseases and tissue types.

"The real innovation is comparing all samples to every other sample," said Chandra Theesfeld, one of the lead researchers along with Young-Suk Lee, who earned his Ph.D. at Princeton in 2016.

Theesfeld likened the idea to humans' ability to recognize nuanced differences between behaviors based on having seen a wide variety of examples. Watching soccer players, for example, might reveal the characteristics of a kicking action, but watching soccer players and ballet dancers at the same time reveals details and context for a similar action with a very different style and purpose.

"Studying them together provides a way to distinguish unique aspects," said Theesfeld, a research scientist in the lab of Olga Troyanskaya of Princeton and the Simons Foundation, who led the team. That viewpoint provides an unbiased way "to learn new things about disease that aren't possible to find with the one-disease-at-a-time approach -- and potentially identify new targets for therapies or even discover new aspects of disease that weren't appreciated."

In making its comparisons, the algorithm gives more weight to differences in gene activity that uniquely define the distinct tissues and diseases. It de-emphasizes information about gene activity common to related diseases, much of which already is well studied. In the soccer-dancing analogy, it's like setting aside the large-scale action of lifting a leg in a kick and finding many details, such the angle of a foot, that taken together constitute a signature set of characteristics that reliably identifies one action or the other.

"Our method is driven by the disease information in the patient sample, so it's not biased toward the popular disease genes that always get studied," Theesfeld said. "We can track patterns of changes in data without knowing exactly what each change means."

Theesfeld noted that 90 percent of studies of genes look at just 10 percent of human genes. URSA(HD) looks at the entire human genome and creates a genome-wide model or signature for each disease.

This approach could be particularly powerful for rare diseases, for which the researchers can now create a model with just a few samples. In the case of neuroblastoma, the pediatric cancer, the researchers found four genes that particularly contributed to the disease and for which there was no previous information in the scientific literature. To confirm the findings, Theesfeld performed laboratory tests on human cells, manipulating the gene activity and observing their effects on cancer-related processes in the cells.

Rather than looking at DNA itself, URSA(HD) looks at RNA, the product that cells create as they transcribe the information in DNA into working molecules that build and run cells and transmit signals from cell to cell. In this way, the system looks beyond mutations (scrambling in the genes themselves) and instead focuses on the downstream transcription products, which can be dysregulated in ways that cause problems even if the original gene is normal.

The research is part of longstanding work in Troyanskaya's lab to integrate massive collections of dissimilar datasets to extract information necessary to make precise biological predictions and to direct laboratory experiments to accelerate discovery. A wide range of data science at Princeton brings together computing and biology to develop foundational tools and insights with potential to have a broad impact for health and humanity.

"Interdisciplinary approaches that merge sophisticated data science with deep knowledge of biology are key to deciphering biomedical puzzles necessary to realize the promise of precision medicine," said Troyanskaya.
In addition to Lee, Theesfeld and Troyanskaya, who is a professor of computer science and Lewis-Sigler Institute for Integrative Genomics and deputy director for genomics at the Flatiron Institute in New York, the researchers include: Rose Oughtred, Jennifer Rust, Christie S. Chang, Joseph Ryu and Kara Dolinski of Princeton; Arjun Krishnan of Michigan State University, and Vessela N. Kristensen of the University of Oslo. Support for the project was provided in part by the National Institutes of Health.

Princeton University, Engineering School

Related Cancer Articles from Brightsurf:

New blood cancer treatment works by selectively interfering with cancer cell signalling
University of Alberta scientists have identified the mechanism of action behind a new type of precision cancer drug for blood cancers that is set for human trials, according to research published in Nature Communications.

UCI researchers uncover cancer cell vulnerabilities; may lead to better cancer therapies
A new University of California, Irvine-led study reveals a protein responsible for genetic changes resulting in a variety of cancers, may also be the key to more effective, targeted cancer therapy.

Breast cancer treatment costs highest among young women with metastic cancer
In a fight for their lives, young women, age 18-44, spend double the amount of older women to survive metastatic breast cancer, according to a large statewide study by the University of North Carolina at Chapel Hill.

Cancer mortality continues steady decline, driven by progress against lung cancer
The cancer death rate declined by 29% from 1991 to 2017, including a 2.2% drop from 2016 to 2017, the largest single-year drop in cancer mortality ever reported.

Stress in cervical cancer patients associated with higher risk of cancer-specific mortality
Psychological stress was associated with a higher risk of cancer-specific mortality in women diagnosed with cervical cancer.

Cancer-sniffing dogs 97% accurate in identifying lung cancer, according to study in JAOA
The next step will be to further fractionate the samples based on chemical and physical properties, presenting them back to the dogs until the specific biomarkers for each cancer are identified.

Moffitt Cancer Center researchers identify one way T cell function may fail in cancer
Moffitt Cancer Center researchers have discovered a mechanism by which one type of immune cell, CD8+ T cells, can become dysfunctional, impeding its ability to seek and kill cancer cells.

More cancer survivors, fewer cancer specialists point to challenge in meeting care needs
An aging population, a growing number of cancer survivors, and a projected shortage of cancer care providers will result in a challenge in delivering the care for cancer survivors in the United States if systemic changes are not made.

New cancer vaccine platform a potential tool for efficacious targeted cancer therapy
Researchers at the University of Helsinki have discovered a solution in the form of a cancer vaccine platform for improving the efficacy of oncolytic viruses used in cancer treatment.

American Cancer Society outlines blueprint for cancer control in the 21st century
The American Cancer Society is outlining its vision for cancer control in the decades ahead in a series of articles that forms the basis of a national cancer control plan.

Read More: Cancer News and Cancer Current Events is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to