The ties that bind: WPI researchers search for the hidden genetic code across species

October 22, 2015

Worcester, Mass. - If a human being, a worm, a broccoli plant, and a yeast cell share common genetic elements, those snippets of DNA, having remained unchanged over millions of years of evolutions, are likely to perform fundamental biological functions.

The National Science Foundation (NSF) has awarded Worcester Polytechnic Institute (WPI) a $768,000 research grant to identify such elements across all known genomes of plants, animals, fungi, and other complex organisms to gain insight into the roles they play in our cells. Dmitry Korkin, PhD, associate professor of computer science and principal investigator for the new project, will use mathematical algorithms and advanced computing technology to analyze vast amounts of genomic data to identify common genetic elements.

"We call these sequences long identical multispecies elements, or LIMEs," said Korkin. "To be conserved across species that diverged hundreds of millions years ago, these elements must carry out some very basic and vital functions in the cells."

Korkin is a member of WPI's Bioinformatics and Computational Biology Program, which uses advanced mathematics and computer science to shed light on basic biology. In the new project, Korkin's team will analyze all the available genomes of eukaryotes, which are organisms whose genetic material is contained within a nucleus. (Bacteria and other simple single-celled organisms do not have nuclei and are called prokaryotes.) Currently, the genomes of some 925 eukaryotic species are sufficiently sequenced for Korkin's analysis; they include many plants and animals, as well as the human genome.

"Just a few years ago, we could not even approach this question, because there was too much data to deal with," Korkin said. "With the technology we had then, the algorithms would have to run, literally, for a thousand years to get a result."

Korkin and his team have made technical leaps, developing new "cache-oblivious" algorithms that are designed not only to answer genetic questions, but also to maximize the efficiency of available computer processing power. "You have to understand the hardware you're running on to optimize the algorithms," Korkin said. "What we're seeing in early results is a thousand-fold improvement. What we were doing on big servers that took weeks, we can now do on a laptop in a couple of hours."

A genome is the complete set of DNA molecules that carry the genetic information needed for development and function of an organism. Famously dubbed "the double-helix", a DNA molecule looks like a twisted ladder with two side rails linked by pairs of only four nucleotides: adenine (A), cytosine (C), guanine (G), and thymine (T). Those four letters are the entire genetic alphabet. The microscopic worm C. elegans has about 100 million base pairs of A, C, G, and T in its genome, while the human genome runs to 3 billion base pairs.

Genes are large sequences of base pairs that provide specific instructions for production of proteins in cells. Genes that code for proteins, however, account for less than two percent of the DNA in human cells. For many years, the remaining 98 percent was called "junk DNA" and thought to be inactive leftovers built up from millennia of evolution. "We now know that it's really not junk at all," Korkin said. "Those non-coding regions of the genome are emerging as very important for basic development and regulatory functions."

Over the next three years, Korkin's team will work to identify identical (or nearly identical) patterns of base pairs that exist across species and develop some understanding of the evolutionary history of those genetic elements and their roles in normal development or the onset of disease. Korkin expects most of the LIMEs will fall in non-coding regions, given that those areas dominate the genome, but the project may also identify some common genes.
About Worcester Polytechnic Institute

Founded in 1865 in Worcester, Mass., WPI is one of the nation's first engineering and technology universities. Its 14 academic departments offer more than 50 undergraduate and graduate degree programs in science, engineering, technology, business, the social sciences, and the humanities and arts, leading to bachelor's, master's and doctoral degrees. WPI's talented faculty work with students on interdisciplinary research that seeks solutions to important and socially relevant problems in fields as diverse as the life sciences and bioengineering, energy, information security, materials processing, and robotics. Students also have the opportunity to make a difference to communities and organizations around the world through the university's innovative Global Perspective Program. There are now more than 45 WPI project centers in the Americas, Africa, Asia-Pacific, and Europe.

Worcester Polytechnic Institute

Related DNA Articles from Brightsurf:

A new twist on DNA origami
A team* of scientists from ASU and Shanghai Jiao Tong University (SJTU) led by Hao Yan, ASU's Milton Glick Professor in the School of Molecular Sciences, and director of the ASU Biodesign Institute's Center for Molecular Design and Biomimetics, has just announced the creation of a new type of meta-DNA structures that will open up the fields of optoelectronics (including information storage and encryption) as well as synthetic biology.

Solving a DNA mystery
''A watched pot never boils,'' as the saying goes, but that was not the case for UC Santa Barbara researchers watching a ''pot'' of liquids formed from DNA.

Junk DNA might be really, really useful for biocomputing
When you don't understand how things work, it's not unusual to think of them as just plain old junk.

Designing DNA from scratch: Engineering the functions of micrometer-sized DNA droplets
Scientists at Tokyo Institute of Technology (Tokyo Tech) have constructed ''DNA droplets'' comprising designed DNA nanostructures.

Does DNA in the water tell us how many fish are there?
Researchers have developed a new non-invasive method to count individual fish by measuring the concentration of environmental DNA in the water, which could be applied for quantitative monitoring of aquatic ecosystems.

Zigzag DNA
How the cell organizes DNA into tightly packed chromosomes. Nature publication by Delft University of Technology and EMBL Heidelberg.

Scientists now know what DNA's chaperone looks like
Researchers have discovered the structure of the FACT protein -- a mysterious protein central to the functioning of DNA.

DNA is like everything else: it's not what you have, but how you use it
A new paradigm for reading out genetic information in DNA is described by Dr.

A new spin on DNA
For decades, researchers have chased ways to study biological machines.

From face to DNA: New method aims to improve match between DNA sample and face database
Predicting what someone's face looks like based on a DNA sample remains a hard nut to crack for science.

Read More: DNA News and DNA Current Events is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to