Nav: Home

A chicken-egg question: Where do baby genes come from?

April 26, 2017

New genes are more likely to appear on the stage of evolution in full-fledged form rather than gradually take shape through successive stages of "proto genes" that become more and more refined over generations. This is the surprising upshot from research led by Benjamin Wilson and Joanna Masel at the University of Arizona, published as an Advance Online Publication by the scientific journal Nature Ecology & Evolution on April 24.

Evolutionary biologists have long pored over the question of where new genes come from, which poses something of a chicken-and-egg problem. Conventional wisdom has it that new genes -- DNA sequences that code for a protein molecule -- evolve from existing genes through duplication and divergence. This happens when DNA copying mechanisms accidentally leave behind an extra copy of a particular gene. Naturally occurring mutations subsequently introduce changes that alter the DNA sequence such that the new gene assumes a function previously not found in the organism's lineage.

Previous studies by other researchers suggested that new genes also emerge from non-coding DNA sequences, via primitive "proto-genes" that become refined over generations, resulting in an "adult," fully functional gene.

Masel and her team found the opposite to be more likely, based on the fact that non-coding DNA sequences are likely to give rise to highly ordered proteins. Proteins, which consist of amino acids chained together into so-called polypeptides, tend to fold into three-dimensional structures that range from simple to mindbogglingly complicated. And while "ordered" may sound like a good thing, Masel is quick to point out that a healthy dose of disorder is key to success when it comes to evolution coming up with new genes that serve as blueprints for new proteins.

For the study, the researchers compiled data on full-genome DNA sequences downloaded from yeast and mouse databases.

"We take all the known mouse genes and yeast genes and query them against everything that's ever been sequenced and see what they're related to," explains Masel, a professor in the Department of Ecology and Evolution and a member of the UA's BIO5 Institute, "and based on that, we assign each gene an age that tells us when it was born."

In the next step, the team used statistical analyses to create a model revealing the average degree of order that would be present in each gene's product.

"We found that the youngest genes are the least ordered of all, which is what you would expect to get if you birthed a gene," Masel says.

The key to a protein that can contribute a useful function for its organism while not harming it is a healthy mix between regions that are soluble because they consist of hydrophilic, or "water-loving," amino acids and stretches that are insoluble because of their hydrophobic, or "water-repelling," amino acids.

If a protein consists of too many water-loving amino acids, it will remain largely unfolded, floating around inside the cell as an unorganized chain incapable of performing biological tasks. If too much of its length is water-repelling, the amino acids will clump together, rendering the protein unusable, and even dangerous, because when such misfolded proteins bump into each other, they tend to stick to each other and accumulate.

"Now think about the most highly ordered proteins we know -- amyloids," Masel says, referring to the infamous piles of proteins found in the brain of Alzheimer's patients. "Because of this, the first order of business for any prospective gene is: 'Do no harm. Do not misfold.'"

This has profound implications for the evolution of new genes from non-coding DNA sequences. Because such sequences are likely to give rise to highly ordered proteins, they are likely to be deleterious to the organism. In this scenario, any prospective new gene must start out as some kind of "super gene," in contrast to a "proto gene." Rather than making its debut in the gene pool as an unrefined gene that still bears many similarities to the non-coding DNA sequences it came from, the protein it encodes must start with a higher-than average degree of disorder to prove itself before evolution would allow it becoming a permanent member of the gene pool.

"Instead of gradually working up to having more hydrophilic regions, young genes work their way down from being more hydrophilic and disordered, to more hydrophobic regions," Masel says. "In other words, when it comes to structural disorder, a polypeptide has the highest chance of being born if it is 'extra gene-like,' rather than 'sort of gene-like.'"

The probability that a gene could arise from a random, non-coding sequence -- also known as "junk DNA," on the other hand, used to be considered negligible, based on the premise that in the vast majority of cases, a random sequence does more harm than good. This may not be so, argues a second paper in the same issue by Rafik Neme, one of the co-authors of the study discussed here. Neme, currently a postdoctoral researcher at Columbia University Medical Center in New York, found the first experimental evidence that non-coding, "silent" stretches of DNA are anything but that.

"Until now, nobody knew whether a randomly sequence could immediately have any effect that would result in a function, or whether function was slowly acquired over time," Neme says. "It's similar to the idea of having a monkey typewriting at random, and expecting it to produce meaningful work."

Neme's experiments show that many sequences exhibit relevant activities immediately, some good and some bad. This, in turn, suggests a discrete transition between non-genes and genes and would favor certain kind of sequences and functions over others.

Based on their findings, Neme and Masel point out, the pool from which genes are born might be more conducive to birthing new genes than one might expect.

"In our scenario, a gene precursor would be a transcript that happened to be translated into a protein sometimes but has no function," she says. "These things come up in evolution all the time, and mutation will quickly destroy it unless that polypeptide provides the organism with some advantage. There either is an advantage that natural selection can act on, or there isn't, so we don't think the would-be genes stick around for very long."

This in turn suggests that gene birth is a sudden transition, rather than a gradual process involving many intermediate steps.
In addition to Wilson, Neme and Masel, the paper was co-authored by Scott Foy, currently at St. Jude Children's Research Hospital in Memphis, Tennessee. Funding was provided by the John Templeton Foundation, the National Institutes of Health and the European Research Council.

University of Arizona

Related Dna Articles:

A new twist on DNA origami
A team* of scientists from ASU and Shanghai Jiao Tong University (SJTU) led by Hao Yan, ASU's Milton Glick Professor in the School of Molecular Sciences, and director of the ASU Biodesign Institute's Center for Molecular Design and Biomimetics, has just announced the creation of a new type of meta-DNA structures that will open up the fields of optoelectronics (including information storage and encryption) as well as synthetic biology.
Solving a DNA mystery
''A watched pot never boils,'' as the saying goes, but that was not the case for UC Santa Barbara researchers watching a ''pot'' of liquids formed from DNA.
Junk DNA might be really, really useful for biocomputing
When you don't understand how things work, it's not unusual to think of them as just plain old junk.
Designing DNA from scratch: Engineering the functions of micrometer-sized DNA droplets
Scientists at Tokyo Institute of Technology (Tokyo Tech) have constructed ''DNA droplets'' comprising designed DNA nanostructures.
Does DNA in the water tell us how many fish are there?
Researchers have developed a new non-invasive method to count individual fish by measuring the concentration of environmental DNA in the water, which could be applied for quantitative monitoring of aquatic ecosystems.
Zigzag DNA
How the cell organizes DNA into tightly packed chromosomes. Nature publication by Delft University of Technology and EMBL Heidelberg.
Scientists now know what DNA's chaperone looks like
Researchers have discovered the structure of the FACT protein -- a mysterious protein central to the functioning of DNA.
DNA is like everything else: it's not what you have, but how you use it
A new paradigm for reading out genetic information in DNA is described by Dr.
A new spin on DNA
For decades, researchers have chased ways to study biological machines.
From face to DNA: New method aims to improve match between DNA sample and face database
Predicting what someone's face looks like based on a DNA sample remains a hard nut to crack for science.
More DNA News and DNA Current Events

Trending Science News

Current Coronavirus (COVID-19) News

Top Science Podcasts

We have hand picked the top science podcasts of 2020.
Now Playing: TED Radio Hour

Listen Again: The Power Of Spaces
How do spaces shape the human experience? In what ways do our rooms, homes, and buildings give us meaning and purpose? This hour, TED speakers explore the power of the spaces we make and inhabit. Guests include architect Michael Murphy, musician David Byrne, artist Es Devlin, and architect Siamak Hariri.
Now Playing: Science for the People

#576 Science Communication in Creative Places
When you think of science communication, you might think of TED talks or museum talks or video talks, or... people giving lectures. It's a lot of people talking. But there's more to sci comm than that. This week host Bethany Brookshire talks to three people who have looked at science communication in places you might not expect it. We'll speak with Mauna Dasari, a graduate student at Notre Dame, about making mammals into a March Madness match. We'll talk with Sarah Garner, director of the Pathologists Assistant Program at Tulane University School of Medicine, who takes pathology instruction out of...
Now Playing: Radiolab

What If?
There's plenty of speculation about what Donald Trump might do in the wake of the election. Would he dispute the results if he loses? Would he simply refuse to leave office, or even try to use the military to maintain control? Last summer, Rosa Brooks got together a team of experts and political operatives from both sides of the aisle to ask a slightly different question. Rather than arguing about whether he'd do those things, they dug into what exactly would happen if he did. Part war game part choose your own adventure, Rosa's Transition Integrity Project doesn't give us any predictions, and it isn't a referendum on Trump. Instead, it's a deeply illuminating stress test on our laws, our institutions, and on the commitment to democracy written into the constitution. This episode was reported by Bethel Habte, with help from Tracie Hunte, and produced by Bethel Habte. Jeremy Bloom provided original music. Support Radiolab by becoming a member today at     You can read The Transition Integrity Project's report here.