Researchers explain signals of CpG 'traffic lights' in DNA

April 09, 2019

A research team featuring bioinformaticians from the Moscow Institute of Physics and Technology (MIPT) has identified reliable markers of gene activity. The discovery has potential for future applications in clinical practice. The findings are reported in BMC Genomics.

In terms of macromolecular chemistry, DNA is a polymer, or polynucleotide, composed of four kinds of repeated units known as nucleotides. What makes the four types different are the associated nitrogenous bases: adenine (A), thymine (T), guanine (G), and cytosine (C). A DNA region with a C followed by G, connected via a phosphate (p), is known as a CpG dinucleotide (figure 1).

Genes are DNA regions carrying primary information about the RNA and proteins produced by a cell. The DNA sequence is usually the same across all cells in an organism, but the "working" genes actually involved in RNA synthesis are different for every cell type. This is enabled by regulatory mechanisms functioning as switches that launch or halt the production of specific RNAs. Once a certain level of organism complexity is reached in evolution, the number of genes does not increase any longer. Instead, more elaborate regulatory programs evolve to make sure the right genes are activated at the right time.

As of now, two major types of mechanisms regulating gene transcription (RNA synthesis) have been studied in detail: those involving so-called transcription factors and epigenetic regulators. Transcription factors are regulatory proteins capable of binding to DNA by recognizing certain nucleotide sequences in the regulatory regions of genes. Once a transcription factor has bound to DNA, it engages special cell machinery to initiate RNA synthesis from that particular gene. A large number of such factors is known (over 1,500 in humans). Their combinations regulate how active transcription should be and whether it should happen at all.

Epigenetic regulation involves mechanisms that control how active a gene is without affecting the primary DNA structure. One of such mechanisms is DNA methylation. It is mostly achieved by attaching a methyl group (CH?) to the cytosine in a CpG dinucleotide (figure 2). Once attached, the methyl group serves as a marker signaling which genes are active or repressed, ultimately determining the cell type. It is no surprise that DNA methylation is associated with numerous biological processes, both normal and pathological. Abnormal DNA methylation is observed in cancer, metabolic disorders, cardiovascular, neurodegenerative, and other diseases.

By binding to cytosine nucleotides in distinct functional regions of the DNA, a methyl group can have distinct effects. For example, methylation of regions close to transcription start sites tends to suppress gene activity. Conversely, methylation of cytosines inside the gene usually serves to activate it (figure 3).

"In our previous papers, we showed that methylation of certain CpG dinucleotides was strongly associated with gene activity. We called such dinucleotides CpG traffic lights. Now we have demonstrated that the methylation of CpG traffic lights is a better indicator of gene activity than promoter or gene body methylation. In addition, we've shown that enhancers -- DNA regions located away from genes but regulating their activity -- are enriched in CpG traffic lights," explained Yulia Medvedeva, the senior author of the paper and associate professor of bioinformatics and systems biology at MIPT, who leads the regulatory transcriptomics and epigenomics group at the Research Center of Biotechnology of the Russian Academy of Sciences.

"We noted that CpG traffic lights are conserved in the course of evolution. That is, these positions are relatively less susceptible to mutations, supporting their functional significance," added Anna Lioznova, the first author of the study and a doctoral student at the Research Center of Biotechnology, RAS.

"We used to think that the main role of CpG traffic lights was to switch the regions of transcription factor binding from the active to the passive state," said study co-author Ivan Kulakovskiy, a researcher at Engelhardt Institute of Molecular Biology and the Institute of Mathematical Problems of Biology, RAS. "Surprisingly, that previously explained and described mechanism proved to account for a relatively small share in the overall number of traffic lights. We suppose that the operation of traffic lights is tied to what's known as an activity 'map' of DNA regions [aka. chromatin states], but the specific mechanisms are still to be discovered."

CpG traffic lights are CpG dinucleotides whose methylation reflects the activation or repression of a gene encoded nearby. In other words, such dinucleotides signal whether RNA is to be synthesized from this gene, and ultimately whether the associated protein will be produced. By studying CpG traffic lights, the researchers hope to understand the mechanisms of gene regulation. In clinical practice, determining the status of cytosine methylation produces more reliable results, compared with direct measurements of gene activity. This opens up prospects for the clinical use of CpG traffic lights as effective indicators of gene activity.
The study was conducted by researchers from MIPT, Lomonosov Moscow State University, King Abdullah University of Science and Technology in Saudi Arabia, and the following institutions of the Russian Academy of Sciences: Research Center of Biotechnology, Vavilov Institute of General Genetics, the Institute of Mathematical Problems of Biology, Engelhardt Institute of Molecular Biology, and the Institute for Information Transmission Problems.

This research was supported by the Russian Foundation for Basic Research and the Russian Science Foundation.

Moscow Institute of Physics and Technology

Related DNA Articles from Brightsurf:

A new twist on DNA origami
A team* of scientists from ASU and Shanghai Jiao Tong University (SJTU) led by Hao Yan, ASU's Milton Glick Professor in the School of Molecular Sciences, and director of the ASU Biodesign Institute's Center for Molecular Design and Biomimetics, has just announced the creation of a new type of meta-DNA structures that will open up the fields of optoelectronics (including information storage and encryption) as well as synthetic biology.

Solving a DNA mystery
''A watched pot never boils,'' as the saying goes, but that was not the case for UC Santa Barbara researchers watching a ''pot'' of liquids formed from DNA.

Junk DNA might be really, really useful for biocomputing
When you don't understand how things work, it's not unusual to think of them as just plain old junk.

Designing DNA from scratch: Engineering the functions of micrometer-sized DNA droplets
Scientists at Tokyo Institute of Technology (Tokyo Tech) have constructed ''DNA droplets'' comprising designed DNA nanostructures.

Does DNA in the water tell us how many fish are there?
Researchers have developed a new non-invasive method to count individual fish by measuring the concentration of environmental DNA in the water, which could be applied for quantitative monitoring of aquatic ecosystems.

Zigzag DNA
How the cell organizes DNA into tightly packed chromosomes. Nature publication by Delft University of Technology and EMBL Heidelberg.

Scientists now know what DNA's chaperone looks like
Researchers have discovered the structure of the FACT protein -- a mysterious protein central to the functioning of DNA.

DNA is like everything else: it's not what you have, but how you use it
A new paradigm for reading out genetic information in DNA is described by Dr.

A new spin on DNA
For decades, researchers have chased ways to study biological machines.

From face to DNA: New method aims to improve match between DNA sample and face database
Predicting what someone's face looks like based on a DNA sample remains a hard nut to crack for science.

Read More: DNA News and DNA Current Events is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to