DNA repeats -- the genome's dark matter

November 22, 2019

Expansions of DNA repeats are very hard to analyze. A method developed by researchers at the Max Planck Institute for Molecular Genetics in Berlin allows for a detailed look at these previously inaccessible regions of the genome. It combines nanopore sequencing, stem cell, and CRISPR-Cas technologies. The method could improve the diagnosis of various congenital diseases and cancers in the future.

Large parts of the genome consist of monotonous regions where short sections of the genome repeat hundreds or thousands of times. But expansions of these "DNA repeats" in the wrong places can have dramatic consequences, like in patients with Fragile X syndrome, one of the most commonly identifiable hereditary causes of cognitive disability in humans. However, these repetitive regions are still regarded as an unknown territory that cannot be examined appropriately, even with modern methods.

A research team led by Franz-Josef Müller at the Max Planck Institute for Molecular Genetics in Berlin and the University Hospital of Schleswig-Holstein in Kiel recently shed light on this inaccessible region of the genome. Müller's team was the first to successfully determine the length of genomic tandem repeats in patient-derived stem cell cultures. The researchers additionally obtained data on the epigenetic state of the repeats by scanning individual DNA molecules. The method, which is based on nanopore sequencing and CRISPR-Cas technologies, opens the door for research into repetitive genomic regions, and the rapid and accurate diagnosis of a range of diseases.

A gene defect on the X chromosome

In Fragile X syndrome, a repeat sequence has expanded in a gene called FMR1 on the X chromosome. "The cell recognizes the repetitive region and switches it off by attaching methyl groups to the DNA," says Müller. These small chemical changes have an epigenetic effect because they leave the underlying genetic information intact. "Unfortunately, the epigenetic marks spread over to the entire gene, which is then completely shut down," explains Müller. The gene is known to be essential for normal brain development. He states: "Without the FMR1 gene, we see severe delays in development leading to varying degrees of intellectual disability or autism."

Female individuals are, in most cases, less affected by the disease, since the repeat region is usually located on only one of the two X chromosomes. Since the unchanged second copy of the gene is not epigenetically altered, it is able to compensate for the genetic defect. In contrast, males have only one X chromosome and one copy of the affected gene and display the full range of clinical symptoms. The syndrome is one of about 30 diseases that are caused by expanding short tandem repeats.

First precise mapping of short tandem repeats

In this study, Müller and his team investigated the genome of stem cells that were derived from patient tissue. They were able to determine the length of the repeat regions and their epigenetic signature, a feat that had not been possible with conventional sequencing methods. The researchers also discovered that the length of the repetitive region could vary to a large degree, even among the cells of a single patient.

The researchers also tested their process with cells derived from patients that contained an expanded repeat in one of the two copies of the C9orf72 gene. This mutation leads to one of the most common monogenic causes of frontotemporal dementia and amyotrophic lateral sclerosis. "We were the first to map the entire epigenetics of extended and unchanged repeat regions in a single experiment," says Müller. Furthermore, the region of interest on the DNA molecule remained physically wholly unaltered. "We developed a unique method for the analysis of single molecules and for the darkest regions of our genome - that's what makes this so exciting for me."

Tiny pores scan single molecules

"Conventional methods are limited when it comes to highly repetitive DNA sequences. Not to mention the inability to simultaneously detect the epigenetic properties of repeats," says Björn Brändl, one of the first authors of the publication. That's why the scientists used Nanopore sequencing technology, which is capable of analyzing these regions. The DNA is fragmented, and each strand is threaded through one of a hundred tiny holes ("nanopores") on a silicon chip. At the same time, electrically charged particles flow through the pores and generate a current. When a DNA molecule moves through one of these pores, the current varies depending on the chemical properties of the DNA. These fluctuations of the electrical signal are enough for the computer to reconstruct the genetic sequence and the epigenetic chemical labels. This process takes place at each pore and, thus, each strand of DNA.

Genome editing tools and bioinformatics illuminate "dark matter"

Conventional sequencing methods analyze the entire genome of a patient. Now, the scientists designed a process to look at specific regions selectively. Brändl used the CRISPR-Cas system to cut DNA segments from the genome that contained the repeat region. These segments went through a few intermediate processing steps and were then funneled into the pores on the sequencing chip.

"If we had not pre-sorted the molecules in this way, their signal would have been drowned in the noise of the rest of the genome," says bioinformatician Pay Giesselmann. He had to develop an algorithm specifically for the interpretation of the electrical signals generated by the repeats: "Most algorithms fail because they do not expect the regular patterns of repetitive sequences." While Giesselmann's program "STRique" does not determine the genetic sequence itself, it counts the number of sequence repetitions with high precision. The program is freely available on the internet.

Numerous potential applications in research and the clinic

"With the CRISPR-Cas system and our algorithms, we can scrutinize any section of the genome - especially those regions that are particularly difficult to examine using conventional methods," says Müller, who is heading the project. "We created the tools that enable every researcher to explore the dark matter of the genome," says Müller. He sees great potential for basic research. "There is evidence that the repeats grow during the development of the nervous system, and we would like to take a closer look at this."

The physician also envisions numerous applications in clinical diagnostics. After all, repetitive regions are involved in the development of cancer, and the new method is relatively inexpensive and fast. Müller is determined to take the procedure to the next level: "We are very close to clinical application."
-end-
Original publication

Pay Giesselmann, Björn Brändl, Etienne Raimondeau, Rebecca Bowen, Christian Rohrandt, Rashmi Tandon, Helene Kretzmer, Günter Assum, Christina Galonska, Reiner Siebert, Ole Ammerpohl, Andrew Heron, Susanne A. Schneider, Julia Ladewig, Philipp Koch, Bernhard M. Schuldt, James E. Graham, Alexander Meissner, Franz-Josef Müller
Analysis of short tandem repeat expansions and their methylation state with nanopore sequencing.
Nature Biotechnology (2019)

Max-Planck-Gesellschaft

Related DNA Articles from Brightsurf:

A new twist on DNA origami
A team* of scientists from ASU and Shanghai Jiao Tong University (SJTU) led by Hao Yan, ASU's Milton Glick Professor in the School of Molecular Sciences, and director of the ASU Biodesign Institute's Center for Molecular Design and Biomimetics, has just announced the creation of a new type of meta-DNA structures that will open up the fields of optoelectronics (including information storage and encryption) as well as synthetic biology.

Solving a DNA mystery
''A watched pot never boils,'' as the saying goes, but that was not the case for UC Santa Barbara researchers watching a ''pot'' of liquids formed from DNA.

Junk DNA might be really, really useful for biocomputing
When you don't understand how things work, it's not unusual to think of them as just plain old junk.

Designing DNA from scratch: Engineering the functions of micrometer-sized DNA droplets
Scientists at Tokyo Institute of Technology (Tokyo Tech) have constructed ''DNA droplets'' comprising designed DNA nanostructures.

Does DNA in the water tell us how many fish are there?
Researchers have developed a new non-invasive method to count individual fish by measuring the concentration of environmental DNA in the water, which could be applied for quantitative monitoring of aquatic ecosystems.

Zigzag DNA
How the cell organizes DNA into tightly packed chromosomes. Nature publication by Delft University of Technology and EMBL Heidelberg.

Scientists now know what DNA's chaperone looks like
Researchers have discovered the structure of the FACT protein -- a mysterious protein central to the functioning of DNA.

DNA is like everything else: it's not what you have, but how you use it
A new paradigm for reading out genetic information in DNA is described by Dr.

A new spin on DNA
For decades, researchers have chased ways to study biological machines.

From face to DNA: New method aims to improve match between DNA sample and face database
Predicting what someone's face looks like based on a DNA sample remains a hard nut to crack for science.

Read More: DNA News and DNA Current Events
Brightsurf.com is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to Amazon.com.