Nav: Home

New studies from the Center for Genome Architecture at Baylor explore 3-D structure of DNA

August 01, 2016

HOUSTON - (August 1, 2016) - In a set of papers published last week in Cell Systems, Dr. Erez Lieberman Aiden, assistant professor of molecular and human genetics and McNair Scholar at Baylor College of Medicine and director of the Center for Genome Architecture (TC4GA), and his colleagues introduce Juicer, an open-source tool used in three-dimensional (3-D) genome sequencing (Hi-C) processes.

Hi-C, invented by Aiden and collaborators in 2009, explores the three-dimensional structure of the genome, creating terabases of sequencing data resulting in high-resolution contact maps that comprehensively chart the loops that form when the genome folds up inside the nucleus of a cell.

In previous Hi-C experiments, Aiden and his team identified the sheer bandwidth of the data as a central challenge. Existing hardware and software simply could not process and analyze the massive amounts of data produced in these experiments, with a single map spanning billions of reads and trillions of base pairs.

To alleviate this bottleneck in data analysis, Aiden and his team at Baylor, led by Dr. Neva Durand, Muhammad Shamim and Ido Machol, designed Juicer, a fully-automated pipeline that allows users with little to no computational background to transform raw sequencing data into genome-wide maps of looping with a single click. Juicer produces the Hi-C file with loops and contact domains automatically annotated, which facilitates the visualization and analysis of the map and its structural features.

"The studies published in Cell Systems describe our team's new, end-to-end system for analysis of 3-D genome sequencing data. It is the first system of its kind, making it possible to map the loops in a mammalian genome in a fully automated fashion," said Durand, a senior scientist at TC4GA and co-first author on both new studies.

As a demonstration of the power of the new tool, Aiden and his colleagues created the deepest 3-D maps of the genome to date, spanning over three terabytes of data drawn from a single experimental condition.

But improvements in software weren't enough: adequate hardware is also a central challenge. The researchers tracked the performance of Juicer on four cluster systems, including a system based on Edico Genome's DRAGEN Bio-IT processing platform coupled with IBM's Power8 architecture.

Edico's DRAGEN platform accelerated the analysis of the massive data sets derived from this study of 3-D structures of DNA by nearly 20 fold, a dramatic speedup from all of other systems tested.

Machol, a co-author on both studies, noted that, "When we ran our pipeline on a hybrid DRAGEN/Power system, the data analysis was 20-fold faster than running the pipeline on an industry standard cluster. That kind of difference opens the door to many analyses that would have been very impractical before."

DRAGEN generates accelerated implementations of genome pipeline algorithms using a field-programmable gate array (FPGA). The platform is reconfigurable and flexible through remote downloads, allowing users to create custom algorithms and refine existing pipelines.

"Given the dramatic acceleration that we observed, we are excited about the extraordinary potential of FPGA technology in 3-D genomics." said Shamim, an M.D./Ph.D. student at Baylor and co-first author on the Juicer study.

Aiden, who is also a faculty member at Rice University, in the department of Computer Science and at the Center for Theoretical Biological Physics, commented on the experiment, saying, "The partnership between TC4GA and Edico Genome is a game-changer. The results that are possible using DRAGEN are more than a one-off exercise: they are a strong indicator of the future of the 3-D genomics field as a whole. We are confident that our collaboration will lead to a great deal of innovation both within the Texas Medical Center community, and beyond."

Added Pieter van Rooyen, Ph.D., chief executive officer of Edico Genome, "Dr. Aiden and his team's application of DRAGEN to accelerate Juicer is a great example of DRAGEN's effectiveness in processing massive amounts of raw sequencing data in minimal time, and without requiring any additional training or a post-graduate degree. We are continually working to optimize DRAGEN and expect the next version to be even faster than the speed we have already achieved."

Juicer is available as open source software and is compatible with multiple cluster operating systems, Edico's DRAGEN, and Amazon Web Services. It may be downloaded on the web at
Other contributors to this work include James T. Robinson, Jill P. Mesirov, and Eric S. Lander of the Broad Institute of Harvard and MIT, and Suhas Rao and Miriam Huntley, from The Center for Genome Architecture.

This work was supported by an NIH New Innovator Award (1DP2OD008540-01), the National Human Genome Research Institute (NHGRI) Centers of Excellence in Genomic Science (P50HG006193), an NVIDIA Research Center Award, an IBM University Challenge Award, a Google Research Award, a Cancer Prevention Research Institute of Texas Scholar Award (R1304), a McNair Medical Institute Scholar Award, the President's Early Career Award in Science and Engineering, and a grant from the National Science Foundation (NSF) Physics Frontiers Centers (Center for Theoretical Biological Physics).

The authors received grants from the Welch Foundation (to E.L.A.), the National Institute of General Medical Sciences (NIGMS R01GM074024 to J.P.M.), and NHGRI (HG003067 to E.S.L.). The Center for Genome Architecture is grateful to Janice, Robert, and Cary McNair for support.

Read the full papers online:

Juicer Provides a One-Click System for Analyzing Loop-Resolution HI-C Experiments:

Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom:

Baylor College of Medicine

Related Genome Articles:

Deciphering the walnut genome
New research could provide a major boost to the state's growing $1.6 billion walnut industry by making it easier to breed walnut trees better equipped to combat the soil-borne pathogens that now plague many of California's 4,800 growers.
Illuminating the genome
Development of a new molecular visualisation method, RNA-guided endonuclease -- in situ labelling (RGEN-ISL) for the CRISPR/Cas9-mediated labelling of genomic sequences in nuclei and chromosomes.
A genome under influence
References form the basis of our comprehension of the world: they enable us to measure the height of our children or the efficiency of a drug.
How a virus destabilizes the genome
New insights into how Kaposi's sarcoma-associated herpesvirus (KSHV) induces genome instability and promotes cell proliferation could lead to the development of novel antiviral therapies for KSHV-associated cancers, according to a study published Sept.
Better genome editing
Reich Group researchers develop a more efficient and precise method of in-cell genome editing.
More Genome News and Genome Current Events

Best Science Podcasts 2019

We have hand picked the best science podcasts for 2019. Sit back and enjoy new science podcasts updated daily from your favorite science news services and scientists.
Now Playing: TED Radio Hour

Erasing The Stigma
Many of us either cope with mental illness or know someone who does. But we still have a hard time talking about it. This hour, TED speakers explore ways to push past — and even erase — the stigma. Guests include musician and comedian Jordan Raskopoulos, neuroscientist and psychiatrist Thomas Insel, psychiatrist Dixon Chibanda, anxiety and depression researcher Olivia Remes, and entrepreneur Sangu Delle.
Now Playing: Science for the People

#537 Science Journalism, Hold the Hype
Everyone's seen a piece of science getting over-exaggerated in the media. Most people would be quick to blame journalists and big media for getting in wrong. In many cases, you'd be right. But there's other sources of hype in science journalism. and one of them can be found in the humble, and little-known press release. We're talking with Chris Chambers about doing science about science journalism, and where the hype creeps in. Related links: The association between exaggeration in health related science news and academic press releases: retrospective observational study Claims of causality in health news: a randomised trial This...