Next gen sequencing technology pinpoint 'on-off switches' in genomes
February 13, 2009
WALNUT CREEK, Calif.- Scientists from the U.S. Department of Energy (DOE) Joint Genome Institute (JGI), Lawrence Berkeley National Laboratory, and the University of California, San Diego have developed a set of molecular tools that provide important insight into the complex genomes of multicellular organisms. The strategy promises to clarify the longstanding mystery of the role played by vast stretches of DNA sequence that do not code for the functional units-genes-that nevertheless may have a powerful regulatory influence. The research is described in the 12 February edition of the journal Nature.
DOE bioenergy researchers have an interest in identifying these regulatory regions in plants, where proteins interact with DNA and exert control over gene expression and development, so that plants used as biomass "feedstocks" can be optimized for biofuels production.
"From the Human Genome Project we have a good idea where in the genome the protein-coding genes are located, but these constitute only about two percent of the human genome, the remaining 98 percent are non-coding sequence whose function is largely unknown," said Len Pennacchio, the paper's senior author and DOE JGI Genomic Technologies Department Head.
"Our approach employs next generation sequencing technology to find regulatory regions, the 'switches' on a genome-wide scale and much more cost effectively," said Pennacchio. "It's the next layer of knowledge that's been missing."
The DOE JGI was founded in 1997 to accelerate the completion of the HGP and completed the DOE's commitment to sequence three (5, 16, 19) of the 23 chromosomes, totaling 11 percent of the human genome, and published the analysis in Nature back in 2004.
In this newly published study, Pennacchio, lead authors DOE scientist Axel Visel and postdoctoral fellow Matthew Blow, and their colleagues, describe a shortcut for identifying gene regulatory regions or the molecular switches that turn on or off gene expression.
Using what's called ChIP-Sequencing or ChIP-Seq, chromatin immunoprecipitation (ChIP) is combined with massively parallel DNA sequencing to identify binding sites of DNA-associated proteins.
Traditionally researchers have relied on evolution to guide them to non-coding sequences that are likely to have a function-such as enhancing the expression of genes. Via the public genome databases, they would align the entire human genome code with that of other vertebrate species (e.g. other mammals, birds, frogs, fish) and then look for sequences that are conserved in evolution.
"Most protein-coding sequences show signs of conservation between species, but there is also a large number of non-coding sequences that have been surprisingly well conserved for tens or even hundreds of millions of years," said Visel. "This suggests that these regions, formerly thought to be "junk" DNA, actually have some functional relevance and are under selection because sequence changes reduce fitness of affected individuals. Using such sequence conservation, we have in previous studies identified enhancer candidate regions and shown in transgenic mouse experiments that these conserved non-coding regions are in fact often enhancers that are active during embryonic development. Conservation-based methods are relatively good at finding enhancers in the genome, but an important limitation is that they don't tell us where and when that particular enhancer would be active and thereby drive the expression of its neighboring target gene(s).
The older methods lacked specificity, Blow said. "For example, if we have a gene that is important both for brain and for limb development, we would not have been able to specifically identify the enhancer sequences near that gene that would drive the expression in the brain or limb, the only way to find out was to test these activities in experiments one-by-one, which is slow and can't be done on a genomic scale.
"Using this new method, we can directly identify a genome-wide set of enhancers that are active in a particular anatomical region or tissue at a particular time-point, which is an important advantage over conservation-based methods because in addition to telling us where an enhancer is located in the genome, it also provides an initial experimental characterization where we should expect this enhancer to be active."
The team used ChIP linked with a particular enhancer-associated protein, p300, then directed DOE JGI's massively parallel next generation sequencing capacity to map several thousand sites in mouse embryonic forebrain, midbrain and limb tissue. Over 80 of these fragments were tested in transgenic mouse experiments indicating an almost perfect success rate of p300-ChIP-Seq for identifying enhancers active in vivo.
"Enhancers are especially important for regulating genes during embryonic development," said Pennacchio. "They can regulate genes over long distances and switch on their target genes during very specific time-points and in very specific anatomical structures during development. There are several examples of mutations in such enhancers that cause disease in humans because genes are not expressed at the right time or in the right place anymore. A fundamental problem in studying such enhancers is that until recently we did not have effective tools to even find them in the genome on a large scale.
Pennacchio said that this new method will prove useful to the greater genomics and biomedical community for characterizing the role of the vast non-coding regions-dubbed genome "dark matter"-about which little is known.
"These datasets will also help to identify mutations in enhancers that play a role in human disease," Pennacchio said. "Human genetic studies indicate that in many cases disease is caused by mutations in non-coding sequences, but it has been difficult to study this in detail because the function of most non-coding sequences is poorly understood. Eventually, this will be useful for purposes including disease detection and personalized medicine."
With the rapidly increasing efficiency and cost-savings of the next generation sequencing technologies, a deluge of data from individual human genomes are being to come to light, to the point where whole-genome sequencing of patients may soon become a standard diagnostic tool.
"While progress is being made towards this goal, it is important to keep in mind that our current understanding of the genome has focused on protein-coding sequences," said Pennacchio. "Datasets like the one provided through this study will be important to understand the remaining 98 percent of the genome and what its role in health and disease is."
The published study provides an important proof of principle to establish and validate a new method in three different mouse tissues at a single embryonic time-point, Pennacchio said. "We can now generate genome-wide enhancer datasets directly from human tissues and compare genome-wide sets of enhancer activities between healthy people and people suffering from disease, which may reveal how enhancer activities change on a global scale in these disease states."
DOE/Joint Genome Institute

|
Genome: The Autobiography of a Species in 23 Chapters (P.S.)
by Matt Ridley (Author)
The genome's been mapped. But what does it mean? Arguably the most significant scientific discovery of the new century, the mapping of the twenty-three pairs of chromosomes that make up the human genome raises almost as many questions as it answers. Questions that will profoundly impact the way we think about disease, about longevity, and about free will. Questions that will affect the rest of your life. Genome offers extraordinary insight into the ramifications of this incredible breakthrough. By picking one newly discovered gene from each pair of chromosomes and telling its story, Matt Ridley recounts the history of our species and its ancestors from the dawn of life to the brink of future medicine. From Huntington's disease to cancer, from the applications of gene therapy to...
|

|
The Human Genome: Book of Essential Knowledge (Curiosity Guides)
by John Quackenbush (Author), John Sulston (Foreword)
The DNA sequence that comprises the human genome--the genetic blueprint found in each of our cells--is undoubtedly the greatest code ever to be broken. Completed at the dawn of a new millennium, the feat electrified both the scientific community and the general public with its tantalizing promise of new and better treatments for countless diseases, including Alzheimer's, cancer, diabetes, and Parkinson's. Yet what is arguably the most important discovery of our time has also opened a Pandora's box of questions about who we are as humans and how the unique information stored in our genomes can and might be used, making it all the more important for everyone to understand the new science of genomics. In The Curiosity Guide to the Human Genome, Dr. John Quackenbush, a renowned scientist and...
|

|
Genome
by Matt Ridley (Author)
|

|
Genomes 3
by T.A. Brown (Author)
Covering molecular genetics from the basics through to genome expression and molecular phylogenetics, Genomes 3 is the latest edition of this pioneering textbook. Updated to incorporate the recent major advances, Genomes 3 is an invaluable companion for any undergraduate throughout their studies in molecular genetics. Genomes 3 builds on the achievements of the previous two editions by putting genomes, rather than genes, at the centre of molecular genetics teaching. Recognising that molecular biology research was being driven more by genome sequencing and functional analysis than by research into genes, this approach has gathered momentum in recent years.
|

|
The $1,000 Genome: The Revolution in DNA Sequencing and the New Era of Personalized Medicine
by Kevin Davies (Author)
In 2000, President Bill Clinton signaled the completion of the Human Genome Project at a cost in excess of $2 billion. A decade later, the price for any of us to order our own personal genome sequence—a comprehensive map of the 3 billion letters in our DNA—is rapidly and inevitably dropping to just $1,000. Dozens of men and women—scientists, entrepreneurs, celebrities, and patients—have already been sequenced, pioneers in a bold new era of personalized genomic medicine. The $1,000 genome has long been considered the tipping point that would open the floodgates to this revolution. Do you have gene variants associated with Alzheimer’s or diabetes, heart disease or cancer? Which drugs should you consider taking for various diseases, and at what dosage? In the years to come, doctors...
|

|
Genome
by Magnus Somnium
Jack Thomas is amazed to find himself the head of the company of his dreams, nestled in the high tech backdrop of beautiful Boulder, Colorado. Built with his best friend and partner Frankie, the bio-genetics company has achieved far beyond their wildest dreams. The company is poised to revolutionize the analysis and treatment of cancer and other diseases. The only thing missing is Jack's high school sweetheart, Emily, who was brutally killed, her murderer never found. Until Samantha, a beautiful green-eyed psychic haunted by Emily’s ghost, shows up. Sam takes Jack on a whirl-wind ghost chase to find Emily’s killer. With help of PIP, the sexy Artificial Intelligence assistant, Jack risks his life using the latest genetic technology to delve into a terrifying...
|

|
A Primer of Genome Science, Third Edition
by Gibson (Author), Muse (Author)
Genome science has matured as a discipline to the point where it is now incorporated as a regular part of the genetics curriculum in universities. A Primer of Genome Science, Third Edition bridges the gap between standard genetics textbooks and highly specialized, technical, and advanced treatments of the subdisciplines. It provides an affordable and up-to-date introduction to the field that is suited to advanced undergraduate or early graduate courses. Bioinformatic principles and experimental strategies are explained side-by-side with the experimental methods, establishing a framework that allows teachers to explore topics and the literature at their own pace. The Primer is organized into six chapters dealing with the scope of genomics, genome sequencing, variation and complex traits,...
|

|
THE HUMAN GENOME, Third Edition: A User's Guide
by Julia E. Richards (Author), R. Scott Hawley (Author)
As genetic issues play a growing role in health practice and public policy, new knowledge in this field will continue to have significant implications for individuals and society. Written to communicate sound and modern science in an accessible way for professionals and students with various levels of scientific background, this thoroughly revised edition of The Human Genome contributes to creating a genetically literate research and clinical population. With case studies and introductory vignettes which illustrate a wide range of perspectives on complex topics in genetics and updated material on the latest research on disease-specific topics, this book serves as a valuable resource for students and working professionals alike.* Full-color illustrations enhance and reinforce key concepts...
|

|
Genetics: From Genes to Genomes (Hartwell, Genetics)
by Leland Hartwell (Author), Leroy Hood (Author), Michael Goldberg (Author), Ann Reynolds (Author), Lee Silver (Author)
Genetics: From Genes to Genomes is a cutting-edge, introductory genetics text authored by an unparalleled author team, including Nobel Prize winner, Leland Hartwell. The 4th edition continues to build upon the integration of Mendelian and molecular principles, providing students with the links between the early understanding of genetics and the new molecular discoveries that have changed the way the field of genetics is viewed. Users who purchase Connect Plus receive access to the full online ebook version of the textbook.
|

|
The Genome Generation
by Elizabeth Finkel (Author)
The year 2001 marked the moment when scientists first read the 3 billion letters of DNA that make up the human genome. This breakthrough begged questions such as What have we learned about evolution? How has it changed the way we practice medicine, grow crops, and breed livestock? and Is the genomic revolution an overhyped flop? Answering these and many other queries, this account covers revolutionary genetic developments in areas as diverse as medicine, agriculture, and evolution. From Botswana to Boston and from Australia to Mexico, the contributors to this work reveal what it means to be part of the genome generation.
|