Science Current Events | Science News | Brightsurf.com
 
Email a Friend Send to a friend
Printer Friendly Print Dog genome sequence and analysis published in Nature

Dog genome sequence and analysis published in Nature

December 08, 2005

Analysis unlocks genetic variation among dog breeds; evolutionary conservation with human reveals regulatory controls of key genes

An international research team led by scientists at the Broad Institute of MIT and Harvard announced today the completion of a high-quality genome sequence of the domestic dog, together with a catalog of 2.5 million specific genetic differences across several dog breeds. Published in the December 8 issue of Nature, the dog research sheds light on both the genetic similarities between dogs and humans and the genetic differences between dog breeds. Comparison of the dog and human DNA reveals key secrets about the regulation of the master genes that control embryonic development. Comparison among dogs also reveals the structure of genetic variation among breeds, which can now be used to unlock the basis of physical and behavioral differences, as well the genetic underpinnings of diseases common to domestic dogs and their human companions.




"Of the more than 5,500 mammals living today, dogs are arguably the most remarkable," said senior author Eric Lander, director of the Broad Institute, professor of biology at MIT and systems biology at Harvard Medical School, and a member of the Whitehead Institute for Biomedical Research. "The incredible physical and behavioral diversity of dogs - from Chihuahuas to Great Danes - is encoded in their genomes. It can uniquely help us understand embryonic development, neurobiology, human disease and the basis of evolution."

Similarities to humans

Dogs not only occupy a special place in human hearts, they also sit at a key branch point in the evolutionary tree relative to humans. By tracking evolution's genetic footprints through the dog, human and mouse genomes, the scientists found that humans share more of their ancestral DNA with dogs than with mice, confirming the utility of dog genetics for understanding human disease.

Most importantly, the comparison revealed the regions of the human genome that are most highly preserved across mammals. Roughly 5% of the human genome has been well preserved by evolution over the past 100 million years and must encode important biological functions. The researchers discovered that the most highly conserved of these sequences are not randomly distributed throughout the genome. Instead, they are crowded around just a tiny fraction (about 1%) of the genes that encode crucial regulatory proteins involved in development (such as transcription factors or axon guidance receptors). "The clustering of regulatory sequences is incredibly interesting," said Kerstin Lindblad-Toh, first author of the Nature paper and co-director of the genome sequencing and analysis program at Broad. "It means that a small subset of crucial human genes is under much more elaborate control than we had ever imagined."

Differences between dog breeds

Dogs were domesticated from gray wolves as long as 100,000 years ago, but selective breeding over the past few centuries has made modern dog breeds a testament to biological diversity. Obvious examples include the contrasting body sizes of 6-pound Chihuahuas and 120-pound Great Danes, the hyperactivity of Jack Russell terriers relative to mild-mannered basset hounds, and the herding instincts of Shetland sheepdogs compared with the protective proclivity of dalmatians.

Efforts to create the genetic tools needed to map important genes in dogs have gained momentum over the last 15 years, and already include a partial survey of the poodle genome. More than two years ago, Lindblad-Toh, Lander, and their colleagues embarked on a two-part project to assemble a complete map of the dog genome. First, they acquired high-quality DNA sequence from a female boxer named "Tasha," covering nearly 99% of the dog's genome. Using this information as a genetic 'compass,' they then sampled the genomes of 10 different dog breeds and other related canine species, including the gray wolf and coyote. By comparing these dogs, they pinpointed ~2.5 million individual genetic differences among breeds, called single nucleotide polymorphisms (SNPs), which serve as recognizable signposts that can be used to locate the genetic contributions to physical and behavioral traits, as well as disease.

Finally, the scientists used the SNP map to reconstruct how intense dog breeding has shaped the genome. They discovered that selective breeding carried large genomic regions of several million bases of DNA into breeds, creating 'haplotype blocks' that are ~100 times larger than seen in the human population. "The huge genomic regions should make it much easier to find the genes responsible for differences in body size, behavior and disease," said Lander. "Such studies will need many fewer markers than for human studies. It should be like hitting the side of a barn."

Mapping human disease-related genes in dogs

Breeding programs not only selected for desired traits, they also had the unintended consequence of predisposing many dog breeds to genetic diseases, including heart disease, cancer, blindness, cataracts, epilepsy, hip dysplasia and deafness. With the dog genome sequence and the SNP map, scientists around the world now have the tools to identify these disease genes.

Humans suffer from many of the same illnesses as their four-legged friends and even show similar symptoms, but the genetic underpinnings have proved difficult to trace. "The genetic contributions to many common diseases appear to be easier to uncover in dogs," said Lindblad-Toh. "If so, it is a significant step forward in understanding the roots of genetic disease in both dogs and humans."

For this work, the dog-owner community is an essential collaborator. "We deeply appreciate the generous cooperation of individual dog owners and breeders, breed clubs and veterinary schools in providing blood samples for genetic analysis and disease gene mapping," said Lindblad-Toh. "Without their interest and help we could not be doing this work."

Funding and data access

Sequencing of the dog genome began in June 2003, funded in large part by the National Human Genome Research Institute (NHGRI). The Broad Institute is part of NHGRI's Large-Scale Sequencing Research Network. NHGRI is one of 27 institutes and centers at the National Institutes of Health (NIH), an agency of the Department of Health and Human Services. The NHGRI Division of Extramural Research supports grants for research and for training and career development at sites nationwide. Information about NHGRI, including the dog genome initiative, can be found at: www.genome.gov.

Broad Institute of MIT and Harvard



Related Genome Sequence Current Events and Genome Sequence News Articles Genome Sequence Current Events and Genome Sequence News RSS Genome Sequence Current Events and Genome Sequence News RSS
Scientists at UA, collaborating institutions decode maize genome
Scientists from the University of Arizona led by Arizona Genomics Institute director Rod A. Wing and from collaborating institutions have deciphered the complete genetic code of the maize plant for the first time.

Maize cell wall genes identified, giving boost to biofuel research
Purdue University scientists have helped identify and group the genes thought to be responsible for cell wall development in maize, an effort that expands their ability to discover ways to produce the biomass best suited for biofuels production.

UCSD discovery allows scientists for the first time to experimentally annotate genomes
Over the last 20 years, the sequencing of the human genome, along with related organisms, has represented one of the largest scientific endeavors in the history of mankind.

First draft of the pig: Researchers sequence swine genome
A global collaborative has produced a first draft of the genome of a domesticated pig, an achievement that will lead to insights in agriculture, medicine, conservation and evolution.

Standards for a new genomic era
A team of geneticists at Los Alamos National Laboratory, together with a consortium of international researchers, has recently proposed a set of standards designed to elucidate the quality of publicly available genetic sequencing information.

Establishing standard definitions for genome sequences
In 1996, researchers from major genome sequencing centers around the world convened on the island of Bermuda and defined a finished genome as a gapless sequence with a nucleotide error rate of one or less in 10,000 bases.

Jumping genes, gene loss and genome dark matter
In research published today by Nature, an international team describes the finest map of changes to the structure of human genomes and a resource they have developed for researchers worldwide to look at the role of these changes in human disease.

Draft potato genome based on unique potato variety
The Potato Genome Sequencing Consortium (PGSC), an international team of scientists from industry and academia in 14 countries, has released a draft sequence of the potato genome with the help of a Virginia Tech researcher.

Scientists discover how to send insects off the scent of crops
Biotechnology and Biological Sciences Research Council (BBSRC)-funded research, published this week in Chemical Communication, describes how scientists have discovered molecules that could confuse insects' ability to detect plants by interfering with their sense of smell. This could reduce damage to crops by insect pests and contribute to food security.

New Sequencing Technique Could Boost Pine Beetle Fight, Improve Cancer Research
UBC researchers have helped developed a cheaper, faster way to compile draft genome sequences that could advance the fight against mountain pine beetle (MPB) infestation and improve cancer research.
More Genome Sequence Current Events and Genome Sequence News Articles
Bioinformatics: Sequence and Genome Analysis, Second Edition

Bioinformatics: Sequence and Genome Analysis, Second Edition
by David W. Mount (Author)

As more speciesÂ’ genomes are sequenced, computational analysis of these data has become increasingly important. The second, entirely updated edition of this widely praised textbook provides a comprehensive and critical examination of the computational methods needed for analyzing DNA, RNA, and protein data, as well as genomes. The book has been rewritten to make it more accessible to a wider audience, including advanced undergraduate and graduate students. New features include chapter guides and explanatory information panels and glossary terms. New chapters in this second edition cover statistical analysis of sequence alignments, computer programming for bioinformatics, and data management and mining. Practically oriented problems at the ends of chapters enhance the value of the book...

Computational Genome Analysis: An Introduction (Statistics for Biology and Health)

Computational Genome Analysis: An Introduction (Statistics for Biology and Health)
by Richard C. Deonier (Author), Simon Tavaré (Author), Michael S. Waterman (Author)

Computational Genome Analysis: An Introduction presents the foundations of key problems in computational molecular biology and bioinformatics. It focuses on computational and statistical principles applied to genomes, and introduces the mathematics and statistics that are crucial for understanding these applications. The book is appropriate for a one-semester course for advanced undergraduate or beginning graduate students, and it can also introduce computational biology to computer scientists, mathematicians, or biologists who are extending their interests into this exciting field.

This book features:

Topics organized around biological problems, such as sequence alignment and assembly, DNA signals, analysis of gene expression, and human genetic variation

...

Introduction to Computational Biology: Maps, Sequences and Genomes (Interdisciplinary Statistics)

Introduction to Computational Biology: Maps, Sequences and Genomes (Interdisciplinary Statistics)
by Michael S. Waterman (Author)

Biology is in the midst of a era yielding many significant discoveries and promising many more. Unique to this era is the exponential growth in the size of information-packed databases. Inspired by a pressing need to analyze that data, Introduction to Computational Biology explores a new area of expertise that emerged from this fertile field- the combination of biological and information sciences. This introduction describes the mathematical structure of biological data, especially from sequences and chromosomes. After a brief survey of molecular biology, it studies restriction maps of DNA, rough landmark maps of the underlying sequences, and clones and clone maps. It examines problems associated with reading DNA sequences and comparing sequences to finding common patterns. The author...

Large-scale Genome Sequence Processing

Large-scale Genome Sequence Processing
by Masahiro Kasahara (Author), Shinichi Morishita (Author)

Efficient computer programs have made it possible to elucidate and analyze large-scale genomic sequences. Fundamental tasks, such as the assembly of numerous whole-genome shotgun fragments, the alignment of complementary DNA sequences with a long genome, and the design of gene-specific primers or oligomers, require efficient algorithms and state-of-the-art implementation techniques. This textbook emphasizes basic software implementation techniques for processing large-scale genome sequences and provides executable sample programs.

  Private venture to sequence human genome launched.: An article from: Issues in Science and Technology
by National Academy of Sciences (Publisher)

This digital document is an article from Issues in Science and Technology, published by National Academy of Sciences on September 22, 1998. The length of the article is 497 words. The page length shown above is based on a typical 300-word page. The article is delivered in HTML format and is available in your Amazon.com Digital Locker immediately after purchase. You can view it with any web browser.

From the supplier: The launch of a private venture to sequence the entire human genome has raised concerns in Congress and the scientific community. The Institute for Genomic Research's partnership with Perkin-Elmer promises to sequence the human genome at less cost and in less time than the federally funded Human Genome Project. Scientists, however, argue that the venture will have...

  Introduction to Computational Biology: Maps, Sequences and Genomes
by Michael S. Waterman (Author)



  Perkin-Elmer to Sequence Genome: An article from: Instrument Business Outlook
by Strategic Directions International Inc. (SDI) (Publisher)

This digital document is an article from Instrument Business Outlook, published by Strategic Directions International Inc. (SDI) on May 15, 1998. The length of the article is 548 words. The page length shown above is based on a typical 300-word page. The article is delivered in HTML format and is available in your Amazon.com Digital Locker immediately after purchase. You can view it with any web browser.

Citation Details
Title: Perkin-Elmer to Sequence Genome
Publication: Instrument Business Outlook (Newsletter)
Date: May 15, 1998
Publisher: Strategic Directions International Inc. (SDI)
Volume: 7 Issue: 3 Page: N/A

Article Type: Article

Distributed by Thomson...

  Of dogs and men. (researchers at University of California at Berkeley working to sequence the genome of Canis familiaris; debate over sequencing the human ... An article from: The Hastings Center Report
by Bette-Jane Crigger (Author)

This digital document is an article from The Hastings Center Report, published by Hastings Center on May 1, 1991. The length of the article is 437 words. The page length shown above is based on a typical 300-word page. The article is delivered in HTML format and is available in your Amazon.com Digital Locker immediately after purchase. You can view it with any web browser.

Citation Details
Title: Of dogs and men. (researchers at University of California at Berkeley working to sequence the genome of Canis familiaris; debate over sequencing the human genome using tissue and blood samples from prominent persons, living or deceased)
Author: Bette-Jane Crigger
Publication: The Hastings Center Report (Refereed)
Date: May 1, 1991
Publisher: Hastings Center
Volume: v21 ...

  Genome sequence analysis: a survey.: An article from: Journal of Computer Science
by Hassan Mathkour (Author), Muneer Ahmad (Author)

This digital document is an article from Journal of Computer Science, published by Science Publications on September 1, 2009. The length of the article is 6439 words. The page length shown above is based on a typical 300-word page. The article is delivered in HTML format and is available immediately after purchase. You can view it with any web browser.

From the author: Key words: Genome, multi-lingual, approximate matching, nucleotide base pair, corpora, duplicate sequences

Citation Details
Title: Genome sequence analysis: a survey.
Author: Hassan Mathkour
Publication: Journal of Computer Science (Magazine/Journal)
Date: September 1, 2009
Publisher: Science Publications
Volume: 5 Issue: 9 Page: 651(10)

Distributed by Gale, a part of Cengage...

Advances in Statistical Methods for the Health Sciences: Applications to Cancer and AIDS Studies, Genome Sequence Analysis, and Survival Analysis (Statistics for Industry and Technology)

Advances in Statistical Methods for the Health Sciences: Applications to Cancer and AIDS Studies, Genome Sequence Analysis, and Survival Analysis (Statistics for Industry and Technology)
by Jean-Louis Auget (Editor), N. Balakrishnan (Editor), Mounir Mesbah (Editor), Geert Molenberghs (Editor)

Statistical methods have become increasingly important and now form integral part of research in the health sciences. Many sophisticated methodologies have been developed for specific applications and problems. This self-contained volume, an outgrowth of an "International Conference on Statistical Methods in Health Sciences," covers a wide range of topics pertaining to new statistical methods and novel applications in the health sciences.

The chapters, written by leading experts in their respective fields, are thematically divided into the following areas:

* Prognostic studies and general epidemiology

* Pharmacovigilance

* Quality of life

* Survival analysis

* Clustering

* Safety and efficacy assessment

*...

© 2009 BrightSurf.com