Genome 10K Project announces first 101 species for genome sequencing

November 30, 2010

SANTA CRUZ, CA--The Genome 10K Community of Scientists and BGI (formerly the Beijing Genomics Institute) of Shenzhen, China, have announced a plan to sequence the genomes of 101 vertebrate species within the next two years, the first of an eventual 10,000 species to be sequenced by the Genome 10K Project.

The Genome 10K Project (G10K) is an international effort to gather specimens of thousands of animals from zoos, museums, and university collections throughout the world, and then sequence the genome of each species to reveal its complete genetic heritage. The project aims to assemble a genomic zoo--a collection of DNA sequences for 10,000 vertebrate species (approximately one for every vertebrate genus) by 2015.

Genome 10K cofounder David Haussler, a professor of biomolecular engineering in the Baskin School of Engineering at UC Santa Cruz, said the first 101 species to be sequenced were selected from the Genome 10K database, which catalogs specimens suitable for sequencing from more than 16,000 vertebrate species, both living and recently extinct. (A list of the selected species is available for download.)

"These genomes will provide a new and exciting window into vertebrate biology," Haussler said. "The experiments of evolution represented in them carry critical information about how animals are built and how they function at the molecular level."

The planned sequencing will take place within the next 24 months at the BGI facilities in China. BGI has recently amassed the largest number of advanced DNA sequencing machines under one roof of any institute in the world and is committed to large-scale application of genome sequencing in science and medicine.

"With this joint pledge to accomplish whole-genome sequence assembly and annotation of 101 new vertebrate species--the first 1 percent of the Genome 10K target--we move much closer to providing students of biology the unabridged DNA code book for the wonder of living vertebrate forms," said project cofounder Stephen J. O'Brien, chief of the Laboratory of Genomic Diversity at the National Cancer Institute.

The sequencing project builds on a BGI project launched in January 2010, called the 1,000 Plant and Animal Reference Genome Project. According to BGI executive director Jun Wang, their project harnesses the leading sequencing platform, advanced technologies, and powerful bioinformatics analytical capability.

"The collaboration with G10K will not only significantly facilitate the progression of BGI's genomics research, but also benefit the development of global genomics researches and their relevant industrial application," Wang said.

The 101 vertebrate species chosen for this phase of the project were selected by a consortium of BGI scientists, their collaborators, sponsors, and the chairs of the Genome 10K taxonomy groups for mammals, birds, amphibians, reptiles, and fish. The species were chosen to capture phylogenetic diversity across the vertebrate radiations while avoiding overlap with public-sponsored whole-genome sequencing projects. Other considerations included the biological features that make a species interesting to humankind, the science community around each species, and the availability of vouchered specimens.

According to Genome 10K cofounder Oliver Ryder, director of genetics at the San Diego Zoo Institute for Conservation Research, the collection and banking of samples in international centers, such as San Diego's Frozen Zoo, sets the stage for rapid progress toward the sequencing goals in collaboration with major sequencing centers, such as BGI. "The data produced provide a major contribution to understanding aspects of the biology of many endangered species, thereby assisting in conservation efforts on their behalf," Ryder said.

In addition to the 101 species chosen for G10K, there are 120 vertebrate species already being sequenced in public-sector genome projects (download list here). The completion of 221 whole-genome sequences within a few years is a first step toward the G10K goal of obtaining high-quality whole-genome sequences, assembled and annotated, for 10,000 vertebrate species. To meet the quality goals of the G10K project, every species must have a reference genome sequence with chromosome-scale contiguity suitable for display on a genome browser and accurate enough for genome-based research.

Selection of future species for sequencing in the Genome 10K project will involve open-science-based input and collaboration, Haussler said. Haussler's team at UCSC's Center for Biomolecular Science and Engineering maintains the Genome 10K database and the UCSC Genome Browser.

The Genome 10K Project will hold a workshop in Santa Cruz, California, from March 14 to 18, 2011. This workshop will accomplish two goals: First, it will continue the mission to gather biological specimens of vertebrate species and describe them thoroughly on a public database. Second, it will gather genome assembly experts to develop a robust solution to the challenge of creating useful assembly with short reads and combined-scaffold assembly approaches.
-end-
The 2011 Genome 10K meetings are supported by the California Institute for Quantitative Biosciences and the American Genetic Association.

University of California - Santa Cruz

Related Genome Articles from Brightsurf:

Genome evolution goes digital
Dr. Alan Herbert from InsideOutBio describes ground-breaking research in a paper published online by Royal Society Open Science.

Breakthrough in genome visualization
Kadir Dede and Dr. Enno Ohlebusch at Ulm University in Germany have devised a method for constructing pan-genome subgraphs at different granularities without having to wait hours and days on end for the software to process the entire genome.

Sturgeon genome sequenced
Sturgeons lived on earth already 300 million years ago and yet their external appearance seems to have undergone very little change.

A sea monster's genome
The giant squid is an elusive giant, but its secrets are about to be revealed.

Deciphering the walnut genome
New research could provide a major boost to the state's growing $1.6 billion walnut industry by making it easier to breed walnut trees better equipped to combat the soil-borne pathogens that now plague many of California's 4,800 growers.

Illuminating the genome
Development of a new molecular visualisation method, RNA-guided endonuclease -- in situ labelling (RGEN-ISL) for the CRISPR/Cas9-mediated labelling of genomic sequences in nuclei and chromosomes.

A genome under influence
References form the basis of our comprehension of the world: they enable us to measure the height of our children or the efficiency of a drug.

How a virus destabilizes the genome
New insights into how Kaposi's sarcoma-associated herpesvirus (KSHV) induces genome instability and promotes cell proliferation could lead to the development of novel antiviral therapies for KSHV-associated cancers, according to a study published Sept.

Better genome editing
Reich Group researchers develop a more efficient and precise method of in-cell genome editing.

Unlocking the genome
A team led by Prof. Stein Aerts (VIB-KU Leuven) uncovers how access to relevant DNA regions is orchestrated in epithelial cells.

Read More: Genome News and Genome Current Events
Brightsurf.com is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to Amazon.com.