Nav: Home

Largest Populus SNP dataset holds promise for biofuels, materials, metabolites

January 17, 2017

OAK RIDGE, Tenn., Jan. 17, 2017--Researchers at the Department of Energy's Oak Ridge National Laboratory (ORNL) have released the largest-ever single nucleotide polymorphism (SNP) dataset of genetic variations in poplar trees, information useful to plant scientists as well as researchers in the fields of biofuels, materials science, and secondary plant metabolism.

For nearly 10 years, researchers with DOE's BioEnergy Science Center (BESC), a DOE Bioenergy Research Center led by ORNL, have studied the genome of Populus--a fast-growing perennial tree recognized for its economic potential in biofuels production. Today, they released the Genome-Wide Association Study (GWAS) dataset that comprises more than 28 million single nucleotide polymorphisms, or SNPs, derived from approximately 900 resequenced poplar genotypes. Each SNP represents a variation in a single DNA nucleotide, or building block, and can act as a biological marker, helping scientists locate genes associated with certain characteristics, conditions, or diseases.

The data "gives us unprecedented statistical power to link DNA changes to phenotypes [physical traits]," said Gerald Tuskan, a corporate fellow and leader of the Plant Systems Biology group in ORNL's Biosciences Division. Tuskan will present the GWAS data today at the Plant & Animal Genome Conference in San Diego. The results of this analysis have been used to seek genetic control of cell-wall recalcitrance--a natural characteristic of plant cell walls that prevents the release of sugars under microbial conversion and inhibits biofuels production.

BESC scientists are also using the dataset to identify the molecular mechanisms controlling deposition of lignin in plant structures. Lignin, the polymer that strengthens plant cell walls, acts as a barrier to accessing cellulose and thereby preventing cellulose breakdown into simple sugars for fermentation.

With the new poplar GWAS dataset, "we can identify the genes and genetic variants [i.e. alleles] that move carbon through the lignin pathway, and then take that knowledge and, through genomic selection, develop plant materials that are tailored to work with microbes to yield the targeted product," Tuskan said. Such products include modified lignin customized for chemicals, polymers and materials. Although the dataset's most immediate applications are in plant science, ORNL researchers plan to use the GWAS data to inform bioscience work in areas such as cleaner, sustainable transportation fuels, carbon fiber for lightweight vehicles and alternatives to conventional plastics and building insulation materials.

Even the medical field could benefit from the work: ORNL researchers, for instance, have used the poplar GWAS to identify the genes that control callus formation, or cells covering a plant wound. The work has implications for cancer research.

"The genes related to callus formation are analogous to many genes involved in the formation of tumors in humans," Tuskan said. "This discovery, and the associated gene expression network surrounding such genes, could inform work related to the Cancer Moonshot," he added, referring to a federal initiative designed to speed progress in cancer research.

Tuskan, who holds a joint appointment at DOE's Joint Genome Institute in California, found inspiration for the work in the sequencing of the human genome about a decade ago. The researchers recognized how those types of studies could be used to address DOE challenges in carbon sequestration, bioprocessing and materials science.

Tuskan emphasized the importance of technological advances to the work. Sequencing capacity and computational abilities "made the work possible," he said. "We are working in the big data realm, and fortunately at the national lab we have the platforms and infrastructure to do this type of analysis."

As part of their work, the researchers used the computational resources available at ORNL through its Compute and Data Environment for Science (CADES) program within ORNL's Computing and Computational Sciences Directorate, as well as the Titan supercomputer at the Oak Ridge Leadership Computing Facility, a DOE Office of Science User Facility.

The research also involves monitoring and cataloging phenotypes of poplar trees in regions from southern British Columbia to central California. "None of the sophisticated genomics and computational science would mean anything without the fieldwork. The genetics, the computational science, and measuring and cataloging phenotypes are the three legs of the platform we stand on at BESC," Tuskan said.

The researchers plan to expand the existing dataset and collaborate with other scientific groups to collect and analyze additional phenotypes.

Other ORNL scientists involved in the project include Wellington Muchero, Jay Chen, Daniel Jacobson, and Tim Tschaplinski. Contributing scientists at the Joint Genome Institute, which performed all the genetic sequencing, were Dan Rokhsar, Wendy Schackwitz, and Jeremy Schmutz. Steve DiFazio at West Virginia University's Department of Biology was also involved in the project. Mark Davis and others at DOE's National Renewable Energy Laboratory made contributions in characterizing the biochemistry of plant cell walls.

The dataset is available at:
The project is supported by BESC, a multi-institutional (18 partners) research organization performing basic and applied science dedicated to improving yields of biofuels by focusing on the fundamental understanding and elimination of biomass recalcitrance. This multidisciplinary research encompasses the biological, chemical, physical, and computational sciences, as well as mathematics and engineering. BESC is one of three DOE Bioenergy Research Centers supported by DOE's Office of Science.

UT-Battelle manages ORNL for DOE's Office of Science. The Office of Science is the single largest supporter of basic research in the physical sciences in the United States, and is working to address some of the most pressing challenges of our time. For more information, please visit

DOE/Oak Ridge National Laboratory

Related Genome Articles:

Genome evolution goes digital
Dr. Alan Herbert from InsideOutBio describes ground-breaking research in a paper published online by Royal Society Open Science.
Breakthrough in genome visualization
Kadir Dede and Dr. Enno Ohlebusch at Ulm University in Germany have devised a method for constructing pan-genome subgraphs at different granularities without having to wait hours and days on end for the software to process the entire genome.
Sturgeon genome sequenced
Sturgeons lived on earth already 300 million years ago and yet their external appearance seems to have undergone very little change.
A sea monster's genome
The giant squid is an elusive giant, but its secrets are about to be revealed.
Deciphering the walnut genome
New research could provide a major boost to the state's growing $1.6 billion walnut industry by making it easier to breed walnut trees better equipped to combat the soil-borne pathogens that now plague many of California's 4,800 growers.
Illuminating the genome
Development of a new molecular visualisation method, RNA-guided endonuclease -- in situ labelling (RGEN-ISL) for the CRISPR/Cas9-mediated labelling of genomic sequences in nuclei and chromosomes.
A genome under influence
References form the basis of our comprehension of the world: they enable us to measure the height of our children or the efficiency of a drug.
How a virus destabilizes the genome
New insights into how Kaposi's sarcoma-associated herpesvirus (KSHV) induces genome instability and promotes cell proliferation could lead to the development of novel antiviral therapies for KSHV-associated cancers, according to a study published Sept.
Better genome editing
Reich Group researchers develop a more efficient and precise method of in-cell genome editing.
Unlocking the genome
A team led by Prof. Stein Aerts (VIB-KU Leuven) uncovers how access to relevant DNA regions is orchestrated in epithelial cells.
More Genome News and Genome Current Events

Trending Science News

Current Coronavirus (COVID-19) News

Top Science Podcasts

We have hand picked the top science podcasts of 2020.
Now Playing: TED Radio Hour

Listen Again: Meditations on Loneliness
Original broadcast date: April 24, 2020. We're a social species now living in isolation. But loneliness was a problem well before this era of social distancing. This hour, TED speakers explore how we can live and make peace with loneliness. Guests on the show include author and illustrator Jonny Sun, psychologist Susan Pinker, architect Grace Kim, and writer Suleika Jaouad.
Now Playing: Science for the People

#565 The Great Wide Indoors
We're all spending a bit more time indoors this summer than we probably figured. But did you ever stop to think about why the places we live and work as designed the way they are? And how they could be designed better? We're talking with Emily Anthes about her new book "The Great Indoors: The Surprising Science of how Buildings Shape our Behavior, Health and Happiness".
Now Playing: Radiolab

The Third. A TED Talk.
Jad gives a TED talk about his life as a journalist and how Radiolab has evolved over the years. Here's how TED described it:How do you end a story? Host of Radiolab Jad Abumrad tells how his search for an answer led him home to the mountains of Tennessee, where he met an unexpected teacher: Dolly Parton.Jad Nicholas Abumrad is a Lebanese-American radio host, composer and producer. He is the founder of the syndicated public radio program Radiolab, which is broadcast on over 600 radio stations nationwide and is downloaded more than 120 million times a year as a podcast. He also created More Perfect, a podcast that tells the stories behind the Supreme Court's most famous decisions. And most recently, Dolly Parton's America, a nine-episode podcast exploring the life and times of the iconic country music star. Abumrad has received three Peabody Awards and was named a MacArthur Fellow in 2011.