Biologists merge methods, results from different disciplines to find new meaning in old data
January 12, 2010
Durham, NC - A growing number of scientists are merging methods and results from different disciplines to extract new meaning from old data, says a team of researchers in a recent issue of Evolution.
As science becomes increasingly specialized and focused on new data, however, researchers who want to analyze previous findings may have a hard time getting funding and institutional support, the authors say. In a commentary piece in the journal Evolution, the authors argue for removing cultural and technological barriers to this process.
"By putting together pieces of prior research, it is possible to transform how you do science and open the doors to findings that previously were unattainable," said Brian Sidlauskas, a former postdoctoral researcher at the National Evolutionary Synthesis Center and lead author on the article. "But such an approach runs counter to the way science traditionally has been conducted, so pursuing synthetic science is somewhat risky."
"We need to reduce the risk, remove the barriers, and encourage more pursuit of synthesis," said Sidlauskas, now a professor at Oregon State University. "The potential is staggering," he added.
Some of the most important research of the last quarter-century, the authors argue, has resulted from "synthetic science" -an approach which combines concepts, tools, and data from multiple disciplines to produce new insights or discoveries.
They cite the work of J. John Sepkoski Jr., who over a 20-year period compiled a database of more than 37,000 entries tracking the first and last appearance of different organisms in the fossil record. The entries, they write, "cut across taxa, time, and geography to reveal emergent patterns over more than 500 million years of life that could not be extracted from the component data in isolation."
"That database led to previously undetermined knowledge of five separate mass extinctions through time, understanding of how major geologic events can increase or reduce biodiversity, the realization that near-shore environments produce a disproportionately large share of evolutionary novelty, and other findings," Sidlauskas said. "It also spawned a new field of synthetic paleobiology."
Sepkoski's data aggregation is one of four methods of synthesis the authors say can transform science. The others, including examples, are:
* Conceptual synthesis: The emerging discipline of evolutionary medicine is one example of how linking concepts from two distinct fields can yield new ways to approach scientific problems. For example, a recent study linked an increase in asthma rates to immune responses that might originally have helped our ancestors fend off parasites.
* Integrating methods: Integrating approaches and analyses from two distinct fields - such as genetics and evolutionary biology - has led to new ways to use modern DNA sequences. For example, researchers can now look into the past to understand the origin of genomes and reconstruct how their structure has changed over millions of years.
* Re-use of results: The authors also review a pair of landmark studies that - after combining hundreds of previous results - found that climate change alters species' distribution, abundance and morphology. These synthetic studies gathered more than 2,300 citations in just five years and substantially informed the current United States government policy on climate change.
Despite the promise, there are a number of cultural barriers to pursuing this kind of science, the researchers say. For one, it is difficult for young scientists to find appropriate training. In addition, peer review and journal publication tend to emphasize the analysis of new data rather than old, they argue. Funding from state and federal agencies is more frequently directed toward more conventional approaches, not to mention the institutional challenges with job searches, promotion and tenure - all of which are geared toward more traditional science.
The technological barriers also are daunting, but offer tantalizing potential, Sidlauskas said.
"When you're looking to synthesize data from several hundred individual studies, data formatting, storage, and accessibility become huge issues," he said. "There has been a growing movement by funding agencies and journals to permanently archive all raw data and materials in some kind of standardized format so they are not lost over time and can be used by researchers of the future."
"It's kind of an open-source approach to science," he added. "Data archives may require some kind of proprietary protection for a few months or years, but after a certain amount of time, they should become public domain. Only by saving the data that underlie today's science will we allow future scientists to use those data in ways that may far exceed what the original researchers envisioned."
National Evolutionary Synthesis Center (NESCent)
Related Science Data Current Events and Science Data News ArticlesTo teach scientific reproducibility, start young
The ability to duplicate an experiment and its results is a central tenet of the scientific method, but recent research has shown an alarming number of peer-reviewed papers are irreproducible.NASA's IRIS Spots Its Largest Solar Flare
On Jan. 28, 2014, NASA's Interface Region Imaging Spectrograph, or IRIS, witnessed its strongest solar flare since it launched in the summer of 2013. Solar flares are bursts of x-rays and light that stream out into space, but scientists don't yet know the fine details of what sets them off.World temperature records available via Google Earth
Climate researchers at the University of East Anglia have made the world's temperature records available via Google Earth.Mission Managers Hail Successful MAVEN Launch
NASA's Mars Atmosphere and Volatile Evolution (MAVEN) mission began with a smooth countdown and flawless launch from Cape Canaveral Air Force Station's Space Launch Complex 41.NASA's IRIS Telescope Offers First Glimpse of Sun's Mysterious Atmosphere
The moment when a telescope first opens its doors represents the culmination of years of work and planning -- while simultaneously laying the groundwork for a wealth of research and answers yet to come. It is a moment of excitement and perhaps even a little uncertainty.First global atlas of marine plankton reveals remarkable underwater world
Under the microscope, they look like they could be from another planet, but these microscopic organisms inhabit the depths of our oceans in nearly infinite numbers.First atlas on oceanic plankton
In an international collaborative project, scientists have recorded the times, places and concentrations of oceanic plankton occurrences worldwide. Their data has been collected in a global atlas that covers organisms from bacteria to krill.NASA's IRIS Spacecraft Is Fully Integrated
NASA's next Small Explorer (SMEX) mission to study the little-understood lower levels of the sun's atmosphere has been fully integrated and final testing is underway.
Novel approach to track migration of arctic-breeding avian species
Animals move around the globe in billions, sometimes - like the snow bunting - one of the iconic Arctic-breeding species, covering huge distances and enduring the most extreme frigid weather conditions.Experiment Finds Ulcer Bug's Achilles' Heel
Experiments at the U.S. Department of Energy's (DOE) SLAC National Accelerator Laboratory have revealed a potential new way to attack common stomach bacteria that cause ulcers and significantly increase the odds of developing stomach cancer.
More Science Data Current Events and Science Data News Articles
Data Science for Business: What you need to know about data mining and data-analytic thinking|
by Foster Provost (Author), Tom Fawcett (Author)
Written by renowned data science experts Foster Provost and Tom Fawcett, Data Science for Business introduces the fundamental principles of data science, and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect. This guide also helps you understand the many data-mining techniques in use today. Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You’ll not only learn how to improve communication between business stakeholders and data scientists, but also how participate intelligently in your company’s data science projects. You’ll also discover how to think...
Data Smart: Using Data Science to Transform Information into Insight|
by John W. Foreman (Author)
Data Science gets thrown around in the press like it's magic. Major retailers are predicting everything from when their customers are pregnant to when they want a new pair of Chuck Taylors. It's a brave new world where seemingly meaningless data can be transformed into valuable insight to drive smart business decisions.
But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the "data scientist," to extract this gold from your data? Nope.
Data science is little more than using straight-forward steps to process raw data into actionable insight. And in Data Smart, author and data scientist John Foreman will show you how that's done within the familiar environment of a spreadsheet.
What Is Data Science?|
by O'Reilly Media
We've all heard it: according to Hal Varian, statistics is the next sexy job. Five years ago, in What is Web 2.0, Tim O'Reilly said that "data is the next Intel Inside." But what does that statement mean? Why do we suddenly care about statistics and about data? This report examines the many sides of data science -- the technologies, the companies and the unique skill sets.
The web is full of "data-driven apps." Almost any e-commerce application is a data-driven application. There's a database behind a web front end, and middleware that talks to a number of other databases and data services (credit card processing companies, banks, and so on). But merely using data isn't really what we mean by "data science." A data application acquires its value from the data itself, and creates more...
Doing Data Science: Straight Talk from the Frontline|
by Cathy O'Neil (Author), Rachel Schutt (Author)
Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know. In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science. Topics include: Statistical inference, exploratory data analysis,...
How Data Science Is Transforming Health Care|
by O'Reilly Media
In the early days of the 20th century, department store magnate John
Wanamaker famously said, "I know that half of my advertising doesn't
work. The problem is that I don't know which half." That remained
basically true until Google transformed advertising with AdSense based
on new uses of data and analysis. The same might be said about health
care and it's poised to go through a similar transformation as new
tools, techniques, and data sources come on line. Soon we'll make
policy and resource decisions based on much better understanding of
what leads to the best outcomes, and we'll make medical decisions
based on a patient's specific biology. The result will be better
health at less cost.
This paper explores how data...
Building Data Science Teams|
As data science evolves to become a business necessity, the importance of assembling a strong and innovative data teams grows. In this in-depth report, data scientist DJ Patil explains the skills, perspectives, tools and processes that position data science teams for success.
What it means to be "data driven."
The unique roles of data scientists.
The four essential qualities of data scientists.
Patil's first-hand experience building the LinkedIn data science team.
Agile Data Science: Building Data Analytics Applications with Hadoop|
by Russell Jurney (Author)
Mining big data requires a deep investment in people and time. How can you be sure you’re building the right models? With this hands-on book, you’ll learn a flexible toolset and methodology for building effective analytics applications with Hadoop. Using lightweight tools such as Python, Apache Pig, and the D3.js library, your team will create an agile environment for exploring data, starting with an example application to mine your own email inboxes. You’ll learn an iterative approach that enables you to quickly change the kind of analysis you’re doing, depending on what the data is telling you. All example code in this book is available as working Heroku apps. Create analytics applications by using the agile big data development methodology Build value from your data in a...
Big Data: A Revolution That Will Transform How We Live, Work, and Think|
by Viktor Mayer-Schönberger (Author), Kenneth Cukier (Author)
Financial Times Business Book of the Year Finalist
“Illuminating and very timely . . . a fascinating — and sometimes alarming — survey of big data’s growing effect on just about everything: business, government, science and medicine, privacy, and even on the way we think.”
—New York Times
It seems like “big data” is in the news every day, as we read the latest examples of how powerful algorithms are teasing out the hidden connections between seemingly unrelated things. Whether it is used by the NSA to fight terrorism or by online retailers to predict customers’ buying patterns, big data is a revolution occurring around us, in the process of forever changing economics, science, culture, and the very way we think. But it also poses new threats, from the...
Data Science For Dummies (For Dummies (Computer/Tech))|
by Carl Anderson (Author)
Discover how data science can help you gain in-depth insight into your business – the easy way!Jobs in data science abound, but few people have the data science skills needed to fill these increasingly important roles in organizations. Data Science For Dummies is the perfect starting point for IT professionals and students interested in making sense of their organization’s massive data sets and applying their findings to real-world business scenarios. From uncovering rich data sources to managing large amounts of data within hardware and software limitations, ensuring consistency in reporting, merging various data sources, and beyond, you’ll develop the know-how you need to effectively interpret data and tell a story that can be understood by anyone in your organization.Provides a...
XML and Web Technologies for Data Sciences with R (Use R!)|
by Deborah Nolan (Author), Duncan Temple Lang (Author)