Science Current Events | Science News | Brightsurf.com
 

Biologists merge methods, results from different disciplines to find new meaning in old data

January 12, 2010
Durham, NC - A growing number of scientists are merging methods and results from different disciplines to extract new meaning from old data, says a team of researchers in a recent issue of Evolution.

As science becomes increasingly specialized and focused on new data, however, researchers who want to analyze previous findings may have a hard time getting funding and institutional support, the authors say. In a commentary piece in the journal Evolution, the authors argue for removing cultural and technological barriers to this process.

"By putting together pieces of prior research, it is possible to transform how you do science and open the doors to findings that previously were unattainable," said Brian Sidlauskas, a former postdoctoral researcher at the National Evolutionary Synthesis Center and lead author on the article. "But such an approach runs counter to the way science traditionally has been conducted, so pursuing synthetic science is somewhat risky."

"We need to reduce the risk, remove the barriers, and encourage more pursuit of synthesis," said Sidlauskas, now a professor at Oregon State University. "The potential is staggering," he added.

Some of the most important research of the last quarter-century, the authors argue, has resulted from "synthetic science" -an approach which combines concepts, tools, and data from multiple disciplines to produce new insights or discoveries.

They cite the work of J. John Sepkoski Jr., who over a 20-year period compiled a database of more than 37,000 entries tracking the first and last appearance of different organisms in the fossil record. The entries, they write, "cut across taxa, time, and geography to reveal emergent patterns over more than 500 million years of life that could not be extracted from the component data in isolation."

"That database led to previously undetermined knowledge of five separate mass extinctions through time, understanding of how major geologic events can increase or reduce biodiversity, the realization that near-shore environments produce a disproportionately large share of evolutionary novelty, and other findings," Sidlauskas said. "It also spawned a new field of synthetic paleobiology."

Sepkoski's data aggregation is one of four methods of synthesis the authors say can transform science. The others, including examples, are:

* Conceptual synthesis: The emerging discipline of evolutionary medicine is one example of how linking concepts from two distinct fields can yield new ways to approach scientific problems. For example, a recent study linked an increase in asthma rates to immune responses that might originally have helped our ancestors fend off parasites.

* Integrating methods: Integrating approaches and analyses from two distinct fields - such as genetics and evolutionary biology - has led to new ways to use modern DNA sequences. For example, researchers can now look into the past to understand the origin of genomes and reconstruct how their structure has changed over millions of years.

* Re-use of results: The authors also review a pair of landmark studies that - after combining hundreds of previous results - found that climate change alters species' distribution, abundance and morphology. These synthetic studies gathered more than 2,300 citations in just five years and substantially informed the current United States government policy on climate change.

Despite the promise, there are a number of cultural barriers to pursuing this kind of science, the researchers say. For one, it is difficult for young scientists to find appropriate training. In addition, peer review and journal publication tend to emphasize the analysis of new data rather than old, they argue. Funding from state and federal agencies is more frequently directed toward more conventional approaches, not to mention the institutional challenges with job searches, promotion and tenure - all of which are geared toward more traditional science.

The technological barriers also are daunting, but offer tantalizing potential, Sidlauskas said.

"When you're looking to synthesize data from several hundred individual studies, data formatting, storage, and accessibility become huge issues," he said. "There has been a growing movement by funding agencies and journals to permanently archive all raw data and materials in some kind of standardized format so they are not lost over time and can be used by researchers of the future."

"It's kind of an open-source approach to science," he added. "Data archives may require some kind of proprietary protection for a few months or years, but after a certain amount of time, they should become public domain. Only by saving the data that underlie today's science will we allow future scientists to use those data in ways that may far exceed what the original researchers envisioned."

National Evolutionary Synthesis Center (NESCent)


Related Science Data Current Events and Science Data News Articles


Science and Cookies: Researchers Tap Into Citizen Science To Shed Light on Ant Diversity
Scientists from North Carolina State University and the University of Florida have combined cookies, citizen science and robust research methods to track the diversity of ant species across the United States, and are now collaborating with international partners to get a global perspective on how ants are moving and surviving in the modern world.

Perspective of the PandaX dark matter experiment
The PandaX experiment of China, which is located in the deepest underground laboratory, has released its technical design report recently.

Black Hole 'Batteries' Keep Blazars Going and Going
Astronomers studying two classes of black-hole-powered galaxies monitored by NASA's Fermi Gamma-ray Space Telescope have found evidence that they represent different sides of the same cosmic coin.

Democratizing data visualization
In 2007, members of the Haystack Group in MIT's Computer Science and Artificial Intelligence Laboratory released a set of Web development tools called "Exhibit."

First Images Available from NASA-JAXA Global Rain and Snowfall Satellite
NASA and the Japan Aerospace Exploration Agency (JAXA) have released the first images captured by their newest Earth-observing satellite, the Global Precipitation Measurement (GPM) Core Observatory, which launched into space Feb. 27.

To teach scientific reproducibility, start young
The ability to duplicate an experiment and its results is a central tenet of the scientific method, but recent research has shown an alarming number of peer-reviewed papers are irreproducible.

NASA's IRIS Spots Its Largest Solar Flare
On Jan. 28, 2014, NASA's Interface Region Imaging Spectrograph, or IRIS, witnessed its strongest solar flare since it launched in the summer of 2013. Solar flares are bursts of x-rays and light that stream out into space, but scientists don't yet know the fine details of what sets them off.

World temperature records available via Google Earth
Climate researchers at the University of East Anglia have made the world's temperature records available via Google Earth.

Mission Managers Hail Successful MAVEN Launch
NASA's Mars Atmosphere and Volatile Evolution (MAVEN) mission began with a smooth countdown and flawless launch from Cape Canaveral Air Force Station's Space Launch Complex 41.

NASA's IRIS Telescope Offers First Glimpse of Sun's Mysterious Atmosphere
The moment when a telescope first opens its doors represents the culmination of years of work and planning -- while simultaneously laying the groundwork for a wealth of research and answers yet to come. It is a moment of excitement and perhaps even a little uncertainty.
More Science Data Current Events and Science Data News Articles

Data Science for Business: What you need to know about data mining and data-analytic thinking

Data Science for Business: What you need to know about data mining and data-analytic thinking
by Foster Provost (Author), Tom Fawcett (Author)


Written by renowned data science experts Foster Provost and Tom Fawcett, Data Science for Business introduces the fundamental principles of data science, and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect. This guide also helps you understand the many data-mining techniques in use today.Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You’ll not only learn how to improve communication between business stakeholders and data scientists, but also how participate intelligently in your company’s data science projects. You’ll also discover how to think...

Data Smart: Using Data Science to Transform Information into Insight

Data Smart: Using Data Science to Transform Information into Insight
by John W. Foreman (Author)


Data Science gets thrown around in the press like it's magic. Major retailers are predicting everything from when their customers are pregnant to when they want a new pair of Chuck Taylors. It's a brave new world where seemingly meaningless data can be transformed into valuable insight to drive smart business decisions.

But how does one exactly do data science? Do you have to hire one of these priests of the dark arts, the "data scientist," to extract this gold from your data? Nope.

Data science is little more than using straight-forward steps to process raw data into actionable insight. And in Data Smart, author and data scientist John Foreman will show you how that's done within the familiar environment of a spreadsheet.

What Is Data Science?

What Is Data Science?
by O'Reilly Media


We've all heard it: according to Hal Varian, statistics is the next sexy job. Five years ago, in What is Web 2.0, Tim O'Reilly said that "data is the next Intel Inside." But what does that statement mean? Why do we suddenly care about statistics and about data? This report examines the many sides of data science -- the technologies, the companies and the unique skill sets.The web is full of "data-driven apps." Almost any e-commerce application is a data-driven application. There's a database behind a web front end, and middleware that talks to a number of other databases and data services (credit card processing companies, banks, and so on). But merely using data isn't really what we mean by "data science." A data application acquires its value from the data itself, and creates more data...

Doing Data Science: Straight Talk from the Frontline

Doing Data Science: Straight Talk from the Frontline
by Cathy O'Neil (Author), Rachel Schutt (Author)


Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know.In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science.Topics include:Statistical inference, exploratory data analysis, and...

Statistics: The Art and Science of Learning from Data (3rd Edition)

Statistics: The Art and Science of Learning from Data (3rd Edition)
by Alan Agresti (Author), Christine Franklin (Author)


Alan Agresti and Chris Franklin have merged their research and classroom experience to develop this successful introductory statistics text. Statistics: The Art and Science of Learning from Data, Third Edition, helps students become statistically literate by encouraging them to ask and answer interesting statistical questions. It takes the ideas that have turned statistics into a central science in modern life and makes them accessible and engaging to students without compromising necessary rigor.   The Third Edition has been edited for conciseness and clarity to keep students focused on the main concepts. The data-rich examples that feature intriguing human-interest topics now include topic labels to indicate which statistical topic is being applied. New learning objectives for each...

Practical Data Science with R

Practical Data Science with R
by Nina Zumel (Author), John Mount (Author)


SummaryPractical Data Science with R lives up to its name. It explains basic principles without the theoretical mumbo-jumbo and jumps right to the real use cases you'll face as you collect, curate, and analyze the data crucial to the success of your business. You'll apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business intelligence, and decision support.Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.About the BookBusiness analysts and developers are increasingly collecting, curating, analyzing, and reporting on crucial business data. The R language and its associated tools provide a straightforward way to tackle day-to-day data science tasks without a...

Building Data Science Teams

Building Data Science Teams
by Radar


As data science evolves to become a business necessity, the importance of assembling a strong and innovative data teams grows. In this in-depth report, data scientist DJ Patil explains the skills, perspectives, tools and processes that position data science teams for success.


Topics include:
What it means to be "data driven."
The unique roles of data scientists.
The four essential qualities of data scientists.
Patil's first-hand experience building the LinkedIn data science team.

How Data Science Is Transforming Health Care

How Data Science Is Transforming Health Care
by O'Reilly Media


In the early days of the 20th century, department store magnate John
Wanamaker famously said, "I know that half of my advertising doesn't
work. The problem is that I don't know which half." That remained
basically true until Google transformed advertising with AdSense based
on new uses of data and analysis. The same might be said about health
care and it's poised to go through a similar transformation as new
tools, techniques, and data sources come on line. Soon we'll make
policy and resource decisions based on much better understanding of
what leads to the best outcomes, and we'll make medical decisions
based on a patient's specific biology. The result will be better
health at less cost.


This paper explores how data...

Data Scientist: The Definitive Guide to Becoming a Data Scientist

Data Scientist: The Definitive Guide to Becoming a Data Scientist
by Zacharias Voulgaris PhD (Author)


As our society transforms into a data-driven one, the role of the Data Scientist is becoming more and more important. If you want to be on the leading edge of what is sure to become a major profession in the not-too-distant future, this book can show you how.

Each chapter is filled with practical information that will help you reap the fruits of big data and become a successful Data Scientist: Learn what big data is and how it differs from traditional data through its main characteristics: volume, variety, velocity, and veracity. Explore the different types of Data Scientists and the skillset each one has. Dig into what the role of the Data Scientist requires in terms of the relevant mindset, technical skills, experience, and how the Data Scientist connects with other people. ...

Agile Data Science: Building Data Analytics Applications with Hadoop

Agile Data Science: Building Data Analytics Applications with Hadoop
by Russell Jurney (Author)


Mining big data requires a deep investment in people and time. How can you be sure you’re building the right models? With this hands-on book, you’ll learn a flexible toolset and methodology for building effective analytics applications with Hadoop.Using lightweight tools such as Python, Apache Pig, and the D3.js library, your team will create an agile environment for exploring data, starting with an example application to mine your own email inboxes. You’ll learn an iterative approach that enables you to quickly change the kind of analysis you’re doing, depending on what the data is telling you. All example code in this book is available as working Heroku apps.Create analytics applications by using the agile big data development methodologyBuild value from your data in a series of...

© 2014 BrightSurf.com