Science Current Events | Science News | Brightsurf.com
 

Biologists merge methods, results from different disciplines to find new meaning in old data

January 12, 2010

Durham, NC - A growing number of scientists are merging methods and results from different disciplines to extract new meaning from old data, says a team of researchers in a recent issue of Evolution.

As science becomes increasingly specialized and focused on new data, however, researchers who want to analyze previous findings may have a hard time getting funding and institutional support, the authors say. In a commentary piece in the journal Evolution, the authors argue for removing cultural and technological barriers to this process.

"By putting together pieces of prior research, it is possible to transform how you do science and open the doors to findings that previously were unattainable," said Brian Sidlauskas, a former postdoctoral researcher at the National Evolutionary Synthesis Center and lead author on the article. "But such an approach runs counter to the way science traditionally has been conducted, so pursuing synthetic science is somewhat risky."

"We need to reduce the risk, remove the barriers, and encourage more pursuit of synthesis," said Sidlauskas, now a professor at Oregon State University. "The potential is staggering," he added.

Some of the most important research of the last quarter-century, the authors argue, has resulted from "synthetic science" -an approach which combines concepts, tools, and data from multiple disciplines to produce new insights or discoveries.

They cite the work of J. John Sepkoski Jr., who over a 20-year period compiled a database of more than 37,000 entries tracking the first and last appearance of different organisms in the fossil record. The entries, they write, "cut across taxa, time, and geography to reveal emergent patterns over more than 500 million years of life that could not be extracted from the component data in isolation."

"That database led to previously undetermined knowledge of five separate mass extinctions through time, understanding of how major geologic events can increase or reduce biodiversity, the realization that near-shore environments produce a disproportionately large share of evolutionary novelty, and other findings," Sidlauskas said. "It also spawned a new field of synthetic paleobiology."

Sepkoski's data aggregation is one of four methods of synthesis the authors say can transform science. The others, including examples, are:

* Conceptual synthesis: The emerging discipline of evolutionary medicine is one example of how linking concepts from two distinct fields can yield new ways to approach scientific problems. For example, a recent study linked an increase in asthma rates to immune responses that might originally have helped our ancestors fend off parasites.

* Integrating methods: Integrating approaches and analyses from two distinct fields - such as genetics and evolutionary biology - has led to new ways to use modern DNA sequences. For example, researchers can now look into the past to understand the origin of genomes and reconstruct how their structure has changed over millions of years.

* Re-use of results: The authors also review a pair of landmark studies that - after combining hundreds of previous results - found that climate change alters species' distribution, abundance and morphology. These synthetic studies gathered more than 2,300 citations in just five years and substantially informed the current United States government policy on climate change.

Despite the promise, there are a number of cultural barriers to pursuing this kind of science, the researchers say. For one, it is difficult for young scientists to find appropriate training. In addition, peer review and journal publication tend to emphasize the analysis of new data rather than old, they argue. Funding from state and federal agencies is more frequently directed toward more conventional approaches, not to mention the institutional challenges with job searches, promotion and tenure - all of which are geared toward more traditional science.

The technological barriers also are daunting, but offer tantalizing potential, Sidlauskas said.

"When you're looking to synthesize data from several hundred individual studies, data formatting, storage, and accessibility become huge issues," he said. "There has been a growing movement by funding agencies and journals to permanently archive all raw data and materials in some kind of standardized format so they are not lost over time and can be used by researchers of the future."

"It's kind of an open-source approach to science," he added. "Data archives may require some kind of proprietary protection for a few months or years, but after a certain amount of time, they should become public domain. Only by saving the data that underlie today's science will we allow future scientists to use those data in ways that may far exceed what the original researchers envisioned."

National Evolutionary Synthesis Center (NESCent)


Related Science Data Current Events and Science Data News Articles


NASA's IRIS Spacecraft Is Fully Integrated
NASA's next Small Explorer (SMEX) mission to study the little-understood lower levels of the sun's atmosphere has been fully integrated and final testing is underway.

Novel approach to track migration of arctic-breeding avian species
Animals move around the globe in billions, sometimes - like the snow bunting - one of the iconic Arctic-breeding species, covering huge distances and enduring the most extreme frigid weather conditions.

Experiment Finds Ulcer Bug's Achilles' Heel
Experiments at the U.S. Department of Energy's (DOE) SLAC National Accelerator Laboratory have revealed a potential new way to attack common stomach bacteria that cause ulcers and significantly increase the odds of developing stomach cancer.

Apollo's Lunar Dust Data Being Restored
Forty years after the last Apollo spacecraft launched, the science from those missions continues to shape our view of the moon. In one of the latest developments, readings from the Apollo 14 and 15 dust detectors have been restored by scientists with the National Space Science Data Center (NSSDC) at NASA's Goddard Space Flight Center in Greenbelt, Md.

Record high for global carbon emissions
Global carbon dioxide (CO2) emissions are set to rise again in 2012, reaching a record high of 35.6 billion tonnes - according to new figures from the Global Carbon Project, co-led by researchers from the Tyndall Centre for Climate Change Research at the University of East Anglia (UEA).

Hinode scientists' stellar effort keeps sun mission 'burning bright'
Whilst the most powerful earthquake since records began hit Japan in 2011, triggering a massive tsunami which devastated much of the country, space scientists involved in one of the 'brightest' international Sun missions continued working tirelessly at the Institute of Space and Astronautical Science in Sagamihara, Japan, to capture new data from our turbulent star.

BGI demonstrated genomic data transfer at nearly 10 gigabits per second between US and China
BGI, the world's largest genomics organization, announced today that a group of scientists and researchers successfully demonstrated genomic data transfer at a sustained rate of almost 10 Gigabits per second (Gbps) over a new link connecting US and China research and education networks.

Religion is a potent force for cooperation and conflict, research shows
Across history and cultures, religion increases trust within groups but also may increase conflict with other groups, according to an article in a special issue of Science.

We can learn a lot from other species
Researchers at the SIB Swiss Institute of Bioinformatics and the EMBL-European Bioinformatics Institute have confirmed the long-held belief that studying the genes we share with other animals is useful.

A 100-gigbit highway for science
Climate researchers are producing some of the fastest growing datasets in science. Five years ago, the amount of information generated for the Nobel Prize-winning United Nations International Panel on Climate Change (IPCC) Fourth Assessment Report was 35 terabytes-equivalent to the amount of text in 35 million books, occupying a bookshelf 248 miles (399 km) long.
More Science Data Current Events and Science Data News Articles

What Is Data Science?

What Is Data Science?
by O'Reilly Media


We've all heard it: according to Hal Varian, statistics is the next sexy job. Five years ago, in What is Web 2.0, Tim O'Reilly said that "data is the next Intel Inside." But what does that statement mean? Why do we suddenly care about statistics and about data? This report examines the many sides of data science -- the technologies, the companies and the unique skill sets.
The web is full of "data-driven apps." Almost any e-commerce application is a data-driven application. There's a database behind a web front end, and middleware that talks to a number of other databases and data services (credit card processing companies, banks, and so on). But merely using data isn't really what we mean by "data science." A data application acquires its value from the data itself, and creates more...

A Simple Introduction to DATA SCIENCE

A Simple Introduction to DATA SCIENCE
by Lars Nielsen (Author), Noreen Burlingame (Author)


Lars Nielsen and Noreen Burlingame provide a brief, understandable, user-friendly guide to all aspects of Data Science.

The authors address the various skills required, the key steps in the Data Science process, software technology related to the effective practice of Data Science, and the best rising academic programs for training in the field.


CONTENTS: Data Science Summarized * What is Big Data * Hadoop * Data Management * Data Cleaning * Data Modeling for Unstructured Data * Predictive Analysis * Creativity and Intuition (or Posing the Right Question, at the Right Time, for the Right Data) * Data Visualization (or Telling the Story) * Cassandra * Academic Programs




How Data Science Is Transforming Health Care

How Data Science Is Transforming Health Care
by O'Reilly Media


In the early days of the 20th century, department store magnate John
Wanamaker famously said, "I know that half of my advertising doesn't
work. The problem is that I don't know which half." That remained
basically true until Google transformed advertising with AdSense based
on new uses of data and analysis. The same might be said about health
care and it's poised to go through a similar transformation as new
tools, techniques, and data sources come on line. Soon we'll make
policy and resource decisions based on much better understanding of
what leads to the best outcomes, and we'll make medical decisions
based on a patient's specific biology. The result will be better
health at less cost.


This paper explores how data...

Building Data Science Teams

Building Data Science Teams
by Radar


As data science evolves to become a business necessity, the importance of assembling a strong and innovative data teams grows. In this in-depth report, data scientist DJ Patil explains the skills, perspectives, tools and processes that position data science teams for success.


Topics include:
What it means to be "data driven."
The unique roles of data scientists.
The four essential qualities of data scientists.
Patil's first-hand experience building the LinkedIn data science team.

Statistics: The Art and Science of Learning from Data (3rd Edition)

Statistics: The Art and Science of Learning from Data (3rd Edition)
by Alan Agresti (Author), Christine Franklin (Author)


Alan Agresti and Chris Franklin have merged their research and classroom experience to develop this successful introductory statistics text. Statistics: The Art and Science of Learning from Data, Third Edition, helps students become statistically literate by encouraging them to ask and answer interesting statistical questions. It takes the ideas that have turned statistics into a central science in modern life and makes them accessible and engaging to students without compromising necessary rigor.   The Third Edition has been edited for conciseness and clarity to keep students focused on the main concepts. The data-rich examples that feature intriguing human-interest topics now include topic labels to indicate which statistical topic is being applied. New learning objectives for each...

Machine Learning: The Art and Science of Algorithms that Make Sense of Data

Machine Learning: The Art and Science of Algorithms that Make Sense of Data
by Peter Flach (Author)


As one of the most comprehensive machine learning texts around, this book does justice to the field's incredible richness, but without losing sight of the unifying principles. Peter Flach's clear, example-based approach begins by discussing how a spam filter works, which gives an immediate introduction to machine learning in action, with a minimum of technical fuss. Flach provides case studies of increasing complexity and variety with well-chosen examples and illustrations throughout. He covers a wide range of logical, geometric and statistical models and state-of-the-art topics such as matrix factorisation and ROC analysis. Particular attention is paid to the central role played by features. The use of established terminology is balanced with the introduction of new and useful concepts,...

Statistics: The Art and Science of Learning from Data (2nd Edition)

Statistics: The Art and Science of Learning from Data (2nd Edition)
by Alan Agresti (Author), Christine Franklin (Author)


KEY MESSAGE: Alan Agresti and Chris Franklin have merged their research and classroom experience to develop this successful introductory statistics text. Statistics: The Art and Science of Learning from Data, Second Edition helps readers become statistically literate by encouraging them to ask and answer interesting statistical questions. It takes the ideas that have turned statistics into a central science in modern life and makes them accessible and engaging to readers without compromising necessary rigor.   KEY TOPICS: GATHERING and EXPLORING DATA; Statistics: The Art and Science of Learning from Data; Exploring Data with Graphs and Numerical Summaries; Association: Contingency, Correlation, and Regression; Gathering Data; PROBABILITY AND PROBABILITY DISTRIBUTIONS; Probability in...

Research Methods in the Social Sciences w/Data Bank CD

Research Methods in the Social Sciences w/Data Bank CD
by Chava Frankfort-Nachmias (Author), David Nachmias (Author)


This acclaimed text offers a comprehensive, systematic treatment of the scientific approach to research within the context of the social sciences.   It leads students through seven major, interrelated stages of research methods: definition of the research problem, statement of hypothesis, research design, measurement, data collection, data analysis, and generalization.The new edition features a new Data Bank CD and an appendix introducing students to SPSS.

Doing Data Science

Doing Data Science
by Cathy O'Neil (Author), Rachel Schutt (Author)


Now that answering complex and compelling questions with data can make the difference in an election or a business model, data science is an attractive discipline. But how can you learn this wide-ranging, interdisciplinary field? With this book, you’ll get material from Columbia University’s "Introduction to Data Science" class in an easy-to-follow format.Each chapter-long lecture features a guest data scientist from a prominent company such as Google, Microsoft, or eBay teaching new algorithms, methods, or models by sharing case studies and actual code they use. You’ll learn what’s involved in the lives of data scientists and be able to use the techniques they present.Guest lectures focus on topics such as:Machine learning and data mining algorithms Statistical models and methods...

Data Structures: A Pseudocode Approach with C

Data Structures: A Pseudocode Approach with C
by Richard F. Gilberg (Author), Behrouz A. Forouzan (Author)


This second edition expands upon the solid, practical foundation established in the first edition of the text. A new four-part organizational structure increases the flexibility of the text, and all material is presented in a straightforward manner accompanied by an array of examples and visual diagrams.

© 2013 BrightSurf.com