Science Resources
Earth Science
Space Science
Life Science
Fields of Scientific Study
Medical Topics and Fields
Cancer Research
Nanotechnology Articles
RSS Feeds
|
 |
 |
 |
New tool enables powerful data analysis
January 08, 2009
A powerful computing tool that allows scientists to extract features and patterns from enormously large and complex sets of raw data has been developed by scientists at University of California, Davis, and Lawrence Livermore National Laboratory. The tool - a set of problem-solving calculations known as an algorithm - is compact enough to run on computers with as little as two gigabytes of memory. The team that developed this algorithm has already used it to probe a slew of phenomena represented by billions of data points, including analyzing and creating images of flame surfaces; searching for clusters and voids in a virtual universe experiment; and identifying and tracking pockets of fluid in a simulated mixing of two fluids.
"What we've developed is a workable system of handling any data in any dimension," said Attila Gyulassy, who led the five-year development effort while pursuing a PhD in computer science at UC Davis. "We expect this algorithm will become an integral part of a scientist's toolbox to answer questions about data."
A paper describing the new algorithm was published in the November-December issue of IEEE Transactions on Visualization and Computer Graphics.
Computers are widely used to perform simulations of real-world phenomena and to capture results of physical experiments and observations, storing this information as collections of numbers. But as the size of these data sets has burgeoned, hand-in-hand with computer capacity, analysis has grown increasingly difficult.
A mathematical tool to extract and visualize useful features from data sets has existed for nearly 40 years - in theory. Called the Morse-Smale complex, it partitions sets by similarity of features and encodes them into mathematical terms. But working with the Morse-Smale complex is not easy. "It's a powerful language. But a cost of that, is that using it meaningfully for practical applications is very difficult," Gyulassy said.
Gyulassy's algorithm divides data sets into parcels of cells, then analyzes each parcel separately using the Morse-Smale complex. Results of those computations are then merged together. As new parcels are created from merged parcels, they are analyzed and merged yet again. At each step, data that do not need to be stored in memory are discarded, drastically reducing the computing power required to run the calculations.
One of Gyulassy's tests of the algorithm was to use it to analyze and track the formation and movement of pockets of fluid in the simulated mixing of two fluids: one dense, one light. The complexity of this data set is so vast - it consists of more than one billion data points on a three-dimensional grid - it challenges even supercomputers, Gyulassy said. Yet the new algorithm with its streamlining features was able to perform the analysis on a laptop computer with just two gigabytes of memory. Although Gyulassy had to wait nearly 24 hours for the little machine to complete its calculations, at the end of this process he could pull up images in mere seconds to illustrate phenomena he was interested in, such as the branching of fluid pockets in the mixture.
Two main factors are driving the need for analysis of large data sets, said co-author Bernd Hamann: a surge in the use of powerful computers that can produce huge amounts of data, and an upswing in affordability and availability of sensing devices that researchers deploy in the field and lab to collect a profusion of data.
"Our data files are becoming larger and larger, while the scientist has less and less time to understand them," said Hamann, a professor of computer science and associate vice chancellor for research at UC Davis. "But what are the data good for if we don't have the means of applying mathematically sound and computationally efficient computer analysis tools to look for what is captured in them?"
Gyulassy is currently developing software that will allow others to put the algorithm to use. He expects the learning curve to be steep for this open-source product, "but if you just learn the minimal amount about what a Morse-Smale complex is," he said, "it will be pretty intuitive."
University of California - Davis
|
 |
Related Data Analysis Current Events and Data Analysis News Articles Data Analysis Current Events and Data Analysis News RSS Musical sensibility can help shape teaching, research education The underlying similarities between teaching, research and music can be a powerful metaphor for education and qualitative inquiry, according to a University of Illinois professor of education.
New therapy for vasculitis will help patients avoid infertility and cancer Researchers have identified that Rituxan, a drug previously approved for the treatment of non-Hodgkin's B cell lymphoma and rheumatoid arthritis, can treat severe ANCA-associated vasculitis as effectively as cyclophosphamide, the current standard therapy.
Carnegie Mellon researchers save electricity with low-power processors and flash memory Researchers at Carnegie Mellon University and Intel Labs Pittsburgh (ILP) have combined low-power, embedded processors typically used in netbooks with flash memory to create a server architecture that is fast, but far more energy efficient for data-intensive applications than the systems now used by major Internet services.
National report shines light on lupus 50-year treatment drought Today, The Lewin Group, a national health care consulting firm, issued recommendations on ways to overcome the barriers that have obstructed lupus drug development resulting in no new drug approval for this disease in more than 50 years - since the Eisenhower Administration.
Vanderbilt astronomers participate in new search for dark energy The most ambitious attempt yet to trace the history of the universe has seen "first light." The Baryon Oscillation Spectroscopic Survey (BOSS), part of the Sloan Digital Sky Survey III (SDSS-III), took its first astronomical data on the night of Sept. 14-15 at the Sloan Foundation telescope in New Mexico.
CU-Boulder space scientists set for final spacecraft flyby of Mercury NASA's MESSENGER spacecraft, which is toting an $8.7 million University of Colorado at Boulder instrument, will make its third and final flyby of Mercury on Sept. 29 -- a clever gravity-assist maneuver that will steer it into orbit around the rocky planet beginning in March 2011.
Superheavy Element 114 Confirmed: A Stepping Stone to the Island of Stability Scientists at the U.S. Department of Energy's Lawrence Berkeley National Laboratory have been able to confirm the production of the superheavy element 114, ten years after a group in Russia, at the Joint Institute for Nuclear Research in Dubna, first claimed to have made it.
Faster, cheaper way to find disease genes in human genome passes initial test University of Washington (UW) researchers have successfully developed a novel genome-analysis strategy for more rapid, lower cost discovery of possible gene-disease links.
NASA, CU-Boulder airborne expedition chases Arctic sea ice questions A small NASA aircraft completed its first successful science flight Thursday in partnership with the University of Colorado at Boulder as part of an expedition to study the receding Arctic sea ice and improve understanding of its life cycle and the long-term stability of the Arctic ice cover.
Reviews of microbial gene language published in special issue of Trends in Microbiology Ten articles describing how a universal language to describe genes is bringing benefits to the study of the microbial world have been published in a special issue of Trends in Microbiology, co-edited by Virginia Bioinformatics Institute professor Brett Tyler. More Data Analysis Current Events and Data Analysis News Articles
|
 |

|
Head First Data Analysis: A Learner's Guide to Big Numbers, Statistics, and Good Decisions
by Michael Milton (Author), Milton Michael (Author)
Today, interpreting data is a critical decision-making factor for businesses and organizations. If your job requires you to manage and analyze all kinds of data, turn to Head First Data Analysis, where you'll quickly learn how to collect and organize data, sort the distractions from the truth, find meaningful patterns, draw conclusions, predict the future, and present your findings to others. Whether you're a product developer researching the market viability of a new product or service, a marketing manager gauging or predicting the effectiveness of a campaign, a salesperson who needs data to support product presentations, or a lone entrepreneur responsible for all of these data-intensive functions and more, the unique approach in Head First Data Analysis is by far the most...
|

|
Data Analysis and Decision Making with Microsoft Excel, Revised, (with CD-ROM and Decision Tools and Statistic Tools Suite)
by S. Christian Albright (Author), Wayne Winston (Author), Christopher Zappe (Author)
Master data analysis, modeling, and spreadsheet use with DATA ANALYSIS AND DECISION MAKING WITH MICROSOFT EXCEL! With a teach-by-example approach, student-friendly writing style, and complete Excel integration, this quantitative methods text provides you with the tools you need to succeed. Margin notes, boxed-in definitions and formulas in the text, enhanced explanations in the text itself, and stated objectives for the examples found throughout the text make studying easy. Problem sets and cases provide realistic examples that enable you to see the relevance of the material to your future as a business leader. The CD-ROMs packaged with every new book include the following add-ins: the Palisade Decision Tools Suite (@RISK, StatTools, PrecisionTree, TopRank, and RISKOptimizer); and...
|

|
Data Analysis Using SQL and Excel
by Gordon S. Linoff (Author)
Useful business analysis requires you to effectively transform data into actionable information. This book helps you use SQL and Excel to extract business information from relational databases and use that data to define business dimensions, store transactions about customers, produce results, and more. Each chapter explains when and why to perform a particular type of business analysis in order to obtain useful results, how to design and perform the analysis using SQL and Excel, and what the results should look like.
|

|
Data Analysis 2nd
by Victoria Bernhardt (Author)
"Data Analysis for Continuous School Improvement" (First Edition, 1998, Second Edition, 2004) What separates successful schools from those that will not be successful in their reform efforts is the use of one, often neglected, essential element—data. With clear and concrete examples from both elementary and secondary schools, "Data Analysis for Continuous School Improvement" shows what data to gather and how to use data to improve all aspects of schools. "Data Analysis" enables you to find out where you are, where you want to be, and how to get there—sensibly, painlessly, and effectively. Schools are powerful organizations. Every day, across the United States, schools are impacting the lives of millions of children and the future of our very existence. Schools become even...
|

|
An Introduction to Statistical Methods and Data Analysis
by R. Lyman Ott (Author), Micheal T. Longnecker (Author)
Ott and Longnecker's AN INTRODUCTION TO STATISTICAL METHODS AND DATA ANALYSIS, Sixth Edition, provides a broad overview of statistical methods for readers who have little or no prior experience in statistics. The authors teach readers to solve problems encountered in research projects, to make decisions based on data in general settings, and to become critical readers of statistical analyses in research papers and in news reports. The first eleven chapters present material typically covered in a college-level introductory statistics course, as well as interesting case studies and examples. The remaining chapters cover regression modeling and design of experiments.
|

|
Microsoft Excel Data Analysis and Business Modeling (Bpg-Other)
by Wayne L. Winston (Author)
Now you can apply the techniques that business analysts at leading companies use to analyze and transform data into bottom line results. For more than 10 years, well-known consultant and business professor Wayne Winston has been teaching corporate clients and MBA candidates the most effective ways to use Microsoft Excel for data analysis, modeling, and decision making. This practical, business-focused guide delivers the best of Winston’s classroom experience to you in 70+ concise chapters, organized by real-world scenarios. Quickly find and apply exactly the information you need to solve a specific business problem—from asset allocation modeling to estimating exponential growth, forecasting sales, optimizing portfolios, and other critical functions. You also get all the book’s...
|

|
Intelligent Data Analysis
by Michael Berthold (Editor), David J. Hand (Editor)
This monograph is a detailed introductory presentation of the key classes of intelligent data analysis methods. The twelve coherently written chapters by leading experts provide complete coverage of the core issues. The first half of the book is devoted to the discussion of classical statistical issues, ranging from the basic concepts of probability, through general notions of inference, to advanced multivariate and time series methods, as well as a detailed discussion of the increasingly important Bayesian approaches and Support Vector Machines. The following chapters then concentrate on the area of machine learning and artificial intelligence and provide introductions into the topics of rule induction methods, neural networks, fuzzy logic, and stochastic search methods. The book...
|

|
Handbook of Statistical Analysis and Data Mining Applications
by Robert Nisbet (Author), John Elder IV (Author), Gary Miner (Author)
The Handbook of Statistical Analysis and Data Mining Applications is a comprehensive professional reference book that guides business analysts, scientists, engineers and researchers (both academic and industrial) through all stages of data analysis, model building and implementation. The Handbook helps one discern the technical and business problem, understand the strengths and weaknesses of modern data mining algorithms, and employ the right statistical methods for practical application. Use this book to address massive and complex datasets with novel statistical approaches and be able to objectively evaluate analyses and solutions. It has clear, intuitive explanations of the principles and tools for solving problems using modern analytic techniques, and discusses their application to...
|
|
|
Intelligent Data Analysis
by Ios Press
Provides a forum for the examination of issues related to the research & applications of Artificial Intelligence techniques in data analysis across a variety of disciplines.
|

|
Qualitative Data Analysis: An Expanded Sourcebook(2nd Edition)
by Matthew B. Miles (Author), Michael Huberman (Author)
In 1984, the first edition of Qualitative Data Analysis addressed a critical need faced by researchers in all fields of the human sciences - how to draw valid meaning from qualitative data. It provided methods of analysis that were practical, credible and reliable. This groundbreaking book has now been revised to take up where the first edition left off and account for the phenomenal expansion of qualitative inquiry since then. In this second edition, Miles and Huberman bring the art of qualitative data analysis up to date, adding hundreds of new techniques, ideas and references that draw on the experience of the authors and many colleagues in the design, testing and use of qualitative data analysis methods. Each method of data display and analysis is described and illustrated...
|
|