Nav: Home

Geospatial knowledge-based verification and improvement of GlobeLand30

October 12, 2016

Global land cover (GLC) data with fine spatial resolution and high quality are essential for global environment changes research, earth system modeling, management of resources and sustainable development planning. Assuring data product quality has been one of the major challenges for all of the operational GLC mapping projects.

Many large area land-cover mapping projects cannot deliver high quality data with single automated routines, although significant progress has been achieved in the area of automated remote-sensed image classification during the last twenty years. In particular, automated classification may cause significant classification errors when applied to 30-m land-cover mapping at a global scale. Identifying potential errors in the preliminary automated classification results through a suitable process of verifying and improving the results with a post-classification verification strategy has become a critical step for improving the quality of land-cover mapping results. Still, efficient tools are lacking for identifying and removing classification errors using geospatial knowledge.

In a recent study, a geospatial knowledge-based verification and improvement approach is developed and used for assuring the data quality of GlobeLand30. It consists of a set of geospatial knowledge-based verification rules and a group of web-based supporting tools.

Natural conditions, human activities and ecological environments affect the geospatial distribution and temporal transformation of land cover. Geospatial knowledge about land cover and its change are summarized by a combination of three different aspects: natural, cultural and temporal constraints. The verification rules are formulated to represent geospatial knowledge. Verification rules are formulated and represented using the so-called production-rule method or decision tree approach.

The web system is used to integrate heterogeneous and dispersed data resources (including primary Landsat-like images, various ancillary datasets and preliminary classification results) and external web services (such as Google Earth and Map World) (Figure 1). It provides a number of interactive tools to facilitate data-sharing and manipulation, such as geo-browsing (zoom in/out and pan), synchronized visualization (maps in two split windows for contrast), annotation (annotating sample, paper, photo, etc.), publication (publishing data service), etc.

The verification and improvement of GlobeLand30 is a collaborative process in which a group of project managers, quality inspectors and data operators work together is designed (Figure 2). With the support of the web system, the detection of potential classification errors and their modifications are accomplished in a collaborative manner. First, the project managers integrate the ancillary data in the web-based system. Then, they allocate the verification tasks and supervise the whole verification process. Second, quality inspectors verify the intermediate classification results to discover the potentially misclassified regions where spatial or temporal inconsistency may occur with the help of knowledge-based rules and ancillary data. They may use high-resolution images from integrated external services to identify and annotate the classification errors. These messages are published and sent to data operators for further modification.

The implementation of this knowledge-based approach has greatly improved the data quality of GlobeLand30 by identifying and removing classification errors. According to a third-party assessment, the overall accuracy of GlobeLand30-2010 reached 83.50% with a kappa coefficient of 0.78. In Italy, the overall accuracy of GlobeLand30 is better than 80% and the accuracy of water bodies of GlobeLand30 in Thessaly, Greece is 91.9%. All these indicate that the geospatial knowledge-based verification and improvement approach is feasible and reliable. It can also be used for other large scale land-cover mapping.
This work was funded by the National Science Foundation of China (Project #41231172), et al. The relevant paper is published in Science China Earth Sciences.

See the article: Zhang Wei Wei, Chen Jun, Liao An Ping, et al. Geospatial knowledge-based verification and improvement of GlobeLand30[J]. Science China Earth Sciences, 2016, 59(9): 1709-1719.

This article was published online (

Science China Press

Related Data Articles:

Discrimination, lack of diversity, & societal risks of data mining highlighted in big data
A special issue of Big Data presents a series of insightful articles that focus on Big Data and Social and Technical Trade-Offs.
Journal AAS publishes first data description paper: Data collection and sharing
AAS published its first data description paper on June 8, 2017.
73 percent of academics say access to research data helps them in their work; 34 percent do not publish their data
Combining results from bibliometric analyses, a global sample of researcher opinions and case-study interviews, a new report reveals that although the benefits of open research data are well known, in practice, confusion remains within the researcher community around when and how to share research data.
Designing new materials from 'small' data
A Northwestern and Los Alamos team developed a novel workflow combining machine learning and density functional theory calculations to create design guidelines for new materials that exhibit useful electronic properties, such as ferroelectricity and piezoelectricity.
Big data for the universe
Astronomers at Lomonosov Moscow State University in cooperation with their French colleagues and with the help of citizen scientists have released 'The Reference Catalog of galaxy SEDs,' which contains value-added information about 800,000 galaxies.
What to do with the data?
Rapid advances in computing constantly translate into new technologies in our everyday lives.
Why keep the raw data?
The increasingly popular subject of raw diffraction data deposition is examined in a Topical Review in IUCrJ.
Infrastructure data for everyone
How much electricity flows through the grid? When and where?
Finding patterns in corrupted data
A new 'robust' statistical method from MIT enables efficient model fitting with corrupted, high-dimensional data.
Big data for little creatures
A multi-disciplinary team of researchers at UC Riverside has received $3 million from the National Science Foundation Research Traineeship program to prepare the next generation of scientists and engineers who will learn how to exploit the power of big data to understand insects.

Related Data Reading:

Best Science Podcasts 2019

We have hand picked the best science podcasts for 2019. Sit back and enjoy new science podcasts updated daily from your favorite science news services and scientists.
Now Playing: TED Radio Hour

Do animals grieve? Do they have language or consciousness? For a long time, scientists resisted the urge to look for human qualities in animals. This hour, TED speakers explore how that is changing. Guests include biological anthropologist Barbara King, dolphin researcher Denise Herzing, primatologist Frans de Waal, and ecologist Carl Safina.
Now Playing: Science for the People

#SB2 2019 Science Birthday Minisode: Mary Golda Ross
Our second annual Science Birthday is here, and this year we celebrate the wonderful Mary Golda Ross, born 9 August 1908. She died in 2008 at age 99, but left a lasting mark on the science of rocketry and space exploration as an early woman in engineering, and one of the first Native Americans in engineering. Join Rachelle and Bethany for this very special birthday minisode celebrating Mary and her achievements. Thanks to our Patreons who make this show possible! Read more about Mary G. Ross: Interview with Mary Ross on Lash Publications International, by Laurel Sheppard Meet Mary Golda...