ABSTRACT Linked Data is known as one of the best solutions for multisource and heterogeneous web data integration and discovery in this era of Big Data. However, data interlinking, which… Click to show full abstract
ABSTRACT Linked Data is known as one of the best solutions for multisource and heterogeneous web data integration and discovery in this era of Big Data. However, data interlinking, which is the most valuable contribution of Linked Data, remains incomplete and inaccurate. This study proposes a multidimensional and quantitative interlinking approach for Linked Data in the geospatial domain. According to the characteristics and roles of geospatial data in data discovery, eight elementary data characteristics are adopted as data interlinking types. These elementary characteristics are further combined to form compound and overall data interlinking types. Each data interlinking type possesses one specific predicate to indicate the actual relationship of Linked Data and uses data similarity to represent the correlation degree quantitatively. Therefore, geospatial data interlinking can be expressed by a directed edge associated with a relation predicate and a similarity value. The approach transforms existing simple and qualitative geospatial data interlinking into complete and quantitative interlinking and promotes the establishment of high-quality and trusted Linked Geospatial Data. The approach is applied to build data intra-links in the Chinese National Earth System Scientific Data Sharing Network (NSTI-GEO) and data -links in NSTI-GEO with the Chinese Meteorological Data Network and National Population and Health Scientific Data Sharing Platform.
               
Click one of the above tabs to view related content.