Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/40143
Title: | Distributed Subweb Specifications for Traversing the Web | Authors: | Bogaerts , Bart KETSMAN, Bas Zeboudj, Younes Taelman, Ruben MOHAMED, Heba Verborgh, Ruben |
Issue Date: | 2024 | Publisher: | CAMBRIDGE UNIV PRESS | Source: | Theory and Practice of Logic Programming, 24 (2), p. 394-420 | Abstract: | Link traversal-based query processing (ltqp), in which a sparql query is evaluated over a web of documents rather than a single dataset, is often seen as a theoretically interesting yet impractical technique. However, in a time where the hypercentralization of data has increasingly come under scrutiny, a decentralized Web of Data with a simple document-based interface is appealing, as it enables data publishers to control their data and access rights. While (ltqp allows evaluating complex queries over such webs, it suffers from performance issues (due to the high number of documents containing data) as well as information quality concerns (due to the many sources providing such documents). In existing ltqp approaches, the burden of finding sources to query is entirely in the hands of the data consumer. In this paper, we argue that to solve these issues, data publishers should also be able to suggest sources of interest and guide the data consumer toward relevant and trustworthy data. We introduce a theoretical framework that enables such guided link traversal and study its properties. We illustrate with a theoretic example that this can improve query results and reduce the number of network requests. We evaluate our proposal experimentally on a virtual linked web with specifications and indeed observe that not just the data quality but also the efficiency of querying improves. | Notes: | Bogaerts, B (corresponding author), Vrije Univ Brussel, Ixelles, Belgium. Bart.Bogaerts@vub.be; bas.ketsman@vub.be; younes.zeboudj@vub.be; heba.mohamed@uhasselt.be; ruben.taelman@ugent.be; ruben.verborgh@ugent.be |
Keywords: | sparql;link traversal-based query processing;web of linked data | Document URI: | http://hdl.handle.net/1942/40143 | ISSN: | 1471-0684 | e-ISSN: | 1475-3081 | DOI: | 10.1017/S1471068423000054 | ISI #: | 000974088500001 | Rights: | The Author(s), 2023. Published by Cambridge University Press. This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited. | Category: | A1 | Type: | Journal Contribution | Validations: | ecoom 2024 |
Appears in Collections: | Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
distributed-subweb-specifications-for-traversing-the-web.pdf | Published version | 460.98 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.