Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/28069
Title: Parallel-Correctness and Parallel-Boundedness for Datalog Programs
Authors: NEVEN, Frank 
Schwentick, Thomas
Spinrath, Christopher
VANDEVOORT, Brecht 
Issue Date: 2019
Publisher: Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik
Source: Barcelo, Pablo; Calautti, Marco (Ed.). 22nd International Conference on Database Theory (ICDT 2019), Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik,p. 1-19 (Art N° 14)
Series/Report: Leibniz International Proceedings in Informatics (LIPIcs)
Series/Report no.: 127
Abstract: Recently, Ketsman et al. started the investigation of the parallel evaluation of recursive queries in the Massively Parallel Communication (MPC) model. Among other things, it was shown that parallel-correctness and parallel-boundedness for general Datalog programs is undecidable, by a reduction from the undecidable containment problem for Datalog. Furthermore, economic policies were introduced as a means to specify data distribution in a recursive setting. In this paper, we extend the latter framework to account for more general distributed evaluation strategies in terms of communication policies. We then show that the undecidability of parallel-correctness runs deeper: it already holds for fragments of Datalog, e.g., monadic and frontier-guarded Datalog, with a decidable containment problem, under relatively simple evaluation strategies. These simple evaluation strategies are defined w.r.t. data-moving distribution constraints. We then investigate restrictions of economic policies that yield decidability. In particular, we show that parallel-correctness is 2EXPTIME-complete for monadic and frontier-guarded Datalog under hash-based economic policies. Next, we consider restrictions of data-moving constraints and show that parallel-correctness and parallel-boundedness are 2EXPTIME-complete for frontier-guarded Datalog. Interestingly, distributed evaluation no longer preserves the usual containment relationships between fragments of Datalog. Indeed, not every monadic Datalog program is equivalent to a frontier-guarded one in the distributed setting. We illustrate the latter by considering two alternative settings where in one of these parallel-correctness is decidable for frontier-guarded Datalog but undecidable for monadic Datalog.
Keywords: Datalog; distributed databases; distributed evaluation; decision problems; complexity
Document URI: http://hdl.handle.net/1942/28069
ISBN: 9783959771016
DOI: 10.4230/LIPIcs.ICDT.2019.14
Rights: © Frank Neven, Thomas Schwentick, Christopher Spinrath, and Brecht Vandevoort; licensed under Creative Commons License CC-BY
Category: C1
Type: Proceedings Paper
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
LIPIcs-ICDT-2019-14.pdfPublished version599.46 kBAdobe PDFView/Open
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.