Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/28069
Title: | Parallel-Correctness and Parallel-Boundedness for Datalog Programs | Authors: | NEVEN, Frank Schwentick, Thomas Spinrath, Christopher VANDEVOORT, Brecht |
Issue Date: | 2019 | Publisher: | Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik | Source: | Barcelo, Pablo; Calautti, Marco (Ed.). 22nd International Conference on Database Theory (ICDT 2019), Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik,p. 1-19 (Art N° 14) | Series/Report: | Leibniz International Proceedings in Informatics (LIPIcs) | Series/Report no.: | 127 | Abstract: | Recently, Ketsman et al. started the investigation of the parallel evaluation of recursive queries in the Massively Parallel Communication (MPC) model. Among other things, it was shown that parallel-correctness and parallel-boundedness for general Datalog programs is undecidable, by a reduction from the undecidable containment problem for Datalog. Furthermore, economic policies were introduced as a means to specify data distribution in a recursive setting. In this paper, we extend the latter framework to account for more general distributed evaluation strategies in terms of communication policies. We then show that the undecidability of parallel-correctness runs deeper: it already holds for fragments of Datalog, e.g., monadic and frontier-guarded Datalog, with a decidable containment problem, under relatively simple evaluation strategies. These simple evaluation strategies are defined w.r.t. data-moving distribution constraints. We then investigate restrictions of economic policies that yield decidability. In particular, we show that parallel-correctness is 2EXPTIME-complete for monadic and frontier-guarded Datalog under hash-based economic policies. Next, we consider restrictions of data-moving constraints and show that parallel-correctness and parallel-boundedness are 2EXPTIME-complete for frontier-guarded Datalog. Interestingly, distributed evaluation no longer preserves the usual containment relationships between fragments of Datalog. Indeed, not every monadic Datalog program is equivalent to a frontier-guarded one in the distributed setting. We illustrate the latter by considering two alternative settings where in one of these parallel-correctness is decidable for frontier-guarded Datalog but undecidable for monadic Datalog. | Keywords: | Datalog; distributed databases; distributed evaluation; decision problems; complexity | Document URI: | http://hdl.handle.net/1942/28069 | ISBN: | 9783959771016 | DOI: | 10.4230/LIPIcs.ICDT.2019.14 | Rights: | © Frank Neven, Thomas Schwentick, Christopher Spinrath, and Brecht Vandevoort; licensed under Creative Commons License CC-BY | Category: | C1 | Type: | Proceedings Paper |
Appears in Collections: | Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
LIPIcs-ICDT-2019-14.pdf | Published version | 599.46 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.