Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/1417
Full metadata record
DC FieldValueLanguage
dc.contributor.authorNEVEN, Frank-
dc.contributor.authorVAN DE CRAEN, Dieter-
dc.date.accessioned2007-05-03T09:36:17Z-
dc.date.available2007-05-03T09:36:17Z-
dc.date.issued2006-
dc.identifier.citationAdvances in Database Technology - Edbt 2006. p. 829-846-
dc.identifier.isbn0302-9743-
dc.identifier.urihttp://hdl.handle.net/1942/1417-
dc.description.abstractScientific data in the life sciences is distributed over various independent multi-format databases and is constantly expanding. We discuss a scenario where a life science research lab monitors over time the results of queries to remote databases beyond their control. Queries are registered at a local system and get executed on a daily basis in batch mode. The goal of the paper is to study evaluation strategies minimizing the total number of accesses to databases when evaluating all queries in bulk. We use an abstraction based on the relational model with fan-out constraints and conjunctive queries. We show that the above problem remains NP-hard in two restricted settings: queries of bounded depth and the scenario with a fixed schema. We further show that both restrictions taken together results in a tractable problem. As the constant for the latter algorithm is too high to be feasible in practice, we present four heuristic methods that are experimentally compared on randomly generated and biologically motivated schemas. Our algorithms are based on a greedy method and approximations for the shortest common super sequence problem.-
dc.format.extent242981 bytes-
dc.format.mimetypeapplication/pdf-
dc.language.isoen-
dc.publisherSpringer-Verlag Berlin-
dc.relation.ispartofseriesLecture Notes in Computer Science-
dc.subject.otherSEQUENCE DATA-BANK; GENBANK-
dc.titleOptimizing monitoring queries over distributed data-
dc.typeJournal Contribution-
local.bibliographicCitation.authorsNeven, Frank-
local.bibliographicCitation.authorsVan de Craen, Dieter-
dc.identifier.epage846-
dc.identifier.spage829-
local.bibliographicCitation.jcatA1-
local.type.refereedRefereed-
local.type.specifiedArticle-
dc.bibliographicCitation.oldjcatA1-
dc.identifier.doi10.1007/11687238-
dc.identifier.isi000237081600049-
item.accessRightsOpen Access-
item.fullcitationNEVEN, Frank & VAN DE CRAEN, Dieter (2006) Optimizing monitoring queries over distributed data. In: Advances in Database Technology - Edbt 2006. p. 829-846.-
item.contributorNEVEN, Frank-
item.contributorVAN DE CRAEN, Dieter-
item.fulltextWith Fulltext-
item.validationecoom 2007-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
edbt2006-neven.pdfPeer-reviewed author version237.29 kBAdobe PDFView/Open
Show simple item record

WEB OF SCIENCETM
Citations

1
checked on Apr 25, 2024

Page view(s)

62
checked on Sep 7, 2022

Download(s)

200
checked on Sep 7, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.