Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/14242
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHELLINGS, Jelle-
dc.contributor.authorFletcher, George H. L.-
dc.contributor.authorHaverkort, Herman-
dc.date.accessioned2012-10-08T10:29:37Z-
dc.date.available2012-10-08T10:29:37Z-
dc.date.issued2012-
dc.identifier.citationSelçuk, Candan K.; Chen, Yi; Snodgrass, Richard T.; Gravano, Luis; Fuxman, Ariel (Ed.). SIGMOD'12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, p. 553-564-
dc.identifier.isbn978-1-4503-1247-9-
dc.identifier.urihttp://hdl.handle.net/1942/14242-
dc.description.abstractIn this paper we introduce the first efficient external-memory algorithm to compute the bisimilarity equivalence classes of a directed acyclic graph (DAG). DAGs are commonly used to model data in a wide variety of practical applications, ranging from XML documents and data provenance models, to web taxonomies and scientific workflows. In the study of efficient reasoning over massive graphs, the notion of node bisimilarity plays a central role. For example, grouping together bisimilar nodes in an XML data set is the first step in many sophisticated approaches to building indexing data structures for efficient XPath query evaluation. To date, however, only internal-memory bisimulation algorithms have been investigated. As the size of real-world DAG data sets often exceeds available main memory, storage in external memory becomes necessary. Hence, there is a practical need for an efficient approach to computing bisimulation in external memory. Our general algorithm has a worst-case IO-complexity of O(Sort(|N| + |E|)), where |N| and |E| are the numbers of nodes and edges, resp., in the data graph and Sort(n) is the number of accesses to external memory needed to sort an input of size n. We also study specializations of this algorithm to common variations of bisimulation for tree-structured XML data sets. We empirically verify efficient performance of the algorithms on graphs and XML documents having billions of nodes and edges, and find that the algorithms can process such graphs efficiently even when very limited internal memory is available. The proposed algorithms are simple enough for practical implementation and use, and open the door for further study of external-memory bisimulation algorithms. To this end, the full open-source C++ implementation has been made freely available.-
dc.language.isoen-
dc.publisherACM-
dc.rightsPermission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. SIGMOD ’12, May 20–24, 2012, Scottsdale, Arizona, USA. Copyright 2012 ACM 978-1-4503-1247-9/12/05 ...$10.00.-
dc.subject.otherbisimulation; graphs; external memory; I/O-
dc.titleEfficient External-Memory Bisimulation on DAGs-
dc.typeProceedings Paper-
local.bibliographicCitation.authorsSelçuk, Candan K.-
local.bibliographicCitation.authorsChen, Yi-
local.bibliographicCitation.authorsSnodgrass, Richard T.-
local.bibliographicCitation.authorsGravano, Luis-
local.bibliographicCitation.authorsFuxman, Ariel-
local.bibliographicCitation.conferencedate20-24 May 2012-
local.bibliographicCitation.conferencename2012 ACM SIGMOD/PODS-
local.bibliographicCitation.conferenceplaceScottsdale-Arizona, USA-
dc.identifier.epage564-
dc.identifier.spage553-
local.bibliographicCitation.jcatC1-
local.publisher.placeNew York, NY, USA-
local.type.refereedRefereed-
local.type.specifiedProceedings Paper-
dc.bibliographicCitation.oldjcatC2-
dc.identifier.doi10.1145/2213836.2213899-
local.bibliographicCitation.btitleSIGMOD'12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data-
item.fulltextWith Fulltext-
item.contributorHELLINGS, Jelle-
item.contributorFletcher, George H. L.-
item.contributorHaverkort, Herman-
item.fullcitationHELLINGS, Jelle; Fletcher, George H. L. & Haverkort, Herman (2012) Efficient External-Memory Bisimulation on DAGs. In: Selçuk, Candan K.; Chen, Yi; Snodgrass, Richard T.; Gravano, Luis; Fuxman, Ariel (Ed.). SIGMOD'12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, p. 553-564.-
item.accessRightsOpen Access-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
p553.pdfPublished version686.3 kBAdobe PDFView/Open
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.