Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/16395
Full metadata record
DC FieldValueLanguage
dc.contributor.authorArenas, Marcelo-
dc.contributor.authorDAENEN, Jonny-
dc.contributor.authorNEVEN, Frank-
dc.contributor.authorUGARTE, Martin-
dc.contributor.authorVAN DEN BUSSCHE, Jan-
dc.contributor.authorVANSUMMEREN, Stijn-
dc.date.accessioned2014-03-06T08:43:32Z-
dc.date.available2014-03-06T08:43:32Z-
dc.date.issued2013-
dc.identifier.citationRoss, Kenneth A.; Srivastava, Divesh; Papadias, Dimitris (Ed.). Proceedings of the ACM SIGMOD International Conference on Management of Data, p. 61-72-
dc.identifier.isbn978-1-4503-2037-5-
dc.identifier.urihttp://hdl.handle.net/1942/16395-
dc.description.abstractA great deal of research into the learning of schemas from XML data has been conducted in recent years to enable the automatic discovery of XML Schemas from XML documents when no schema, or only a low-quality one is available. Unfortunately, and in strong contrast to, for instance, the relational model, the automatic discovery of even the simplest of XML constraints, namely XML keys, has been left largely unexplored in this context. A major obstacle here is the unavailability of a theory on reasoning about XML keys in the presence of XML schemas, which is needed to validate the quality of candidate keys. The present paper embarks on a fundamental study of such a theory and classifies the complexity of several crucial properties concerning XML keys in the presence of an XSD, like, for instance, testing for consistency, boundedness, satisfiability, universality, and equivalence. Of independent interest, novel results are obtained related to cardinality estimation of XPath result sets. A mining algorithm is then developed within the framework of levelwise search. The algorithm leverages known discovery algorithms for functional dependencies in the relational model, but incorporates the above mentioned properties to assess and refine the quality of derived keys. An experimental study on an extensive body of real world XML data evaluating the effectiveness of the proposed algorithm is provided.-
dc.description.sponsorshipHercules foundation, Flemish Government, Fondecyt Grant #1131049, FP7-ICT- 233599 and ERC grant agreement DIADEM, no. 246858.-
dc.language.isoen-
dc.publisherACM-
dc.rightsCopyright 2013 ACM 978-1-4503-2037-5/13/06 ...$15.00-
dc.subject.otherXML;key;mining-
dc.titleDiscovering XSD keys from XML data-
dc.typeProceedings Paper-
local.bibliographicCitation.authorsRoss, Kenneth A.-
local.bibliographicCitation.authorsSrivastava, Divesh-
local.bibliographicCitation.authorsPapadias, Dimitris-
local.bibliographicCitation.conferencedateJune 22-27, 2013-
local.bibliographicCitation.conferencenameSIGMOD '13-
local.bibliographicCitation.conferenceplaceNew York (NY), USA-
dc.identifier.epage72-
dc.identifier.spage61-
local.bibliographicCitation.jcatC1-
local.publisher.placeNew York, NY, USA-
local.type.refereedRefereed-
local.type.specifiedProceedings Paper-
dc.identifier.doi10.1145/2463676.2463705-
local.bibliographicCitation.btitleProceedings of the ACM SIGMOD International Conference on Management of Data-
item.contributorArenas, Marcelo-
item.contributorDAENEN, Jonny-
item.contributorNEVEN, Frank-
item.contributorUGARTE, Martin-
item.contributorVAN DEN BUSSCHE, Jan-
item.contributorVANSUMMEREN, Stijn-
item.fullcitationArenas, Marcelo; DAENEN, Jonny; NEVEN, Frank; UGARTE, Martin; VAN DEN BUSSCHE, Jan & VANSUMMEREN, Stijn (2013) Discovering XSD keys from XML data. In: Ross, Kenneth A.; Srivastava, Divesh; Papadias, Dimitris (Ed.). Proceedings of the ACM SIGMOD International Conference on Management of Data, p. 61-72.-
item.accessRightsOpen Access-
item.fulltextWith Fulltext-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
sigfp248-arenas.pdfPeer-reviewed author version264.31 kBAdobe PDFView/Open
Show simple item record

SCOPUSTM   
Citations

6
checked on Sep 3, 2020

Page view(s)

70
checked on Sep 7, 2022

Download(s)

132
checked on Sep 7, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.