Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/12737
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorNEVEN, Frank-
dc.contributor.authorFONTEYN, Dominique-
dc.date.accessioned2011-11-25T09:06:35Z-
dc.date.available2011-11-25T09:06:35Z-
dc.date.issued2011-
dc.identifier.urihttp://hdl.handle.net/1942/12737-
dc.description.abstractXML is the most popular languages for storing data on the web. Using schemas we can specify the structure of these documents. Its presence is used for automatic validation and. However, half of the online XML fragments do not refer to a schema and about two-thirds of the XSDs are not valid w.r.t. the W3C specifications. Thus we look for algorithms to infer an XSD for a set of XML fragments. In this thesis we explore inference techniques. This boils down to inferring regular expressions. However we cannot learn all regular expressions from positive data only and restrict us to SOREs. We present iXSD for local SOXSDs. Next we identify k-occurrence REs which are harder. We focus on HMMs to infer kOREs with iDRegEx. We combine these algorithms to infer local k-OXSDs. We present a similarity measure for two XSDs used for evaluating the experimental results. We see that it does not perform well on precision and generalisation but rather well on similarity and runtime.-
dc.languagenl-
dc.language.isoen-
dc.publishertUL Diepenbeek-
dc.titleHidden Markov Modellen voor het infereren van XSDs-
dc.typeTheses and Dissertations-
local.bibliographicCitation.jcatT2-
dc.description.notesmaster in de informatica-databases-
local.type.specifiedMaster thesis-
dc.bibliographicCitation.oldjcatD2-
item.accessRightsClosed Access-
item.contributorFONTEYN, Dominique-
item.fulltextNo Fulltext-
item.fullcitationFONTEYN, Dominique (2011) Hidden Markov Modellen voor het infereren van XSDs.-
Appears in Collections:Master theses
Show simple item record

Page view(s)

28
checked on Oct 30, 2023

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.