Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/12737
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | NEVEN, Frank | - |
dc.contributor.author | FONTEYN, Dominique | - |
dc.date.accessioned | 2011-11-25T09:06:35Z | - |
dc.date.available | 2011-11-25T09:06:35Z | - |
dc.date.issued | 2011 | - |
dc.identifier.uri | http://hdl.handle.net/1942/12737 | - |
dc.description.abstract | XML is the most popular languages for storing data on the web. Using schemas we can specify the structure of these documents. Its presence is used for automatic validation and. However, half of the online XML fragments do not refer to a schema and about two-thirds of the XSDs are not valid w.r.t. the W3C specifications. Thus we look for algorithms to infer an XSD for a set of XML fragments. In this thesis we explore inference techniques. This boils down to inferring regular expressions. However we cannot learn all regular expressions from positive data only and restrict us to SOREs. We present iXSD for local SOXSDs. Next we identify k-occurrence REs which are harder. We focus on HMMs to infer kOREs with iDRegEx. We combine these algorithms to infer local k-OXSDs. We present a similarity measure for two XSDs used for evaluating the experimental results. We see that it does not perform well on precision and generalisation but rather well on similarity and runtime. | - |
dc.language | nl | - |
dc.language.iso | en | - |
dc.publisher | tUL Diepenbeek | - |
dc.title | Hidden Markov Modellen voor het infereren van XSDs | - |
dc.type | Theses and Dissertations | - |
local.bibliographicCitation.jcat | T2 | - |
dc.description.notes | master in de informatica-databases | - |
local.type.specified | Master thesis | - |
dc.bibliographicCitation.oldjcat | D2 | - |
item.accessRights | Closed Access | - |
item.contributor | FONTEYN, Dominique | - |
item.fulltext | No Fulltext | - |
item.fullcitation | FONTEYN, Dominique (2011) Hidden Markov Modellen voor het infereren van XSDs. | - |
Appears in Collections: | Master theses |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.