Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/701
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKosala, Raymond-
dc.contributor.authorVAN DEN BUSSCHE, Jan-
dc.contributor.authorBruynooghe, Maurice-
dc.contributor.authorBlockeel, Hendrik-
dc.date.accessioned2005-04-08T09:40:38Z-
dc.date.available2005-04-08T09:40:38Z-
dc.date.issued2002-
dc.identifier.citationPrinciples of Data Mining and Knowledge Discovery: 6th European Conference, PKDD 2002. p. 299-310-
dc.identifier.issn0302-9743-
dc.identifier.urihttp://hdl.handle.net/1942/701-
dc.description.abstractInformation extraction (IE) addresses the problem of extracting specific information from a collection of documents. Much of the previous work for IE from structured documents formatted in HTML or XML uses techniques for IE from strings, such as grammar and automata induction. However, such documents have a tree structure. Hence it is natural to investigate methods that are able to recognise and exploit this tree structure. We do this by exploring the use of tree automata for IE in structured documents. Experimental results on benchmark data sets show that our approach compares favorably with previous approaches.-
dc.format.extent530269 bytes-
dc.format.mimetypeapplication/pdf-
dc.language.isoen-
dc.publisherSpringer-Verlag-
dc.relation.ispartofseriesLecture Notes in Computer Science-
dc.titleInformation Extraction in Structured Documents Using Tree Automata Induction-
dc.typeJournal Contribution-
local.bibliographicCitation.authorsElomaa, T.-
local.bibliographicCitation.authorsMannila, H.-
local.bibliographicCitation.authorsToivonen, H.-
local.bibliographicCitation.conferencedateAUG 19-23, 2002-
local.bibliographicCitation.conferencenamePrinciples of Data Mining and Knowledge Discovery: 6th European Conference, PKDD 2002-
local.bibliographicCitation.conferenceplaceHelsinki, Finland-
dc.identifier.epage310-
dc.identifier.spage299-
local.bibliographicCitation.jcatA1-
local.type.refereedRefereed-
local.type.specifiedArticle-
local.relation.ispartofseriesnr2431-
dc.bibliographicCitation.oldjcatA2-
item.accessRightsOpen Access-
item.fulltextWith Fulltext-
item.contributorKosala, Raymond-
item.contributorVAN DEN BUSSCHE, Jan-
item.contributorBruynooghe, Maurice-
item.contributorBlockeel, Hendrik-
item.fullcitationKosala, Raymond; VAN DEN BUSSCHE, Jan; Bruynooghe, Maurice & Blockeel, Hendrik (2002) Information Extraction in Structured Documents Using Tree Automata Induction. In: Principles of Data Mining and Knowledge Discovery: 6th European Conference, PKDD 2002. p. 299-310.-
crisitem.journal.issn0302-9743-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
datamining2.pdf517.84 kBAdobe PDFView/Open
Show simple item record

Page view(s)

86
checked on Nov 7, 2023

Download(s)

154
checked on Nov 7, 2023

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.