Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/45025
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLight, Dean-
dc.contributor.authorAiashy, Ahmad-
dc.contributor.authorDiab, Mahmoud-
dc.contributor.authorNachmias, Daniel-
dc.contributor.authorVANSUMMEREN, Stijn-
dc.contributor.authorKimelfeld, Benny-
dc.date.accessioned2025-01-09T10:46:27Z-
dc.date.available2025-01-09T10:46:27Z-
dc.date.issued2024-
dc.date.submitted2025-01-03T11:17:05Z-
dc.identifier.citationProceedings of the Vldb Endowment, 17 (12) , p. 4281 -4284-
dc.identifier.urihttp://hdl.handle.net/1942/45025-
dc.description.abstractDocument spanners have been proposed as a formal framework for declarative Information Extraction (IE) from text, following IE products from the industry and academia. Over the past decade, the framework has been studied thoroughly in terms of expressive power, complexity, and the ability to naturally combine text analysis with relational querying. This demonstration presents SPANNERLIB-a library for embedding document spanners in Python code. SPANNERLIB facilitates the development of IE programs by providing an implementation of Spannerlog (Datalog-based document spanners) that interacts with the Python code in two directions: rules can be embedded inside Python, and they can invoke custom Python code (e.g., calls to ML-based NLP models) via user-defined functions. The demonstration scenarios showcase IE programs, with increasing levels of complexity, within Jupyter Notebook.-
dc.description.sponsorshipThis work was funded by the Israel Science Foundation (ISF) under grant 768/19. The work of Dean Light and Benny Kimelfeld has been funded by the German Research Foundation (DFG) under grant KI 2348/1-1. S. Vansummeren was supported by the Bijzonder Onderzoeksfonds (BOF) of Hasselt University under Grant No. BOF20ZAP02 as well as the Flanders AI research program.-
dc.language.isoen-
dc.publisherASSOC COMPUTING MACHINERY-
dc.rightsThis work is licensed under the Creative Commons BY-NC-ND 4.0 International License. Visit https://creativecommons.org/licenses/by-nc-nd/4.0/ to view a copy of this license. For any use beyond those covered by this license, obtain permission by emailing info@vldb.org. Copyright is held by the owner/author(s). Publication rights licensed to the VLDB Endowment-
dc.titleSpannerLib: Embedding Declarative Information Extraction in an Imperative Workflow-
dc.typeJournal Contribution-
dc.identifier.epage4284-
dc.identifier.issue12-
dc.identifier.spage4281-
dc.identifier.volume17-
local.format.pages4-
local.bibliographicCitation.jcatA1-
dc.description.notesLight, D (corresponding author), Technion, Haifa, Israel.-
dc.description.notesdean.light92@gmail.com; ahmad-ai@campus.technion.ac.il;-
dc.description.notesmahmoud.diab@campus.technion.ac.il; nach.daniel@gmail.com;-
dc.description.notesstijn.vansummeren@uhasselt.be; bennyk@cs.technion.ac.il-
local.publisher.place1601 Broadway, 10th Floor, NEW YORK, NY USA-
local.type.refereedRefereed-
local.type.specifiedArticle-
dc.identifier.doi10.14778/3685800.3685855-
dc.identifier.isi001378223700007-
local.provider.typewosris-
local.description.affiliation[Light, Dean; Aiashy, Ahmad; Diab, Mahmoud; Nachmias, Daniel; Kimelfeld, Benny] Technion, Haifa, Israel.-
local.description.affiliation[Vansummeren, Stijn] UHasselt, Data Sci Inst, Diepenbeek, Belgium.-
local.uhasselt.internationalyes-
item.fulltextWith Fulltext-
item.contributorLight, Dean-
item.contributorAiashy, Ahmad-
item.contributorDiab, Mahmoud-
item.contributorNachmias, Daniel-
item.contributorVANSUMMEREN, Stijn-
item.contributorKimelfeld, Benny-
item.fullcitationLight, Dean; Aiashy, Ahmad; Diab, Mahmoud; Nachmias, Daniel; VANSUMMEREN, Stijn & Kimelfeld, Benny (2024) SpannerLib: Embedding Declarative Information Extraction in an Imperative Workflow. In: Proceedings of the Vldb Endowment, 17 (12) , p. 4281 -4284.-
item.accessRightsOpen Access-
crisitem.journal.issn2150-8097-
crisitem.journal.eissn2150-8097-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
SpannerLib_ Embedding Declarative Information Extraction in an Imperative Workflow.pdfPublished version648.93 kBAdobe PDFView/Open
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.