Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/45025
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Light, Dean | - |
dc.contributor.author | Aiashy, Ahmad | - |
dc.contributor.author | Diab, Mahmoud | - |
dc.contributor.author | Nachmias, Daniel | - |
dc.contributor.author | VANSUMMEREN, Stijn | - |
dc.contributor.author | Kimelfeld, Benny | - |
dc.date.accessioned | 2025-01-09T10:46:27Z | - |
dc.date.available | 2025-01-09T10:46:27Z | - |
dc.date.issued | 2024 | - |
dc.date.submitted | 2025-01-03T11:17:05Z | - |
dc.identifier.citation | Proceedings of the Vldb Endowment, 17 (12) , p. 4281 -4284 | - |
dc.identifier.uri | http://hdl.handle.net/1942/45025 | - |
dc.description.abstract | Document spanners have been proposed as a formal framework for declarative Information Extraction (IE) from text, following IE products from the industry and academia. Over the past decade, the framework has been studied thoroughly in terms of expressive power, complexity, and the ability to naturally combine text analysis with relational querying. This demonstration presents SPANNERLIB-a library for embedding document spanners in Python code. SPANNERLIB facilitates the development of IE programs by providing an implementation of Spannerlog (Datalog-based document spanners) that interacts with the Python code in two directions: rules can be embedded inside Python, and they can invoke custom Python code (e.g., calls to ML-based NLP models) via user-defined functions. The demonstration scenarios showcase IE programs, with increasing levels of complexity, within Jupyter Notebook. | - |
dc.description.sponsorship | This work was funded by the Israel Science Foundation (ISF) under grant 768/19. The work of Dean Light and Benny Kimelfeld has been funded by the German Research Foundation (DFG) under grant KI 2348/1-1. S. Vansummeren was supported by the Bijzonder Onderzoeksfonds (BOF) of Hasselt University under Grant No. BOF20ZAP02 as well as the Flanders AI research program. | - |
dc.language.iso | en | - |
dc.publisher | ASSOC COMPUTING MACHINERY | - |
dc.rights | This work is licensed under the Creative Commons BY-NC-ND 4.0 International License. Visit https://creativecommons.org/licenses/by-nc-nd/4.0/ to view a copy of this license. For any use beyond those covered by this license, obtain permission by emailing info@vldb.org. Copyright is held by the owner/author(s). Publication rights licensed to the VLDB Endowment | - |
dc.title | SpannerLib: Embedding Declarative Information Extraction in an Imperative Workflow | - |
dc.type | Journal Contribution | - |
dc.identifier.epage | 4284 | - |
dc.identifier.issue | 12 | - |
dc.identifier.spage | 4281 | - |
dc.identifier.volume | 17 | - |
local.format.pages | 4 | - |
local.bibliographicCitation.jcat | A1 | - |
dc.description.notes | Light, D (corresponding author), Technion, Haifa, Israel. | - |
dc.description.notes | dean.light92@gmail.com; ahmad-ai@campus.technion.ac.il; | - |
dc.description.notes | mahmoud.diab@campus.technion.ac.il; nach.daniel@gmail.com; | - |
dc.description.notes | stijn.vansummeren@uhasselt.be; bennyk@cs.technion.ac.il | - |
local.publisher.place | 1601 Broadway, 10th Floor, NEW YORK, NY USA | - |
local.type.refereed | Refereed | - |
local.type.specified | Article | - |
dc.identifier.doi | 10.14778/3685800.3685855 | - |
dc.identifier.isi | 001378223700007 | - |
local.provider.type | wosris | - |
local.description.affiliation | [Light, Dean; Aiashy, Ahmad; Diab, Mahmoud; Nachmias, Daniel; Kimelfeld, Benny] Technion, Haifa, Israel. | - |
local.description.affiliation | [Vansummeren, Stijn] UHasselt, Data Sci Inst, Diepenbeek, Belgium. | - |
local.uhasselt.international | yes | - |
item.fulltext | With Fulltext | - |
item.contributor | Light, Dean | - |
item.contributor | Aiashy, Ahmad | - |
item.contributor | Diab, Mahmoud | - |
item.contributor | Nachmias, Daniel | - |
item.contributor | VANSUMMEREN, Stijn | - |
item.contributor | Kimelfeld, Benny | - |
item.fullcitation | Light, Dean; Aiashy, Ahmad; Diab, Mahmoud; Nachmias, Daniel; VANSUMMEREN, Stijn & Kimelfeld, Benny (2024) SpannerLib: Embedding Declarative Information Extraction in an Imperative Workflow. In: Proceedings of the Vldb Endowment, 17 (12) , p. 4281 -4284. | - |
item.accessRights | Open Access | - |
crisitem.journal.issn | 2150-8097 | - |
crisitem.journal.eissn | 2150-8097 | - |
Appears in Collections: | Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
SpannerLib_ Embedding Declarative Information Extraction in an Imperative Workflow.pdf | Published version | 648.93 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.