Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/35394
Title: Database Principles and Challenges in Text Analysis
Authors: DOLESCHAL, Johannes 
Kimelfeld, Benny
MARTENS, Wim 
Issue Date: 2021
Publisher: ASSOC COMPUTING MACHINERY
Source: SIGMOD RECORD, 50 (2) , p. 6 -17
Abstract: A common conceptual view of text analysis is that of a two-step process, where we first extract relations from text documents and then apply a relational query over the result. Hence, text analysis shares technical challenges with, and can draw ideas from, relational databases. A framework that formally instantiates this connection is that of the document spanners. In this article, we review recent advances in various research efforts that adapt fundamental database concepts to text analysis through the lens of document spanners. Among others, we discuss aspects of query evaluation, aggregate queries, provenance, and distributed query planning.
Notes: Doleschal, J (corresponding author), Univ Bayreuth, Bayreuth, Germany.; Doleschal, J (corresponding author), Hasselt Univ, Hasselt, Belgium.
johannes.doleschal@uni-bayreuth.de; bennyk@cs.technion.ac.il;
wim.martens@uni-bayreuth.de
Document URI: http://hdl.handle.net/1942/35394
ISSN: 0163-5808
e-ISSN: 1943-5835
ISI #: WOS:000692565900002
Category: A1
Type: Journal Contribution
Validations: ecoom 2022
Appears in Collections:Research publications

Show full item record

WEB OF SCIENCETM
Citations

2
checked on Apr 24, 2024

Page view(s)

22
checked on Sep 7, 2022

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.