Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/39394
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBonilla Huerfano, Johnatan-
dc.contributor.authorBouzouita, Miriam-
dc.contributor.authorSEGUNDO DIAZ, Rosa Lilia-
dc.date.accessioned2023-02-08T15:24:28Z-
dc.date.available2023-02-08T15:24:28Z-
dc.date.issued2022-
dc.date.submitted2023-02-08T10:42:49Z-
dc.identifier.citationRevista internacional de lingüística iberoamericana, XX (2 (40)) , p. 77 -96-
dc.identifier.urihttp://hdl.handle.net/1942/39394-
dc.description.abstractThis article presents the advances made for the construction of the ‘Annotated and Parsed Audible Corpus of Spoken Rural Spanish’. The methodology for building a treebank to evaluate the accuracy of state-of-the-art part of speech taggers: spaCy, Stanza and UDPipe is presented. It is shown that, when oral data is tagged the accuracy is 0.93; none of the regional varieties presents a significant difference in accuracy over the others. Regarding the grammatical categories, interjections, proper nouns, adjectives, and auxiliaries have the lowest F value. Finally, some examples of the polyfunctionality of some grammatical categories and the fuzzy boundaries between them are discussed, like the passive participle, and the ambiguity between adverbs and subordinate conjunctions that might affect accuracy.-
dc.language.isoes-
dc.publisherIberoamericana Editorial Vervuert, Instituto Ibero-Americano de Berlín.-
dc.relation.ispartofseriesDialectología digital: innovaciones técnicas y metodológicas-
dc.titleLa construcción del Corpus Oral y Sonoro del Español Rural-Anotado y Parseado (COSER-AP): avances en el etiquetado de partes del discurso-
dc.title.alternativeThe Construction of the Annotated and Parsed Audible Corpus of Spoken Rural Spanish (COSER-AP): Advances in the Annotation of the Parts of Speech-
dc.typeJournal Contribution-
dc.identifier.epage96-
dc.identifier.issue2 (40)-
dc.identifier.spage77-
dc.identifier.volumeXX-
local.bibliographicCitation.jcatA1-
local.type.refereedRefereed-
local.type.specifiedArticle-
dc.identifier.eissn-
local.provider.typebibtex-
local.uhasselt.internationalyes-
item.fulltextNo Fulltext-
item.accessRightsClosed Access-
item.validationvabb 2024-
item.contributorBonilla Huerfano, Johnatan-
item.contributorBouzouita, Miriam-
item.contributorSEGUNDO DIAZ, Rosa Lilia-
item.fullcitationBonilla Huerfano, Johnatan; Bouzouita, Miriam & SEGUNDO DIAZ, Rosa Lilia (2022) La construcción del Corpus Oral y Sonoro del Español Rural-Anotado y Parseado (COSER-AP): avances en el etiquetado de partes del discurso. In: Revista internacional de lingüística iberoamericana, XX (2 (40)) , p. 77 -96.-
crisitem.journal.issn1579-9425-
Appears in Collections:Research publications
Show simple item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.