Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/32466
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSánchez, Ricardo-
dc.contributor.authorBELLO GARCIA, Marilyn-
dc.contributor.authorMorell, Carlos-
dc.contributor.authorBello, Rafael-
dc.contributor.authorVANHOOF, Koen-
dc.date.accessioned2020-10-14T08:23:56Z-
dc.date.available2020-10-14T08:23:56Z-
dc.date.issued2019-
dc.date.submitted2020-10-13T23:45:12Z-
dc.identifier.citationProceedings of the 2nd International Conference of Information Processing CIPI - IOTAI 2019,-
dc.identifier.isbn9789593123723-
dc.identifier.urihttp://hdl.handle.net/1942/32466-
dc.description.abstractIn the last years, the amounts of data have increased considerably and therefore, it is becoming more complex to handle these volumes of information. Measuring the data quality is a pivotal aspect to assess the classifier's discriminatory power as the classifiers accuracy heavily depends on the data used to build the model. Multi-label classification is one specific type of classification problem, which has generated an increasing interest in recent years. However, there are no quality measures for multi-label datasets implemented in cluster computing frameworks to evaluate large datasets. This work aims to implement a measure of data quality for multi-label datasets based on Granular Computing under the Apache Spark framework. As a result, it was possible to calculate the values of the quality measure for the datasets, and even in relatively short times.-
dc.language.isoen-
dc.subject.otherapache spark-
dc.subject.otherQuality Measure-
dc.subject.othermulti-label classification-
dc.subject.otherMulti-label Classification-
dc.subject.otherApache Spark-
dc.subject.otherquality measure-
dc.titleA quality measure for multi-label datasets on the Apache Spark framework-
dc.typeProceedings Paper-
local.bibliographicCitation.conferencedate06/24/2019 - 06/28/2019-
local.bibliographicCitation.conferencenameInternational Workshop of Internet of Things & Artificial Intelligence-
local.bibliographicCitation.conferenceplaceCayos de Villa Clara, Cuba-
local.format.pages4-
local.bibliographicCitation.jcatC1-
local.type.refereedRefereed-
local.type.specifiedProceedings Paper-
dc.identifier.urlhttps://convencion.uclv.cu/event/2nd-international-conference-of-information-processing-cipi-iotai-2019-international-workshop-of-internet-of-things-artificial-intelligence-2019-06-24-2019-06-29-37/track/a-quality-measure-for-multi-label-datasets-on-the-apache-spark-framework-1642-
local.provider.typePdf-
local.bibliographicCitation.btitleProceedings of the 2nd International Conference of Information Processing CIPI - IOTAI 2019-
local.uhasselt.uhpubyes-
item.fulltextWith Fulltext-
item.fullcitationSánchez, Ricardo; BELLO GARCIA, Marilyn; Morell, Carlos; Bello, Rafael & VANHOOF, Koen (2019) A quality measure for multi-label datasets on the Apache Spark framework. In: Proceedings of the 2nd International Conference of Information Processing CIPI - IOTAI 2019,.-
item.contributorSánchez, Ricardo-
item.contributorBELLO GARCIA, Marilyn-
item.contributorMorell, Carlos-
item.contributorBello, Rafael-
item.contributorVANHOOF, Koen-
item.validationvabb 2023-
item.accessRightsOpen Access-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
IoT-AI2019-3pag.pdfPublished version179.94 kBAdobe PDFView/Open
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.