Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/32466
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Sánchez, Ricardo | - |
dc.contributor.author | BELLO GARCIA, Marilyn | - |
dc.contributor.author | Morell, Carlos | - |
dc.contributor.author | Bello, Rafael | - |
dc.contributor.author | VANHOOF, Koen | - |
dc.date.accessioned | 2020-10-14T08:23:56Z | - |
dc.date.available | 2020-10-14T08:23:56Z | - |
dc.date.issued | 2019 | - |
dc.date.submitted | 2020-10-13T23:45:12Z | - |
dc.identifier.citation | Proceedings of the 2nd International Conference of Information Processing CIPI - IOTAI 2019, | - |
dc.identifier.isbn | 9789593123723 | - |
dc.identifier.uri | http://hdl.handle.net/1942/32466 | - |
dc.description.abstract | In the last years, the amounts of data have increased considerably and therefore, it is becoming more complex to handle these volumes of information. Measuring the data quality is a pivotal aspect to assess the classifier's discriminatory power as the classifiers accuracy heavily depends on the data used to build the model. Multi-label classification is one specific type of classification problem, which has generated an increasing interest in recent years. However, there are no quality measures for multi-label datasets implemented in cluster computing frameworks to evaluate large datasets. This work aims to implement a measure of data quality for multi-label datasets based on Granular Computing under the Apache Spark framework. As a result, it was possible to calculate the values of the quality measure for the datasets, and even in relatively short times. | - |
dc.language.iso | en | - |
dc.subject.other | apache spark | - |
dc.subject.other | Quality Measure | - |
dc.subject.other | multi-label classification | - |
dc.subject.other | Multi-label Classification | - |
dc.subject.other | Apache Spark | - |
dc.subject.other | quality measure | - |
dc.title | A QUALITY MEASURE FOR MULTI-LABEL DATASETS ON THE APACHE SPARK FRAMEWORK | - |
dc.type | Proceedings Paper | - |
local.bibliographicCitation.conferencedate | 06/24/2019 - 06/28/2019 | - |
local.bibliographicCitation.conferencename | International Workshop of Internet of Things & Artificial Intelligence | - |
local.bibliographicCitation.conferenceplace | Cayos de Villa Clara, Cuba | - |
local.format.pages | 4 | - |
local.bibliographicCitation.jcat | C1 | - |
local.type.refereed | Refereed | - |
local.type.specified | Proceedings Paper | - |
dc.identifier.url | https://convencion.uclv.cu/event/2nd-international-conference-of-information-processing-cipi-iotai-2019-international-workshop-of-internet-of-things-artificial-intelligence-2019-06-24-2019-06-29-37/track/a-quality-measure-for-multi-label-datasets-on-the-apache-spark-framework-1642 | - |
local.provider.type | - | |
local.bibliographicCitation.btitle | Proceedings of the 2nd International Conference of Information Processing CIPI - IOTAI 2019 | - |
local.uhasselt.uhpub | yes | - |
item.validation | vabb 2023 | - |
item.contributor | Sánchez, Ricardo | - |
item.contributor | BELLO GARCIA, Marilyn | - |
item.contributor | Morell, Carlos | - |
item.contributor | Bello, Rafael | - |
item.contributor | VANHOOF, Koen | - |
item.accessRights | Open Access | - |
item.fullcitation | Sánchez, Ricardo; BELLO GARCIA, Marilyn; Morell, Carlos; Bello, Rafael & VANHOOF, Koen (2019) A QUALITY MEASURE FOR MULTI-LABEL DATASETS ON THE APACHE SPARK FRAMEWORK. In: Proceedings of the 2nd International Conference of Information Processing CIPI - IOTAI 2019,. | - |
item.fulltext | With Fulltext | - |
Appears in Collections: | Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
IoT-AI2019-3pag.pdf | Published version | 179.94 kB | Adobe PDF | View/Open |
Page view(s)
70
checked on Sep 6, 2022
Download(s)
8
checked on Sep 6, 2022
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.