Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/1511
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJANSSENS, Davy-
dc.contributor.authorBRIJS, Tom-
dc.contributor.authorVANHOOF, Koen-
dc.contributor.authorWETS, Geert-
dc.date.accessioned2007-05-21T09:49:30Z-
dc.date.available2007-05-21T09:49:30Z-
dc.date.issued2006-
dc.identifier.citationCOMPUTERS & OPERATIONS RESEARCH, 33(11). p. 3107-3123-
dc.identifier.issn0305-0548-
dc.identifier.urihttp://hdl.handle.net/1942/1511-
dc.description.abstractDiscretization is defined as the process that divides continuous numeric values into intervals of discrete categorical values. In this article, the concept of cost-based discretization as a pre-processing step to the induction of a classifier is introduced in order to obtain an optimal multi-interval splitting for each numeric attribute. A transparent description of the method and the steps involved in cost-based discretization are given. The aim of this paper is to present this method and to assess the potential benefits of such an approach. Furthermore, its performance against two other well-known methods, i.e. entropy- and pure error-based discretization is examined. To this end, experiments on 14 data sets, taken from the UCI Repository on Machine Learning were carried out. In order to compare the different methods, the area under the Receiver Operating Characteristic (ROC) graph was used and tested on its level of significance. For most data sets the results show that cost-based discretization achieves satisfactory results when compared to entropy- and error-based discretization.-
dc.format.extent88957 bytes-
dc.format.mimetypeapplication/pdf-
dc.language.isoen-
dc.publisherElsevier-
dc.subject.otherDiscretization; ROC-curve; Cost-sensitive learning-
dc.titleEvaluating the performance of cost-based discretization versus entropy- and error-based discretization-
dc.typeJournal Contribution-
dc.identifier.epage3123-
dc.identifier.issue11-
dc.identifier.spage3107-
dc.identifier.volume33-
local.bibliographicCitation.jcatA1-
local.type.refereedRefereed-
local.type.specifiedArticle-
dc.bibliographicCitation.oldjcatA1-
dc.identifier.doi10.1016/j.cor.2005.01.022-
dc.identifier.isi000237180200004-
item.fulltextWith Fulltext-
item.fullcitationJANSSENS, Davy; BRIJS, Tom; VANHOOF, Koen & WETS, Geert (2006) Evaluating the performance of cost-based discretization versus entropy- and error-based discretization. In: COMPUTERS & OPERATIONS RESEARCH, 33(11). p. 3107-3123.-
item.accessRightsOpen Access-
item.contributorJANSSENS, Davy-
item.contributorBRIJS, Tom-
item.contributorVANHOOF, Koen-
item.contributorWETS, Geert-
item.validationecoom 2007-
crisitem.journal.issn0305-0548-
crisitem.journal.eissn1873-765X-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
evaluating.pdfPeer-reviewed author version86.87 kBAdobe PDFView/Open
Show simple item record

SCOPUSTM   
Citations

33
checked on Sep 2, 2020

WEB OF SCIENCETM
Citations

27
checked on May 8, 2024

Page view(s)

60
checked on Jul 15, 2022

Download(s)

182
checked on Jul 15, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.