Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/26621
Full metadata record
DC FieldValueLanguage
dc.contributor.authorECTORS, Wim-
dc.contributor.authorREUMERS, Sofie-
dc.contributor.authorLEE, Won Do-
dc.contributor.authorKOCHAN, Bruno-
dc.contributor.authorJANSSENS, Davy-
dc.contributor.authorBELLEMANS, Tom-
dc.contributor.authorWETS, Geert-
dc.date.accessioned2018-08-07T07:28:19Z-
dc.date.available2018-08-07T07:28:19Z-
dc.date.issued2020-
dc.identifier.citationFuture generation computer systems, 110, p. 338-349.-
dc.identifier.issn0167-739X-
dc.identifier.urihttp://hdl.handle.net/1942/26621-
dc.description.abstractDespite the advantages, big transport data are characterized by a considerable disadvantage as well. Personal and activity-travel information are often lacking, making it necessary to deduce this information with data mining techniques. However, some studies predict many unique activity type classes (ATCs), while others merge multiple activity types into larger ATCs. This action enhances the activity inference estimation, but destroys important activity information. Previous studies do not provide a strong justification for this practice. An objectively optimized set of ATCs, balancing model prediction accuracy and preserving activity information from the original data, becomes essential. Previous research developed a classification methodology in which the optimal set of ATCs was identified by analyzing all possible ATC combinations. However, this approach is practically impossible in a finite amount of time for e.g. the US National Household Travel Survey (NHTS) 2009 data set, which comprises 36 ATCs (home activity excluded), since there would be 3.82 · 1030 unique combinations (an exponential increase). The aim of this paper is to optimize which original ATCs should be grouped into a new class, and this for data sets for which it is impossible or impractical to simply calculate all ATC combinations. The proposed method defines anoptimization parameter U (based on classification accuracy and information retention) which is maximized in an iterative local search algorithm. The optimal set of ATCs for the NHTS 2009 data set was determined. A comparison finds that this optimum is considerably better than many expert opinion activity type classification systems. Convergence was confirmed and large performance gains were found-
dc.language.isoen-
dc.publisherELSEVIER-
dc.rights2018 Elsevier B.V. All rights reserved.-
dc.subject.otherActivity type classification-
dc.subject.other(Big) Transport data annotation-
dc.subject.otherOptimal set of activity types-
dc.subject.otherLocal search algorithm-
dc.subject.otherClassification accuracy-
dc.subject.otherEntropy indices-
dc.titleOptimizing copious activity type classes based on classification accuracy and entropy retention-
dc.typeJournal Contribution-
dc.identifier.epage349-
dc.identifier.spage338-
dc.identifier.volume110-
local.format.pages12-
local.bibliographicCitation.jcatA1-
local.publisher.placeRADARWEG 29, 1043 NX AMSTERDAM, NETHERLANDS-
local.type.refereedRefereed-
local.type.specifiedArticle-
local.classdsPublValOverrule/author_version_not_expected-
dc.identifier.doi10.1016/j.future.2018.04.080-
dc.identifier.isi000541153400031-
dc.identifier.eissn1872-7115-
local.provider.typeWeb of Science-
local.uhasselt.internationalyes-
item.contributorECTORS, Wim-
item.contributorREUMERS, Sofie-
item.contributorLEE, Won Do-
item.contributorKOCHAN, Bruno-
item.contributorJANSSENS, Davy-
item.contributorBELLEMANS, Tom-
item.contributorWETS, Geert-
item.validationecoom 2022-
item.fullcitationECTORS, Wim; REUMERS, Sofie; LEE, Won Do; KOCHAN, Bruno; JANSSENS, Davy; BELLEMANS, Tom & WETS, Geert (2020) Optimizing copious activity type classes based on classification accuracy and entropy retention. In: Future generation computer systems, 110, p. 338-349..-
item.accessRightsOpen Access-
item.fulltextWith Fulltext-
crisitem.journal.issn0167-739X-
crisitem.journal.eissn1872-7115-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
paper.pdfNon Peer-reviewed author version499.09 kBAdobe PDFView/Open
main.pdf
  Restricted Access
Published version1.15 MBAdobe PDFView/Open    Request a copy
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.