Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/757
Full metadata record
DC FieldValueLanguage
dc.contributor.authorEGGHE, Leo-
dc.contributor.authorROUSSEAU, Ronald-
dc.date.accessioned2005-05-24T07:59:08Z-
dc.date.available2005-05-24T07:59:08Z-
dc.date.issued2004-
dc.identifier.citationJournal of Information Science, 30(6). p. 509-519-
dc.identifier.issn0165-5515-
dc.identifier.urihttp://hdl.handle.net/1942/757-
dc.description.abstractSimilarity between objects (documents, persons, answers to a questionnaire, etc.) is generally determined through relations between representations of these objects. In the case of binary representations the presence of a property (e.g. an index term) carries a weight of one, its absence a weight of zero. In many similarity studies common zeros are ignored. This situation is called the zero insensitive case. In this article, however, we study the zero sensitive case. Clearly, answers to binary questionnaires (yes-no, encoded as 1-0) are zero sensitive, as people who answer ‘no’ to the same questions are more similar than those who give different answers. We present a wish list for such a zero sensitive approach to similarity. Making a difference between common zeros and common ones leads to an ‘identity-similarity’ theory. Hence, we move beyond a pure similarity theory. Two approaches to the problem of similarity measurement of presence-absence data, where common zeros matter and have the same effect as common ones, are presented. For the case that there is a difference between common ones and common zeros a totally new approach is proposed. In each case a coding approach is used, leading to new representations, which then lead to a similarity ranking. Examples of functions respecting these rankings are given.-
dc.format.extent2124081 bytes-
dc.format.mimetypeapplication/pdf-
dc.language.isoen-
dc.publisherSage-
dc.subject.otherZero-sensitive similarity; absence-presence data; differences between identical representations-
dc.titleAn approach to similarity measurement of absence-presence data: the case that common zeros matter-
dc.typeJournal Contribution-
dc.identifier.epage519-
dc.identifier.issue6-
dc.identifier.spage509-
dc.identifier.volume30-
local.bibliographicCitation.jcatA1-
local.type.refereedRefereed-
local.type.specifiedArticle-
dc.bibliographicCitation.oldjcatA1-
dc.identifier.doi10.1177/0165551504047827-
dc.identifier.isi000225754700004-
item.contributorEGGHE, Leo-
item.contributorROUSSEAU, Ronald-
item.fullcitationEGGHE, Leo & ROUSSEAU, Ronald (2004) An approach to similarity measurement of absence-presence data: the case that common zeros matter. In: Journal of Information Science, 30(6). p. 509-519.-
item.accessRightsRestricted Access-
item.fulltextWith Fulltext-
item.validationecoom 2006-
crisitem.journal.issn0165-5515-
crisitem.journal.eissn1741-6485-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
an approach.pdfNon Peer-reviewed author version2.07 MBAdobe PDFView/Open
approach 1.pdf
  Restricted Access
Published version151.73 kBAdobe PDFView/Open    Request a copy
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.