Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/17947
Title: A new method for information retrieval, based on the theory of relative concentration
Authors: EGGHE, Leo 
Issue Date: 1990
Publisher: ACM
Source: Vidick, Jean-Luc (Ed.). Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval, p. 469-493
Abstract: This paper introduces a new method for information retrieval of documents that are represented by a vector. The novelty of the algorithm lies in the fact that no (generalized) p-norms are used as a matching function between the query and the document (as is done e.g. by Salton and others) but a function that measures the relative dispersion of the terms between a document and a query. This function originates from an earlier paper of the author where a good measure of relative concentration was introduced, used in informetrics to measure the degree of specialization of a journal w.r.t. the entire subject. This new information retrieval algorithm is shown to have many desirable properties (in the sense of the new Cater-Kraft wish list) including those of the original cosine-matching function of Salton. In addition the property of the cosine-matching function that, if one only uses weights 0 to 1, one is reduced to Boolean IR, is refined in the sense that one takes into consideration the broadness or specialization of a document and a query. Our new matching function satisfies these additional properties.
Document URI: http://hdl.handle.net/1942/17947
ISBN: 0-89791-408-2
DOI: 10.1145/96749.98254
Category: C1
Type: Proceedings Paper
Appears in Collections:Research publications

Show full item record

SCOPUSTM   
Citations

2
checked on Sep 3, 2020

Page view(s)

56
checked on Jun 16, 2023

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.