Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/17947
Title: A new method for information retrieval, based on the theory of relative concentration
Authors: EGGHE, Leo 
Issue Date: 1990
Publisher: ACM
Source: Vidick, Jean-Luc (Ed.). Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval, p. 469-493
Abstract: This paper introduces a new method for information retrieval of documents that are represented by a vector. The novelty of the algorithm lies in the fact that no (generalized) p-norms are used as a matching function between the query and the document (as is done e.g. by Salton and others) but a function that measures the relative dispersion of the terms between a document and a query. This function originates from an earlier paper of the author where a good measure of relative concentration was introduced, used in informetrics to measure the degree of specialization of a journal w.r.t. the entire subject. This new information retrieval algorithm is shown to have many desirable properties (in the sense of the new Cater-Kraft wish list) including those of the original cosine-matching function of Salton. In addition the property of the cosine-matching function that, if one only uses weights 0 to 1, one is reduced to Boolean IR, is refined in the sense that one takes into consideration the broadness or specialization of a document and a query. Our new matching function satisfies these additional properties.
Document URI: http://hdl.handle.net/1942/17947
ISBN: 0-89791-408-2
DOI: 10.1145/96749.98254
Category: C1
Type: Proceedings Paper
Appears in Collections:Research publications

Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.