Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/17947
Title: | A new method for information retrieval, based on the theory of relative concentration | Authors: | EGGHE, Leo | Issue Date: | 1990 | Publisher: | ACM | Source: | Vidick, Jean-Luc (Ed.). Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval, p. 469-493 | Abstract: | This paper introduces a new method for information retrieval of documents that are represented by a vector. The novelty of the algorithm lies in the fact that no (generalized) p-norms are used as a matching function between the query and the document (as is done e.g. by Salton and others) but a function that measures the relative dispersion of the terms between a document and a query. This function originates from an earlier paper of the author where a good measure of relative concentration was introduced, used in informetrics to measure the degree of specialization of a journal w.r.t. the entire subject. This new information retrieval algorithm is shown to have many desirable properties (in the sense of the new Cater-Kraft wish list) including those of the original cosine-matching function of Salton. In addition the property of the cosine-matching function that, if one only uses weights 0 to 1, one is reduced to Boolean IR, is refined in the sense that one takes into consideration the broadness or specialization of a document and a query. Our new matching function satisfies these additional properties. | Document URI: | http://hdl.handle.net/1942/17947 | ISBN: | 0-89791-408-2 | DOI: | 10.1145/96749.98254 | Category: | C1 | Type: | Proceedings Paper |
Appears in Collections: | Research publications |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.