Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/787
Title: General study of the distribution of N-tuples of letters or words based on the distributions of the single letters or words
Authors: EGGHE, Leo 
Issue Date: 2000
Publisher: Elsevier
Source: Mathematical and Computer Modelling, 31(8-9). p. 35-41
Abstract: This paper establishes the general relation between the distribution of N-tuples of letters (e.g., N-truncations, N-grams) or words (e.g., N-word phrases) and the distributions of the single letters or words. Here the very general case is treated: the case where there is dependence on the place i in the N-tuple (i = 1,…, N) in the sense that, for each i = 1,…, N, a different distribution of the letters or words is supposed. Concrete calculations are performed in the important case of Zipfian distributions (i.e., power laws) for the single letters or words. In this case, we prove that the distribution of the N-tuples (N-fixed) is the sum of power laws.
Keywords: Zipf; N-word phrase; N-gram; N-truncation
Document URI: http://hdl.handle.net/1942/787
ISSN: 0895-7177
DOI: 10.1016/S0895-7177(00)00058-3
ISI #: 000087016000004
Category: A1
Type: Journal Contribution
Validations: ecoom 2001
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
general.pdfPeer-reviewed author version233.48 kBAdobe PDFView/Open
general 1.pdfPublished version452.84 kBAdobe PDFView/Open
Show full item record

SCOPUSTM   
Citations

4
checked on Sep 2, 2020

WEB OF SCIENCETM
Citations

5
checked on Apr 30, 2024

Page view(s)

54
checked on Sep 6, 2022

Download(s)

120
checked on Sep 6, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.