Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/743
Full metadata record
DC FieldValueLanguage
dc.contributor.authorEGGHE, Leo-
dc.date.accessioned2005-04-26T09:21:07Z-
dc.date.available2005-04-26T09:21:07Z-
dc.date.issued2005-
dc.identifier.citationMATHEMATICAL AND COMPUTER MODELLING, 41. p. 807-823-
dc.identifier.issn0895-7177-
dc.identifier.urihttp://hdl.handle.net/1942/743-
dc.description.abstractN-grams are generalized words consisting of N consecutive symbols (letters), as they are used in a text. N-word phrases are general concepts consisting of N consecutive words, also as used in a text. Given the rank-frequency function of single letters (i.e. 1-grams) or of single words (i.e. 1-word phrases) being Zipfian, we determine in this paper the exact rank-frequency function (i.e. the occurrence of N-grams or N-word phrases on each rank) and size-frequency distribution (i.e. the density of N-grams or N-word phrases on each occurrence density) of these N-grams and N-word phrases. This paper distinguishes itself from other ones on this topic by allowing no approximations in the calculations. This leads to an intricate rank-frequency function for N-grams and N-word phrases (as we knew before from unpublished calculations) but leads surprisingly, to a very simple size-frequency function f(N) for N-grams or N-word phrases.-
dc.format.extent299128 bytes-
dc.format.mimetypeapplication/pdf-
dc.language.isoen-
dc.publisherELSEVIER-
dc.subject.otherN-gram; N-word phrase; Rank-frequency distribution; Size-frequency distribution; Zipfian distribution-
dc.titleThe exact rank-frequency function and size-frequency function of N-grams and N-word phrases with applications-
dc.typeJournal Contribution-
dc.identifier.epage823-
dc.identifier.spage807-
dc.identifier.volume41-
local.bibliographicCitation.jcatA1-
local.type.refereedRefereed-
local.type.specifiedArticle-
dc.bibliographicCitation.oldjcatA1-
dc.identifier.doi10.1016/j.mcm.2003.12.016-
dc.identifier.isi000229364100015-
item.fullcitationEGGHE, Leo (2005) The exact rank-frequency function and size-frequency function of N-grams and N-word phrases with applications. In: MATHEMATICAL AND COMPUTER MODELLING, 41. p. 807-823.-
item.validationecoom 2006-
item.accessRightsOpen Access-
item.fulltextWith Fulltext-
item.contributorEGGHE, Leo-
crisitem.journal.issn0895-7177-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
exact 1.pdf
  Restricted Access
Published version824.36 kBAdobe PDFView/Open    Request a copy
exact 2.pdfPeer-reviewed author version502.57 kBAdobe PDFView/Open
Show simple item record

SCOPUSTM   
Citations

2
checked on Sep 3, 2020

WEB OF SCIENCETM
Citations

2
checked on Jul 18, 2024

Page view(s)

50
checked on Sep 5, 2022

Download(s)

118
checked on Sep 5, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.