Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/743
Full metadata record
DC FieldValueLanguage
dc.contributor.authorEGGHE, Leo-
dc.date.accessioned2005-04-26T09:21:07Z-
dc.date.available2005-04-26T09:21:07Z-
dc.date.issued2005-
dc.identifier.citationMATHEMATICAL AND COMPUTER MODELLING, 41. p. 807-823-
dc.identifier.issn0895-7177-
dc.identifier.urihttp://hdl.handle.net/1942/743-
dc.description.abstractN-grams are generalized words consisting of N consecutive symbols (letters), as they are used in a text. N-word phrases are general concepts consisting of N consecutive words, also as used in a text. Given the rank-frequency function of single letters (i.e. 1-grams) or of single words (i.e. 1-word phrases) being Zipfian, we determine in this paper the exact rank-frequency function (i.e. the occurrence of N-grams or N-word phrases on each rank) and size-frequency distribution (i.e. the density of N-grams or N-word phrases on each occurrence density) of these N-grams and N-word phrases. This paper distinguishes itself from other ones on this topic by allowing no approximations in the calculations. This leads to an intricate rank-frequency function for N-grams and N-word phrases (as we knew before from unpublished calculations) but leads surprisingly, to a very simple size-frequency function f(N) for N-grams or N-word phrases.-
dc.format.extent299128 bytes-
dc.format.mimetypeapplication/pdf-
dc.language.isoen-
dc.publisherELSEVIER-
dc.subject.otherN-gram; N-word phrase; Rank-frequency distribution; Size-frequency distribution; Zipfian distribution-
dc.titleThe exact rank-frequency function and size-frequency function of N-grams and N-word phrases with applications-
dc.typeJournal Contribution-
dc.identifier.epage823-
dc.identifier.spage807-
dc.identifier.volume41-
local.bibliographicCitation.jcatA1-
local.type.refereedRefereed-
local.type.specifiedArticle-
dc.bibliographicCitation.oldjcatA1-
dc.identifier.doi10.1016/j.mcm.2003.12.016-
dc.identifier.isi000229364100015-
item.contributorEGGHE, Leo-
item.fullcitationEGGHE, Leo (2005) The exact rank-frequency function and size-frequency function of N-grams and N-word phrases with applications. In: MATHEMATICAL AND COMPUTER MODELLING, 41. p. 807-823.-
item.accessRightsOpen Access-
item.fulltextWith Fulltext-
item.validationecoom 2006-
crisitem.journal.issn0895-7177-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
exact 1.pdf
  Restricted Access
Published version824.36 kBAdobe PDFView/Open    Request a copy
exact 2.pdfPeer-reviewed author version502.57 kBAdobe PDFView/Open
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.