Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/743
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | EGGHE, Leo | - |
dc.date.accessioned | 2005-04-26T09:21:07Z | - |
dc.date.available | 2005-04-26T09:21:07Z | - |
dc.date.issued | 2005 | - |
dc.identifier.citation | MATHEMATICAL AND COMPUTER MODELLING, 41. p. 807-823 | - |
dc.identifier.issn | 0895-7177 | - |
dc.identifier.uri | http://hdl.handle.net/1942/743 | - |
dc.description.abstract | N-grams are generalized words consisting of N consecutive symbols (letters), as they are used in a text. N-word phrases are general concepts consisting of N consecutive words, also as used in a text. Given the rank-frequency function of single letters (i.e. 1-grams) or of single words (i.e. 1-word phrases) being Zipfian, we determine in this paper the exact rank-frequency function (i.e. the occurrence of N-grams or N-word phrases on each rank) and size-frequency distribution (i.e. the density of N-grams or N-word phrases on each occurrence density) of these N-grams and N-word phrases. This paper distinguishes itself from other ones on this topic by allowing no approximations in the calculations. This leads to an intricate rank-frequency function for N-grams and N-word phrases (as we knew before from unpublished calculations) but leads surprisingly, to a very simple size-frequency function f(N) for N-grams or N-word phrases. | - |
dc.format.extent | 299128 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | en | - |
dc.publisher | ELSEVIER | - |
dc.subject.other | N-gram; N-word phrase; Rank-frequency distribution; Size-frequency distribution; Zipfian distribution | - |
dc.title | The exact rank-frequency function and size-frequency function of N-grams and N-word phrases with applications | - |
dc.type | Journal Contribution | - |
dc.identifier.epage | 823 | - |
dc.identifier.spage | 807 | - |
dc.identifier.volume | 41 | - |
local.bibliographicCitation.jcat | A1 | - |
local.type.refereed | Refereed | - |
local.type.specified | Article | - |
dc.bibliographicCitation.oldjcat | A1 | - |
dc.identifier.doi | 10.1016/j.mcm.2003.12.016 | - |
dc.identifier.isi | 000229364100015 | - |
item.contributor | EGGHE, Leo | - |
item.fullcitation | EGGHE, Leo (2005) The exact rank-frequency function and size-frequency function of N-grams and N-word phrases with applications. In: MATHEMATICAL AND COMPUTER MODELLING, 41. p. 807-823. | - |
item.accessRights | Open Access | - |
item.fulltext | With Fulltext | - |
item.validation | ecoom 2006 | - |
crisitem.journal.issn | 0895-7177 | - |
Appears in Collections: | Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
exact 1.pdf Restricted Access | Published version | 824.36 kB | Adobe PDF | View/Open Request a copy |
exact 2.pdf | Peer-reviewed author version | 502.57 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.