Assessing Word Commonness - Adding dispersion to frequency

Paulsen, Mikkel Ekeland

dc.contributor.author	Paulsen, Mikkel Ekeland
dc.date.accessioned	2023-01-26T08:27:25Z
dc.date.available	2023-01-26T08:27:25Z
dc.date.created	2022-12-02T11:45:26Z
dc.date.issued	2022
dc.identifier.issn	1384-6655
dc.identifier.uri	https://hdl.handle.net/11250/3046458
dc.description.abstract	The article investigates the two main corpus indicators of word commonness, frequency and dispersion, through a cross-validation analysis of frequency and four dispersion measures (‘Range’, ‘Chi-squared’, ‘Deviation of Proportions’ and ‘Juilland’s D’). The approach provides an estimation of the capacity of the named measures to predict the distribution of corpus items in an extracted language sample. Based on a dataset of 273 Norwegian compounds, the results show that especially Deviation of Proportions is a robust measure of dispersion that can be used in conjunction with frequency to substantiate assertions of word commonness based on corpus data. In addition, dispersion measures do not only reflect what sort of distribution the frequency statistic is generated from, but also how reliable the frequency estimation in the corpus sample is in terms of giving an accurate representation of frequency in the language variety that the corpus is sampled from.	en_US
dc.language.iso	eng	en_US
dc.publisher	John Benjamins Publishing	en_US
dc.title	Assessing Word Commonness - Adding dispersion to frequency	en_US
dc.type	Journal article	en_US
dc.type	Peer reviewed	en_US
dc.description.version	acceptedVersion	en_US
dc.rights.holder	Copyright John Benjamins Publishing Company	en_US
cristin.ispublished	true
cristin.fulltext	postprint
cristin.qualitycode	2
dc.identifier.doi	10.1075/ijcl.21037.eke
dc.identifier.cristin	2087698
dc.source.journal	International Journal of Corpus Linguistics	en_US
dc.identifier.citation	International Journal of Corpus Linguistics. 2022.	en_US

Tilhørende fil(er)

Filnavn:: Revised+Assessing+Word+Commonn ...
Størrelse:: 829.6Kb
Format:: PDF
Beskrivelse:: accepted version

Åpne

Denne innførselen finnes i følgende samling(er)

Department of Linguistics, Literary and Aestetic Studies [950]
Registrations from Cristin [9616]

Vis enkel innførsel