언어 텍스트에 나타나는 벤포드 법칙: 원리와 응용Benford's Law in Linguistic Texts: Its Princi- ple and Applications
- Other Titles
- Benford's Law in Linguistic Texts: Its Princi- ple and Applications
- Authors
- 홍정하
- Issue Date
- 2010
- Publisher
- 한국언어정보학회
- Keywords
- 벤포드 법칙(Benford' s Law); 텍스트(texts); 말뭉치(corpora); 형태소(morphemes); 빈도 목록(frequency lists); 텍스트 분포 원리(principle of text distribution); 복잡계(complex systems); 저빈도(low-frequency)
- Citation
- 언어와 정보, v.14, no.1, pp.145 - 163
- Indexed
- KCI
- Journal Title
- 언어와 정보
- Volume
- 14
- Number
- 1
- Start Page
- 145
- End Page
- 163
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/117592
- ISSN
- 1226-7430
- Abstract
- This paper aims to propose that Benford's Law, non-uniform distribution of the leading digits in lists of numbers from many real-life sources, also appears in linguistic texts. The rst digits in the frequency lists of morphemes from Sejong Morphologically Analyzed Corpora represent non-uniform distribution following Benford's Law, but showing complexity of numerical sources from complex systems like earthquakes. Benford's Law in texts is a principle re ecting regular distribution of low-frequency linguistic types, called LNRE(large number of rare events), and governing texts, corpora, or sample texts relatively independent of text sizes and the number of types. Although texts share a similar distribution pattern by Benford's Law, we can investigate non-uniform distribution slightly varied from text to text that provides useful applications to evaluate randomness of texts distribution focused on low-frequency types.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - ETC > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.