언어 자료를 활용한 한국어 복합명사 구조 분석
- Authors
- 김동성
- Issue Date
- 2011
- Publisher
- 대한언어학회
- Keywords
- Morphology; Collocation; Complex Nominals; Computational Linguistics; Corpus Statistics; Computational Morphology; Endocentricity/Exocentricity; Bracketing; Morphology; Collocation; Complex Nominals; Computational Linguistics; Corpus Statistics; Computational Morphology; Endocentricity/Exocentricity; Bracketing
- Citation
- 언어학, v.19, no.3, pp.129 - 150
- Indexed
- KCI
- Journal Title
- 언어학
- Volume
- 19
- Number
- 3
- Start Page
- 129
- End Page
- 150
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/114538
- DOI
- 10.24303/lakdoi.2011.19.3.129
- ISSN
- 1225-7141
- Abstract
- This paper introduces an analysis on the structure of Korean complex nominals. The analysis attracts both theoretical linguistics and language processing related areas, such as information retrieval and speech synthesis. Our approach has three stages. First, we identify endocentric data and exocentric data, using human intuition. Since exocentric data does not have the internal structure, we do not consider exocentric data. Second, we do the bracketing experiment on the endocentric data for representing a hierarchical structure of constituent parts, using statistical collocation measurements based on the 10 million Sejong corpus. The last stage is composed of several processes to figure out head-modifier or predicate-argument relations, using argument structure and selection restriction specified in the Sejong electronic dictionary. Our method is based on not only the corpus-based materials but also linguistic knowledge (with intuition-based judgement). The importance of our approach is to show how to use language resources to utilize linguistic knowledges in analyzing linguistic data.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - ETC > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.