세종 구문분석 말뭉치를 기반으로 한 확률 문맥자유문법 규칙
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 최재웅 | - |
dc.contributor.author | 송상헌 | - |
dc.contributor.author | 전지은 | - |
dc.date.accessioned | 2021-09-09T14:57:38Z | - |
dc.date.available | 2021-09-09T14:57:38Z | - |
dc.date.created | 2021-06-17 | - |
dc.date.issued | 2008 | - |
dc.identifier.issn | 1226-8011 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/125070 | - |
dc.description.abstract | The Sejong Korean Treebank (SKT) was built as part of 10 year government-sponsored Sejong project, and more than 80 million graphic-word Korean parsed corpus has been released to the public at the end of 2007. The purpose of this paper is to extract Context-Free Grammar (CFG) rules from SKT and to draw some linguistic generalizations based on the CFG rules. We introduce an extraction algorithm that was used in this study and prove that it meets the minimal requirements as an objective extraction method in terms of its precision and recall rates. Then our discussion of the extracted CFG rules proceed in terms of the minimal tree structure containing a mother node (MN) and its two daughter nodes (Left DN, Right DN). We arrive at various linguistic or stochastic generalizations restricting the distribution of the categories in the minimal tree structure for Korean, for example, one that states 'In more than 95% of the cases that involve S, VP, NP, VNP, and AP, MN and RDN share the same category.' We provide most of the detailed statistical information regarding the basic properties of SKT and the CFG rules derived from it. | - |
dc.language | Korean | - |
dc.language.iso | ko | - |
dc.publisher | 고려대학교 언어정보연구소 | - |
dc.title | 세종 구문분석 말뭉치를 기반으로 한 확률 문맥자유문법 규칙 | - |
dc.title.alternative | Probabilistic Context-Free Grammar Rules based on Sejong Korean Treebank | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | 최재웅 | - |
dc.identifier.bibliographicCitation | 언어정보, no.9, pp.87 - 139 | - |
dc.relation.isPartOf | 언어정보 | - |
dc.citation.title | 언어정보 | - |
dc.citation.number | 9 | - |
dc.citation.startPage | 87 | - |
dc.citation.endPage | 139 | - |
dc.type.rims | ART | - |
dc.identifier.kciid | ART001343515 | - |
dc.description.journalClass | 2 | - |
dc.subject.keywordAuthor | Sejong Korean Treebank | - |
dc.subject.keywordAuthor | probabilistic context-free grammar rules | - |
dc.subject.keywordAuthor | corpus | - |
dc.subject.keywordAuthor | frequency | - |
dc.subject.keywordAuthor | parsed nodes | - |
dc.subject.keywordAuthor | trees | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.