대규모 신문 기사의 자동 키워드 추출과 분석 -t-점수를 이용하여-
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 김일환 | - |
dc.contributor.author | 이도길 | - |
dc.date.accessioned | 2021-09-07T17:35:53Z | - |
dc.date.available | 2021-09-07T17:35:53Z | - |
dc.date.created | 2021-06-17 | - |
dc.date.issued | 2011 | - |
dc.identifier.issn | 1226-9123 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/113705 | - |
dc.description.abstract | Kim, Ilhwan & Lee, Do-Gil. 2011. 11. Automatic Keyword Extraction and Analysis from the Large Scale Newspaper Corpus Based on t-score. Korean Linguistics 53,145-194. As the type and size of documents radically increased in recent years, how to automatically extract proper keywords from those documents has also been important. This paper aims to propose an automatic method to extract keywords and to analyze their characteristics. The keywords are extracted from Trends 21 corpus, a collection of four major Korean daily newspapers issued from the year 2000 to 2009. We introduce t-score to measure the keywordness. The keywords were extracted from two aspects i.e. year and topic. We present the top 100 keywords for 6 topics and 10years. Also, to verify whether these keywords can be representatives of the texts, we compared them with the headline news of 2009. The two main contributions of this work are as follows: 1) this study can present keywords which are automatically extracted from large scaled corpora without any human intervention by the verifiable and objective method and 2) this study analyzed the characteristics of the keywords by topic and year. | - |
dc.language | Korean | - |
dc.language.iso | ko | - |
dc.publisher | 한국어학회 | - |
dc.title | 대규모 신문 기사의 자동 키워드 추출과 분석 -t-점수를 이용하여- | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | 김일환 | - |
dc.contributor.affiliatedAuthor | 이도길 | - |
dc.identifier.bibliographicCitation | 한국어학, v.53, pp.145 - 194 | - |
dc.relation.isPartOf | 한국어학 | - |
dc.citation.title | 한국어학 | - |
dc.citation.volume | 53 | - |
dc.citation.startPage | 145 | - |
dc.citation.endPage | 194 | - |
dc.type.rims | ART | - |
dc.identifier.kciid | ART001602390 | - |
dc.description.journalClass | 2 | - |
dc.description.journalRegisteredClass | kci | - |
dc.subject.keywordAuthor | 키워드(keyword) | - |
dc.subject.keywordAuthor | 키워드성(keywordness) | - |
dc.subject.keywordAuthor | 키워드 추출(extraction of keyword) | - |
dc.subject.keywordAuthor | 사용 빈도(frequency of use) | - |
dc.subject.keywordAuthor | t-점수(t-score) | - |
dc.subject.keywordAuthor | [물결 21] 코퍼스(Trends21 corpus | - |
dc.subject.keywordAuthor | 신문 기사(newspaper) | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.