Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Minimizing Human Intervention for Constructing Korean Part-of-Speech Tagged Corpus

Authors
Lee, Do-GilHong, GumwonLee, Seok KeeRim, Hae-Chang
Issue Date
8월-2010
Publisher
IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG
Keywords
part-of-speech tagging; morphological analysis; part-of-speech tagged corpus
Citation
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, v.E93D, no.8, pp.2336 - 2338
Indexed
SCIE
SCOPUS
Journal Title
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Volume
E93D
Number
8
Start Page
2336
End Page
2338
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/115989
DOI
10.1587/transinf.E93.D.2336
ISSN
0916-8532
Abstract
The construction of annotated corpora requires considerable manual effort. This paper presents a pragmatic method to minimize human intervention for the construction of Korean part-of-speech (POS) tagged corpus. Instead of focusing on improving the performance of conventional automatic POS taggers, we devise a discriminative POS tagger which can selectively produce either a single analysis or multiple analyses based on the tagging reliability. The proposed approach uses two decision rules to judge the tagging reliability. Experimental results show that the proposed approach can effectively control the quality of corpus and the amount of manual annotation by the threshold value of the rule.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Associate Research Center > Research Institute of Korean Studies > 1. Journal Articles
College of Informatics > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE