An all-words sense tagging method for resource-deficient languages

Yi, Bong-Jun; Lee, Do-Gil; Rim, Hae-Chang

doi:10.1093/llc/fqw031

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

An all-words sense tagging method for resource-deficient languages

Full metadata record

DC Field	Value	Language
dc.contributor.author	Yi, Bong-Jun	-
dc.contributor.author	Lee, Do-Gil	-
dc.contributor.author	Rim, Hae-Chang	-
dc.date.accessioned	2021-09-03T02:14:13Z	-
dc.date.available	2021-09-03T02:14:13Z	-
dc.date.created	2021-06-16	-
dc.date.issued	2017-09	-
dc.identifier.issn	2055-7671	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/82363	-
dc.description.abstract	All-words sense tagging is the task of determining the correct senses of all content words in a given text. Many methods utilizing various language resources, such as a machine readable dictionary (MRD), sense tagged corpus, and WordNet, have been proposed for tagging senses to all words rather than a small number of sample words. However, sense tagging methods that require vast resources cannot be used for resource-deficient languages. The conventional sense tagging method for resource-deficient languages, which utilizes only an MRD, suffers from low recall and low precision because it determines senses only when a gloss word in the dictionary exactly matches a context word. In this study, we propose an all-words sense tagging method that is effective for resource-deficient languages in particular. It requires an MRD, which is the essential resource for all-words sense tagging, and a raw corpus, which is easily acquired and freely available. The proposed sense tagging method attempts to find semantically related context words based on the co-occurrence information extracted from the raw corpus and utilizes these words for tagging the senses of the target word. The experimental results of an evaluation of the proposed sense tagging algorithm on a Korean test corpus consisting of approximately 15 million words show that it can tag senses to all contents words automatically with high precision. Furthermore, we also show that a semantic concordancer can be developed based on the automatic sense tagged corpus.	-
dc.language	English	-
dc.language.iso	en	-
dc.publisher	OXFORD UNIV PRESS	-
dc.title	An all-words sense tagging method for resource-deficient languages	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Lee, Do-Gil	-
dc.contributor.affiliatedAuthor	Rim, Hae-Chang	-
dc.identifier.doi	10.1093/llc/fqw031	-
dc.identifier.scopusid	2-s2.0-85028692839	-
dc.identifier.wosid	000417907800013	-
dc.identifier.bibliographicCitation	DIGITAL SCHOLARSHIP IN THE HUMANITIES, v.32, no.3, pp.672 - 688	-
dc.relation.isPartOf	DIGITAL SCHOLARSHIP IN THE HUMANITIES	-
dc.citation.title	DIGITAL SCHOLARSHIP IN THE HUMANITIES	-
dc.citation.volume	32	-
dc.citation.number	3	-
dc.citation.startPage	672	-
dc.citation.endPage	688	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	ssci	-
dc.description.journalRegisteredClass	ahci	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Arts & Humanities - Other Topics	-
dc.relation.journalResearchArea	Linguistics	-
dc.relation.journalWebOfScienceCategory	Humanities, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Linguistics	-

Files in This Item: There are no files associated with this item.

Appears in Collections: Associate Research Center > Research Institute of Korean Studies > 1. Journal Articles; College of Informatics > Department of Computer Science and Engineering > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,352,071; Today View :5,094

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE