구어체 적응 사전 학습을 통한 한국어 감정 분류 성능 향상
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 이정훈 | - |
dc.contributor.author | 김동화 | - |
dc.contributor.author | 노영빈 | - |
dc.contributor.author | 강필성 | - |
dc.date.accessioned | 2022-11-05T05:41:50Z | - |
dc.date.available | 2022-11-05T05:41:50Z | - |
dc.date.created | 2022-11-04 | - |
dc.date.issued | 2021 | - |
dc.identifier.issn | 1225-0988 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/144771 | - |
dc.description.abstract | Language models (LMs) pretrained on a large text corpus and fine-tuned on a task data have a remarkable performance for document classification task. Recently, an adaptive pretraining method that re-pretrains the pretrained LMs using an additional dataset in the same domain with the given task to make up the domain discrepancy has reported significant performance improvements. However, current adaptive pretraining methods only focus on the domain gap between pretraining data and fine-tuning data. The writing style is also different because the pretraining data, e.g., Wikipedia, is written in a literary style, but the task data, e.g., customer review, is usually written in a colloquial style. In this work, we propose a colloquial-adaptive pretraining method that re-pretrains the pretrained LM with informal sentences to generalize the LM to colloquial style. We verify the proposed method based on multi-emotion classification datasets. The experimental results show that the proposed method yields improved classification performance on both low- and high-resource data. | - |
dc.language | Korean | - |
dc.language.iso | ko | - |
dc.publisher | 대한산업공학회 | - |
dc.title | 구어체 적응 사전 학습을 통한 한국어 감정 분류 성능 향상 | - |
dc.title.alternative | Improving Korean Emotion Classification via Colloquial-Adaptive Pretraining | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | 강필성 | - |
dc.identifier.bibliographicCitation | 대한산업공학회지, v.47, no.4, pp.342 - 350 | - |
dc.relation.isPartOf | 대한산업공학회지 | - |
dc.citation.title | 대한산업공학회지 | - |
dc.citation.volume | 47 | - |
dc.citation.number | 4 | - |
dc.citation.startPage | 342 | - |
dc.citation.endPage | 350 | - |
dc.type.rims | ART | - |
dc.identifier.kciid | ART002743794 | - |
dc.description.journalClass | 2 | - |
dc.description.journalRegisteredClass | kci | - |
dc.subject.keywordAuthor | Natural Language Processing | - |
dc.subject.keywordAuthor | Transfer Learning | - |
dc.subject.keywordAuthor | Adaptive Pretraining | - |
dc.subject.keywordAuthor | Multi-Emotion Classification | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
145 Anam-ro, Seongbuk-gu, Seoul, 02841, Korea+82-2-3290-2963
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.