최소대립 문장쌍을 활용한 한국어 사전학습모델의 통사 연구 활용 가능성 검증

박권식; 김성태; 송상헌

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

최소대립 문장쌍을 활용한 한국어 사전학습모델의 통사 연구 활용 가능성 검증Verification of Korean Pre-trained Models' Feasibility of Syntactic Research Using Pairwise Sentences

Other Titles: Verification of Korean Pre-trained Models' Feasibility of Syntactic Research Using Pairwise Sentences

Authors: 박권식; 김성태; 송상헌

Issue Date: 2021

Publisher: 한국언어정보학회

Keywords: BERT; acceptability judgment; correlation coefficient; deep learning; minimal pair

Citation: 언어와 정보, v.25, no.3, pp.1 - 21

Indexed: KCI

Journal Title: 언어와 정보

Volume: 25

Number: 3

Start Page: 1

End Page: 21

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/138318

ISSN: 1226-7430

Abstract: Syntactic studies make use of the minimally pairwise sentences as an argumentation tool, because the pairs allow us to pay attention to the constraints of interest. Likewise, it is helpful to use a set of minimal pairs in deep learning-based experiments for assessing the syntactic ability of neural language models. In this context, this study verifies whether the deep learning Korean model has the ability to properly distinguish the well-formed expressions and the corresponding ill-formed expressions. In the meanwhile, this study serves to examine the feasibility of the language resource constructed by the Korean government for deep learning architecture. The research is three-fold. First, we conducted an acceptability judgment testing to verify whether and how the language resource used in this study is indeed trustworthy. The results indicate that the judgments provided in the language resource converge with the judgments of our own experiment well enough. Second, we employed four Korean models such as mBERT, KoBERT, KR-BERT, KorBERT in order to evaluate how the language resource has a potentiality to predict the well-formedness of Korean expressions. The different models yield different results, the reason of which is fully discussed. Third, we made use of an independent test-set for evaluating the deep learning systems. It turns out that the results are still challenging, which implies that the current Korean models may have room for improvement to understand the syntactic phenomena.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Liberal Arts > Department of Linguistics > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,448,593; Today View :24,897

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE