Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

최소대립 문장쌍을 활용한 한국어 사전학습모델의 통사 연구 활용 가능성 검증Verification of Korean Pre-trained Models' Feasibility of Syntactic Research Using Pairwise Sentences

Other Titles
Verification of Korean Pre-trained Models' Feasibility of Syntactic Research Using Pairwise Sentences
Authors
박권식김성태송상헌
Issue Date
2021
Publisher
한국언어정보학회
Keywords
BERT; acceptability judgment; correlation coefficient; deep learning; minimal pair
Citation
언어와 정보, v.25, no.3, pp.1 - 21
Indexed
KCI
Journal Title
언어와 정보
Volume
25
Number
3
Start Page
1
End Page
21
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/138318
ISSN
1226-7430
Abstract
Syntactic studies make use of the minimally pairwise sentences as an argumentation tool, because the pairs allow us to pay attention to the constraints of interest. Likewise, it is helpful to use a set of minimal pairs in deep learning-based experiments for assessing the syntactic ability of neural language models. In this context, this study verifies whether the deep learning Korean model has the ability to properly distinguish the well-formed expressions and the corresponding ill-formed expressions. In the meanwhile, this study serves to examine the feasibility of the language resource constructed by the Korean government for deep learning architecture. The research is three-fold. First, we conducted an acceptability judgment testing to verify whether and how the language resource used in this study is indeed trustworthy. The results indicate that the judgments provided in the language resource converge with the judgments of our own experiment well enough. Second, we employed four Korean models such as mBERT, KoBERT, KR-BERT, KorBERT in order to evaluate how the language resource has a potentiality to predict the well-formedness of Korean expressions. The different models yield different results, the reason of which is fully discussed. Third, we made use of an independent test-set for evaluating the deep learning systems. It turns out that the results are still challenging, which implies that the current Korean models may have room for improvement to understand the syntactic phenomena.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Liberal Arts > Department of Linguistics > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE