최소대립 문장쌍을 활용한 한국어 사전학습모델의 통사 연구 활용 가능성 검증Verification of Korean Pre-trained Models' Feasibility of Syntactic Research Using Pairwise Sentences
- Other Titles
- Verification of Korean Pre-trained Models' Feasibility of Syntactic Research Using Pairwise Sentences
- Authors
- 박권식; 김성태; 송상헌
- Issue Date
- 2021
- Publisher
- 한국언어정보학회
- Keywords
- BERT; acceptability judgment; correlation coefficient; deep learning; minimal pair
- Citation
- 언어와 정보, v.25, no.3, pp.1 - 21
- Indexed
- KCI
- Journal Title
- 언어와 정보
- Volume
- 25
- Number
- 3
- Start Page
- 1
- End Page
- 21
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/138318
- ISSN
- 1226-7430
- Abstract
- Syntactic studies make use of the minimally pairwise sentences as an argumentation tool, because the pairs allow us to pay attention to the constraints of interest. Likewise, it is helpful to use a set of minimal pairs in deep learning-based experiments for assessing the syntactic ability of neural language models. In this context, this study verifies whether the deep learning Korean model has the ability to properly distinguish the well-formed expressions and the corresponding ill-formed expressions. In the meanwhile, this study serves to examine the feasibility of the language resource constructed by the Korean government for deep learning architecture. The research is three-fold. First, we conducted an acceptability judgment testing to verify whether and how the language resource used in this study is indeed trustworthy. The results indicate that the judgments provided in the language resource converge with the judgments of our own experiment well enough. Second, we employed four Korean models such as mBERT, KoBERT, KR-BERT, KorBERT in order to evaluate how the language resource has a potentiality to predict the well-formedness of Korean expressions. The different models yield different results, the reason of which is fully discussed. Third, we made use of an independent test-set for evaluating the deep learning systems. It turns out that the results are still challenging, which implies that the current Korean models may have room for improvement to understand the syntactic phenomena.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Liberal Arts > Department of Linguistics > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.