Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot Learnersopen access
- Authors
- Seo, Jaehyung; Moon, Hyeonseok; Lee, Chanhee; Eo, Sugyeong; Park, Chanjun; Kim, Jihoon; Chun, Changwoo; Lim, Heuiseok
- Issue Date
- 2022
- Publisher
- IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- Keywords
- Learning systems; Training; Natural language processing; Data models; Modeling; Semantics; Predictive models; Prompt-based learning; natural language processing; language modeling; Korean language understanding; few-shot
- Citation
- IEEE ACCESS, v.10, pp.107587 - 107597
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE ACCESS
- Volume
- 10
- Start Page
- 107587
- End Page
- 107597
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/145578
- DOI
- 10.1109/ACCESS.2022.3213027
- ISSN
- 2169-3536
- Abstract
- Prompt-based learning is a method used for language models to interpret natural language by remembering the prior knowledge acquired and the training objective. Recent prompt-based few-shot learners have achieved superior performance by alleviating the catastrophic forgetting that occurs in pretrained language models. Few-shot learning contributes towards solving the data scarcity problem, an enormous challenge in AI systems and a significant consideration in natural language processing research. In spite of the significance of few-shot learning, research on Korean language-based few-shot learning is insufficient, and whether the prompt-based approach is appropriate for the Korean language has not been thoroughly verified. As a step toward realizing a Korean-prompt-based few-shot learner, we attempt to apply prompt engineering to the Korean language understanding benchmark dataset and introduce plain template insertion to overcome data scarcity in a more practical few-shot setting. The contributions of this study are as follows: (1) presumably, this is the first study to apply prompt-based few-shot learning to Korean benchmark datasets. With 32 few-shot settings, it improves performance by +14.88, +29.04, and +1.81 in the natural language inference, semantic textual similarity, and topic classification tasks. (2) We present prompt engineering, which merely inserts a plain template and increases data efficiency without training example selection, augmentation, reformulation, and retrieval. (3) Our approach is robust to the Korean prompt's contextual information and sentence structure and is applicable to both hard- and soft-prompt.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.