다국어 BERT를 활용한 한국어 자연어 질의의 SQL 변환
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 윤훈상 | - |
dc.contributor.author | 허재혁 | - |
dc.contributor.author | 김정섭 | - |
dc.contributor.author | 강필성 | - |
dc.date.accessioned | 2022-06-11T20:40:58Z | - |
dc.date.available | 2022-06-11T20:40:58Z | - |
dc.date.created | 2022-06-10 | - |
dc.date.issued | 2022 | - |
dc.identifier.issn | 1225-0988 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/142064 | - |
dc.description.abstract | Text-to-SQL is one of semantic parsing methods that converts natural language questions into SQL queries, and it aims to extract data from any relational database without knowledge of SQL query configuration. Although development of large amounts of datasets (WikiSQL, SPIDER) and development of pre-trained language models (BERT) contributed to the improvement of Text-to-SQL performance in English, language-specific dataset construction and model research have not been much progressed. Therefore, this study proposes a multilingual BERT-based Text-to-SQL methodology that converts the natural language question in Korean into SQL query for an English database. To this end, four strategies for translating Korean queries into English were explored, and their effectiveness was verified by applying each strategy to three text-to-SQL model structures. As a result of the experiment, it was confirmed that it showed a significant SQL generation performance even for Korean questions. The proposed methodology is meaningful in that it shows semantic inferences between database tables, column information, and questions composed of different languages are possible, and it is expected to support efficient database access by Korean users who lack proficiency in writing SQL queries. | - |
dc.language | Korean | - |
dc.language.iso | ko | - |
dc.publisher | 대한산업공학회 | - |
dc.title | 다국어 BERT를 활용한 한국어 자연어 질의의 SQL 변환 | - |
dc.title.alternative | Text-to-SQL for Korean Language based on Multilingual BERT | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | 강필성 | - |
dc.identifier.bibliographicCitation | 대한산업공학회지, v.48, no.1, pp.91 - 104 | - |
dc.relation.isPartOf | 대한산업공학회지 | - |
dc.citation.title | 대한산업공학회지 | - |
dc.citation.volume | 48 | - |
dc.citation.number | 1 | - |
dc.citation.startPage | 91 | - |
dc.citation.endPage | 104 | - |
dc.type.rims | ART | - |
dc.identifier.kciid | ART002810521 | - |
dc.description.journalClass | 2 | - |
dc.description.journalRegisteredClass | kci | - |
dc.subject.keywordAuthor | Text-to-SQL | - |
dc.subject.keywordAuthor | Multilingual BERT | - |
dc.subject.keywordAuthor | WikiSQL | - |
dc.subject.keywordAuthor | SQLova | - |
dc.subject.keywordAuthor | HydraNet | - |
dc.subject.keywordAuthor | Bridge | - |
dc.subject.keywordAuthor | Back Translation | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.