Neural spelling correction: translating incorrect sentences to correct sentences for multimedia
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Park, Chanjun | - |
dc.contributor.author | Kim, Kuekyeng | - |
dc.contributor.author | Yang, YeongWook | - |
dc.contributor.author | Kang, Minho | - |
dc.contributor.author | Lim, Heuiseok | - |
dc.date.accessioned | 2022-04-02T23:40:34Z | - |
dc.date.available | 2022-04-02T23:40:34Z | - |
dc.date.created | 2022-04-01 | - |
dc.date.issued | 2021 | - |
dc.identifier.issn | 1380-7501 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/139627 | - |
dc.description.abstract | The aim of a spelling correction task is to detect spelling errors and automatically correct them. In this paper we aim to perform the Korean spelling correction task from a machine translation perspective, allowing it to overcome the limitations of cost, time and data. Based on a sequence to sequence model, the model aligns its source sentence with an 'error filled sentence' and its target sentence aligned to the correct counter part. Thus, 'translating' the error sentence to a correct sentence. For this research, we have also proposed three new data generation methods allowing the creation of multiple spelling correction parallel corpora from just a single monolingual corpus. Additionally, we discovered that applying the Copy Mechanism not only resolves the problem of overcorrection but even improves it. For this paper, we evaluated our model upon these aspects: Performance comparisons to other models and evaluation on overcorrection. The results show the proposed model to even out-perform other systems currently in commercial use. | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | SPRINGER | - |
dc.title | Neural spelling correction: translating incorrect sentences to correct sentences for multimedia | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Lim, Heuiseok | - |
dc.identifier.doi | 10.1007/s11042-020-09148-2 | - |
dc.identifier.scopusid | 2-s2.0-85087374483 | - |
dc.identifier.wosid | 000543678900003 | - |
dc.identifier.bibliographicCitation | MULTIMEDIA TOOLS AND APPLICATIONS, v.80, no.26-27, pp.34591 - 34608 | - |
dc.relation.isPartOf | MULTIMEDIA TOOLS AND APPLICATIONS | - |
dc.citation.title | MULTIMEDIA TOOLS AND APPLICATIONS | - |
dc.citation.volume | 80 | - |
dc.citation.number | 26-27 | - |
dc.citation.startPage | 34591 | - |
dc.citation.endPage | 34608 | - |
dc.type.rims | ART | - |
dc.type.docType | Article; Early Access | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Software Engineering | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.subject.keywordAuthor | Korean spelling correction | - |
dc.subject.keywordAuthor | Automatic noise generation | - |
dc.subject.keywordAuthor | Neural machine translation | - |
dc.subject.keywordAuthor | Transformer | - |
dc.subject.keywordAuthor | Copy mechanism | - |
dc.subject.keywordAuthor | Overcorrection | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.