Three-phase text error correction model for Korean SMS messages
- Authors
- Byun, J.; Park, S.-Y.; Lee, S.-W.; Rim, H.-C.
- Issue Date
- 2009
- Publisher
- Institute of Electronics, Information and Communication, Engineers, IEICE
- Keywords
- SMS messages; Spelling errors; Text error correction; Word spacing errors
- Citation
- IEICE Transactions on Information and Systems, v.E92-D, no.5, pp.1213 - 1217
- Indexed
- SCOPUS
- Journal Title
- IEICE Transactions on Information and Systems
- Volume
- E92-D
- Number
- 5
- Start Page
- 1213
- End Page
- 1217
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/121921
- DOI
- 10.1587/transinf.E92.D.1213
- ISSN
- 0916-8532
- Abstract
- In this paper, we propose a three-phase text error correction model consisting of a word spacing error correction phase, a syllablebased spelling error correction phase, and a word-based spelling error correction phase. In order to reduce the text error correction complexity, the proposed model corrects text errors step by step. With the aim of correcting word spacing errors, spelling errors, and mixed errors in SMS messages, the proposed model tries to separately manage the word spacing error correction phase and the spelling error correction phase. For the purpose of utilizing both the syllable-based approach covering various errors and the word-based approach correcting some specific errors accurately, the proposed model subdivides the spelling error correction phase into the syllable-based phase and the word-based phase. Experimental results show that the proposed model can improve the performance by solving the text error correction problem based on the divide-and-conquer strategy. Copyright © 2009 The Institute of Electronics, Information and Communication Engineers.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Informatics > Department of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.