Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Utilizing the Web for Automatic Word Spacing

Authors
Hong, GumwonLee, Jeong-HoonSong, Young-InLee, Do-GilRim, Hae-Chang
Issue Date
12월-2009
Publisher
IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG
Keywords
word spacing; word segmentation
Citation
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, v.E92D, no.12, pp.2553 - 2556
Indexed
SCIE
SCOPUS
Journal Title
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Volume
E92D
Number
12
Start Page
2553
End Page
2556
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/118763
DOI
10.1587/transinf.E92.D.2553
ISSN
1745-1361
Abstract
This paper presents a new approach to word spacing problems by mining reliable words from the Web and use them as additional resources. Conventional approaches to automatic word spacing use noise-free data to train parameters for word spacing models. However, the insufficiency and irrelevancy of training examples is always the main bottleneck associated with automatic word spacing. To mitigate the data-sparseness problem. this paper proposes an algorithm to discover reliable words on the Web to expand the vocabularies and a model to utilize the words as additional resources. The proposed approach is very simple and practical to adapt to new domains. Experimental results show that the proposed approach achieves better performance compared to the conventional word spacing approaches.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Associate Research Center > Research Institute of Korean Studies > 1. Journal Articles
College of Informatics > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE