단어와 자소 기반 합성곱 신경망을 이용한 문서 분류Text Classification based on Convolutional Neural Network with Word and Character Level
- Other Titles
- Text Classification based on Convolutional Neural Network with Word and Character Level
- Authors
- 모경현; 박재선; 장명준; 강필성
- Issue Date
- 2018
- Publisher
- 대한산업공학회
- Keywords
- Document Classification; Convolutional Neural Network; Word Embedding; Character Embedding; Naïve bayes; Logistic Regression
- Citation
- 대한산업공학회지, v.44, no.3, pp.180 - 188
- Indexed
- KCI
- Journal Title
- 대한산업공학회지
- Volume
- 44
- Number
- 3
- Start Page
- 180
- End Page
- 188
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/79760
- DOI
- 10.7232/JKIIE.2018.44.3.180
- ISSN
- 1225-0988
- Abstract
- Documents classification aims to analyze keywords or contextual meanings from a given document and classify them into specific categories. In order to successfully perform document classification, it is necessary to accurately extract the word information included in a given document. However, there are many variations of Korean words depending on the types of postposition, rooting and ending. In the case of online documents, these variations become even more severe. Considering the characteristics of these Korean documents, in this paper we propose a document classification method using both word and character information. By using character information, it is possible to consider information that was difficult to express by word set such as typos and emoticons in the document classification process. This model, which combines the features of the whole sentence obtained from the word information and the local features obtained from the character information, experimentally confirmed that it has higher classification performance than the existing models using only word information.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Industrial and Management Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.