워드 임베딩과 단어 네트워크 분석을 활용한비지도학습 기반의 문서 다중 범주 가중치 산출 : 휴대폰 리뷰 사례를 중심으로
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 정재윤 | - |
dc.contributor.author | 모경현 | - |
dc.contributor.author | 서승완 | - |
dc.contributor.author | 김창엽 | - |
dc.contributor.author | 김해동 | - |
dc.contributor.author | 강필성 | - |
dc.date.accessioned | 2021-09-02T18:19:03Z | - |
dc.date.available | 2021-09-02T18:19:03Z | - |
dc.date.created | 2021-06-17 | - |
dc.date.issued | 2018 | - |
dc.identifier.issn | 1225-0988 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/79217 | - |
dc.description.abstract | Due to the increased amounts of online documents, there is a growing demand for text categorization thatcategorizes documents into predefined categories. Many approaches to this problem are based on supervisedmachine learning which couldn’t be applied to unlabeled data. However, large number of documents, such asonline cell phone reviews, have no category information and key categories are not predefined. To solve theseproblems, we propose unsupervised document multi-labeling method based on word embedding and wordnetwork analysis. After embedding words in a lower dimensional space using Word2Vec technique, we generatea weight matrix by calculating similarities between words. We create a word network using this matrix andextract the key categories from this network. With key category-weight matrix and co-occurrence matrix, wegenerate a document-category score matrix. To verify our proposed method, we collect 298,206 cell phonereviews from four review websites. Then, we compared the results of the proposed method with labeleddocuments from human cognitive perspective. | - |
dc.language | Korean | - |
dc.language.iso | ko | - |
dc.publisher | 대한산업공학회 | - |
dc.title | 워드 임베딩과 단어 네트워크 분석을 활용한비지도학습 기반의 문서 다중 범주 가중치 산출 : 휴대폰 리뷰 사례를 중심으로 | - |
dc.title.alternative | Unsupervised Document Multi-Category Weight Extraction based on Word Embedding and Word Network Analysis : A Case Study on Mobile Phone Reviews | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | 강필성 | - |
dc.identifier.doi | 10.7232/JKIIE.2018.44.6.442 | - |
dc.identifier.bibliographicCitation | 대한산업공학회지, v.44, no.6, pp.442 - 451 | - |
dc.relation.isPartOf | 대한산업공학회지 | - |
dc.citation.title | 대한산업공학회지 | - |
dc.citation.volume | 44 | - |
dc.citation.number | 6 | - |
dc.citation.startPage | 442 | - |
dc.citation.endPage | 451 | - |
dc.type.rims | ART | - |
dc.identifier.kciid | ART002412363 | - |
dc.description.journalClass | 2 | - |
dc.description.journalRegisteredClass | kci | - |
dc.subject.keywordAuthor | Word Embedding | - |
dc.subject.keywordAuthor | Unsupervised Learning | - |
dc.subject.keywordAuthor | Word Network Analysis | - |
dc.subject.keywordAuthor | Multi-Label Weight Extraction | - |
dc.subject.keywordAuthor | Text Mining | - |
dc.subject.keywordAuthor | Mobile Phone Reviews | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
145 Anam-ro, Seongbuk-gu, Seoul, 02841, Korea+82-2-3290-2963
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.