ConceptVector: Text Visual Analytics via Interactive Lexicon Building using Word Embedding
- Authors
- Park, Deokgun; Kim, Seungyeon; Lee, Jurim; Choo, Jaegul; Diakopoulos, Nicholas; Elmqvist, Niklas
- Issue Date
- 1월-2018
- Publisher
- IEEE COMPUTER SOC
- Keywords
- Text analytics; visual analytics; word embedding; text summarization; text classification; concepts
- Citation
- IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, v.24, no.1, pp.361 - 370
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS
- Volume
- 24
- Number
- 1
- Start Page
- 361
- End Page
- 370
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/78416
- DOI
- 10.1109/TVCG.2017.2744478
- ISSN
- 1077-2626
- Abstract
- Central to many text analysis methods is the notion of a concept: a set of semantically related keywords characterizing a specific object, phenomenon, or theme. Advances in word embedding allow building a concept from a small set of seed terms. However, naive application of such techniques may result in false positive errors because of the polysemy of natural language. To mitigate this problem, we present a visual analytics system called ConceptVector that guides a user in building such concepts and then using them to analyze documents. Document-analysis case studies with real-world datasets demonstrate the fine-grained analysis provided by ConceptVector. To support the elaborate modeling of concepts, we introduce a bipolar concept model and support for specifying irrelevant words. We validate the interactive lexicon building interface by a user study and expert reviews. Quantitative evaluation shows that the bipolar lexicon generated with our methods is comparable to human-generated ones.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Artificial Intelligence > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.