Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Visual Thinking of Neural Networks: Interactive Text to Image Synthesis

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Hyunhee-
dc.contributor.authorKim, Gyeongmin-
dc.contributor.authorHur, Yuna-
dc.contributor.authorLim, Heuiseok-
dc.date.accessioned2021-12-08T07:41:53Z-
dc.date.available2021-12-08T07:41:53Z-
dc.date.created2021-08-30-
dc.date.issued2021-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/130278-
dc.description.abstractReasoning, a trait of cognitive intelligence, is regarded as a crucial ability that distinguishes humans from other species. However, neural networks now pose a challenge to this human ability. Text-to-image synthesis is a class of vision and linguistics, wherein the goal is to learn multimodal representations between the image and text features. Hence, it requires a high-level reasoning ability that understands the relationships between objects in the given text and generates high-quality images based on the understanding. Text-to-image translation can be termed as the visual thinking of neural networks. In this study, our model infers the complicated relationships between objects in the given text and generates the final image by leveraging the previous history. We define diverse novel adversarial loss functions and finally demonstrate the best one that elevates the reasoning ability of the text-to-image synthesis. Remarkably, most of our models possess their own reasoning ability. Quantitative and qualitative comparisons with several methods demonstrate the superiority of our approach.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleVisual Thinking of Neural Networks: Interactive Text to Image Synthesis-
dc.typeArticle-
dc.contributor.affiliatedAuthorLim, Heuiseok-
dc.identifier.doi10.1109/ACCESS.2021.3074973-
dc.identifier.scopusid2-s2.0-85104670342-
dc.identifier.wosid000645845000001-
dc.identifier.bibliographicCitationIEEE ACCESS, v.9, pp.64510 - 64523-
dc.relation.isPartOfIEEE ACCESS-
dc.citation.titleIEEE ACCESS-
dc.citation.volume9-
dc.citation.startPage64510-
dc.citation.endPage64523-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusRECOGNITION MEMORY-
dc.subject.keywordPlusPICTURE-
dc.subject.keywordPlusWORDS-
dc.subject.keywordAuthorCognition-
dc.subject.keywordAuthorVisualization-
dc.subject.keywordAuthorNeural networks-
dc.subject.keywordAuthorGenerative adversarial networks-
dc.subject.keywordAuthorImage synthesis-
dc.subject.keywordAuthorImage registration-
dc.subject.keywordAuthorText recognition-
dc.subject.keywordAuthorGenerative adversarial networks-
dc.subject.keywordAuthorimage generation-
dc.subject.keywordAuthormultimodal learning-
dc.subject.keywordAuthormultimodal representation-
dc.subject.keywordAuthortext-to-image synthesis-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE