Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Three-stream fusion network for first-person interaction recognition

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Ye-Ji-
dc.contributor.authorLee, Dong-Gyu-
dc.contributor.authorLee, Seong-Whan-
dc.date.accessioned2021-08-30T20:31:50Z-
dc.date.available2021-08-30T20:31:50Z-
dc.date.created2021-06-18-
dc.date.issued2020-07-
dc.identifier.issn0031-3203-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/54928-
dc.description.abstractFirst-person interaction recognition is a challenging task because of unstable video conditions resulting from the camera wearer's movement. For human interaction recognition from a first-person viewpoint, this paper proposes a three-stream fusion network with two main parts: three-stream architecture and three-stream correlation fusion. The three-stream architecture captures the characteristics of the target appearance, target motion, and camera ego-motion. Meanwhile the three-stream correlation fusion combines the feature map of each of the three streams to consider the correlations among the target appearance, target motion, and camera ego-motion. The fused feature vector is robust to the camera movement and compensates for the noise of the camera ego-motion. Short-term intervals are modeled using the fused feature vector, and a long short-term memory (LSTM) model considers the temporal dynamics of the video. We evaluated the proposed method on two public benchmark datasets to validate the effectiveness of our approach. The experimental results show that the proposed fusion method successfully generated a discriminative feature vector, and our network outperformed all competing activity recognition methods in first-person videos where considerable camera ego-motion occurs. (C) 2020 Published by Elsevier Ltd.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherELSEVIER SCI LTD-
dc.titleThree-stream fusion network for first-person interaction recognition-
dc.typeArticle-
dc.contributor.affiliatedAuthorLee, Seong-Whan-
dc.identifier.doi10.1016/j.patcog.2020.107279-
dc.identifier.scopusid2-s2.0-85079886700-
dc.identifier.wosid000530845000025-
dc.identifier.bibliographicCitationPATTERN RECOGNITION, v.103-
dc.relation.isPartOfPATTERN RECOGNITION-
dc.citation.titlePATTERN RECOGNITION-
dc.citation.volume103-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordAuthorFirst-person vision-
dc.subject.keywordAuthorFirst-person interaction recognition-
dc.subject.keywordAuthorThree-stream fusion network-
dc.subject.keywordAuthorThree-stream correlation fusion-
dc.subject.keywordAuthorCamera ego-motion-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Seong Whan photo

Lee, Seong Whan
인공지능학과
Read more

Altmetrics

Total Views & Downloads

BROWSE