Multi-Modal Recurrent Attention Networks for Facial Expression Recognition

Lee, Jiyoung; Kim, Sunok; Kim, Seungryong; Sohn, Kwanghoon

doi:10.1109/TIP.2020.2996086

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Multi-Modal Recurrent Attention Networks for Facial Expression Recognition

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Jiyoung	-
dc.contributor.author	Kim, Sunok	-
dc.contributor.author	Kim, Seungryong	-
dc.contributor.author	Sohn, Kwanghoon	-
dc.date.accessioned	2021-08-31T16:09:02Z	-
dc.date.available	2021-08-31T16:09:02Z	-
dc.date.created	2021-06-18	-
dc.date.issued	2020	-
dc.identifier.issn	1057-7149	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/59019	-
dc.description.abstract	Recent deep neural networks based methods have achieved state-of-the-art performance on various facial expression recognition tasks. Despite such progress, previous researches for facial expression recognition have mainly focused on analyzing color recording videos only. However, the complex emotions expressed by people with different skin colors under different lighting conditions through dynamic facial expressions can be fully understandable by integrating information from multi-modal videos. We present a novel method to estimate dimensional emotion states, where color, depth, and thermal recording videos are used as a multi-modal input. Our networks, called multi-modal recurrent attention networks (MRAN), learn spatiotemporal attention volumes to robustly recognize the facial expression based on attention-boosted feature volumes. We leverage the depth and thermal sequences as guidance priors for color sequence to selectively focus on emotional discriminative regions. We also introduce a novel benchmark for multi-modal facial expression recognition, termed as multi-modal arousal-valence facial expression recognition (MAVFER), which consists of color, depth, and thermal recording videos with corresponding continuous arousal-valence scores. The experimental results show that our method can achieve the state-of-the-art results in dimensional facial expression recognition on color recording datasets including RECOLA, SEWA and AFEW, and a multi-modal recording dataset including MAVFER.	-
dc.language	English	-
dc.language.iso	en	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.subject	DATABASE	-
dc.subject	EMOTION	-
dc.title	Multi-Modal Recurrent Attention Networks for Facial Expression Recognition	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Kim, Seungryong	-
dc.identifier.doi	10.1109/TIP.2020.2996086	-
dc.identifier.wosid	000546910100006	-
dc.identifier.bibliographicCitation	IEEE TRANSACTIONS ON IMAGE PROCESSING, v.29, pp.6977 - 6991	-
dc.relation.isPartOf	IEEE TRANSACTIONS ON IMAGE PROCESSING	-
dc.citation.title	IEEE TRANSACTIONS ON IMAGE PROCESSING	-
dc.citation.volume	29	-
dc.citation.startPage	6977	-
dc.citation.endPage	6991	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.subject.keywordPlus	DATABASE	-
dc.subject.keywordPlus	EMOTION	-
dc.subject.keywordAuthor	Face recognition	-
dc.subject.keywordAuthor	Image color analysis	-
dc.subject.keywordAuthor	Videos	-
dc.subject.keywordAuthor	Emotion recognition	-
dc.subject.keywordAuthor	Benchmark testing	-
dc.subject.keywordAuthor	Databases	-
dc.subject.keywordAuthor	Task analysis	-
dc.subject.keywordAuthor	Multi-modal facial expression recognition	-
dc.subject.keywordAuthor	dimensional (continuous) emotion recognition	-
dc.subject.keywordAuthor	attention mechanism	-

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,360,474; Today View :13,738

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE