Cooperative Multi-Agent Reinforcement Learning With Approximate Model Learning

Park, Young Joon; Lee, Young Jae; Kim, Seoung Bum

doi:10.1109/ACCESS.2020.3007219

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Cooperative Multi-Agent Reinforcement Learning With Approximate Model Learning

Full metadata record

DC Field	Value	Language
dc.contributor.author	Park, Young Joon	-
dc.contributor.author	Lee, Young Jae	-
dc.contributor.author	Kim, Seoung Bum	-
dc.date.accessioned	2021-08-31T16:05:30Z	-
dc.date.available	2021-08-31T16:05:30Z	-
dc.date.created	2021-06-19	-
dc.date.issued	2020	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/58989	-
dc.description.abstract	In multi-agent reinforcement learning, it is essential for agents to learn communication protocol to optimize collaboration policies and to solve unstable learning problems. Existing methods based on actor-critic networks solve the communication problem among agents. However, these methods have difficulty in improving sample efficiency and learning robust policies because it is not easy to understand the dynamics and nonstationary of the environment as the policies of other agents change. We propose a method for learning cooperative policies in multi-agent environments by considering the communications among agents. The proposed method consists of recurrent neural network-based actor-critic networks and deterministic policy gradients to centrally train decentralized policies. The actor networks cause the agents to communicate using forward and backward paths and to determine subsequent actions. The critic network helps to train the actor networks by sending gradient signals to the actors according to their contribution to the global reward. To address issues with partial observability and unstable learning, we propose using auxiliary prediction networks to approximate state transitions and the reward function. We used multi-agent environments to demonstrate the usefulness and superiority of the proposed method by comparing it with existing multi-agent reinforcement learning methods, in terms of both learning efficiency and goal achievements in the test phase. The results demonstrate that the proposed method outperformed other alternatives.	-
dc.language	English	-
dc.language.iso	en	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.subject	DYNAMICS	-
dc.subject	PERFORMANCE	-
dc.subject	FRAMEWORK	-
dc.title	Cooperative Multi-Agent Reinforcement Learning With Approximate Model Learning	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Kim, Seoung Bum	-
dc.identifier.doi	10.1109/ACCESS.2020.3007219	-
dc.identifier.scopusid	2-s2.0-85088690432	-
dc.identifier.wosid	000554543000001	-
dc.identifier.bibliographicCitation	IEEE ACCESS, v.8, pp.125389 - 125400	-
dc.relation.isPartOf	IEEE ACCESS	-
dc.citation.title	IEEE ACCESS	-
dc.citation.volume	8	-
dc.citation.startPage	125389	-
dc.citation.endPage	125400	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.subject.keywordPlus	DYNAMICS	-
dc.subject.keywordPlus	PERFORMANCE	-
dc.subject.keywordPlus	FRAMEWORK	-
dc.subject.keywordAuthor	Reinforcement learning	-
dc.subject.keywordAuthor	model-free method	-
dc.subject.keywordAuthor	multi-agent system	-
dc.subject.keywordAuthor	multi-agent cooperation	-
dc.subject.keywordAuthor	actor-critic method	-
dc.subject.keywordAuthor	deterministic policy gradient	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > School of Industrial and Management Engineering > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher KIM, Seoung Bum photo

KIM, Seoung Bum: 공과대학 (산업경영공학부)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,739,198; Today View :22,119

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE