Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle

Wang, Ning; Gao, Ying; Zhao, Hong; Ahn, Choon Ki

doi:10.1109/TNNLS.2020.3009214

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle

Full metadata record

DC Field	Value	Language
dc.contributor.author	Wang, Ning	-
dc.contributor.author	Gao, Ying	-
dc.contributor.author	Zhao, Hong	-
dc.contributor.author	Ahn, Choon Ki	-
dc.date.accessioned	2021-11-17T16:41:14Z	-
dc.date.available	2021-11-17T16:41:14Z	-
dc.date.created	2021-08-30	-
dc.date.issued	2021-07	-
dc.identifier.issn	2162-237X	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/127776	-
dc.description.abstract	In this article, a novel reinforcement learning-based optimal tracking control (RLOTC) scheme is established for an unmanned surface vehicle (USV) in the presence of complex unknowns, including dead-zone input nonlinearities, system dynamics, and disturbances. To be specific, dead-zone nonlinearities are decoupled to be input-dependent sloped controls and unknown biases that are encapsulated into lumped unknowns within tracking error dynamics. Neural network (NN) approximators are further deployed to adaptively identify complex unknowns and facilitate a Hamilton-Jacobi-Bellman (HJB) equation that formulates optimal tracking. In order to derive a practically optimal solution, an actor-critic reinforcement learning framework is built by employing adaptive NN identifiers to recursively approximate the total optimal policy and cost function. Eventually, theoretical analysis shows that the entire RLOTC scheme can render tracking errors that converge to an arbitrarily small neighborhood of the origin, subject to optimal cost. Simulation results and comprehensive comparisons on a prototype USV demonstrate remarkable effectiveness and superiority.	-
dc.language	English	-
dc.language.iso	en	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.subject	NONLINEAR-SYSTEMS	-
dc.subject	ADAPTIVE-CONTROL	-
dc.subject	ROBUST-CONTROL	-
dc.subject	ARCHITECTURE	-
dc.subject	ITERATION	-
dc.title	Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Ahn, Choon Ki	-
dc.identifier.doi	10.1109/TNNLS.2020.3009214	-
dc.identifier.scopusid	2-s2.0-85095577821	-
dc.identifier.wosid	000670541500019	-
dc.identifier.bibliographicCitation	IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, v.32, no.7, pp.3034 - 3045	-
dc.relation.isPartOf	IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS	-
dc.citation.title	IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS	-
dc.citation.volume	32	-
dc.citation.number	7	-
dc.citation.startPage	3034	-
dc.citation.endPage	3045	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Hardware & Architecture	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.subject.keywordPlus	NONLINEAR-SYSTEMS	-
dc.subject.keywordPlus	ADAPTIVE-CONTROL	-
dc.subject.keywordPlus	ROBUST-CONTROL	-
dc.subject.keywordPlus	ARCHITECTURE	-
dc.subject.keywordPlus	ITERATION	-
dc.subject.keywordAuthor	Optimal control	-
dc.subject.keywordAuthor	Artificial neural networks	-
dc.subject.keywordAuthor	Nonlinear systems	-
dc.subject.keywordAuthor	System dynamics	-
dc.subject.keywordAuthor	Vehicle dynamics	-
dc.subject.keywordAuthor	Mathematical model	-
dc.subject.keywordAuthor	Learning (artificial intelligence)	-
dc.subject.keywordAuthor	Completely unknown dynamics	-
dc.subject.keywordAuthor	optimal tracking control	-
dc.subject.keywordAuthor	reinforcement earning-based control	-
dc.subject.keywordAuthor	unknown dead-zone input nonlinearities	-
dc.subject.keywordAuthor	unmanned surface vehicle (USV)	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > School of Electrical Engineering > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Ahn, Choon ki photo

Ahn, Choon ki: 공과대학 (전기전자공학부)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,708,345; Today View :39,786

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE