AGV dispatching algorithm based on deep Q-network in CNC machines environment

Chang, Kyuchang; Park, Seung Hwan; Baek, Jun-Geol

doi:10.1080/0951192X.2021.1992669

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

AGV dispatching algorithm based on deep Q-network in CNC machines environment

Authors: Chang, Kyuchang; Park, Seung Hwan; Baek, Jun-Geol

Issue Date: 6월-2022

Publisher: TAYLOR & FRANCIS LTD

Keywords: Automated guided vehicle (AGV); dispatching algorithm; deep Q-network (DQN); reinforcement learning (RL); computerized numerical control (CNC) machines

Citation: INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, v.35, no.6, pp.662 - 677

Indexed: SCIE
SCOPUS

Journal Title: INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING

Volume: 35

Number: 6

Start Page: 662

End Page: 677

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/143991

DOI: 10.1080/0951192X.2021.1992669

ISSN: 0951-192X

Abstract: This research focuses on providing an optimal dispatching algorithm for an automatic guided vehicle (AGV) in a mobile metal board manufacturing facility. The target process comprises multiple computerized numerical control (CNC) machines and an AGV. An AGV feeds materials between two rows of CNC machines for processing metal boards, or conveys a work in process. As it is difficult to derive a mathematically optimal working order owing to the high computational cost, simple dispatching rules have typically been applied in such environments. However, these rules are generally not optimal, and expert knowledge is required to determine which rule to choose. To overcome certain of these disadvantages and increase productivity, a deep reinforcement learning (RL) algorithm is used to learn an AGV's dispatching algorithm. The target production line as a virtual simulated grid-shaped workspace is modeled to develop a deep Q-network (DQN)-based dispatching algorithm. A convolutional neural network (CNN) is used to input raw pixels and output a value function for estimating future rewards, and an agent is trained to successfully learn the control policies. To create an elaborate dispatching strategy, different hyper-parameters of the DQN are tuned and a reasonable modeling method is experimentally determined. The proposed method automatically develops an optimal dispatching policy without requiring human control or prior expert knowledge. Compared with general heuristic dispatching rules, the results illustrate the improved performance of the proposed methodology.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > School of Industrial and Management Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Baek, Jun Geol photo

Baek, Jun Geol: 공과대학 (산업경영공학부)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :9,522,679; Today View :14,593

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE