Rebalancing Docked Bicycle Sharing System with Approximate Dynamic Programming and Reinforcement Learning
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Seo, Young-Hyun | - |
dc.contributor.author | Kim, Dong-Kyu | - |
dc.contributor.author | Kang, Seungmo | - |
dc.contributor.author | Byon, Young-Ji | - |
dc.contributor.author | Kho, Seung-Young | - |
dc.date.accessioned | 2022-09-24T14:40:58Z | - |
dc.date.available | 2022-09-24T14:40:58Z | - |
dc.date.created | 2022-09-23 | - |
dc.date.issued | 2022-05-09 | - |
dc.identifier.issn | 0197-6729 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/143896 | - |
dc.description.abstract | The bicycle, an active transportation mode, has received increasing attention as an alternative in urban environments worldwide. However, effectively managing the stock levels of rental bicycles at each station is challenging as demand levels vary with time, particularly when users are allowed to return bicycles at any station. There is a need for system-wide management of bicycle stock levels by transporting available bicycles from one station to another. In this study, a bicycle rebalancing model based on a Markov decision process (MDP) is developed using a real-time dynamic programming method and reinforcement learning considering dynamic system characteristics. The pickup and return demands are stochastic and continuously changing. As a result, the proposed framework suggests the best operation option every 10 min based on the realized system variables and future demands predicted by the random forest method, minimizing the expected unmet demand. Moreover, we adopt custom prioritizing strategies to reduce the number of action candidates for the operator and the computational complexity for practicality in the MDP framework. Numerical experiments demonstrate that the proposed model outperforms existing methods, such as short-term rebalancing and static lookahead policies. Among the suggested prioritizing strategies, focusing on stations with a larger error in demand prediction was found to be the most effective. Additionally, the effects of various safety buffers were examined. | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | WILEY-HINDAWI | - |
dc.title | Rebalancing Docked Bicycle Sharing System with Approximate Dynamic Programming and Reinforcement Learning | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Seo, Young-Hyun | - |
dc.contributor.affiliatedAuthor | Kang, Seungmo | - |
dc.identifier.doi | 10.1155/2022/2780711 | - |
dc.identifier.scopusid | 2-s2.0-85130727561 | - |
dc.identifier.wosid | 000803952900001 | - |
dc.identifier.bibliographicCitation | JOURNAL OF ADVANCED TRANSPORTATION, v.2022 | - |
dc.relation.isPartOf | JOURNAL OF ADVANCED TRANSPORTATION | - |
dc.citation.title | JOURNAL OF ADVANCED TRANSPORTATION | - |
dc.citation.volume | 2022 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | Y | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Transportation | - |
dc.relation.journalWebOfScienceCategory | Engineering, Civil | - |
dc.relation.journalWebOfScienceCategory | Transportation Science & Technology | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.