Quality-Aware Deep Reinforcement Learning for Streaming in Infrastructure-Assisted Connected Vehicles
- Authors
- Yun, W.J.; Kwon, D.; Choi, M.; Kim, J.; Caire, G.; Molisch, A.F.
- Issue Date
- 2월-2022
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Adaptive systems; Base stations; Deep deterministic policy gradient; millimeter wave vehicular network; Optimization; Reinforcement learning; Roads; Streaming media; Vehicle dynamics; video streaming service
- Citation
- IEEE Transactions on Vehicular Technology, v.71, no.2, pp.2002 - 2017
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Transactions on Vehicular Technology
- Volume
- 71
- Number
- 2
- Start Page
- 2002
- End Page
- 2017
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/137857
- DOI
- 10.1109/TVT.2021.3134457
- ISSN
- 0018-9545
- Abstract
- This paper proposes a deep reinforcement learning-based video streaming scheme for mobility-aware vehicular networks, e.g., vehicles on the highway. We consider infrastructure-assisted and mmWave-based scenarios in which the macro base station (MBS) cannot directly provide the streaming service to vehicles due to the short range of mmWave beams so that small mmWave base stations (mBSs) along the road deliver the desired videos to users. Motivated by the fact that a video stream consists of sequential chunks and smooth streaming has to be provided while users pass through multiple mBSs at high speed, the MBS proactively pushes some video chunks of the desired contents to mBSs. This is done to support vehicles that are currently covered and/or will be by each mBS. We formulate the dynamic video delivery scheme that adaptively determines 1) which content, 2) what quality and 3) how many chunks to be proactively delivered from the MBS to mBSs using Markov decision process (MDP). Since it is difficult for the MBS to track all the channel conditions and the network states have extensive dimensions, we adopt the deep deterministic policy gradient (DDPG) algorithm for the DRL-based video delivery scheme. This paper finally shows that the DRL agent learns a streaming policy that pursues high average quality while limiting packet drops, avoiding playback stalls, reducing quality fluctuations and saving backhaul usage. IEEE
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Electrical Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.