Quality-Aware Deep Reinforcement Learning for Streaming in Infrastructure-Assisted Connected Vehicles

Yun, W.J.; Kwon, D.; Choi, M.; Kim, J.; Caire, G.; Molisch, A.F.

doi:10.1109/TVT.2021.3134457

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Quality-Aware Deep Reinforcement Learning for Streaming in Infrastructure-Assisted Connected Vehicles

Authors: Yun, W.J.; Kwon, D.; Choi, M.; Kim, J.; Caire, G.; Molisch, A.F.

Issue Date: 2월-2022

Publisher: Institute of Electrical and Electronics Engineers Inc.

Keywords: Adaptive systems; Base stations; Deep deterministic policy gradient; millimeter wave vehicular network; Optimization; Reinforcement learning; Roads; Streaming media; Vehicle dynamics; video streaming service

Citation: IEEE Transactions on Vehicular Technology, v.71, no.2, pp.2002 - 2017

Indexed: SCIE
SCOPUS

Journal Title: IEEE Transactions on Vehicular Technology

Volume: 71

Number: 2

Start Page: 2002

End Page: 2017

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/137857

DOI: 10.1109/TVT.2021.3134457

ISSN: 0018-9545

Abstract: This paper proposes a deep reinforcement learning-based video streaming scheme for mobility-aware vehicular networks, e.g., vehicles on the highway. We consider infrastructure-assisted and mmWave-based scenarios in which the macro base station (MBS) cannot directly provide the streaming service to vehicles due to the short range of mmWave beams so that small mmWave base stations (mBSs) along the road deliver the desired videos to users. Motivated by the fact that a video stream consists of sequential chunks and smooth streaming has to be provided while users pass through multiple mBSs at high speed, the MBS proactively pushes some video chunks of the desired contents to mBSs. This is done to support vehicles that are currently covered and/or will be by each mBS. We formulate the dynamic video delivery scheme that adaptively determines 1) which content, 2) what quality and 3) how many chunks to be proactively delivered from the MBS to mBSs using Markov decision process (MDP). Since it is difficult for the MBS to track all the channel conditions and the network states have extensive dimensions, we adopt the deep deterministic policy gradient (DDPG) algorithm for the DRL-based video delivery scheme. This paper finally shows that the DRL agent learns a streaming policy that pursues high average quality while limiting packet drops, avoiding playback stalls, reducing quality fluctuations and saving backhaul usage. IEEE

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > School of Electrical Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Joong heon photo

Kim, Joong heon: 공과대학 (전기전자공학부)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :9,533,268; Today View :26,212

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE