Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

TensorLightning: A Traffic-Efficient Distributed Deep Learning on Commodity Spark Clusters

Authors
Lee, SeilKim, HanjooPark, JaehongJang, JaeheeJeong, Chang-SungYoon, Sungroh
Issue Date
2018
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Keywords
TensorLightning; deep learning; Apache Spark; distributed system; commodity servers
Citation
IEEE ACCESS, v.6, pp.27671 - 27680
Indexed
SCIE
SCOPUS
Journal Title
IEEE ACCESS
Volume
6
Start Page
27671
End Page
27680
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/80954
DOI
10.1109/ACCESS.2018.2842103
ISSN
2169-3536
Abstract
With the recent success of deep learning, the amount of data and computation continues to grow daily. Hence a distributed deep learning system that shares the training workload has been researched extensively. Although a scale-out distributed environment using commodity servers is widely used, not only is there a limit due to synchronous operation and communication traffic but also combining deep neural network (DNN) training with existing clusters often demands additional hardware and migration between different cluster frameworks or libraries, which is highly inefficient. Therefore, we propose TensorLightning which integrates the widely used data pipeline of Apache Spark with powerful deep learning libraries, Caffe and TensorFlow. TensorLightning embraces a brand-new parameter aggregation algorithm and parallel asynchronous parameter managing schemes to relieve communication discrepancies and overhead. We redesign the elastic averaging stochastic gradient descent algorithm with pruned and sparse form parameters. Our approach provides the fast and flexible DNN training with high accessibility. We evaluated our proposed framework with convolutional neural network and recurrent neural network models; the framework reduces network traffic by 67% with faster convergence.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Jeong, Chang Sung photo

Jeong, Chang Sung
College of Engineering (School of Electrical Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE