Robust and Communication-Efficient Federated Learning From Non-i.i.d. Data

Sattler, Felix; Wiedemann, Simon; Mueller, Klaus-Robert; Samek, Wojciech

doi:10.1109/TNNLS.2019.2944481

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Robust and Communication-Efficient Federated Learning From Non-i.i.d. Data

Authors: Sattler, Felix; Wiedemann, Simon; Mueller, Klaus-Robert; Samek, Wojciech

Issue Date: 9월-2020

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Training; Data models; Servers; Deep learning; Protocols; Training data; Distributed databases; Deep learning; distributed learning; efficient communication; federated learning; privacy-preserving machine learning

Citation: IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, v.31, no.9, pp.3400 - 3413

Indexed: SCIE
SCOPUS

Journal Title: IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Volume: 31

Number: 9

Start Page: 3400

End Page: 3413

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/53658

DOI: 10.1109/TNNLS.2019.2944481

ISSN: 2162-237X

Abstract: Federated learning allows multiple parties to jointly train a deep learning model on their combined data, without any of the participants having to reveal their local data to a centralized server. This form of privacy-preserving collaborative learning, however, comes at the cost of a significant communication overhead during training. To address this problem, several compression methods have been proposed in the distributed training literature that can reduce the amount of required communication by up to three orders of magnitude. These existing methods, however, are only of limited utility in the federated learning setting, as they either only compress the upstream communication from the clients to the server (leaving the downstream communication uncompressed) or only perform well under idealized conditions, such as i.i.d. distribution of the client data, which typically cannot be found in federated learning. In this article, we propose sparse ternary compression (STC), a new compression framework that is specifically designed to meet the requirements of the federated learning environment. STC extends the existing compression technique of top-k gradient sparsification with a novel mechanism to enable downstream compression as well as ternarization and optimal Golomb encoding of the weight updates. Our experiments on four different learning tasks demonstrate that STC distinctively outperforms federated averaging in common federated learning scenarios. These results advocate for a paradigm shift in federated optimization toward high-frequency low-bitwidth communication, in particular in the bandwidth-constrained learning environments.

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Artificial Intelligence > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :9,529,830; Today View :21,925

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE