Journals & Magazines >IEEE Transactions on Parallel... >Volume: 34 Issue: 7

Faster Federated Learning With Decaying Number of Local SGD Steps

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In Federated Learning (FL) client devices connected over the internet collaboratively train a machine learning model without sharing their private data with a central ser...Show More

Metadata

Abstract:

In Federated Learning (FL) client devices connected over the internet collaboratively train a machine learning model without sharing their private data with a central server or with other clients. The seminal Federated Averaging (FedAvg) algorithm trains a single global model by performing rounds of local training on clients followed by model averaging. FedAvg can improve the communication-efficiency of training by performing more steps of Stochastic Gradient Descent (SGD) on clients in each round. However, client data in real-world FL is highly heterogeneous, which has been extensively shown to slow model convergence and harm final performance when

$K > 1$ steps of SGD are performed on clients per round. In this article we propose decaying

$K$ as training progresses, which can jointly improve the final performance of the FL model whilst reducing the wall-clock time and the total computational cost of training compared to using a fixed

$K$ . We analyse the convergence of FedAvg with decaying

$K$ for strongly-convex objectives, providing novel insights into the convergence properties, and derive three theoretically-motivated decay schedules for

$K$ . We then perform thorough experiments on four benchmark FL datasets (FEMNIST, CIFAR100, Sentiment140, Shakespeare) to show the real-world benefit of our approaches in terms of real-world convergence time, computational cost, and generalisation performance.

Published in: IEEE Transactions on Parallel and Distributed Systems ( Volume: 34, Issue: 7, July 2023)

Page(s): 2198 - 2207

Date of Publication: 17 May 2023

ISSN Information:

DOI: 10.1109/TPDS.2023.3277367

Funding Agency:

Contents

I. Introduction

Federated Learning (FL) is a recent distributed Machine Learning (ML) paradigm that aims to collaboratively train an ML model using data owned by clients, without those clients sharing their training data with a central server or other participating clients. Practical applications of FL range from ‘cross-device’ scenarios, with a huge number of unreliable clients each possessing a small number of samples, to ‘cross-silo’ scenarios with fewer, more reliable clients possessing more data [1]. FL has huge economic potential, with cross-device tasks including mobile-keyboard next-word prediction [2], voice detection [3], and even as proof-of-work for blockchain systems [4]. Cross-silo tasks include hospitals jointly training healthcare models [5] and financial institutions creating fraud detectors [6]. FL has been of particular interest for training large Deep Neural Networks (DNNs) due to their state-of-the-art performance across a wide range of tasks.

References is not available for this document.

Faster Federated Learning With Decaying Number of Local SGD Steps

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Faster Federated Learning With Decaying Number of Local SGD Steps

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References