Conferences >2024 IEEE Symposium on Comput...

Partial Training Mechanism to Handle the Impact of Stragglers in Federated Learning with Heterogeneous Clients

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Federated Learning (FL) allows distributed devices, known as clients, to train Machine Learning (ML) models collaboratively without sharing sensitive data. A characterist...Show More

Metadata

Abstract:

Federated Learning (FL) allows distributed devices, known as clients, to train Machine Learning (ML) models collaboratively without sharing sensitive data. A characteristic of FL for mobile and IoT environments is system heterogeneity among clients, which can vary from low-end devices with constrained communication and computing resources to powerful devices with high-speed network access and dedicated GPUs. As the server must wait for all the clients to communicate their updates, slow clients (a.k.a. stragglers) will significantly increase the training time. To tackle this problem, we propose FedPulse, a Partial Training (PT) based mechanism to mitigate the effect of stragglers in FL. The idea is to reduce the training time by dynamically allocating smaller submodels to resource-constrained clients. Experimental results on famous classification datasets show that the proposed solution outperforms other submodel allocation mechanisms and reduces the training time by up to 58% with an accuracy loss of less than 1% when compared to FedAvg.

Published in: 2024 IEEE Symposium on Computers and Communications (ISCC)

Date of Conference: 26-29 June 2024

Date Added to IEEE Xplore: 31 October 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/ISCC61673.2024.10733724

Conference Location: Paris, France

References is not available for this document.

Contents

I. Introduction

The popularization of mobile and IoT devices allows the collection of a large amount of data at the network’s edge, which enables the training of large Machine Learning (ML) models for a wide range of applications, such as next-word prediction and object detection. Traditionally, the edge devices send their local data to a powerful central server, which trains the model in a centralized manner. However, this mechanism raises several privacy concerns as the local data might be sensitive, and regulations such as the General Data Protection Regulation (GDPR) might restrict data collection from those devices [1], making centralized training unfeasible.

Select All

C. Tankard, "What the gdpr means for businesses", Network Security, vol. 2016, no. 6, pp. 5-8, 2016.

CrossRef Google Scholar

B. McMahan, E. Moore, D. Ramage, S. Hampson and B. A. y Arcas, "Communication-efficient learning of deep networks from decentralized data", Artificial intelligence and statistics, pp. 1273-1282, 2017.

Partial Training Mechanism to Handle the Impact of Stragglers in Federated Learning with Heterogeneous Clients

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

Authors

Figures

References

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?