Loading [MathJax]/extensions/MathZoom.js
Evaluation and Optimization of Distributed Machine Learning Techniques for Internet of Things | IEEE Journals & Magazine | IEEE Xplore

Evaluation and Optimization of Distributed Machine Learning Techniques for Internet of Things


Abstract:

Federated learning (FL) and split learning (SL) are state-of-the-art distributed machine learning techniques to enable machine learning training without accessing raw dat...Show More

Abstract:

Federated learning (FL) and split learning (SL) are state-of-the-art distributed machine learning techniques to enable machine learning training without accessing raw data on clients or end devices. However, their comparative training performance under real-world resource-restricted Internet of Things (IoT) device settings remains barely studied. This work provides empirical comparisons of FL and SL in real-world IoT settings regarding (i) learning performance with heterogeneous data distributions and (ii) on-device execution overhead. Our analyses in this work demonstrate that the learning performance of SL is better than FL under an imbalanced data distribution but worse than FL under an extreme non-IID data distribution. Recently, FL and SL are combined to form splitfed learning (SFL) to leverage each of their benefits (e.g., parallel training of FL and lightweight on-device computation requirement of SL). Our work considers FL, SL, and SFL, and mounts them on Raspberry Pi devices to evaluate their performance, including training time, communication overhead, power consumption, and memory usage with resource-restricted IoT devices. Besides evaluations, we apply two optimizations. First, we generalize SFL by carefully examining the possibility of a hybrid type of model training at the server-side. The generalized SFL merges sequential (dependent) and parallel (independent) processes of model training and thus is beneficial to a system with a large scale of IoT devices, specifically at the server-side operations. Second, we propose pragmatic techniques to substantially reduce the communication overhead by up to four times for the SL and (generalized) SFL.
Published in: IEEE Transactions on Computers ( Volume: 71, Issue: 10, 01 October 2022)
Page(s): 2538 - 2552
Date of Publication: 15 December 2021

ISSN Information:

Funding Agency:

References is not available for this document.

1 Introduction

Due to its stunning performance, deep learning (DL) has enabled various applications ranging from image classification, object detection, speech recognition to disease diagnosis, financial fraud detection [1], [2], [3]. One major factor in achieving high accuracy is usually to leverage big data to learn high-level features. The intuitive means is to gather the data centrally and then perform the DL model training. However, data can often be highly private or sensitive. For example, data collected from medical sensors [4] and microphones [5] would be such cases. Consequently, users may resist sharing their data with service/cloud providers to build a DL model. In addition, the data aggregator must pay great attention to the data regulations such as General Data Protection Regulation (GDPR) [6], and California Privacy Rights Act (CPRA) [7]. On the other hand, the centralized data might be mishandled or improperly managed by service providers—e.g., incidentally accessed by unauthorized parties [8], or used for unsolicited analytic, or compromised through the network and system security vulnerabilities—resulting in data breach [9], [10]. Therefore, there is a demand for training DL models without aggregating and accessing sensitive raw data that reside in the client-side [11], [12], [13], [14], [15].

Select All
1.
Y. LeCun, Y. Bengio and G. Hinton, "Deep learning", Nature, vol. 521, no. 7553, pp. 436-444, 2015.
2.
M. Bakator and D. Radosav, "Deep learning and medical diagnosis: A review of literature", Multimodal Technol. Interact., vol. 2, no. 3, 2018.
3.
T. A. Tang, L. Mhamdi, D. McLernon, S. A. Raza Zaidi and M. Ghogho, "Deep learning approach for network intrusion detection in software defined networking", Proc. Int. Conf. Wireless Netw. Mobile Commun., pp. 258-263, 2016.
4.
L. Huang, A. L. Shea, H. Qian, A. Masurkar, H. Deng and D. Liu, "Patient clustering improves efficiency of federated machine learning to predict mortality and hospital stay time using distributed electronic medical records", J. Biomed. Inform., vol. 99, 2019.
5.
"InvenSense announces worlds smallest lowest power I2S microphone for IoT and wearable markets".
6.
"General data protection regulation", Feb. 2016, [online] Available: https://gdpr-info.eu/.
7.
"California privacy rights act", Feb. 2020, [online] Available: https://www.cookiebot.com/en/cpra/.
8.
"Amazon alexa user receives 1700 audio recordings of a stranger through human error".
9.
S. Shastri, M. Wasserman and V. Chidambaram, "The seven sins of personal-data processing systems under GDPR", Proc. 11th USENIX Conf. Hot Topics Cloud Comput., 2019.
10.
"Major biometrics data leak impacts UK metropolitan police banks enterprise companies".
11.
H. Brendan McMahan, "A survey of algorithms and analysis for adaptive online learning", J. Mach. Learn. Res., vol. 18, pp. 90:1-90:50, 2017.
12.
K. Bonawitz et al., "Towards federated learning at scale: System design", Proc. Mach. Learn. Syst., vol. 1, pp. 374-388, 2019.
13.
W. Wu, L. He, W. Lin, R. Mao, C. Maple and S. A. Jarvis, "SAFA: A semi-asynchronous protocol for fast federated learning with low overhead", IEEE Trans. Comput., vol. 70, no. 5, pp. 655-668, May 2021.
14.
O. Gupta and R. Raskar, "Distributed learning of deep neural network over multiple agents", J. Netw. Comput. Appl., vol. 116, pp. 1-8, 2018.
15.
P. Vepakomma, T. Swedish, R. Raskar, O. Gupta and A. Dubey, "No peek: A survey of private distributed deep learning", CoRR, vol. abs/1812.03288, 2018.
16.
C. Thapa, M. A. P. Chamikara and S. Camtepe, "Splitfed: When federated learning meets split learning", 2020, [online] Available: https://arxiv.org/pdf/2004.12088.pdf.
17.
J. Chen and X. Ran, "Deep learning with edge computing: A review", Proc. IEEE, vol. 107, no. 8, pp. 1655-1674, Aug. 2019.
18.
"Teach learn and make with raspberry Pi".
19.
"Federated learning of a recurrent neural network on raspberry pis", Feb. 2019, [online] Available: https://blog.openmined.org/federated-learning-of-a-rnn-on-raspberry-pis/.
20.
X. Zhang, M. Hu, J. Xia, T. Wei, M. Chen and S. Hu, "Efficient federated learning for cloud-based AIoT applications", IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., vol. 40, no. 11, pp. 2211-2223, Nov. 2021.
21.
Y. Gao et al., "End-to-end evaluation of federated learning and split learning for internet of things", Proc. 39th Int. Symp. Reliable Distrib. Syst., pp. 91-100, 2020.
22.
B. McMahan et al., "Communication-efficient learning of deep networks from decentralized data", pp. 1273-1282, 2017.
23.
T. Li, A. K. Sahu, A. Talwalkar and V. Smith, "Federated learning: Challenges methods and future directions", vol. 37, pp. 50-60, 2020.
24.
P. Vepakomma, O. Gupta, T. Swedish and R. Raskar, "Split learning for health: Distributed deep learning without sharing raw patient data", 2018, [online] Available: http://arxiv.org/abs/1812.00564.
25.
C. Thapa, M. A. P. Chamikara and S. Camtepe, "Advancements of federated learning towards privacy preservation: from federated learning to split learning", 2020, [online] Available: .
26.
S. Kiranyaz, O. Avci, O. Abdeljaber, T. Ince, M. Gabbouj and D. J. Inman, "1D convolutional neural networks and applications: A survey", vol. 151, 2021.
27.
A. Musaddiq, Y. Bin Zikria, O. Hahm, H. Yu, A. K. Bashir and S. Won Kim, "A survey on resource management in IoT operating systems", IEEE Access, vol. 6, pp. 8459-8482, 2018.
28.
A. G. Howard et al., "Mobilenets: Efficient convolutional neural networks for mobile vision applications", 2017.
29.
A. Krizhevsky et al., "Learning multiple layers of features from tiny images" in , Princeton, New Jersey:Citeseer, 2009.
30.
Contact IEEE to Subscribe

References

References is not available for this document.