I. Introduction
Nowadays, Artificial Intelligence (AI) and Machine Learning (ML) have achieved great success in a variety of applications, including image classification, machine translation and speech recognition [1]-[5]. Considering the increasing volume of training data and the growing complexity of training models, Distributed Machine Learning (DML), in particular DML with data parallelism, has become a common practice to train large amounts of data in an acceptable amount of time [6]-[12].