I. Introduction
Recent advancements in smart devices (e.g., new chip-sets in phones and smart cars) have made them capable of collecting large amounts of data in real-time. In the example case of smart cars, collection of more than 4 TB of data per day is predicted [1]. Utilizing this data for training a machine learning (ML) model (e.g., for driving assistance) is the main motivation for implementing distributed ML techniques over the network edge in 6G-and-beyond systems [2].