I. Introduction
Wireless unmanned aerial vehicles (UAVs) network is foreseen to be an integral component in the upcoming sixth-generation (6G) networks [1], [2], which has the potential to support various applications, such as video streaming and disaster surveillance. In these scenarios, the UAVs (users) fly over the target area under the control of the base stations (BSs) to collect data (e.g., images and videos) and then transmit them to the BS for data processing. Each user can observe its local environment and collects a sub-dataset that only contains partial environment information, so all the users should transmit their local sub-datasets to the BS for integrated data processing (e.g., training machine learning (ML) models, including deep neural network (DNN) and convolutional neural network (CNN)). However, the transmission of large datasets causes high communication overhead and large energy consumption for the users, and may potentially reveal user privacy [3]. Thanks to the increased computational capability brought by GPUs, distributed learning becomes more attractive by enabling local learning at users and model aggregation at the BS via only sharing model parameters rather than raw data.