I. Introduction
Cellular-connected unmanned aerial vehicle (UAV) networks are becoming integral components of sixth-generation (6G) networks, showcasing various applications (e.g., capture images and videos, and collect various types of sensing data) [1]. The rapid advancement of Artificial Intelligence (AI) technologies has led to AI-aided mobile traffic prediction schemes, previously explored for cost-efficient network resource management in terrestrial networks [2], [3], [4], [5], are now being adapted for UAV networks. By considering the distributed nature of mobile networks and privacy issues, one focus of recent studies is the use of Federated Learning (FL) [6] to create mobile traffic prediction models collaboratively. FL enables clients, such as multi-access edge computing (MEC) servers, to conduct local training and transmit local model parameters instead of entire datasets for server aggregation [7]. This approach helps reduce large transmission delays and processing bottlenecks associated with centralized learning (CL) servers. However, FL imposes a significant computational load on clients for local training [8]. This is particularly challenging when dealing with complex models like deep learning (DL), especially in UAV networks, where limited battery power and computational resources are major constraints.