I. Introduction
These years, we are witnessing our planet warming up at an unprecedented rate, causing irreversible change to the climate [1]. Unfortunately, the recent advance in AI plays an increasing role in this tragedy [2]. For example, training a large transformer model would generate the same amount of carbon emissions as five fuel vehicles in their entire lifetime [3]. The study in [4] shows that 40% of the energy comes from centralized cooling while training is performed in data centers. On the other hand, Federated Learning (FL) rises as a new paradigm to preserve user privacy by conducting training on distributed mobile/edge devices [6], [7], [9], [10]. It is shown that federated learning potentially offers better energy efficiency because centralized cooling is no longer required [5].