I. Introduction
Cloud services have been increasingly adopted by organisations and individuals [1]. This trend is accelerated by the roll-out of 5G mobile networks and the pandemic. Cloud vendors are motivated to provide a diverse range of services appealing to customers. As a major cloud vendor, we have observed the rapid growth of cloud traffic in recent years. We start experiencing more often CPU overloading at our NFV-based middleboxes on the path of east-west traffic (VM-VM) (Fig. 3). By looking into individual CPU overloading occurrences, we found that CPU overloading was caused by heavy-hitter flows (Top-1 flows in Fig. 2). In a multi-tenant cloud, due to infrastructure sharing, CPU overloading may lead to packet losses and long latency across all the tenants whose traffic goes through the CPU, even if the majority of the tenants do not contribute to the heavy hitters.