Energy-Efficient Online Scheduling of Transformer Inference Services on GPU Servers | IEEE Journals & Magazine | IEEE Xplore