I. Introduction
The proliferation of large language models (LLMs) marks a monumental leap in the realms of artificial intelligence and natural language processing. These models, with their deep structures and vast parameter sizes, offer capabilities that redefine the benchmarks of machine-human interactions for 6G [2]. However, the very nature of their size and intricacy means they cannot be effortlessly deployed, especially in constrained environments like mobile devices [3].