I. Introduction
Recently, the emergence of Large Language Model (LLM) has propelled artificial intelligence (AI) into unprecedented realms [1], ushering in a new era of text generation capabilities. The advent of LLMs not only advances AI in the realm of textual comprehension and generation but also catalyzes developments in related fields. Their integration with robotics undeniably injects fresh vigor into embodied intelligence.