I. Introduction
Modern computing systems become increasingly complex to satisfy the growing performance demanded by applications. As the number of transistors available on a single chip increases to billions or even larger numbers, chip multiprocessor (CMP) is becoming an attractive platform for high-performance and low-power applications. In a complex CMP system, the communication efficiency among and inside chips is as important as the computation efficiency of individual processors in the system.