I. Introduction
Compiler optimization have been proven numerous times again and again with the capability to improve system performance by generating the most efficient binary codes for a particular system architecture. Various studies have shown that an efficient code properly optimized by a good compiler in regard to the target architecture of choice not only improves execution speed, but also improves system power consumption [1], reduces memory subsystem power consumption [2], [3], improves cache and memory performance [4], [5] and have indirect effect in improving I/O performance [6]. While these optimizations have been proven to be beneficial for the system, little to no cost were needed for the upgrade.