I. Introduction
High-level synthesis (HLS) tools have recently reached commercial maturity, enabling high hardware design productivity for field-programmable gate array (FPGA) technology. However, for many applications, there is still a considerable gap between the quality of results produced by HLS tools and those obtained by manual optimized RTL design. Computational bottlenecks are typically located in some critical loops of high-level programs, and hence loop pipelining has emerged as one of the preeminent optimization techniques in HLS.
Motivational code.