I. Introduction
With scaling benefits ending, more and more designers are now working on Domain-Specific Architectures (DSAs). Especially, because of the recent phenomenal popularity of Artificial Intelligence, the dedicated hardware accelerators with both performance enhancement and energy efficiency for Deep Neural Networks (DNNs) have received tremendous interest [1] [2] [3] [4] both in academia and industry. Among previously proposed custom DNN accelerator architectures, 2D spatial array architecture is a prominent choice [1] [2]. vector-based execution array used in such as NVDLA [5].