I. Introduction
Thanks to the availability of large amounts of data and increased computational power, CNNs have replaced multiple state-of-the-art techniques in a wide variety of fields, from image analysis, to audio processing. Despite the indubitable gains that CNNs offer in several tasks, a complete understanding of all the intricate and hidden processes that lie behind a CNN-based model has not been reached yet. For instance, researchers are still investigating whether learned features are interpretable [1]. Other authors are studying which portion of a CNN input actually triggers a specific classification result [2]–[4]. Answering these additional questions does not only help to develop more accurate solutions, but it also makes CNNs results easier to explain.