I. Introduction
Deep architectural models such as convolutional neural networks (CNNs) and generative adversarial networks (GANs) enable the generation of high-dimensional data such as images, [1], [2], [3], [4], [5], [6] and videos [7], [8], [9], [10], [11], [12]. These models can manipulate the given high-dimensional data conditioned on the desired manipulation. For example, image manipulation and editing architectures [13], [14], [15], [16] allow users to transfer the style from another image.