I. Introduction
Generative statistical models of shape and appearance variations built based on the ideas presented in a series of seminal papers [1]–[4] have a long history in biomedical image analysis [5]. Over the years, they have been applied to many different kinds of shape and appearance representations including point sets [2], dense deformation fields [6], level-sets [7], image patches [8], and full images [1], [9]. Largely independent of the actual type of data representation used, those models always rely on the same general idea: Valid instances of the data modeled lie close to or in a low-dimensional manifold embedded in a high-dimensional space. More specifically, this manifold is assumed to be an affine subspace of the embedding space. The translation vector of the subspace is usually the sample mean of the training population and its orthonormal basis encodes the main directions of variation seen in the training set. Due to their linear nature, simple and computationally efficient closed-form solutions for the transport between the latent subspace and the embedding space exist.