1. INTRODUCTION
Real-world datasets are often heterogeneous and incomplete, meaning they contain different types of data that range from continuous to ordinal and categorical, with missing values at random locations. For example, electronic health records of hospitals might contain different clinical measurements, diagnoses, and demographic information about their patients [1]. They do not only contain different numerical lab values but also variables like race and blood type that are categorical. Moreover, the two types are often with missing values for various reasons.