I. Introduction
In recent years and in many fields, ever more datasets have been generated for similar purposes. However, because they are from different sources, they are not directly comparable. In an effort to bring the information in different datasets closer to comparability, we want a generic way to express the information content of one dataset relative to another. The information content of interest may be the equivalent independent sample size for a set of dependent data, or the minimum number of parameters required for good data summarization, or may be a representation of the data in term of a different model. In the latter case, we get a virtual sample that is as similar as possible to the original dataset.