I. Introduction
Machine learning (ML) models are widely used in many fields, such as spam detection, image classification, and natural language processing [1], [2]. The accuracy of models is closely related to the quality of the training dataset, in addition to well-designed ML algorithms. An experimental study with datasets of 300 million images at Google [3] demonstrates that the performance of models increases as the order of magnitude of training data grows. However, training datasets are usually held by multiple organizations and contain sensitive information. For example, a company wants to build a model to discern the most appropriate time for advertising. The training datasets used for learning the model are extracted from the consumer purchase data recorded by several online shopping sites, and the consumer data contains sensitive information about consumers. Therefore, it is important to protect data privacy in training ML models.