Conferences >ICC 2022 - IEEE International...

Privacy-preserving Training Algorithm for Naive Bayes Classifiers

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The growing popularity of Machine learning (ML) that appreciates high quality training datasets collected from multiple organizations raises natural questions about the p...Show More

Metadata

Abstract:

The growing popularity of Machine learning (ML) that appreciates high quality training datasets collected from multiple organizations raises natural questions about the privacy guarantees that can be provided in such settings. Our work tackles this problem in the context of multi-party secure ML wherein multiple organizations provide their sensitive datasets to a data user and train a Naive Bayes (NB) model with the data user. We propose PPNB, a privacy-preserving scheme for training NB models, based on Homomorphic Cryptosystem (HC) and Differential Privacy (DP). PPNB achieves a balance performance between efficiency and accuracy in multi-party secure ML, enabled flexible switch among different tradeoffs by parameter tuning. Extensive experimental results validate the effectiveness of PPNB.

Published in: ICC 2022 - IEEE International Conference on Communications

Date of Conference: 16-20 May 2022

Date Added to IEEE Xplore: 11 August 2022

ISBN Information:

ISSN Information:

DOI: 10.1109/ICC45855.2022.9838847

Conference Location: Seoul, Korea, Republic of

Funding Agency:

Contents

I. Introduction

Machine learning (ML) models are widely used in many fields, such as spam detection, image classification, and natural language processing [1], [2]. The accuracy of models is closely related to the quality of the training dataset, in addition to well-designed ML algorithms. An experimental study with datasets of 300 million images at Google [3] demonstrates that the performance of models increases as the order of magnitude of training data grows. However, training datasets are usually held by multiple organizations and contain sensitive information. For example, a company wants to build a model to discern the most appropriate time for advertising. The training datasets used for learning the model are extracted from the consumer purchase data recorded by several online shopping sites, and the consumer data contains sensitive information about consumers. Therefore, it is important to protect data privacy in training ML models.

References is not available for this document.

Privacy-preserving Training Algorithm for Naive Bayes Classifiers

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Privacy-preserving Training Algorithm for Naive Bayes Classifiers

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?