Conferences >2022 30th Signal Processing a...

A Dependent Feature Weighting Filter for Naive Bayes Classifier

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Naive Bayes (NB) classification is one of the most extensively used algorithms in data mining and machine learning due to its high efficiency and structural simplicity ba...Show More

Metadata

Abstract:

Naive Bayes (NB) classification is one of the most extensively used algorithms in data mining and machine learning due to its high efficiency and structural simplicity based on conditional independence of attributes. In this paper, we present a dependence metric to quantify the dependence among attributes and class attributes and propose feature-feature significance (FFS) and feature-class significance(FCS)to discover highly predictive attributes over less predictive ones in NB classification. We show how to get feature weights from FFS and FCS and propose a novel dependent feature weighted (DFW) NB classification. To increase performance further, we recommend clustering the random sample of interest due to the non-homogeneous dependence nature of features, and then using feature weighting to alleviate the conditional independence. As a consequence, we propose a cluster-based DFW (CDFW) NB as a result of weighting the DFW filters of random sub-samples by their accuracy and then merging them for performance augmentation. The experimental results show that the NB with DFW filter provides good results when compared to the conventional NB and all other feature weighting techniques.

Published in: 2022 30th Signal Processing and Communications Applications Conference (SIU)

Date of Conference: 15-18 May 2022

Date Added to IEEE Xplore: 29 August 2022

ISBN Information:

Print on Demand(PoD) ISSN: 2165-0608

DOI: 10.1109/SIU55565.2022.9864833

Conference Location: Safranbolu, Turkey

Contents

I. Introduction

As supervised learning algorithms are being extensively employed to categorize new instances using the statistical structure of training sample, classification is the process of identifying a random sample with its statistical characteristics in order to make predictions of unknown variables or categories and has become one of the most fundamental topics in data mining and machine learning [1]–[9], [and references therein]. Bayesian networks [1]–[3] are commonly used to address classification problems in which a learning algorithm attempts to create a classifier from a given training random sample with class labels. As a result, the primary goal of a classification algorithm is to construct a classifier, and the Naive Bayes (NB) classifier stands out among existing classification algorithms not just only for its simplicity [1]–[7], but also for its high robust performance [1]–[3], [5]–[7]. In the context, it is worthy emphasizing that it has been used to solve classification difficulties for both binary and multi-class categories in many real-world applications and remarkably performs well [10], [and references therein]. However, NB classifier still has two major shortcomings [11], one of which is unreliable probability estimation. The other one, and may be the significant one, is the unrealistic assumption of conditional independence among attributes. The enhancements proposed to address the significant shortcoming include feature weighting [12]–[19].

References is not available for this document.

A Dependent Feature Weighting Filter for Naive Bayes Classifier

Abstract:

Metadata

Abstract:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

A Dependent Feature Weighting Filter for Naive Bayes Classifier

Alerts

Abstract:

Metadata

Abstract:

I. Introduction

References