Journals & Magazines >IEEE Transactions on Fuzzy Sy... >Volume: 22 Issue: 6

Construction of Neurofuzzy Models For Imbalanced Data Classification

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We propose a new class of neurofuzzy construction algorithms with the aim of maximizing generalization capability specifically for imbalanced data classification problems...Show More

Metadata

Abstract:

We propose a new class of neurofuzzy construction algorithms with the aim of maximizing generalization capability specifically for imbalanced data classification problems based on leave-one-out (LOO) cross-validation. The algorithms are in two stages: First, an initial rule base is constructed based on estimating the Gaussian mixture model with analysis of variance decomposition from input data; the second stage carries out the joint weighted least squares parameter estimation and rule selection using an orthogonal forward subspace selection (OFSS) procedure. We show how different LOO based rule selection criteria can be incorporated with OFSS and advocate either maximizing the LOO area under curve of the receiver operating characteristics or maximizing the LOO F-measure if the datasets exhibit imbalanced class distribution. Extensive comparative simulations illustrate the effectiveness of the proposed algorithms.

Published in: IEEE Transactions on Fuzzy Systems ( Volume: 22, Issue: 6, December 2014)

Page(s): 1472 - 1488

Date of Publication: 23 December 2013

ISSN Information:

DOI: 10.1109/TFUZZ.2013.2296091

Contents

I. Introduction

In data based modeling, two of the main aims are good generalization and model interpretation. Model generalization refers to the model's capability to approximate accurately the system output for unseen data. Fundamental to the evaluation of model generalization capability is the concept of cross-validation [1], and one commonly used version of cross-validation is the so-called leave-one-out (LOO) cross-validation. For the linear-in-the-parameters models, the leave-one-out mean square error (LOOMSE) can be calculated without actually splitting the training dataset and estimating the associated models by making use of the Sherman–Morrison–Woodbury theorem [2]. The orthogonal forward regression (OFR) algorithm efficiently constructs parsimonious models [3] by selecting regressors according to their contribution to the maximization of the model error reduction ratio (ERR). By incorporating the OFR framework with analytical expression of LOO errors, the LOOMSE was proposed [4] as model term selective criterion based on the idea that model generalization can be sequentially optimized within the model construction process. Because the LOOMSE measures the expected performance on new data, it reaches an optimal value for a certain model size, therefore, that the OFR based construction procedure automatically terminates without use of other stopping criterion [4]. Similarly for the two-class classification problem, sparse kernel classifiers can be constructed via sequentially minimizing the LOO misclassification rate (MR) [5].

References is not available for this document.

MIT Libraries

MIT Libraries

Construction of Neurofuzzy Models For Imbalanced Data Classification

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Construction of Neurofuzzy Models For Imbalanced Data Classification

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References