Journals & Magazines >IEEE Transactions on Cybernet... >Volume: 48 Issue: 2

WoCE: A framework for Clustering Ensemble by Exploiting the Wisdom of Crowds Theory

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The wisdom of crowds (WOCs), as a theory in the social science, gets a new paradigm in computer science. The WOC theory explains that the aggregate decision made by a gro...Show More

Metadata

Abstract:

The wisdom of crowds (WOCs), as a theory in the social science, gets a new paradigm in computer science. The WOC theory explains that the aggregate decision made by a group is often better than those of its individual members if specific conditions are satisfied. This paper presents a novel framework for unsupervised and semisupervised cluster ensemble by exploiting the WOC theory. We employ four conditions in the WOC theory, i.e., diversity, independency, decentralization, and aggregation, to guide both constructing of individual clustering results and final combination for clustering ensemble. First, independency criterion, as a novel mapping system on the raw data set, removes the correlation between features on our proposed method. Then, decentralization as a novel mechanism generates high quality individual clustering results. Next, uniformity as a new diversity metric evaluates the generated clustering results. Further, weighted evidence accumulation clustering method is proposed for the final aggregation without using thresholding procedure. Experimental study on varied data sets demonstrates that the proposed approach achieves superior performance to state-of-the-art methods.

Published in: IEEE Transactions on Cybernetics ( Volume: 48, Issue: 2, February 2018)

Page(s): 486 - 499

Date of Publication: 04 January 2017

ISSN Information:

PubMed ID: 28060718

DOI: 10.1109/TCYB.2016.2642999

Funding Agency:

Contents

I. Introduction

Clustering, the art of discovering meaningful patterns in the unlabeled data sets, is one of the main tasks in machine learning. Semisupervised clustering is a branch of clustering methods that uses prior supervision information, such as labeled data, known data associations, or pairwise constraints, to aid the clustering process. This paper focuses on pairwise constraints, i.e., pairs of instances known as belonging to the same cluster (must-link constraints) or different clusters (cannot-link constraints). Pairwise constraints arise naturally in many real tasks and have been widely used in semisupervised clustering. There is a wide range of issues in the clustering methods. For instance, individual clustering algorithms provide different accuracies in a complex data set because they generate the clustering results by optimizing a local or global function instead of natural relations between data points [1]–[4]. As another example, pairwise constraints often result in highly unstable clustering performance, whereas they have the potential to improve clustering accuracy in practice [5], [6].

References is not available for this document.

MIT Libraries

MIT Libraries

WoCE: A framework for Clustering Ensemble by Exploiting the Wisdom of Crowds Theory

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

WoCE: A framework for Clustering Ensemble by Exploiting the Wisdom of Crowds Theory

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References