Active transductive KNN for sparsely labeled text classification | IEEE Conference Publication | IEEE Xplore

Active transductive KNN for sparsely labeled text classification


Abstract:

Sparsely labeled classification may exist in many real-world applications and it is more challenging than the problems most existing semi-supervised learning/active learn...Show More

Abstract:

Sparsely labeled classification may exist in many real-world applications and it is more challenging than the problems most existing semi-supervised learning/active learning algorithms considered. In this paper, an active transductive framework is proposed for sparsely labeled text classification. It integrates the advantages of semi-supervised learning and active learning, and employs several techniques to cope with the training data bias and sparsity. A batch mode active learning strategy is used to enhance the performance of semi-supervised learning. The fusion of active learning with rechecking strategy, as well as the employment of common feature extraction technique, makes our framework robust to the training data bias and sparsity. Experimental results on several real data sets show that the proposed classification framework is more effective and efficient for sparsely labeled text classification compared with several state-of-the-art methods.
Date of Conference: 20-24 November 2012
Date Added to IEEE Xplore: 22 April 2013
ISBN Information:
Conference Location: Kobe, Japan
No metrics found for this document.

I. Introduction

Sparsely labeled classification may exist in many real-world applications such as content-based image retrieval, online web-page recommendation, object identification and text categorization, where the abundant unlabeled instances are available but the labeled ones are fairly expensive to obtain since manually labeling the training data for a machine learning algorithm is a tedious and time-consuming process, and even unpractical(e.g., online web-page recommendation). Correspondingly, one important challenge for large-scale text categorization is how to reduce the number of labeled documents that are required for building reliable text classifier.

Usage
Select a Year
2024

View as

Total usage sinceApr 2013:129
00.20.40.60.811.2JanFebMarAprMayJunJulAugSepOctNovDec000000000010
Year Total:1
Data is updated monthly. Usage includes PDF downloads and HTML views.
Contact IEEE to Subscribe

References

References is not available for this document.