Loading [a11y]/accessibility-menu.js
Improving Chinese text categorization by outlier learning | IEEE Conference Publication | IEEE Xplore

Improving Chinese text categorization by outlier learning


Abstract:

Text categorization is one of the typical machine learning tasks that suffer from an incomplete training data problem. A main reason is the existence of outliers in train...Show More

Abstract:

Text categorization is one of the typical machine learning tasks that suffer from an incomplete training data problem. A main reason is the existence of outliers in training data, such as non-sense documents, documents mislabeled or lying on the border between different categories, and documents that are out of the defined categories, etc. Therefore, in a text categorization task, outlier learning technique could be adopted to improve text categorization. In this paper, an outlier learning based text categorization system is proposed, where AdaBoost algorithm is adopted for outlier identifying. Simulation results reveal that the new system is successful in improving learning performance for text categorization.
Date of Conference: 30 October 2005 - 01 November 2005
Date Added to IEEE Xplore: 27 February 2006
Print ISBN:0-7803-9361-9
Conference Location: Wuhan, China
Citations are not available for this document.

Getting results...

Contact IEEE to Subscribe

References

References is not available for this document.