Conferences >2018 IEEE International Confe...

Variable Entropy of Noise in Evaluation of Effectiveness of Context Usage by Machine Learning Methods

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Solving a problem requires knowledge of its context - the information needed to solve it. With the advent of Big Data and ML, huge amounts of data is collected in the des...Show More

Metadata

Abstract:

Solving a problem requires knowledge of its context - the information needed to solve it. With the advent of Big Data and ML, huge amounts of data is collected in the desire to develop systems which will then find solutions to specified problems. This requires algorithms to discover and effectively use context hidden within provided data. A system more adept in this task can be regarded as more computationally aware of requirements, data sources and methods that are important to solve given problems. This is a highly desirable property which helps to achieve goals. In this article we present a method for estimating the effectiveness of context usage of machine learning algorithms. It is based on comparison of machine learning models trained on data containing various forms of injected context created with use of noise of varying entropy levels. Finally we give results of using this solution on selected machine learning algorithms and benchmark problems from ICxS Contextual Data repository.

Published in: 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Date of Conference: 07-10 October 2018

Date Added to IEEE Xplore: 17 January 2019

ISBN Information:

ISSN Information:

DOI: 10.1109/SMC.2018.00129

Conference Location: Miyazaki, Japan

Contents

I. Introduction

We define the context as the set of elements of the data which are needed to solve the problem defined by that data. In the case of classification problems represented by real-life benchmark data sets one can say that given data set includes the whole context if he will find an algorithm that solves that problem with 100% accuracy by analyzing only that data. But if we don't know such algorithm, we also do not know what portion of the context exists within the data. This is because we can't say if limited accuracy of ML models, in the case of given data set, is caused by: lack of some part of the context within the data, or by the fact that the data includes all the information needed to solve the problem, but known training algorithms can't find and/or properly use it. In such situation till we don't know the perfect solution, we don't know if it can exist or how close known methods are to the best possible solution. This makes most of real-life benchmark data sets not the best for evaluation of properties of ML algorithms.

References is not available for this document.

MIT Libraries

MIT Libraries

Variable Entropy of Noise in Evaluation of Effectiveness of Context Usage by Machine Learning Methods

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Variable Entropy of Noise in Evaluation of Effectiveness of Context Usage by Machine Learning Methods

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References