Loading [MathJax]/extensions/MathMenu.js
Effect of Training Data Order for Machine Learning | IEEE Conference Publication | IEEE Xplore

Effect of Training Data Order for Machine Learning


Abstract:

For many Machine Learning algorithms on supervised learning problems, the order of training data samples can affect the quality of the derived model and the accuracy of p...Show More

Abstract:

For many Machine Learning algorithms on supervised learning problems, the order of training data samples can affect the quality of the derived model and the accuracy of predictions. This paper describes a project to quantify this effect, and to statistically quantify the variation exhibited by several algorithms using permutations of a given training data set. It is demonstrated that this variation can be quite significant, and that training data set ordering should be an important consideration when approaching a classification task.
Date of Conference: 05-07 December 2019
Date Added to IEEE Xplore: 20 April 2020
ISBN Information:
Conference Location: Las Vegas, NV, USA
Citations are not available for this document.

1. Background

Supervised Machine Learning for classification is the process wherein an algorithm develops a method of assigning class labels to input data based on example input / output pairs, or “training data”. Many such algorithms exist and have demonstrated success in a variety of contexts. Some of these algorithms are training data order invariant, meaning the same classification model will result from the same training data regardless of the order in which the individual samples are presented to the algorithm. Others, however, can vary based on the training data order.

Cites in Papers - |

Cites in Papers - IEEE (3)

Select All
1.
Zaur Kh. Kalazhokov, Yan T. Makoveichuk, "Methodology for Training Data Science Skills Based on Competitions on the Kaggle Platform", 2023 Seminar on Information Computing and Processing (ICP), pp.94-96, 2023.
2.
Majdi Richa, Jean-Christophe Prévotet, Mickaël Dardaillon, Mohamad Mroué, Abed Ellatif Samhat, "Automated Training Data Construction using Measurements for High-Level Learning-Based FPGA Power Modeling", 2022 International Conference on Smart Systems and Power Management (IC2SPM), pp.170-175, 2022.
3.
Mukhamad Angga Gumilang, Trismayanti Dwi Puspitasari, Hermawan Arief Putranto, Abdul Kholiq, Asep Samsudin, "Sentiment Analysis Based on Tweet Reply at Public Figure Account using Machine Learning and Latent Semantic Analysis", 2022 8th International Conference on Science and Technology (ICST), vol.1, pp.1-6, 2022.

Cites in Papers - Other Publishers (6)

1.
Hamed Esmaeili, Maryam Avateffazeli, Meysam Haghshenas, Reza Rizvi, "A Hybrid Framework for Characterizing and Benchmarking Fatigue S‐N Curves in Aluminum Alloys by Integrating Empirical and Data‐Driven Approaches", Fatigue & Fracture of Engineering Materials & Structures, 2024.
2.
Majdi Richa, Jean-Christophe Prévotet, Mickaël Dardaillon, Mohamad Mroué, Abed Ellatif Samhat, "High-Level Online Power Monitoring of FPGA IP Based on Machine Learning", Design and Architecture for Signal and Image Processing, vol.13879, pp.107, 2023.
3.
Poonam, Neera Batra, "Evaluation of Various Machine Learning Based Existing Stress Prediction Support Systems (SPSSs) for COVID-19 Pandemic", Advanced Network Technologies and Intelligent Computing, vol.1798, pp.408, 2023.
4.
J.T. Fel, C.T. Ellis, N.B. Turk-Browne, "Automated and manual segmentation of the hippocampus in human infants", Developmental Cognitive Neuroscience, pp.101203, 2023.
5.
Claudia Mazo, Claudia Aura, Arman Rahman, William M. Gallagher, Catherine Mooney, "Application of Artificial Intelligence Techniques to Predict Risk of Recurrence of Breast Cancer: A Systematic Review", Journal of Personalized Medicine, vol.12, no.9, pp.1496, 2022.
6.
Vo Van Hai, Ho Le Thi Kim Nhung, Huynh Thai Hoc, "Empirical Evidence in Early Stage Software Effort Estimation Using Data Flow Diagram", Software Engineering and Algorithms, vol.230, pp.632, 2021.
Contact IEEE to Subscribe

References

References is not available for this document.