Introduction
Machine learning is the process of separating information from data so that it may be transformed into useful goals. Moreover, machine learning along with computer vision has augmented domains such as medical diagnostics, scientific research, and statistical data analysis. Also, it is being used in Email intelligence, social network, banking and finance, and personal smart assistant. People are influenced by how machine learning and artificial intelligence have changed our life by making them easy. Is it correct? In daily life directly or indirectly, the machine learning models are ruling humans, from what to eat, where to go and what to read too. But from where do these predictions or suggestions come from? We design machine learning models to do this all, train, and test it on large datasets. The datasets are the collection of data collected from various sources. But the threat for the machine learning algorithms is the dataset has the chance of poisoning attacks. The poisoning attacks mean adding bad data to the dataset used to train and test or misleading the training models. Data poisoning attacks have the potential to reduce the accuracy of algorithms, which could be detrimental. The machine learning models expect to have the highest accuracy possible. If data set with poisoning attacks will be used to train the healthcare diagnosis algorithms, banking, financing, evolution, and assessments, other decision-sensitive areas, might be harmful financially, physically as well as mentally. To anticipate and deal with poisoning attacks detection must be done. Detection of poisoning attacks on datasets is beneficial to distinguish between true data and poisoned data. The goal is to identify the deep learning method Recurrent Neural Network (RNN), the Naive Baye’s Classifier (NBC), and the support vector machine (SVM) as the models with the highest accuracy for detecting data poisoning attacks.