I. Introduction
Lending Club Bank, a prominent online financial institution in the USA, founded in 2007, is the nation’s leading peer-to-peer lending platform, connecting borrowers with investors. Acknowledged as a worldwide leader in person-to-person lending, this financial institution boasts the pioneering achievement of obtaining regulatory approval from the Securities and Exchange Commission (SEC) for its innovative investment products. This paper delves into the utilization of Exploratory Data Analysis (EDA) and Machine Learning to address real-world business challenges. Its focus lies in the meticulous analysis of risk dynamics within the realm of banking and financial services [1], shedding light on the strategic implementation of data-driven methodologies to mitigate the potential for monetary setbacks while extending credit to clientele. In order to make correct predictions, we used three different classifiers, of which the logistic regression classifier [2] was the best. A similar study has been conducted in [3] on a dataset with 1500 rows and 18 features using logistic regression, but in real-time, the data collected to draw accurate inferences is much larger and cannot be handled by traditional data handling techniques. As the amount of data grew and the importance of advanced computing resources became evident, the requirement to embrace big data became crucial [4]. Utilizing a framework that can handle big data and execute machine learning or deep learning tasks on that data can prove to be highly advantageous for developers.