I. INTRODUCTION
The mortality rate from melanoma has continuously increased over three decades, reporting four to five deaths per 100,000 population [1]. Dermoscopy is the primary tool to diagnose lesions, resulting in limited diagnostic performance due to the visual complexity of different types of lesions and requiring expert input. There are primary two types of skin lesions: Benign (further consisting of nevi, basal, and squamous) and Melanoma. All of them present high intraclass variations, such as their color, texture, shape, and size, making it difficult to distinguish between them. The criteria named ABCDE [2] was introduced to assess the characteristics of lesion type; however, an expert requires intensive knowledge to analyze these properties. The human performance for detecting melanoma is less than 80% [3], with a mean sensitivity of 40% [4], and even less in the case of more complex lesion patterns. Additionally, the traditional clinical diagnostic approaches are expensive, prolonged diagnosis timeframes, and are unavailable to many experts [5]. The mortality rate and complications can be drastically reduced if the disease is detected early. In recent years, traditional machine learning techniques (supervised and unsupervised classification algorithms) [6] achieved remarkable performance in different areas, which are extended to the medical field. In addition, deep learning models represented by CNN gained great success by offering a high detection rate for complex data sets. In the current state of deep learning research towards melanoma classification, there have been no comparative analyses of deep learning performance with raw and processed data to identify whether deep learning models are more effective with cleaned and segmented data.