I. Introduction
Machine learning have shown outstanding performance in a variety of applications since the recent advent of largescale datasets. This success is contingent on the availability of large amounts of labeled data [1]-[3], which is both costly and time-consuming to gather [4]-[6]. Many techniques are introduced in the literature to reduce the high labeling cost by using non-expert annotators Amazon’s Mechanical Turk [4][8]; however, the use of nun experts often results in falsely labeled data also known as noisy labels [4]– [7]. In real-world datasets, the percentage of incorrect labels has been observed to range from 5% to 38% [8]– [12].