1. INTRODUCTION
Image classification is a research area in computer vision that has gained great attention in recent years mainly to tackle object classification and detection problems [1], [2], [3]. Object states, on the contrary, have not been considered as much as object classification in recent literature. Moreover, object states require further analysis especially for robotics-based applications. Robotic manipulation, task planning, and grasping require knowledge and constant feedback about the state of the environment and objects. For instance, if a robot chef wants to perform the task of chopping an onion, it has to grasp the whole onion, cut it into half, recognize its new state (sliced), grasp it accordingly, and cut it into smaller parts while continuously monitoring the state. Ultimately, the robot needs to recognize the desired state and understand when it has reached the end of the procedure (e.g. chopping). The problem of states has been analyzed in several previous works [4], [5], [6]. Similar to [6] we will address the issue of states in cooking related images.