I. Introduction
In the computer vision field, object detection and localization have recently gotten a lot of interest. For achieving the task deep learning is used which is known to require huge quantities of data. The data must be annotated in which a bounding box information of different objects available on the image is stored. Annotation is widely done using manual methods where human intervention is required to draw the bounding box using a tool and store the bounding box information. For larger projects, detailed processes with numerous phases and users with varied roles are available during annotations. However, in smaller projects with limited resources, this can be difficult to implement.