1. Introduction
Automated instrument identification from surgical video frames will aid the development of intelligent applications in instrument tracking, pose estimation, augmented reality overlay, visual servoing, analysis of surgical phase, and will aid development of safety mechanisms for surgery. In addition to benefiting different components in surgery, the position information of instruments also helps in measuring the context-awareness of a surgeon, potentially helping to reduce human errors. Bounding box recognition of instruments is not sufficient due to the coarse boundaries generated around the regions of interest. The dense prediction of tool versus background through semantic segmentation enhances safety by parsing the whole scene without suppressing occluded or slanted objects.