I. Introduction
Nowadays, images are being taken and shared to be commented on an unprecedented rate among social networks like Facebook, Twitter, and Flickr. Images in these social media platforms do not exist in isolation and most images on the web carry rich text information including informative and semantic signals like who takes the photo, and where and with whom. Therefore, it is desirable to explore social media context, especially text context information jointly with pixel information, to aid visual recognition tasks on images.