Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions | IEEE Conference Publication | IEEE Xplore