Multimodal Explanations by Predicting Counterfactuality in Videos | IEEE Conference Publication | IEEE Xplore