MTCA: A Multimodal Summarization Model Based on Two-Stream Cross Attention | IEEE Conference Publication | IEEE Xplore