Multi-modal Remote Sensing Image Description Based on Word Embedding and Self-Attention Mechanism | IEEE Conference Publication | IEEE Xplore