Multimodal Visual-Tactile Representation Learning through Self-Supervised Contrastive Pre-Training | IEEE Conference Publication | IEEE Xplore