Toward Matrix Multiplication for Deep Learning Inference on the Xilinx Versal | IEEE Conference Publication | IEEE Xplore