Low-bit Quantization of Neural Networks for Efficient Inference | IEEE Conference Publication | IEEE Xplore