Video object detection based on the spatial-temporal convolution feature memory model | IEEE Conference Publication | IEEE Xplore