Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos | IEEE Conference Publication | IEEE Xplore