Elevating Perception: Unified Recognition Framework and Vision-Language Pre-Training Using Three-Dimensional Image Reconstruction | IEEE Conference Publication | IEEE Xplore