Early or Late Fusion Matters: Efficient RGB-D Fusion in Vision Transformers for 3D Object Recognition | IEEE Conference Publication | IEEE Xplore