Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection | IEEE Conference Publication | IEEE Xplore