Self-Training Vision Language BERTs With a Unified Conditional Model | IEEE Journals & Magazine | IEEE Xplore