1. INTRODUCTION
People can focus their auditory attention on the sound of their interest in a complex acoustic environment [1]. Researchers have attempted to endow machines with a similar capability by audio source separation, a process of separating all audio sources out of their mixture. Audio source separation includes speech separation [2]–[4], music separation [5]–[7], and universal sound separation [8]–[10].