1. Introduction
After the breakthrough of deep learning technology [1]–[8], speech recognition accuracy has improved dramatically. Recently, speech recognition systems are widely used not only in smart phones and Personal Computers (PCs) but also in standalone devices in far-field environments. Examples include voice assistant systems such as Amazon Alexa, Google Home [9], [10], and Samsung Bixby [11].