1. Introduction
Deep neural networks (DNNs) have become the dominant acoustic model for speech recognition [1], [2], [3], [4], [5]. Two frameworks to incorporate DNNs into the existing HMM-based decoder include the hybrid [6] and tandem [7] approaches. In this paper, we focus on the hybrid approach.