I. Introduction
Dexterous in-hand manipulation is one of the essential functions of robots in human-robot interaction, intelligent manufacturing, telemanipulation, and assisted living. Still, it is hard to solve due to the high degrees of freedom in control and the complex interaction with the object. Deep Reinforcement Learning (DRL) [1] has shown its abilities [2] to solve dexterous in-hand manipulation thanks to its learning capability, which enables the robot to find a control policy by interacting with the environment through exploration.