I. Introduction
Learning to perform grasping with a multi-finger hand has been a long-standing problem in robotics [1], [2], [3], [4]. Using a dexterous hand instead of a parallel gripper offers the robot the flexibility on operating with daily life objects like humans do, but also largely increases the difficulty given the large Degree-of-Freedom of the dexterous hand. A typical method for this task is a 2-step paradigm including grasp pose estimations following by motion planning [5], [6], [7]. Recent works have also studied on using Reinforcement Learning with human demonstration guidance for grasping [8], [9].