I. Introduction
The goal of visual servoing (VS) techniques is to control a dynamic system, such as a robot, by using the data provided by one or multiple cameras [13], [6]. Classical approaches rely on the extraction, tracking and matching of a set of visual features. These features, generally points, lines, or moments, are used as inputs to a control law that allows a robust positioning of the robot. While there has been progress in extracting and tracking the relevant features, a new approach called direct visual servoing (DVS, eg, [10], [7], [16]) was introduced recently to consider the image as a whole, which does not require anymore feature extraction nor tracking. The main drawback of DVS is its small convergence domain compared to classical techniques, which is due to the high non-linearities of the cost function to be minimized.