1 Introduction
A long-standing goal of machine vision has been to build a system which is able to recognize many different kinds of objects in a cluttered world. Although the general problem remains unsolved, progress has been made on restricted versions of this goal. One succesful special case considers the problem of detecting individual instances of highly textured objects, such as magazine covers or toys, despite clutter, occlusion, and affine transformations. The method exploits features which are invariant to various transformations, yet which are very specific to a particular object [24], [31]. This can be used to solve tasks such as “find an object that looks just like this one,” where the user presents a specific instance, but it cannot be used to solve tasks such as “find an object that looks like a car,” which requires learning an appearance model of a generic car.