1. Introduction
While the field of visual category recognition has seen rapid progress in recent years, much remains to be done to reach human level performance. The best current approaches can deal with 100 or so categories, e.g. the CUReT dataset for materials, and the Caltech-101 dataset for objects; this is still a long way from the the estimate of 30,000 or so categories that humans can distinguish. Another significant feature of human visual recognition is that it can be trained with very few examples, cf. machine learning approaches to digits and faces currently require hundreds if not thousands of examples.