1. Introduction
Many high-performance object detection algorithms operate top-down and do not exploit grouping or segmentation processes. The best algorithms [9], [26] in the PASCAL VOC challenge [8] fall into this category as do top systems for important applications such as finding people in images [2] and detecting pedestrians specifically [6], [7], [22]. When object segmentation is desired as an output, it is often obtained in a post-processing step, for example, by aligning the predictions of a top-down detector to image contours [2].