1. Introduction
This paper introduces semantic texton forests, and demonstrates their use for image categorization and semantic segmentation; see Figure 1. Our aim is to show that one can build powerful texton codebooks without computing expensive filter-banks or descriptors, and without performing costly k-means clustering and nearest-neighbor assignment. Semantic texton forests (s TFS) fulfill both criteria. They are randomized decision forests that use only simple pixel comparisons on local image patches, performing both an implicit hierarchical clustering into semantic textons and an explicit local classification of the patch category. Our results show that STFs improve the state-of-the-art in both quantitative performance and execution speed. Semantic texton forests. (a) Test image, with ground truth in-set. Semantic texton forests very efficiently compute (b) a set of semantic textons per pixel and (c) a rough local segmentation prior. Our algorithm uses both textons and priors as features to give coherent semantic segmentation (d), and even finds the building unmarked in the ground truth. Colors show texton indices in (b), but categories corresponding to the ground truth in (c) and (d).