1. Introduction
In recent years discrete optimization has emerged as an important tool in solving Computer Vision problems. This has primarily been the result of the increasing use of energy minimization algorithms such as graph cuts [4], [11], tree-reweighted message passing [10], [25] and variants of belief propagation (BP) [17], [26]. These algorithms allow us to perform approximate inference (i.e. obtain the MAP estimate) on graphical models such as Markov Random Fields (MRF) and Conditional Random Fields (CRF) [14].