1 Introduction
Image and video segmentation is useful in many applications for identifying regions of interest in a scene or annotating the data. The MPEG-4 standard needs segmentation for object-based video coding. However, the problem of unsupervised segmentation is ill-defined because semantic objects do not usually correspond to homogeneous spatiotemporal regions in color, texture, or motion. Some of the recent work in image segmentation include stochastic model-based approaches [1], [6], [13], [17], [24], [25], morphological watershed-based region growing [18], energy diffusion [14], and graph partitioning [20]. The work on video segmentation includes motion-based segmentation [3], [19], [21], [23], spatial segmentation and motion tracking [8], [22], moving objects extraction [12], [15], and region growing using spatiotemporal similarity [4], [16]. Quantitative evaluation methods have also been suggested [2].