I. Introduction
After SHOT boundary detection, a common next step in content-based video analysis is to group the shots into different scene clusters, each comprising shots with similar contents. One or a few shots can then be selected from each scene cluster to form a compact representation of the video content for such purpose as video indexing, browsing, and understanding [1].