I. Introduction
Video interpretation or video understanding is one of the most exciting research areas in recent years.[1]–[3] In general, users remember video contents in terms of events or stories. Thus there is a need to organize video contents in terms of small, logic units that represent the conceptual chunks in users' memory. Such unit is called story. News video is a specific kind of video. The segmentation of news video into story units is an important step towards effective processing and management of large news video archives. However, story segmentation is not an easy job, because video data is essentially combination of texts, video, sound, images and other media. These media are not isolated, but have semantic associations between each other. Therefore, we analyze comprehensive semantic information which contained in the video data in order to get the video clip complying with the people's requirements.