1 Introduction
OVER the last decade, we have witnessed an explosive growth in the data accessible on the Web. However, there are two fundamental issues regarding the effectiveness of information gathering from the Web: mismatch and overload. Mismatch means some useful and interesting data has been overlooked, whereas overload means some gathered data is not what users want.