Extracting important layout features from the web page content | IEEE Conference Publication | IEEE Xplore