Automatic Content Extraction on Semi-structured Documents | IEEE Conference Publication | IEEE Xplore