ProbClean: A probabilistic duplicate detection system | IEEE Conference Publication | IEEE Xplore