Models and algorithms for duplicate document detection | IEEE Conference Publication | IEEE Xplore