Detecting similar HTML documents using a fuzzy set information retrieval approach | IEEE Conference Publication | IEEE Xplore