An improved KNN text classification algorithm based on Simhash | IEEE Conference Publication | IEEE Xplore