Measuring documents similarity in large corpus using MapReduce algorithm | IEEE Conference Publication | IEEE Xplore