Abstract
Recently, using old or irrelevant images in microblogs to spread false rumors has become increasingly rampant. Therefore, tracking and verifying the sources of images has become essential. In order to solve this problem, this paper provides a real-time, large-scale duplicate image detection method based on multi-feature fusion. This method firstly uses multi-feature fusion to improve retrieval accuracy. Then, by Hbase optimization, it uses a bloom filter and range query to improve retrieval efficiency. Experimental results show that, compared with existing algorithms, this method has higher precision and recall rates. Meanwhile, real-time responsiveness and scalability of the approach also meet real-world needs.











Similar content being viewed by others
Change history
24 February 2017
An erratum to this article has been published.
References
Wang, S.G., Liu, Z.P., Sun, Q.B., et al.: Towards an accurate evaluation of quality of cloud service in service-oriented cloud computing. J. Intell. Manuf. 25, 283–291 (2014)
Wang, S.G., Zheng, Z.B., Wu, Z.P., et al.: Reputation measurement and malicious feedback rating prevention in web service recommendation systems. IEEE Trans. Serv. Comput. 8, 755–767 (2015)
Wang, S.G., Huang, L., Hsu, C.-H., et al.: Collaboration reputation for trustworthy web service selection in social networks. J. Comput. Syst. Sci. 82(1), 130–143 (2016)
Wang B., Li Z.W., Li, M.J. et al.: Large-scale duplicate detection for web image search. In: IEEE International Conference on Multimedia and Expo, pp. 353–356 (2006)
Changick, K.: Content-based image copy detection. Signal Process Image Commun. 18(3), 169–184 (2003)
Li, H.F., Xu, Z.H., Zhou, F.H., et al.: A robust image copy detection scheme using ordinal measure of full DCT coefficients. J. Comput. Res. Dev. 47(10), 1812–1822 (2010)
Chang, E.Y., Wang, J.Z., Li, C., RIME: A replicated image detector for the world-wide-web. In: Proceedings of SPIE Symposium of Voice, Video, and Data Communications, pp. 68–77, Boston, USA, (1998)
Wu, M.N., Lin, C.C., Chang, C.C., Image copy detection with rotating tolerance. In: Proceedings of Computational Intelligence and Security (CIS), pp. 464–469 (2005)
Zhou, F.H., Li, X.W., Xu, Z.H., et al.: Image copy detection with rotation and scaling tolerance. J. Comput. Res. Dev. 46(8), 1349–1356 (2009)
Hsiao, J.H., Chen, C.S., Chien, L.F., Chen, M.S.: A new approach to image copy detection based of extended feature sets. IEEE Trans. Image Process 16(8), 2069–2079 (2007)
Feng, L., Wu, J., Sl, Liu, et al.: Global correlation descriptor: a novel image representation for image retrieval. J. Vis. Commun. Image Represent. 33, 104–114 (2015)
Jegou, H., Perronnin, F., Douze, M., et al.: Aggregating local image descriptors into compact codes. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1704–1716 (2012)
Lowe, D.G. Object recognition from local scale-invariant features. In: Proceedings of the 7th International Conference on computer Vision, pp. 1150–1157 (1999)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1615–1630 (2005)
Bosch, A., Zisserman, A., Munoz, X. Scene classification via pLSA. In: Proceedings of the 9th European Conference on Computer Vision, pp. 517–530 (2006)
Ke, Y., Sukthankar, R. PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE computer society conference on Computer Vision and Pattern Recognition, pp. 506–513 (2004)
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Proceedings of the 9th European Conference on Computer Vision, vol 1, pp. 404–417 (2006)
Bauer, J., Sunderhauf N., Protzel P. Comparing several implementations of two recently published feature detectors. In: International Conference on Intelligent and Autonomous Systems, pp. 143–148 (2007)
Matas, J., Chum, O., Urban, M., et al. Robust wide baseline stereo from maximally stable extremal region. In: Proceedings of the 13th British Machine Vision Conference (BMVC), pp. 384–393 (2002)
Calonder, M., Lepetit, V., Strecha, C., et al. BRIEF: binary robust independent elementary features. In: Proceedings of the 11th European Conference on Computer Vision(ECCV), pp. 778–792 (2010)
Rublee, E., Rabaud, V., Konolige, K., et al. ORB: an efficient alternative to SIFT or SURF. In: Proceedings of IEEE International Conference on Computer Vision (ICCV), pp. 2564–2572 (2011)
Tola, E., Lepetit, V., Fua, P. A fast local descriptor for dense matching. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Alahi, A., Ortiz, R., Vandergheynst, P. FREAK: fast retina keypoints. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 510–517 (2012)
Yang, X., Cheng, K.: Local difference binary for ultrafast and distinctive feature description. IEEE Trans. Pattern Anal. Mach. Intell. 36(1), 188–194 (2014)
Indyk, P., Motwani, R. Approximate nearest neighbors: towards removing the curse of dimensionality. In: Proceedings of the 30th Annual ACM Symposium on Theory of Computing, pp. 604–613 (1998)
Broder, A.Z. On the resemblance and containment of documents. In: Proceedings of Compression and Complexity of Sequences, pp. 21–29 (1997)
Moses, S. Similarity estimation techniques from rounding algorithms. In: Proceedings of the 34th Annual ACM Symposium on Theory of Computing, pp. 380–388 (2002)
Shakhnarovich, G., Viola, P.A., Darrell, T. Fast pose estimation with parameter sensitive hashing. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 750–759 (2003)
Torralba, A., Fergus, R., Weiss, Y. Small codes and large image databases for recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)
Zhang, D., Wang, J., Cai, D., et al. Self-taught hashing for fast similarity search. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 18–25 (2010)
Zou, F.H., Feng, H., Ling, H.F., et al.: Nonnegative spare coding induced hashing for image copy detection. J. Neurocomput. 105, 81–89 (2013)
Ren, X.F., Ramanan, D.: Histograms of sparse codes for object detection. IEEE Conf. Comput. Vis. Pattern 9(4), 3246–3253 (2013)
Wang, J., Kumar, S., Chang, S.F. Semi-supervised hashing for scalable image retrieval. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3424–3431 (2010)
Bauml, M., Tapaswi, M., Stiefelhagen, R.: Semi-supervised learning with constraints for person identification in multimedia data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern, pp. 3602–3609 (2013)
Yuan, P.S., Sha, C.F., Wang, X.L., et al.: C-approximate nearest neighbor query algorithm based on learning for high-dimensional data. J. Softw. 23(8), 2018–2031 (2012)
Acknowledgments
This research is supported by the National Natural Science Foundation of China (Grant Nos. U1504608 and 81501548) and the Foundation and Cutting-Edge Technologies Research Program of Henan Province (132300410430).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, M., Li, Y., Zhang, Z. et al. Real-time, large-scale duplicate image detection method based on multi-feature fusion. J Real-Time Image Proc 13, 557–570 (2017). https://doi.org/10.1007/s11554-016-0632-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-016-0632-9