skip to main content
10.1145/2771937.2771938acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Ultra-Fast Similarity Search Using Ternary Content Addressable Memory

Published:31 May 2015Publication History

ABSTRACT

Similarity search, and specifically the nearest-neighbor search (NN) problem is widely used in many fields of computer science such as machine learning, computer vision and databases. However, in many settings such searches are known to suffer from the notorious curse of dimensionality, where running time grows exponentially with d. This causes severe performance degradation when working in high-dimensional spaces. Approximate techniques such as locality-sensitive hashing [2] improve the performance of the search, but are still computationally intensive.

In this paper we propose a new way to solve this problem using a special hardware device called ternary content addressable memory (TCAM). TCAM is an associative memory, which is a special type of computer memory that is widely used in switches and routers for very high speed search applications. We show that the TCAM computational model can be leveraged and adjusted to solve NN search problems in a single TCAM lookup cycle, and with linear space. This concept does not suffer from the curse of dimensionality and is shown to improve the best known approaches for NN by more than four orders of magnitude. Simulation results demonstrate dramatic improvement over the best known approaches for NN, and suggest that TCAM devices may play a critical role in future large-scale databases and cloud applications.

References

  1. M. Aly, M. Munich, and P. Perona. Indexing in large scale image collections: Scaling properties and benchmark. In WACV, pages 418--425, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Andoni and P. Indyk. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Comm. of the ACM, 51(1):117--122, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The r*-tree: An efficient and robust access method for points and rectangles. In SIGMOD, pages 322--331, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. S. Beis and D. G. Lowe. Shape indexing using approximate nearest-neighbour search in high-dimensional spaces. In CVPR, pages 1000--1006, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. L. Bentley. Multidimensional divide-and-conquer. Comm. of the ACM, 23(4):214--229, 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Borodin, R. Ostrovsky, and Y. Rabani. Lower bounds for high dimensional nearest neighbor search and related problems. In STOC, pages 312--321, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Brown and D. G. Lowe. Recognising panoramas. In ICCV, volume 3, page 1218, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. I. Corp. Intel Xeon processor E7-4870, 2011. http://ark.intel.com/products/53579/.Google ScholarGoogle Scholar
  9. V. Garcia, E. Debreuve, F. Nielsen, and M. Barlaud. K-nearest neighbor search: Fast GPU-based implementations and application to high-dimensional feature matching. In ICIP, pages 3757--3760, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  10. F. Gray. Pulse code communication. US Patent 2,632,058, March 17 1953 (filed November 13 1947).Google ScholarGoogle Scholar
  11. C. Inc. NEURON search processors, 2014. http://bit.ly/1uSaH8q.Google ScholarGoogle Scholar
  12. P. Indyk and R. Motwani. Approximate nearest neighbors: Towards removing the curse of dimensionality. In STOC, pages 604--613, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. W. Jiang, Q. Wang, and V. K. Prasanna. Beyond TCAMs: An SRAM-based parallel multi-pipeline architecture for terabit IP lookup. In INFOCOM, pages 1786--1794, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  14. K. Lakshminarayanan, A. Rangarajan, and S. Venkatachary. Algorithms for advanced packet classification with ternary CAMs. In SIGCOMM, pages 193--204, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Liang, C. Liu, Y.-Q. Xu, B. Guo, and H.-Y. Shum. Real-time texture synthesis by patch-based sampling. ACM ToG, 20(3):127--150, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Lloyd. Least squares quantization in pcm. IEEE Trans. Inf. Theor., 28(2):129--137, Sep 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. G. Lowe. Object recognition from local scale-invariant features. In ICCV, pages 1150--1157, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Nvidia. Tesla K80 GPU accelerator, Nov. 2014. http://bit.ly/1C69bVd.Google ScholarGoogle Scholar
  19. A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV, 42:145--175, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Open Networking Foundation. OpenFlow Switch Specification Version 1.3.2, April 2013.Google ScholarGoogle Scholar
  21. J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In CVPR, pages 1--8, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  22. V. Ravikumar and R. N. Mahapatra. TCAM architecture for IP lookup using prefix properties. Micro, IEEE, 24(2):60--69, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Renesas Electronics America Inc. 20Mbit QUAD-search content addressable memory. http://bit.ly/18hYySx.Google ScholarGoogle Scholar
  24. B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman. Labelme: A database and web-based tool for image annotation. IJCV, 77(1--3):157--173, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. H. Samet. Foundations of multidimensional and metric data structures. Morgan Kaufmann, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. R. Shinde, A. Goel, P. Gupta, and D. Dutta. Similarity search and locality sensitive hashing using ternary content addressable memories. In SIGMOD, pages 375--386, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. R. Weber, H.-J. Schek, and S. Blott. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In VLDB, pages 194--205, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Ultra-Fast Similarity Search Using Ternary Content Addressable Memory

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              DaMoN'15: Proceedings of the 11th International Workshop on Data Management on New Hardware
              May 2015
              100 pages
              ISBN:9781450336383
              DOI:10.1145/2771937

              Copyright © 2015 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 31 May 2015

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Research
              • Refereed limited

              Acceptance Rates

              DaMoN'15 Paper Acceptance Rate12of16submissions,75%Overall Acceptance Rate80of102submissions,78%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader