skip to main content
10.1145/3064176.3064190acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article

High-Throughput Subset Matching on Commodity GPU-Based Systems

Published: 23 April 2017 Publication History

Abstract

Large-scale information processing often relies on subset matching for data classification and routing. Examples are publish/subscribe and stream processing systems, database systems, social media, and information-centric networking. For instance, an advanced Twitter-like messaging service where users might follow specific publishers as well as specific topics encoded as tag sets must join a stream of published messages with the users and their preferred tag sets so that the user tag set is a subset of the message tags.
Subset matching is an old but also notoriously difficult problem. We present TagMatch, a system that solves this problem by taking advantage of a hybrid CPU/GPU stream processing architecture. TagMatch targets large-scale applications with thousands of matching operations per seconds against hundreds of millions of tag sets. We evaluate TagMatch on an advanced message streaming application, with very positive results both in absolute terms and in comparison with existing systems. As a notable example, our experiments demonstrate that TagMatch running on a single, commodity machine with two GPUs can easily sustain the traffic throughput of Twitter even augmented with expressive tag-based selection.

References

[1]
B. Ahlgren, C. Dannewitz, C. Imbrenda, D. Kutscher, and B. Ohlman. A survey of information-centric networking. IEEE Communications Magazine, 50(7):26--36, 2012.
[2]
P. Bouros, N. Mamoulis, S. Ge, and M. Terrovitis. Set containment join revisited. Knowledge and Information Systems, pages 1--28, 2015.
[3]
C.-Y. Chan and Y. E. Ioannidis. Bitmap index design and evaluation. In Proceedings of the International Conference on Management of Data, SIGMOD '98, pages 355--366, 1998.
[4]
A. Farroukh, E. Ferzli, N. Tajuddin, and H.-A. Jacobsen. Parallel event processing for content-based publish/subscribe systems. In Proceedings of the International Conference on Distributed Event-Based Systems, DEBS '09, pages 8:1--8:4, 2009.
[5]
S. Helmer and G. Moerkotte. Evaluation of main memory join algorithms for joins with set comparison join predicates. In Proceedings of the International Conference on Very Large Data Bases, VLDB '97, pages 386--395, 1997.
[6]
L. Hong, G. Convertino, and E. H. Chi. Language matters in twitter: A large scale study. In ICWSM, 2011.
[7]
R. Jampani and V. Pudi. Using prefix-trees for efficiently computing set joins. In Proceedings of the International Conference on Database Systems for Advanced Applications, DASFAA '05, pages 761--772, 2005.
[8]
H. Kwak, C. Lee, H. Park, and S. Moon. What is twitter, a social network or a news media? In Proceedings of the 19th International Conference on World Wide Web, WWW'10, pages 591--600, 2010.
[9]
Y. Luo, G. H. L. Fletcher, J. Hidders, and P. D. Bra. Efficient and scalable trie-based algorithms for computing set containment relations. In Proceedings of the International Conference on Data Engineering, ICDE '15, pages 303--314, 2015.
[10]
N. Mamoulis. Efficient processing of joins on set-valued attributes. In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, SIGMOD '03, pages 157--168, 2003.
[11]
A. Margara and G. Cugola. High-performance publish-subscribe matching using parallel hardware. IEEE Transactions on Parallel and Distributed Systems, 25(1):126--135, 2014.
[12]
S. Melnik and H. Garcia-Molina. Adaptive algorithms for set containment joins. ACM Transactions on Database Systems, 28 (1):56--99, 2003.
[13]
T. Morzy and M. Zakrzewicz. Group bitmap index: A structure for association rules retrieval. In Proceedings of the International Conference on Knowledge Discovery and Data Mining, KDD '98, pages 284--288, 1998.
[14]
M. Papalini, A. Carzaniga, K. Khazaei, and A. L. Wolf. Scalable routing for tag-based information-centric networking. In Proceedings of the 1st International Conference on Information-centric Networking, ICN '14, pages 17--26, 2014.
[15]
M. Papalini, K. Khazaei, A. Carzaniga, and D. Rogora. High throughput forwarding for ICN with descriptors and locators. In Proceedings of the 2016 Symposium on Architectures for Networking and Communications Systems, ANCS '16, pages 43--54, 2016.
[16]
D. Perino, M. Varvello, L. Linguaglossa, R. Laufer, and R. Boislaigue. Caesar: A content router for high-speed forwarding on content names. In Proceedings of the Symposium on Architectures for Networking and Communications Systems, ANCS '14, pages 137--148, 2014.
[17]
K. Ramasamy, J. M. Patel, J. F. Naughton, and R. Kaushik. Set containment joins: The good, the bad and the ugly. In Proceedings of the International Conference on Very Large Data Bases, VLDB '00, pages 351--362, 2000.
[18]
R. Rantzau. Processing frequent itemset discovery queries by division and set containment join operators. In Proceedings of the SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, DMKD '03, pages 20--27, 2003.
[19]
R. L. Rivest. Partial-match retrieval algorithms. SIAM Journal on Computing, 5(1):19--50, 1976.
[20]
K. H. Tsoi, I. Papagiannis, M. Migliavacca, W. Luk, and P. Pietzuch. Accelerating publish/subscribe matching on reconfigurable supercomputing platforms. In Many-Core and Reconfigurable Supercomputing Conference, MRSC '10, 2010.
[21]
Y. Wang, Y. Zu, T. Zhang, K. Peng, Q. Dong, B. Liu, W. Meng, H. Dai, X. Tian, Z. Xu, H. Wu, and D. Yang. Wire speed name lookup: A gpu-based approach. In Proceedings of the USENIX Conference on Networked Systems Design and Implementation, NSDI '13, pages 199--212, 2013.
[22]
Y. Wang, B. Xu, D. Tai, J. Lu, T. Zhang, H. Dai, B. Zhang, and B. Liu. Fast name lookup for named data networking. In Proceedings of the International Symposium of Quality of Service, IWQoS '14, pages 198--207, 2014.
[23]
T. W. Yan and H. Garcia-Molina. Index structures for selective dissemination of information under the Boolean model. ACM Transactions on Database Systems, 19(2):332--364, 1994.
[24]
H. Yuan and P. Crowley. Reliably scalable name prefix lookup. In Proceedings of the Symposium on Architectures for Networking and Communications Systems, ANCS '15, pages 111--121, 2015.

Cited By

View all
  • (2019)A Parallel-Computing Algorithm for High-Energy Physics Particle Tracking and Decoding Using GPU ArchitecturesIEEE Access10.1109/ACCESS.2019.29272617(91612-91626)Online publication date: 2019

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
EuroSys '17: Proceedings of the Twelfth European Conference on Computer Systems
April 2017
648 pages
ISBN:9781450349383
DOI:10.1145/3064176
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 April 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. GPU-based processing
  2. Subset matching
  3. message selection and dissemination

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

EuroSys '17
Sponsor:
EuroSys '17: Twelfth EuroSys Conference 2017
April 23 - 26, 2017
Belgrade, Serbia

Acceptance Rates

Overall Acceptance Rate 241 of 1,308 submissions, 18%

Upcoming Conference

EuroSys '25
Twentieth European Conference on Computer Systems
March 30 - April 3, 2025
Rotterdam , Netherlands

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2019)A Parallel-Computing Algorithm for High-Energy Physics Particle Tracking and Decoding Using GPU ArchitecturesIEEE Access10.1109/ACCESS.2019.29272617(91612-91626)Online publication date: 2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media