research-article

Fast, expressive top-k matching

Authors:
William Culhane

Purdue University, West Lafayette, IN

Purdue University, West Lafayette, IN
View Profile

,
K. R. Jayaram

IBM Research, Yorktown Heights, NY

IBM Research, Yorktown Heights, NY
View Profile

,
Patrick Eugster

Purdue University, West Lafayette, IN

Purdue University, West Lafayette, IN
View Profile

Middleware '14: Proceedings of the 15th International Middleware ConferenceDecember 2014Pages 73–84https://doi.org/10.1145/2663165.2663326

Published:08 December 2014Publication History

Middleware '14: Proceedings of the 15th International Middleware Conference

Pages 73–84

ABSTRACT

Top-k matching is a fundamental problem underlying on-line advertising platforms, mobile social networks, etc. Distributed processes (e.g., advertisers) specify predicates, which we call subscriptions, for events (e.g., user actions) they wish to react to. Subscriptions define weights for elementary constraints on individual event attributes and do not require that events match all constraints. An event is multicast only to the processes with the k highest match scores for that event -- this score is the aggregation of the weights of all constraints in a subscription matching the event.

However, state-of-the-art approaches to top-k matching support only rigid models of events and subscriptions, which leads to suboptimal matches. We present a novel model of weighted top-k matching which is more expressive than the state-of-the-art, and a corresponding efficient algorithm. Our model supports attributes with intervals, weights specified by producers of events or by subscriptions, negative weights, prorating of matched constraints, and the ability to vary scores dynamically with system parameters. Our algorithm exhibits time and space complexities which are competitive with state-of-the-art algorithms regardless of our added expressiveness -- O(M log N + S log k) and O(M N + k) respectively, with N the number of constraints, M the number of event attributes, and S the number of matching constraints.

Through empirical evaluation with both statistically generated and real-world data we demonstrate that our algorithm is (a) equally or more efficient and scalable than the state-of-the art without exploiting our added expressiveness, and it (b) significantly outperforms existing approaches upgraded -- if possible at all -- to match our expressiveness.

References

M. Aguilera, R. Strom, D. Sturman, M. Astley, and T. Chandra. Matching Events in a Content-based Subscription System. In PODC, 1999. Google ScholarDigital Library
L. Arge and J. Vitter. Optimal Dynamic Interval Management in External Memory. In FOCS, 1996. Google ScholarDigital Library
A. Bhalgat, J. Feldman, and V. Mirrokni. Online Allocation of Display Ads with Smooth Delivery. In KDD, 2012. Google ScholarDigital Library
P. Cao and Z. Wang. Efficient Top-K Query Calculation in Distributed Networks. In PODC, 2004. Google ScholarDigital Library
S.-C. Chu. Viral Advertising in Social Media: Participation in Facebook Groups and Responses among College-aged Users. Journal of Interactive Advertising, 12(1):30--43, 2011.Google ScholarCross Ref
T. Cormen, R. Rivest, C. Leiserson, and C. Stein. Introduction to Algorithms. MIT Press, 2011. Google ScholarDigital Library
W. Culhane, K. Kogan, C. Jayalath, and P. Eugster. LOOM: Optimal Aggregation Overlays for In-Memory Big Data Processing. In HotCloud, 2014. Google ScholarDigital Library
M. Dylla, I. Miliaraki, and M. Theobald. Top-k Query Processing in Probabilistic Databases with Non-materialized Views. In ICDE, 2013. Google ScholarDigital Library
R. Fagin. Combining Fuzzy Information from Multiple Systems. In PODS, 1996. Google ScholarDigital Library
R. Fagin, A. Lotem, and M. Naor. Optimal Aggregation Algorithms for Middleware. In PODS, 2001. Google ScholarDigital Library
R. Fagin and E. L. Wimmers. A Formula for Incorporating Weights into Scoring Rules. Theoretical Computer Science, 239(2):309--338, 2000. Google ScholarDigital Library
A. Goldfarb and C. E. Tucker. Online Advertising, Behavioral Targeting, and Privacy. Commun. ACM, 54(5):25--27, 2011. Google ScholarDigital Library
IMDB. IMDB Movie Ratings. http://www.imdb.com/interfaces.Google Scholar
A. Machanavajjhala, E. Vee, M. Garofalakis, and J. Shanmugasundaram. Scalable Ranked Publish/Subscribe. In VLDB, 2008. Google ScholarDigital Library
M. McGowan. Facebook Rolling Out Video Ads to News Feeds Social Network Gives Brands Four Demographics to Target with 15-Second Spots. Adweek. http://www.adweek.com/news/technology/facebook-rolling-out-video-ads-news-feeds-149239, May 2013.Google Scholar
M. Prigg. Dislike: Over HALF of Facebook Users Say they are Fed Up with Constant Adverts and Sponsored Posts. Mail Online. (shortcut) http://unsourced.org/art/6978, July 2012.Google Scholar
M. Sadoghi and H. Jacobsen. Relevance Matters: Capitalizing on Less (Top-k Matching in Publish/Subscribe). In ICDE, 2012. Google ScholarDigital Library
C. Song, Z. Li, and T. Ge. Top-K Oracle: A New Way to Present Top-K Tuples for Uncertain Data. In ICDE, 2013.Google Scholar
M. A. Stelzner. Social Media Marketing Industry Report. Social Media Examiner. http://www.socialmediaexaminer.com/SocialMediaMarketingReport2011.pdf, 2011.Google Scholar
Yahoo! C15 - Yahoo! Music User Ratings of Musical Tracks, Albums, Artists and Genres v 1.0. Yahoo! Webscope http://webscope.sandbox.yahoo.com.Google Scholar
J. Yan, N. Liu, G. Wang, W. Zhang, Y. Jiang, and Z. Chen. How Much Can Behavioral Targeting Help Online Advertising? In WWW, 2009. Google ScholarDigital Library

Recommendations

Fast Template Matching With Polynomials

Template matching is widely used for many applications in image and signal processing. This paper proposes a novel template matching algorithm, called algebraic template matching. Given a template and an input image, algebraic template matching ...
Read More
A fast bit-parallel multi-patterns string matching algorithm for biological sequences
ISB '10: Proceedings of the International Symposium on Biocomputing

The problem of searching occurrences of a pattern P[0...m-1] in the text T[0...n-1>with m ≤ n, where the symbols of P and T are drawn from some alphabet Σ of size σ, is called exact string matching problem. In the present day, pattern matching is a ...
Read More
A Dual-Bound Algorithm for Very Fast and Exact Template Matching

Recently proposed fast template matching techniques employ rejection schemes derived from lower bounds on the match measure. This paper generalizes that idea and shows that in addition to lower bounds, upper bounds on the match measure can be used to ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
Middleware '14: Proceedings of the 15th International Middleware Conference
December 2014
334 pages
ISBN:9781450327855
DOI:10.1145/2663165
General Chair:
Laurent Réveillère
LaBRI, University of Bordeaux, France
,
Program Chairs:
Lucy Cherkasova
HP Labs, USA
,
François Taïani
Université de Rennes 1 / IRISA, France
Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 December 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Middleware '14 Paper Acceptance Rate27of144submissions,19%Overall Acceptance Rate203of948submissions,21%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 186
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Fast, expressive top-k matching

Middleware '14: Proceedings of the 15th International Middleware Conference

ABSTRACT

References

Cited By

Recommendations

Fast Template Matching With Polynomials

A fast bit-parallel multi-patterns string matching algorithm for biological sequences

A Dual-Bound Algorithm for Very Fast and Exact Template Matching

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Fast, expressive top-k matching

Middleware '14: Proceedings of the 15th International Middleware Conference

ABSTRACT

References

Cited By

Recommendations

Fast Template Matching With Polynomials

A fast bit-parallel multi-patterns string matching algorithm for biological sequences

A Dual-Bound Algorithm for Very Fast and Exact Template Matching

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media