skip to main content
10.1145/2663165.2663326acmconferencesArticle/Chapter ViewAbstractPublication PagesmiddlewareConference Proceedingsconference-collections
research-article

Fast, expressive top-k matching

Published:08 December 2014Publication History

ABSTRACT

Top-k matching is a fundamental problem underlying on-line advertising platforms, mobile social networks, etc. Distributed processes (e.g., advertisers) specify predicates, which we call subscriptions, for events (e.g., user actions) they wish to react to. Subscriptions define weights for elementary constraints on individual event attributes and do not require that events match all constraints. An event is multicast only to the processes with the k highest match scores for that event -- this score is the aggregation of the weights of all constraints in a subscription matching the event.

However, state-of-the-art approaches to top-k matching support only rigid models of events and subscriptions, which leads to suboptimal matches. We present a novel model of weighted top-k matching which is more expressive than the state-of-the-art, and a corresponding efficient algorithm. Our model supports attributes with intervals, weights specified by producers of events or by subscriptions, negative weights, prorating of matched constraints, and the ability to vary scores dynamically with system parameters. Our algorithm exhibits time and space complexities which are competitive with state-of-the-art algorithms regardless of our added expressiveness -- O(M log N + S log k) and O(M N + k) respectively, with N the number of constraints, M the number of event attributes, and S the number of matching constraints.

Through empirical evaluation with both statistically generated and real-world data we demonstrate that our algorithm is (a) equally or more efficient and scalable than the state-of-the art without exploiting our added expressiveness, and it (b) significantly outperforms existing approaches upgraded -- if possible at all -- to match our expressiveness.

References

  1. M. Aguilera, R. Strom, D. Sturman, M. Astley, and T. Chandra. Matching Events in a Content-based Subscription System. In PODC, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Arge and J. Vitter. Optimal Dynamic Interval Management in External Memory. In FOCS, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Bhalgat, J. Feldman, and V. Mirrokni. Online Allocation of Display Ads with Smooth Delivery. In KDD, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. Cao and Z. Wang. Efficient Top-K Query Calculation in Distributed Networks. In PODC, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S.-C. Chu. Viral Advertising in Social Media: Participation in Facebook Groups and Responses among College-aged Users. Journal of Interactive Advertising, 12(1):30--43, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  6. T. Cormen, R. Rivest, C. Leiserson, and C. Stein. Introduction to Algorithms. MIT Press, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. W. Culhane, K. Kogan, C. Jayalath, and P. Eugster. LOOM: Optimal Aggregation Overlays for In-Memory Big Data Processing. In HotCloud, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Dylla, I. Miliaraki, and M. Theobald. Top-k Query Processing in Probabilistic Databases with Non-materialized Views. In ICDE, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. Fagin. Combining Fuzzy Information from Multiple Systems. In PODS, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. R. Fagin, A. Lotem, and M. Naor. Optimal Aggregation Algorithms for Middleware. In PODS, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Fagin and E. L. Wimmers. A Formula for Incorporating Weights into Scoring Rules. Theoretical Computer Science, 239(2):309--338, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Goldfarb and C. E. Tucker. Online Advertising, Behavioral Targeting, and Privacy. Commun. ACM, 54(5):25--27, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. IMDB. IMDB Movie Ratings. http://www.imdb.com/interfaces.Google ScholarGoogle Scholar
  14. A. Machanavajjhala, E. Vee, M. Garofalakis, and J. Shanmugasundaram. Scalable Ranked Publish/Subscribe. In VLDB, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. McGowan. Facebook Rolling Out Video Ads to News Feeds Social Network Gives Brands Four Demographics to Target with 15-Second Spots. Adweek. http://www.adweek.com/news/technology/facebook-rolling-out-video-ads-news-feeds-149239, May 2013.Google ScholarGoogle Scholar
  16. M. Prigg. Dislike: Over HALF of Facebook Users Say they are Fed Up with Constant Adverts and Sponsored Posts. Mail Online. (shortcut) http://unsourced.org/art/6978, July 2012.Google ScholarGoogle Scholar
  17. M. Sadoghi and H. Jacobsen. Relevance Matters: Capitalizing on Less (Top-k Matching in Publish/Subscribe). In ICDE, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. Song, Z. Li, and T. Ge. Top-K Oracle: A New Way to Present Top-K Tuples for Uncertain Data. In ICDE, 2013.Google ScholarGoogle Scholar
  19. M. A. Stelzner. Social Media Marketing Industry Report. Social Media Examiner. http://www.socialmediaexaminer.com/SocialMediaMarketingReport2011.pdf, 2011.Google ScholarGoogle Scholar
  20. Yahoo! C15 - Yahoo! Music User Ratings of Musical Tracks, Albums, Artists and Genres v 1.0. Yahoo! Webscope http://webscope.sandbox.yahoo.com.Google ScholarGoogle Scholar
  21. J. Yan, N. Liu, G. Wang, W. Zhang, Y. Jiang, and Z. Chen. How Much Can Behavioral Targeting Help Online Advertising? In WWW, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    Middleware '14: Proceedings of the 15th International Middleware Conference
    December 2014
    334 pages
    ISBN:9781450327855
    DOI:10.1145/2663165

    Copyright © 2014 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 8 December 2014

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    Middleware '14 Paper Acceptance Rate27of144submissions,19%Overall Acceptance Rate203of948submissions,21%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader