skip to main content
10.1145/2740908.2742008acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Scalable Preference Learning from Data Streams

Published:18 May 2015Publication History

ABSTRACT

We study the task of learning the preferences of online readers of news, based on their past choices. Previous work has shown that it is possible to model this situation as a competition between articles, where the most appealing articles of the day are those selected by the most users. The appeal of an article can be computed from its textual content, and the evaluation function can be learned from training data. In this paper, we show how this task can benefit from an efficient algorithm, based on hashing representations, which enables it to be deployed on high intensity data streams. We demonstrate the effectiveness of this approach on four real world news streams, compare it with standard approaches, and describe a new online demonstration based on this technology.

References

  1. Christopher M Bishop. Pattern Recognition and Machine Learning, volume 1. Springer New York, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Léon Bottou. Stochastic Gradient Descent Tricks. In Neural Networks: Tricks of the Trade, pages 421--436. Springer, 2012.Google ScholarGoogle Scholar
  3. Christopher JC Burges. Dimension Reduction. Now Publishers Inc, 2010.Google ScholarGoogle Scholar
  4. Graham Cormode and S Muthukrishnan. An Improved Data Stream Summary: The Count-Min Sketch and its Applications. Journal of Algorithms, 55(1):58--75, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Anirban Dasgupta, Ravi Kumar, and Tamás Sarlós. A Sparse Johnson-Lindenstrauss Transform. In Proceedings of the Forty-Second ACM Symposium on Theory of Computing, pages 341--350. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. I. Flaounas, O. Ali, M. Turchi, T. Snowsill, F. Nicart, T. De Bie, and N. Cristianini. NOAM: News Outlets Analysis and Monitoring System. In SIGMOD 2011, pages 1275--1278. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Ilias Flaounas, Thomas Lansdall-Welfare, Panagiota Antonakaki, and Nello Cristianini. The Anatomy of a Modular System for Media Content Analysis. CoRR, abs/1402.6208, 2014.Google ScholarGoogle Scholar
  9. Elena Hensinger, Ilias Flaounas, and Nello Cristianini. Modelling and Predicting News Popularity. Pattern Analysis and Applications, 16(4):623--635, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Thorsten Joachims. Learning to Classify Text using Support Vector Machines: Methods, Theory and Algorithms. Kluwer Academic Publishers, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Thorsten Joachims. Optimizing Search Engines using Clickthrough Data. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 133--142. ACM, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Christopher D Manning, Prabhakar Raghavan, and Hinrich Schütze. Introduction to Information Retrieval, volume 1. Cambridge University Press Cambridge, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Ricardo Nanculef, Ilias Flaounas, and Nello Cristianini. Efficient Classification of Multi-labelled Text Streams by Clashing. Expert Systems with Applications, 2014.Google ScholarGoogle Scholar
  14. Stephen Robertson. Understanding Inverse Document Frequency: on Theoretical Arguments for IDF. Journal of documentation, 60(5):503--520, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  15. Evan Sandhaus. The New York Times Annotated Corpus. Linguistic Data Consortium, Philadelphia, 6(12), 2008.Google ScholarGoogle Scholar
  16. Kilian Weinberger, Anirban Dasgupta, John Langford, Alex Smola, and Josh Attenberg. Feature Hashing for Large Scale Multitask Learning. In Proceedings of the 26th Annual International Conference on Machine Learning. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Scalable Preference Learning from Data Streams

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '15 Companion: Proceedings of the 24th International Conference on World Wide Web
      May 2015
      1602 pages
      ISBN:9781450334730
      DOI:10.1145/2740908

      Copyright © 2015 Copyright is held by the International World Wide Web Conference Committee (IW3C2)

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 May 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader