skip to main content
10.1145/1871437.1871677acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Online stratified sampling: evaluating classifiers at web-scale

Published:26 October 2010Publication History

ABSTRACT

Deploying a classifier to large-scale systems such as the web requires careful feature design and performance evaluation. Evaluation is particularly challenging because these large collections frequently change. In this paper we adapt stratified sampling techniques to evaluate the precision of classifiers deployed in large-scale systems. We investigate different types of stratification strategies, and then we derive a new online sampling algorithm that incrementally approximates the theoretical optimal disproportionate sampling strategy. In experiments, the proposed algorithm significantly outperforms both simple random sampling as well as other types of stratified sampling, with an average reduction of about 20% in labeling effort to reach the same confidence and interval-bounds on precision

References

  1. J. Allan, B. Carterette, J. A. Aslam, V. Pavlu, B. Dachev, and E. Kanoulas. Million query track 2007 overview. In E. M. Voorhees and L. P. Buckland, editors, The Sixteenth Text REtrieval Conference Proceedings (TREC 2007). National Institute of Standards and Technology, December 2008. NIST Special Publication SP 500-274.Google ScholarGoogle Scholar
  2. P. N. Bennett. Using asymmetric distributions to improve text classifier probability estimates. In SIGIR `03, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Carletta. Assessing agreement in classification tasks: the kappa statistic. Computational Linguistics, 22(2):249--254, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Chaudhuri, G. Das, and V. Narasayya. Optimized stratified sampling for approximate query processing. ACM TODS, 32(2), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. Cormack and T. Lynam. Online supervised spam filter evaluation. ACM TOIS, 25(3), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Dixon, A. Ellison, and N. Gotelli. Improving the precision of estimates of the frequency of rare events. Ecology, 86(5), 2005.Google ScholarGoogle Scholar
  7. S. Fernandes, C. Kamienski, J. Kelner, D. Mariz, and D. Sadok. A stratified traffic sampling methodology for seeing the big picture. Computer Networks, 52:2677--2689, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. X. He, L. Duan, Y. Zhou, and B. Dom. Threshold selection for web-page classification with highly skewed class distribution. In WWW `09, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. Kish. Survey Sampling. John Wiley & Sons, Inc., 1965.Google ScholarGoogle Scholar
  10. D. D. Lewis. Evaluating and optimizing autonomous text classification systems. In SIGIR `95, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Netscape Communication Corporation. Open directory project. http://www.dmoz.org.Google ScholarGoogle Scholar
  12. J. C. Platt. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In A. J. Smola, P. Bartlett, B. Scholkopf, and D. Schuurmans, editors, Advances in Large Margin Classifiers. MIT Press, 1999.Google ScholarGoogle Scholar
  13. S. K. Thompson. Sampling. Wiley-Interscience, 2002.Google ScholarGoogle Scholar
  14. E. Yilmaz, E. Kanoulas, and J. A. Aslam. A simple and efficient sampling method for estimating AP and NDCG. In SIGIR `08, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. B. Zadrozny. Learning and evaluating classifiers under sample selection bias. In ICML `04, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. B. Zadrozny and C. Elkan. Reducing multiclass to binary by coupling probability estimates. In KDD '02, 2002.Google ScholarGoogle Scholar
  17. T. Zseby. Stratification strategies for sampling-based non-intrusive measurements of one-way delay. In Passive and Active Measurement Workshop (PAM 2003), 2003.Google ScholarGoogle Scholar

Index Terms

  1. Online stratified sampling: evaluating classifiers at web-scale

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '10: Proceedings of the 19th ACM international conference on Information and knowledge management
      October 2010
      2036 pages
      ISBN:9781450300995
      DOI:10.1145/1871437

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 October 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader