skip to main content
10.1145/1081870.1081911acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

SVM selective sampling for ranking with application to data retrieval

Published:21 August 2005Publication History

ABSTRACT

Learning ranking (or preference) functions has been a major issue in the machine learning community and has produced many applications in information retrieval. SVMs (Support Vector Machines) - a classification and regression methodology - have also shown excellent performance in learning ranking functions. They effectively learn ranking functions of high generalization based on the "large-margin" principle and also systematically support nonlinear ranking by the "kernel trick". In this paper, we propose an SVM selective sampling technique for learning ranking functions. SVM selective sampling (or active learning with SVM) has been studied in the context of classification. Such techniques reduce the labeling effort in learning classification functions by selecting only the most informative samples to be labeled. However, they are not extendable to learning ranking functions, as the labeled data in ranking is relative ordering, or partial orders of data. Our proposed sampling technique effectively learns an accurate SVM ranking function with fewer partial orders. We apply our sampling technique to the data retrieval application, which enables fuzzy search on relational databases by interacting with users for learning their preferences. Experimental results show a significant reduction of the labeling effort in inducing accurate ranking functions.

References

  1. K. Brinker. Active learning of label ranking functions. In Proc. Int. Conf. Machine Learning (ICML'04), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. N. Bruno, L. Gravano, and A. Marian. Evaluating top-k queries over web-accessible databases. In Proc. Int. Conf. Data Engineering (ICDE'02), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. J. C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2:121--167, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. E. Chang and S. Tong. Support vector machine active learning for image retrieval. In ACM Multimedia 2001, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Christianini and J. Shawe-Taylor. An Introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. W. W. Cohen, R. E. Schapire, and Y. Singer. Learning to order things. In Proc. Advances in Neural Information Processing Systems (NIPS'98), 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Furnkranz and E. Hullermeier. Pairwise preference learning and ranking. In Proc. European Conf. Machine Learning (ECML'03), 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Har-Peled, D. Roth, and D. Zimak. Constraint classification: A new approach to multiclass classification and ranking. In Proc. Advances in Neural Information Processing Systems (NIPS'02), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. Herbrich, T. Graepel, and K. Obermayer, editors. Large margin rank boundaries for ordinal regression. MIT-Press, 2000.Google ScholarGoogle Scholar
  10. V. Hristidis, N. Koudas, and Y. Papakonstantinou. PREFER: A system for the efficient execution of multi-parametric ranked queries. Proceedings ACM SIGMOD International Conference on Management of Data, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Joachims. Optimizing search engines using clickthrough data. In Proc. ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining (KDD'02), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. G. Schohn and D. Cohn. Less is more: Active learning with support vector machines. In Proc. Int. Conf. Machine Learning (ICML'00), pages 839--846, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. S. Tong and D. Koller. Support vector machine active learning with applications to text classification. In Proc. Int. Conf. Machine Learning (ICML'00), pages 999--1006, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. V. N. Vapnik. Statistical Learning Theory. John Wiley and Sons, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. H. Yu, S. Hwang, and K. C.-C. Chang. Rankfp: A framework for supporting rank formulation and processing. In Proc. Int. Conf. Data Engineering (ICDE'05), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. SVM selective sampling for ranking with application to data retrieval

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
        August 2005
        844 pages
        ISBN:159593135X
        DOI:10.1145/1081870

        Copyright © 2005 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 21 August 2005

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader