skip to main content
10.1145/1526709.1526715acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Fast dynamic reranking in large graphs

Published:20 April 2009Publication History

ABSTRACT

In this paper we consider the problem of re-ranking search results by incorporating user feedback. We present a graph theoretic measure for discriminating irrelevant results from relevant results using a few labeled examples provided by the user. The key intuition is that nodes relatively closer (in graph topology) to the relevant nodes than the irrelevant nodes are more likely to be relevant. We present a simple sampling algorithm to evaluate this measure at specific nodes of interest, and an efficient branch and bound algorithm to compute the top k nodes from the entire graph under this measure. On quantifiable prediction tasks the introduced measure outperforms other diffusion-based proximity measures which take only the positive relevance feedback into account. On the Entity-Relation graph built from the authors and papers of the entire DBLP citation corpus (1.4 million nodes and 2.2 million edges) our branch and bound algorithm takes about 1.5 seconds to retrieve the top 10 nodes w.r.t. this measure with 10 labeled nodes.

References

  1. J. Abernethy, O. Chapelle, and C. Castillo. Web spam identification through content and hyperlinks. In Proc. AIRWEB, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Aldous and J. A. Fill. Reversible Markov Chains. 2001.Google ScholarGoogle Scholar
  3. A. Balmin, V. Hristidis, and Y. Papakonstantinou. ObjectRank: Authority-based keyword search in databases. In VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. In Proc. WWW, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Chakrabarti. Dynamic personalized pagerank in entity-relation graphs. In WWW, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Y. Freund, R. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining preferences. J. Mach. Learn. Res., 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Grady. Random walks for image segmentation. PAMI, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Z. Gyongyi, H. Garcia--Molina, and J. Pedersen. Combating web spam with trustrank. In Proc. VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. G. Jeh and J. Widom. Scaling personalized web search. In Stanford University Technical Report, 2002.Google ScholarGoogle Scholar
  10. R. Jin, H. Valizadegan, and H. Li. Ranking refinement and its application to information retrieval. In WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Joshi, R. Kumar, B. Reed, and A. Tomkins. Anchor-based proximity measures. In Proc. WWW, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. I. Koutis and G. L. Miller. A linear work, o(n1=6) time, parallel algorithm for solving planar laplacians. In Proc. SODA, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Levin, D. Lischinski, and Y. Weiss. Colorization using optimization. ACM Transactions on Graphics, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Ntoulas, M. Najork, M. Manasse, and D. Fetterly. Detecting spam web pages through content analysis. In Proc. WWW, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. P. Sarkar and A. Moore. A tractable approach to finding closest truncated-commute-time neighbors in large graphs. In Proc. UAI, 2007.Google ScholarGoogle Scholar
  16. P. Sarkar, A. W. Moore, and A. Prakash. Fast incremental proximity search in large graphs. In ICML, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Spielman and S. Teng. Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In Proceedings of the STOC'04, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Z. Xu, R. Akella, and Y. Zhang. Incorporating diversity and density in active learning for relevance feedback. In ECIR, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. X. Zhu, Z. Ghahramani, and J. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In ICML, volume 20, 2003.Google ScholarGoogle Scholar

Index Terms

  1. Fast dynamic reranking in large graphs

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader