skip to main content
10.1145/1498759.1498829acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Effective latent space graph-based re-ranking model with global consistency

Published: 09 February 2009 Publication History

Abstract

Recently the re-ranking algorithms have been quite popular for web search and data mining. However, one of the issues is that those algorithms treat the content and link information individually. Inspired by graph-based machine learning algorithms, we propose a novel and general framework to model the re-ranking algorithm, by regularizing the smoothness of ranking scores over the graph, along with a regularizer on the initial ranking scores (which are obtained by the base ranker). The intuition behind the model is the global consistency over the graph: similar entities are likely to have the same ranking scores with respect to a query. Our approach simultaneously incorporates the content with other explicit or implicit link information in a latent space graph. Then an effective unified re-ranking algorithm is performed on the graph with respect to the query. To illustrate our methodology, we apply the framework to literature retrieval and expert finding applications on DBLP bibliography data. We compare the proposed method with the initial language model method and another PageRank-style re-ranking method. Also, we evaluate the proposed method with varying graphs and settings. Experimental results show that the improvement in our proposed method is consistent and promising.

References

[1]
Dblp bibliography. URL:http://www.informatik.uni-trier.de/~ey/db/.
[2]
Expert lists. URL:http://keg.cs.tsinghua.edu.cn/project/psn/dataset.html.
[3]
A. Agarwal and S. Chakrabarti. Learning random walks to rank nodes in graphs. In Proceedings of the 24th International Conference on Machine Learning, pages 9--16, 2007.
[4]
A. Agarwal, S. Chakrabarti, and S. Aggarwal. Learning to rank networked entities. In Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 14--23, 2006.
[5]
R. Baeza-Yates, B. Ribeiro-Neto, et al. Modern information retrieval. Addison-Wesley Harlow, England, 1999.
[6]
K. Balog, L. Azzopardi, and M. de Rijke. Formal models for expert finding in enterprise corpora. In Proceedings of the 29th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 43--50, 2006.
[7]
C. Buckley and E. M. Voorhees. Retrieval evaluation with incomplete information. In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 25--32, 2004.
[8]
C. J. C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. N. Hullender. Learning to rank using gradient descent. In Proceedings of the 22nd International Conference on Machine Learning, pages 89--96, 2005.
[9]
Y. Cao, J. Liu, S. Bao, and H. Li. Research on expert search at enterprise track of trec 2005. In Proceedings of TREC 2005, 2005.
[10]
D. Cohn and H. Chang. Learning to probabilistically identify authoritative documents. In Proceedings of the 17th International Conference on Machine Learning, pages 167--174, 2000.
[11]
D. A. Cohn and T. Hofmann. The missing link - a probabilistic model of document content and hypertext connectivity. In Advances in Neural Information Processing Systems, pages 430--436, 2000.
[12]
T. Davis. Direct Methods for Sparse Linear Systems. Society for Industrial Mathematics, 2006.
[13]
S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6):391--407, 1990.
[14]
H. Deng, I. King, and M. R. Lyu. Formal Models for Expert Finding on DBLP Bibliography Data. In Proceedings of the 8th IEEE International Conference on Data Mining, 2008.
[15]
F. Diaz. Regularizing ad hoc retrieval scores. In Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, pages 672--679, 2005.
[16]
H. Fang and C. Zhai. Probabilistic models for expert finding. Proceedings of the 29th European Conference on Information Retrieval (ECIR), 2007.
[17]
T. Hofmann. Probabilistic latent semantic indexing. In Proceedings of the 22nd ACM SIGIR Conference on Research and Development in Information Retrieval, pages 50--57, 1999.
[18]
R. Jin, H. Valizadegan, and H. Li. Ranking refinement and its application to information retrieval. In Proceedings of the 17th International Conference on World Wide Web, pages 397--406, 2008.
[19]
J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5):604--632, 1999.
[20]
O. Kurland and L. Lee. Pagerank without hyperlinks: structural re-ranking using links induced by language models. In Proceedings of the 28nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 306--313, 2005.
[21]
O. Kurland and L. Lee. Respect my authority!: Hits without hyperlinks, utilizing cluster-based language models. In Proceedings of the 29nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 83--90, 2006.
[22]
Q. Mei, D. Cai, D. Zhang, and C. Zhai. Topic modeling with network regularization. In Proceedings of the 17th International Conference on World Wide Web, pages 101--110, 2008.
[23]
E. Minkov, W. W. Cohen, and A. Y. Ng. Contextual search and name disambiguation in email using graphs. In Proceedings of the 29th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 27--34, 2006.
[24]
Z. Nie, Y. Zhang, J.-R. Wen, and W.-Y. Ma. Object-level ranking: bringing order to web objects. In Proceedings of the 14th International Conference on World Wide Web, pages 567--574, 2005.
[25]
L. Page and S. Brin. The anatomy of a large-scale hypertextual web search engine. In Proceedings of the 7th International Conference on World Wide Web, 98, 1998.
[26]
D. Petkova and W. B. Croft. Hierarchical language models for expert finding in enterprise corpora. In 18th IEEE International Conference on Tools with Artificial Intelligence, pages 599--608, 2006.
[27]
J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In ACM SIGIR Conference on Research and Development in Information Retrieval, pages 275--281, 1998.
[28]
T. Qin, T.-Y. Liu, X.-D. Zhang, D.-S. Wang, W.-Y. Xiong, and H. Li. Learning to rank relational objects and its application to web search. In Proceedings of the 17th International Conference on World Wide Web, pages 407--416, 2008.
[29]
A. Smola and R. Kondor. Kernels and regularization on graphs. Conference on Learning Theory, COLT/KW, 2003.
[30]
C. Zhai and J. D. Lafferty. Two-stage language models for information retrieval. In Proceedings of the 25th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 49--56, 2002.
[31]
C. Zhai and J. D. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst., 22(2):179--214, 2004.
[32]
B. Zhang, H. Li, Y. Liu, L. Ji, W. Xi, W. Fan, Z. Chen, and W.-Y. Ma. Improving web search results using affinity graph. In Proceedings of the 28th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 504--511, 2005.
[33]
D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Schölkopf. Learning with local and global consistency. In Advances in Neural Information Processing Systems, 2003.
[34]
D. Zhou, S. Zhu, K. Yu, X. Song, B. L. Tseng, H. Zha, and C. L. Giles. Learning multiple graphs for document recommendations. In Proceedings of the 17th International Conference on World Wide Web, pages 141--150, 2008.
[35]
S. Zhu, K. Yu, Y. Chi, and Y. Gong. Combining content and link for classification using matrix factorization. In Proceedings of the 30th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 487--494, 2007.
[36]
X. Zhu, Z. Ghahramani, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In Proceedings of the 20th International Conference on Machine Learning, pages 912--919, 2003.

Cited By

View all
  • (2023)Personalized Re-ranking for Recommendation with Mask PretrainingData Science and Engineering10.1007/s41019-023-00219-68:4(357-367)Online publication date: 2-Sep-2023
  • (2022)Scene Re-ranking for Recommendation2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP)10.1109/MMSP55362.2022.9949116(1-6)Online publication date: 26-Sep-2022
  • (2017)Utilizing human processing for fuzzy-based military situation awareness based on social media2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)10.1109/FUZZ-IEEE.2017.8015709(1-6)Online publication date: Jul-2017
  • Show More Cited By

Index Terms

  1. Effective latent space graph-based re-ranking model with global consistency

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining
      February 2009
      314 pages
      ISBN:9781605583907
      DOI:10.1145/1498759
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 09 February 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. DBLP
      2. expert finding
      3. graph-based re-ranking model
      4. latent space
      5. regularization

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      WSDM'09
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 498 of 2,863 submissions, 17%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 08 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Personalized Re-ranking for Recommendation with Mask PretrainingData Science and Engineering10.1007/s41019-023-00219-68:4(357-367)Online publication date: 2-Sep-2023
      • (2022)Scene Re-ranking for Recommendation2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP)10.1109/MMSP55362.2022.9949116(1-6)Online publication date: 26-Sep-2022
      • (2017)Utilizing human processing for fuzzy-based military situation awareness based on social media2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)10.1109/FUZZ-IEEE.2017.8015709(1-6)Online publication date: Jul-2017
      • (2017)Multimodal Data fusion for SRGPS antenna motion error reductionMultimedia Tools and Applications10.1007/s11042-016-3972-376:9(12035-12050)Online publication date: 1-May-2017
      • (2017)A Normalized Framework Based on Multiple Relationships for Document Re-rankingInformation Retrieval10.1007/978-3-319-68699-8_10(122-135)Online publication date: 21-Oct-2017
      • (2016)Landmark Reranking for Smart Travel Guide Systems by Combining and Analyzing Diverse MediaIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2016.252394846:11(1492-1504)Online publication date: Nov-2016
      • (2016)Web Image Search Re-Ranking With Click-Based Similarity and TypicalityIEEE Transactions on Image Processing10.1109/TIP.2016.259365325:10(4617-4630)Online publication date: 1-Oct-2016
      • (2015)cluTM: Content and Link Integrated Topic Model on Heterogeneous Information NetworksWeb-Age Information Management10.1007/978-3-319-21042-1_17(207-218)Online publication date: 6-Jun-2015
      • (2014)Multimedia search rerankingACM Computing Surveys10.1145/253679846:3(1-38)Online publication date: 1-Jan-2014
      • (2014)Exploring and inferring user-user pseudo-friendship for sentiment analysis with heterogeneous networksStatistical Analysis and Data Mining10.1002/sam.112237:4(308-321)Online publication date: 1-Aug-2014
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media