research-article

Effective latent space graph-based re-ranking model with global consistency

Authors:

Michael R. Lyu,

Irwin KingAuthors Info & Claims

WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining

Pages 212 - 221

https://doi.org/10.1145/1498759.1498829

Published: 09 February 2009 Publication History

Abstract

Recently the re-ranking algorithms have been quite popular for web search and data mining. However, one of the issues is that those algorithms treat the content and link information individually. Inspired by graph-based machine learning algorithms, we propose a novel and general framework to model the re-ranking algorithm, by regularizing the smoothness of ranking scores over the graph, along with a regularizer on the initial ranking scores (which are obtained by the base ranker). The intuition behind the model is the global consistency over the graph: similar entities are likely to have the same ranking scores with respect to a query. Our approach simultaneously incorporates the content with other explicit or implicit link information in a latent space graph. Then an effective unified re-ranking algorithm is performed on the graph with respect to the query. To illustrate our methodology, we apply the framework to literature retrieval and expert finding applications on DBLP bibliography data. We compare the proposed method with the initial language model method and another PageRank-style re-ranking method. Also, we evaluate the proposed method with varying graphs and settings. Experimental results show that the improvement in our proposed method is consistent and promising.

References

[1]

Dblp bibliography. URL:http://www.informatik.uni-trier.de/~ey/db/.

[2]

Expert lists. URL:http://keg.cs.tsinghua.edu.cn/project/psn/dataset.html.

[3]

A. Agarwal and S. Chakrabarti. Learning random walks to rank nodes in graphs. In Proceedings of the 24th International Conference on Machine Learning, pages 9--16, 2007.

Digital Library

[4]

A. Agarwal, S. Chakrabarti, and S. Aggarwal. Learning to rank networked entities. In Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 14--23, 2006.

Digital Library

[5]

R. Baeza-Yates, B. Ribeiro-Neto, et al. Modern information retrieval. Addison-Wesley Harlow, England, 1999.

Digital Library

[6]

K. Balog, L. Azzopardi, and M. de Rijke. Formal models for expert finding in enterprise corpora. In Proceedings of the 29th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 43--50, 2006.

Digital Library

[7]

C. Buckley and E. M. Voorhees. Retrieval evaluation with incomplete information. In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 25--32, 2004.

Digital Library

[8]

C. J. C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. N. Hullender. Learning to rank using gradient descent. In Proceedings of the 22nd International Conference on Machine Learning, pages 89--96, 2005.

Digital Library

[9]

Y. Cao, J. Liu, S. Bao, and H. Li. Research on expert search at enterprise track of trec 2005. In Proceedings of TREC 2005, 2005.

[10]

D. Cohn and H. Chang. Learning to probabilistically identify authoritative documents. In Proceedings of the 17th International Conference on Machine Learning, pages 167--174, 2000.

Digital Library

[11]

D. A. Cohn and T. Hofmann. The missing link - a probabilistic model of document content and hypertext connectivity. In Advances in Neural Information Processing Systems, pages 430--436, 2000.

[12]

T. Davis. Direct Methods for Sparse Linear Systems. Society for Industrial Mathematics, 2006.

Digital Library

[13]

S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6):391--407, 1990.

[14]

H. Deng, I. King, and M. R. Lyu. Formal Models for Expert Finding on DBLP Bibliography Data. In Proceedings of the 8th IEEE International Conference on Data Mining, 2008.

Digital Library

[15]

F. Diaz. Regularizing ad hoc retrieval scores. In Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, pages 672--679, 2005.

Digital Library

[16]

H. Fang and C. Zhai. Probabilistic models for expert finding. Proceedings of the 29th European Conference on Information Retrieval (ECIR), 2007.

Digital Library

[17]

T. Hofmann. Probabilistic latent semantic indexing. In Proceedings of the 22nd ACM SIGIR Conference on Research and Development in Information Retrieval, pages 50--57, 1999.

Digital Library

[18]

R. Jin, H. Valizadegan, and H. Li. Ranking refinement and its application to information retrieval. In Proceedings of the 17th International Conference on World Wide Web, pages 397--406, 2008.

Digital Library

[19]

J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5):604--632, 1999.

Digital Library

[20]

O. Kurland and L. Lee. Pagerank without hyperlinks: structural re-ranking using links induced by language models. In Proceedings of the 28nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 306--313, 2005.

Digital Library

[21]

O. Kurland and L. Lee. Respect my authority!: Hits without hyperlinks, utilizing cluster-based language models. In Proceedings of the 29nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 83--90, 2006.

Digital Library

[22]

Q. Mei, D. Cai, D. Zhang, and C. Zhai. Topic modeling with network regularization. In Proceedings of the 17th International Conference on World Wide Web, pages 101--110, 2008.

Digital Library

[23]

E. Minkov, W. W. Cohen, and A. Y. Ng. Contextual search and name disambiguation in email using graphs. In Proceedings of the 29th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 27--34, 2006.

Digital Library

[24]

Z. Nie, Y. Zhang, J.-R. Wen, and W.-Y. Ma. Object-level ranking: bringing order to web objects. In Proceedings of the 14th International Conference on World Wide Web, pages 567--574, 2005.

Digital Library

[25]

L. Page and S. Brin. The anatomy of a large-scale hypertextual web search engine. In Proceedings of the 7th International Conference on World Wide Web, 98, 1998.

Digital Library

[26]

D. Petkova and W. B. Croft. Hierarchical language models for expert finding in enterprise corpora. In 18th IEEE International Conference on Tools with Artificial Intelligence, pages 599--608, 2006.

Digital Library

[27]

J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In ACM SIGIR Conference on Research and Development in Information Retrieval, pages 275--281, 1998.

Digital Library

[28]

T. Qin, T.-Y. Liu, X.-D. Zhang, D.-S. Wang, W.-Y. Xiong, and H. Li. Learning to rank relational objects and its application to web search. In Proceedings of the 17th International Conference on World Wide Web, pages 407--416, 2008.

Digital Library

[29]

A. Smola and R. Kondor. Kernels and regularization on graphs. Conference on Learning Theory, COLT/KW, 2003.

[30]

C. Zhai and J. D. Lafferty. Two-stage language models for information retrieval. In Proceedings of the 25th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 49--56, 2002.

Digital Library

[31]

C. Zhai and J. D. Lafferty. A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst., 22(2):179--214, 2004.

Digital Library

[32]

B. Zhang, H. Li, Y. Liu, L. Ji, W. Xi, W. Fan, Z. Chen, and W.-Y. Ma. Improving web search results using affinity graph. In Proceedings of the 28th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 504--511, 2005.

Digital Library

[33]

D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Schölkopf. Learning with local and global consistency. In Advances in Neural Information Processing Systems, 2003.

Digital Library

[34]

D. Zhou, S. Zhu, K. Yu, X. Song, B. L. Tseng, H. Zha, and C. L. Giles. Learning multiple graphs for document recommendations. In Proceedings of the 17th International Conference on World Wide Web, pages 141--150, 2008.

Digital Library

[35]

S. Zhu, K. Yu, Y. Chi, and Y. Gong. Combining content and link for classification using matrix factorization. In Proceedings of the 30th ACM SIGIR Conference on Research and Development in Information Retrieval, pages 487--494, 2007.

Digital Library

[36]

X. Zhu, Z. Ghahramani, and J. D. Lafferty. Semi-supervised learning using gaussian fields and harmonic functions. In Proceedings of the 20th International Conference on Machine Learning, pages 912--919, 2003.

Digital Library

Cited By

Han PZhou SYu JXu ZChen LShang S(2023)Personalized Re-ranking for Recommendation with Mask PretrainingData Science and Engineering10.1007/s41019-023-00219-68:4(357-367)Online publication date: 2-Sep-2023
https://doi.org/10.1007/s41019-023-00219-6
Han PShang S(2022)Scene Re-ranking for Recommendation2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP)10.1109/MMSP55362.2022.9949116(1-6)Online publication date: 26-Sep-2022
https://doi.org/10.1109/MMSP55362.2022.9949116
Miao STang Z(2017)Utilizing human processing for fuzzy-based military situation awareness based on social media2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)10.1109/FUZZ-IEEE.2017.8015709(1-6)Online publication date: Jul-2017
https://doi.org/10.1109/FUZZ-IEEE.2017.8015709
Show More Cited By

Index Terms

Effective latent space graph-based re-ranking model with global consistency
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing
    2. Retrieval models and ranking

Recommendations

A General Model for Mutual Ranking Systems
ACIIDS 2014: Proceedings, Part I, of the 6th Asian Conference on Intelligent Information and Database Systems - Volume 8397

Ranking has been applied in many domains using recommendation systems such as search engine, e-commerce, and so on. We will introduce and study N-linear mutual ranking, which can rank n classes of objects at once. The ranking scores of these classes are ...
The impact of author ranking in a library catalogue
BooksOnline '11: Proceedings of the 4th ACM workshop on Online books, complementary social media and crowdsourcing

The field of information retrieval has witnessed over 50 years of research on retrieval methods for metadata descriptions and controlled indexing languages, the prototypical example being the library catalogue. It seems only natural to resort to ...
Finding what is missing from a digital library: A case study in the Computer Science field

This article proposes a process to retrieve the URL of a document for which metadata records exist in a digital library catalog but a pointer to the full text of the document is not available. The process uses results from queries submitted to Web ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining

February 2009

314 pages

ISBN:9781605583907

DOI:10.1145/1498759

Editors:
Ricardo Baeza-Yates
Yahoo! Research, Spain
,
Paolo Boldi
Universita degli Studi di Milano, Italy
,
Berthier Ribeiro-Neto
Google Engineering, Brazil & CS Dept., Univ. Fed. de Minas Gerais, Brazil
,
B. Barla Cambazoglu
Yahoo! Research

Copyright © 2009 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMOD: ACM Special Interest Group on Management of Data
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Yahoo! Research
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
Nokia
Google Inc.
SIGIR: ACM Special Interest Group on Information Retrieval
Microsoft: Microsoft

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 February 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Research Grants Council, University Grants Committee, Hong Kong

Conference

WSDM'09

Sponsor:

WSDM'09: Second ACM International Conference on Web Search and Web Data Mining

February 9 - 12, 2009

Barcelona, Spain

Acceptance Rates

Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

34
Total Citations
View Citations
531
Total Downloads

Downloads (Last 12 months)10
Downloads (Last 6 weeks)2

Reflects downloads up to 09 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Han PZhou SYu JXu ZChen LShang S(2023)Personalized Re-ranking for Recommendation with Mask PretrainingData Science and Engineering10.1007/s41019-023-00219-68:4(357-367)Online publication date: 2-Sep-2023
https://doi.org/10.1007/s41019-023-00219-6
Han PShang S(2022)Scene Re-ranking for Recommendation2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP)10.1109/MMSP55362.2022.9949116(1-6)Online publication date: 26-Sep-2022
https://doi.org/10.1109/MMSP55362.2022.9949116
Miao STang Z(2017)Utilizing human processing for fuzzy-based military situation awareness based on social media2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)10.1109/FUZZ-IEEE.2017.8015709(1-6)Online publication date: Jul-2017
https://doi.org/10.1109/FUZZ-IEEE.2017.8015709
Feng TMao X(2017)Multimodal Data fusion for SRGPS antenna motion error reductionMultimedia Tools and Applications10.1007/s11042-016-3972-376:9(12035-12050)Online publication date: 1-May-2017
https://dl.acm.org/doi/10.1007/s11042-016-3972-3
Zhao WZhou D(2017)A Normalized Framework Based on Multiple Relationships for Document Re-rankingInformation Retrieval10.1007/978-3-319-68699-8_10(122-135)Online publication date: 21-Oct-2017
https://doi.org/10.1007/978-3-319-68699-8_10
Shen JShen JMei TGao X(2016)Landmark Reranking for Smart Travel Guide Systems by Combining and Analyzing Diverse MediaIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2016.252394846:11(1492-1504)Online publication date: Nov-2016
https://doi.org/10.1109/TSMC.2016.2523948
Yang XMei TZhang YLiu JSatoh S(2016)Web Image Search Re-Ranking With Click-Based Similarity and TypicalityIEEE Transactions on Image Processing10.1109/TIP.2016.259365325:10(4617-4630)Online publication date: 1-Oct-2016
https://dl.acm.org/doi/10.1109/TIP.2016.2593653
Wang QPeng ZWang SYu PLi QHong X(2015)cluTM: Content and Link Integrated Topic Model on Heterogeneous Information NetworksWeb-Age Information Management10.1007/978-3-319-21042-1_17(207-218)Online publication date: 6-Jun-2015
https://doi.org/10.1007/978-3-319-21042-1_17
Mei TRui YLi STian Q(2014)Multimedia search rerankingACM Computing Surveys10.1145/253679846:3(1-38)Online publication date: 1-Jan-2014
https://dl.acm.org/doi/10.1145/2536798
Deng HHan JLi HJi HWang HLu Y(2014)Exploring and inferring user-user pseudo-friendship for sentiment analysis with heterogeneous networksStatistical Analysis and Data Mining10.1002/sam.112237:4(308-321)Online publication date: 1-Aug-2014
https://dl.acm.org/doi/10.1002/sam.11223
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten