Abstract
The amount of information on the web is growing rapidly, and search engines that rely on keyword matching usually return too many low quality matches. To improve search results, a challenging task for search engines is how to effectively calculate a relevance ranking for each web page. This paper discusses in what order a search engine should return the URLs it has produced in response to a user’s query, so as to show more relevant pages first. Emphasis is given on the ranking functions adopted by WebGather that take link structure and user popularity factors into account. Experimental results are also presented to evaluate the proposed strategy.
Similar content being viewed by others
References
Steve Lawrence, C Lee Giles. Accessibility of information on the Web.Nature, July 8, 1999, 400: 107–109.
Zhou Limin, Liu Jianguo, Chen Baojue. WebGather: A Chinese-English WWW Search engineSoftware Journal, 1998, (Supplement): 504–510.
Liu Jianguo, Lei Ming, Wang Jianyong, Chen Baojue. Digging for gold on the Web: Experience with the WebGather. InProceedings of the Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region, IEEE Computer Society Press May 14–17, 2000, pp.751–755.
Brin S, Page L. The anatomy of a large-scale hypertextual Web search engine. InProc. 7th World Wide Web Conference, Elsevier Science, Amsterdam, 1998, pp. 107–117.
Larry Page, Page Rank: Bring Order to the Web. http://www.pcd.stanford.edu/~page/papers/pagerrank/ppframe.htm.
Soumen Chakrabati, Byron E. Dom. Mining the Web’s link structureComputer, Aug., 1999, pp.60–67.
Gary Culliss. User popularity ranked search engine. http://www.infonortics.com/searchengines/boston1999/culliss/index.htm
Lei Ming, Liu Jianguo, Wang Jianyong, Chen Baojue. A model for dynamic information updating in lexicon based search engine.Journal of Computer Research and Development, 2000, 37(10) 1265–1270 (in Chinese)
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported by the National “9th Five-Year Plan” Key Technology Project (No.96-743-01-05-01) and the NKBRSF of China (No. G199032706).
LEI Ming is currently working in Baidu. com incorporation. He got his B.S. and M.S. degrees from Peking University in 1997 and 2000, respectively, both in computer science. His research interests include computer network and web search engine.
WANG Jianyong is an assistant processor of the Department of Computer Science and Technology at Peking University, P.R. China. He received his B.S. degree from Lanzhou University in 1991 and his Ph.D. degree from Institute of Computing Technology, The Chinese Academy of Sciences in 1999, both in computer science. His research interests include distributed systems and algorithms, and web information retrieval.
CHEN Baojue is a professor of the Department of Computer Science and Technology, Peking University. Her recent research interests include OSI networking techniques, middleware design, and Web-based applications and services.
LI Xiaoming got his B.E. degree from Harbin Institute of Technology in 1982, and the Ph.D. degree from Stevens Institute of Technology in 1986. Currently he is interested in high performance networking and distributed processing technologies.
Rights and permissions
About this article
Cite this article
Lei, M., Wang, J., Chen, B. et al. Improved relevance ranking in WebGather. J. Comput. Sci. & Technol. 16, 410–417 (2001). https://doi.org/10.1007/BF02948958
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02948958