Using Web Archive for Improving Search Engine Results

Jatowt, Adam; Kawai, Yukiko; Tanaka, Katsumi

doi:10.1007/11610113_91

Adam Jatowt²¹,
Yukiko Kawai²¹ &
Katsumi Tanaka^21,22

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3841))

Included in the following conference series:

Asia-Pacific Web Conference

622 Accesses
1 Citations

Abstract

Search engines affect page popularity by making it difficult for currently unpopular pages to reach the top ranks in the search results. This is because people tend to visit and create links to the top-ranked pages. We have addressed this problem by analyzing the previous content of web pages. Our approach is based on the observation that the quality of this content greatly affects link accumulation and hence the final rank of the page. We propose detecting the content that has the greatest impact on the link accumulation process of top-ranked pages and using it for detecting high quality but unpopular web pages. Such pages would have higher ranks assigned.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Amitay, E., Carmel, D., Herscovici, M., Lempel, R., Soffer, A.: Trend Detection Through Temporal Link Analysis. Journal of The American Society for Information Science and Technology 55, 1–12 (2004)
Article Google Scholar
Baeza-Yates, R., Saint-Jean, F., Castillo, C.: Web Structure, Age and Page Quality. In: Laender, A.H.F., Oliveira, A.L. (eds.) SPIRE 2002. LNCS, vol. 2476, pp. 117–130. Springer, Heidelberg (2002)
Chapter Google Scholar
Cho, J., Roy, S.: Impact of search engines on page popularity. In: Proceedings of the 13th International World Wide Web Conference, New York, USA (2004)
Google Scholar
Cho, J., Roy, S., Adams, R.: Page quality: In search of an unbiased web ranking. In: Proceedings of SIGMOD 2005, Baltimore, Maryland, USA (2005)
Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the Web. Technical report, Stanford Digital Library Technologies Project (1998)
Google Scholar
Spearman rank correlation coefficient, http://mathworld.wolfram.com/SpearmanRankCorrelationCoefficient.html
Wikipedia, http://www.wikipedia.org

Download references

Author information

Authors and Affiliations

National Institute of Information and Communications Technology, 3-5 Hikaridai, Seika-cho, Soraku-gun, 619-0289, Kyoto, Japan
Adam Jatowt, Yukiko Kawai & Katsumi Tanaka
Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, Sakyo-ku, 606-8501, Kyoto, Japan
Katsumi Tanaka

Authors

Adam Jatowt
View author publications
You can also search for this author in PubMed Google Scholar
Yukiko Kawai
View author publications
You can also search for this author in PubMed Google Scholar
Katsumi Tanaka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of ITEE, The University of Queensland, Australia
Xiaofang Zhou
School of Computer Science and Technology, Heilongjiang University, China
Jianzhong Li
School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD, Australia
Heng Tao Shen
Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, 153-8505, Tokyo, Japan
Masaru Kitsuregawa
Victoria University, Australia
Yanchun Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jatowt, A., Kawai, Y., Tanaka, K. (2006). Using Web Archive for Improving Search Engine Results. In: Zhou, X., Li, J., Shen, H.T., Kitsuregawa, M., Zhang, Y. (eds) Frontiers of WWW Research and Development - APWeb 2006. APWeb 2006. Lecture Notes in Computer Science, vol 3841. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11610113_91

Download citation

DOI: https://doi.org/10.1007/11610113_91
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31142-3
Online ISBN: 978-3-540-32437-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics