Skip to main content

New Quality Metrics for Web Search Results

  • Conference paper
Web Information Systems and Technologies (WEBIST 2008)

Abstract

Web search results enjoy an increasing importance in our daily lives. But what can be said about their quality, especially when querying a controversial issue? The traditional information retrieval metrics of precision and recall do not provide much insight in the case of web information retrieval. In this paper we examine new ways of evaluating quality in search results: coverage and independence. We give examples on how these new metrics can be calculated and what their values reveal regarding the two major search engines, Google and Yahoo. We have found evidence of low coverage for commercial and medical controversial queries, and high coverage for a political query that is highly contested. Given the fact that search engines are unwilling to tune their search results manually, except in a few cases that have become the source of bad publicity, low coverage and independence reveal the efforts of dedicated groups to manipulate the search results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amento, B., Terveen, L., Hill, W.: Does authority mean quality? Predicting expert quality ratings of web documents. In: Proceedings of the Twenty-Third Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York (2000)

    Google Scholar 

  2. Berenson, A.: On hair-trigger wall street, a stock plunges on fake news. New York Times (2000)

    Google Scholar 

  3. Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30(1–7), 107–117 (1998)

    Article  Google Scholar 

  4. Google. The Google API, google, inc. (2003), http://code.google.com/apis/

  5. Graham, L., Metaxas, P.T.: “Of course it’s true; i saw it on the internet!”: Critical thinking in the internet era. Commun. ACM 46(5), 70–75 (2003)

    Article  Google Scholar 

  6. Gyuongyi, Z., Garcia-Molina, H.: Web spam taxonomy. In: Proceedings of the First International Workshop on Adversarial Information Retrieval on the Web, Chiba, Japan (2005)

    Google Scholar 

  7. Manning, C., Raghavan, P., Schultze, H.: Introduction to Information Retrieval, forthcoming edn. Cambridge Press, Cambridge (2008)

    Book  MATH  Google Scholar 

  8. Metaxas, P.T., Destefano, J.: Web spam, propaganda and trust. In: Proceedings of the First International Workshop on Adversarial Information Retrieval on the Web, Chiba, Japan (2005)

    Google Scholar 

  9. Moran, M., Hunt, B.: Search Engine Marketing. IBM Press, New Jersey (2006)

    Google Scholar 

  10. Ntoulas, A., Cho, J., Olston, C.: What’s new on the web? the evolution of the web from a search engine perspective. In: Proceedings of the WWW 2004 Conference, New York, NY (2004)

    Google Scholar 

  11. Silverstein, C., Marais, H., Henzinger, M., Moricz, M.: Analysis of a very large web search engine query log. SIGIR Forum 33(1), 6–12 (1999)

    Article  Google Scholar 

  12. Vedder, A.: Misinformation through the internet: Epistemology and ethics. Intersentia, Antwerpen, Gronigen, Oxford (2001)

    Google Scholar 

  13. Yahoo, The Yahoo search API, Yahoo, inc. (2006), http://developer.yahoo.com/search/

  14. Wellness letter, UC Berkeley (June 2003), http://www.berkeleywellness.com/html/ds/dsGrowthHormone.php (retrieved October 10, 2008)

  15. Wikipedia entry on Growth hormone, http://en.wikipedia.org/wiki/Hgh (retrieved October 10, 2008)

  16. Wikipedia entry on ADHD, http://en.wikipedia.org/wiki/Hyperkinetic_conduct_disorder (retrieved October 10, 2008)

  17. Wikipedia entry on Morality and Legality of Abortion, http://en.wikipedia.org/wiki/Morality_legality_of_abortion (retrieved October 10, 2008)

  18. Fagin, R., Kumar, R., Sivakumar, D.: Comparing top k lists. SIAM J. on Discrete Math. 17(1), 134–160 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  19. Bar-Ilan, J., Mat-Hassan, M., Levene, M.: Methods for comparing rankings of search engine results. Computer Networks 50(10), 1448–1463 (2006)

    Article  MATH  Google Scholar 

  20. Metaxas, P.T., and Ivanova, L.: Coverage and Independence - Defining Quality in Web Search Results. In: Proceedings of the International Conference on Web Information Systems and Technologies (WEBIST), Madeira, Portugal (2008)

    Google Scholar 

  21. Online article entitled Google Kills Bushs Miserable Failure Search & Other Google Bombs, http://searchengineland.com/google-kills-bushs-miserable-failure-search-other-google-bom bs-10363.php (retrieved October 10, 2008)

  22. McCown, F., Nelson, M.L.: Agreeing to Disagreeing: Search Engines and their Public Interfaces. In: The Proc. of ACM JCDL 2007, Vancouver, Canada (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Metaxas, P.T., Ivanova, L., Mustafaraj, E. (2009). New Quality Metrics for Web Search Results. In: Cordeiro, J., Hammoudi, S., Filipe, J. (eds) Web Information Systems and Technologies. WEBIST 2008. Lecture Notes in Business Information Processing, vol 18. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01344-7_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-01344-7_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-01343-0

  • Online ISBN: 978-3-642-01344-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics