Skip to main content

A Topic-Specific Web Search System Focusing on Quality Pages

  • Conference paper
Research and Advanced Technology for Digital Libraries (ECDL 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6273))

Included in the following conference series:

  • 1615 Accesses

Abstract

We describe a topic-specific Web search system focused on quality pages and argue that there is a need for such quality-based topic-specific search tools. The first implementation of the search system is available on the Web and it deals with climate change. The key idea is to crawl (using a focused crawling technique) in known trusted sites and in sites that are connected to them. We also discuss the further development of the system and our future research. Our project plan involves building a larger quality-based Web search system dealing with many globally significant topics (in addition to climate change).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30(1-7), 107–117 (1998)

    Article  Google Scholar 

  2. Widyantoro, D.: Toward the development of next generation search engine. In: International Conference on Electrical Engineering and Informatics, Bandung, Indonesia (2007)

    Google Scholar 

  3. Griffiths, K., Christensen, H.: The quality and accessibility of Australian depression sites on the World Wide Web. Medical Journal of Australia 176, S97–S104 (2002)

    Google Scholar 

  4. Krones, C., Böhm, G., Ruhl, K., Stumpf, M., Klinge, U, Schumpelick, V.: Inguinal hernia on the Internet: A critical comparison of Germany and the U.K. Hernia 8(1), 47–52 (2004)

    Google Scholar 

  5. Chakrabarti, S., van den Berg, M., Dom, B.: Focused crawling: a new approach to topic-specific Web resource discovery. In: Eighth International World Wide Web Conference, Toronto, May 11-14 (1999)

    Google Scholar 

  6. Pirkola, A., Talvensaari, T.: Addressing the limited scope problem of focused crawling using a result merging approach. In: 25th Annual ACM Symposium on Applied Computing (ACM SAC), Sierre, Switzerland, March 22 - 26, pp. 1735–1740 (2010)

    Google Scholar 

  7. Tang, T., Hawking, D., Craswell, N., Griffiths, K.: Focused crawling for both topical relevance and quality of medical information. In: Fourteenth ACM International Conference on Information and Knowledge Management, CIKM 2005 (2005)

    Google Scholar 

  8. Webometrics University Ranking, http://www.webometrics.info/

  9. Nalanda iVia Focused Crawler, http://ivia.ucr.edu/

  10. Lemur search engine, http://www.lemurproject.org/

  11. Apache Lucene, http://lucene.apache.org/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pirkola, A., Talvensaari, T. (2010). A Topic-Specific Web Search System Focusing on Quality Pages. In: Lalmas, M., Jose, J., Rauber, A., Sebastiani, F., Frommholz, I. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2010. Lecture Notes in Computer Science, vol 6273. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15464-5_64

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15464-5_64

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15463-8

  • Online ISBN: 978-3-642-15464-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics