Skip to main content

Teaching Web Information Retrieval to Computer Science Students: Concrete Approach and Its Analysis

  • Chapter
  • First Online:
Teaching and Learning in Information Retrieval

Part of the book series: The Information Retrieval Series ((INRE,volume 31))

Abstract

In this paper I report about my experience in teaching Web Information Retrieval (IR) at graduate level. I have been teaching a Web IR course for two Master’s degrees in Computer Science and Information Technology at Udine University for the last 5 years. I present the topics I am currently teaching in my course, briefly describing the syllabus, discussing the available textbooks and reading material, and term projects for students. I provide some evidence, gathered by means of a questionnaire, about students’ satisfaction with the course.

I also discuss in detail a crucial choice that every lecturer of Web IR has to make, namely whether (1) to teach the classical pre-Web IR issues first and present the Web-specific issues later, or (2) to teach directly the Web IR discipline per se. The first approach has the advantages of building on prerequisite knowledge, of presenting the historical development of the discipline, and probably appears more natural to most lecturers, who have followed the historical development of the field. Conversely, the second approach has the advantage of concentrating on a more modern view of the field, and probably leads to a higher motivation in the students, since the more appealing Web issues are dealt with at the beginning of the course. By discussing textbook support, analyzing related disciplines, and reporting on students’ and lecturers’ feedback, I provide some evidence that the first approach seems preferable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    I rely on the soft/hard science dichotomy. Although it has obvious limitations and it is quite rough, it is also well known (http://en.wikipedia.org/wiki/Hard_and_soft_science) and it will do here.

  2. 2.

    I am still teaching the same course this academic year, with almost no changes and about 20 students, but I am not presenting any data from this year.

  3. 3.

    In Italy students can try the exam when they want after course end, even in the subsequent years.

  4. 4.

    Indeed the first and the third ones are more Web data mining than WIR books; however, the two disciplines overlap quite a lot and the boundaries between them are not so clear.

  5. 5.

    I did not perform any deep statistical analysis of these data. I just note that using the median, instead of the mean would lead to similar considerations, and that I do not analyze variance (or interquartile range) values, nor statistical significance.

References

  • Albert R, Barabasi A-L (1999) Emergence of scaling in random networks. Science 286:509–512

    Article  MathSciNet  MATH  Google Scholar 

  • Albert R, Jeong H, Barabasi A-L (1999) Internet: diameter of the world-wide web. Nature 401:130–131

    Article  Google Scholar 

  • Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. ACM, New York

    Google Scholar 

  • Barroso LA, Dean J, Hölzle U (2003) Web search for a planet: the Google cluster architecture. IEEE Micro 23(2):22–28

    Article  Google Scholar 

  • Belew RK (2000) Finding out about. Cambridge University Press, Cambridge, MA

    MATH  Google Scholar 

  • Bharat K, Chang B-W, Henzinger M, Ruhl M (2001) Who links to whom: mining linkage between web sites. In: IEEE International Conference on Data Mining (ICDM ’01), San Jose, CA, 29 Nov–2 Dec 2001

    Google Scholar 

  • Blair DC (1990) Language and representation in information retrieval. Elsevier, New York

    Google Scholar 

  • Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. WWW 7:107–117

    Google Scholar 

  • Broder A, Kumar R, Maghoul F, Raghavan P, Rajagopalan S, Stata R, Tomkins A, Wiener J (2000) Graph structure in the web. Comput Netw 33(1–6):309–320

    Article  Google Scholar 

  • Chakrabarti S (2003) Mining the web. Morgan Kaufmann, San Francisco

    Google Scholar 

  • Cho J, Garcia-Molina H (2000) The evolution of the web and implications for an incremental crawler. In: Proceedings of the 26th VLDB. http://rose.cs.ucla.edu/˜cho/papers/cho-evol.pdf

  • Croft WB, Metzler D, Strohman T (2009) Search engines – information retrieval in practice. Addison-Wesley, Harlow

    Google Scholar 

  • Fetterly D, Manasse M, Najork M, Wiener JL (2004) A large-scale study of the evolution of web pages. Softw Pract Exp 34:213–237

    Article  Google Scholar 

  • Frakes WB, Baeza-Yates R (1992) Information retrieval: data structures and algorithms. Prentice-Hall, Englewood Cliffs, NJ

    Google Scholar 

  • Grossman DA, Frieder O (2004) Information retrieval: algorithms and heuristics, 2nd edn. Springer, Heidelberg

    Google Scholar 

  • Gulli A, Signorini A (2005) The indexable web is more than 11.5 billion pages. In: WWW14. http://www.cs.uiowa.edu/˜asignori/web-size/

  • IEEE & ACM (2001). The Joint Task Force on Computing Curricula – Computing curricula 2001: Computer Science – Final Report, 15 Dec 2001. http://acm.org/education/curric_vols/cc2001.pdf

  • Ingwersen P (1992) Information retrieval interaction. Taylor Graham, London

    Google Scholar 

  • Ingwersen P, Järvelin K (2005) The TURN: integration of information seeking and retrieval in context. Springer, Heidelberg

    MATH  Google Scholar 

  • Kleinberg J (1999) Authoritative sources in a hyperlinked environment. J ACM 46(5):604–632

    Article  MathSciNet  MATH  Google Scholar 

  • Korfhage RR (1997) Information storage and retrieval. Wiley, New York

    Google Scholar 

  • Lawrence S, Giles CL (1998) Searching the world wide web. Science 280:98–100

    Article  Google Scholar 

  • Lawrence S, Giles CL (1999) Accessibility of information on the web. Nature 400:107–109

    Article  Google Scholar 

  • Levene M (2006) An introduction to search engines and Web navigation. Addison-Wesley, Harlow

    Google Scholar 

  • Liu B (2007) Web data mining – exploring hyperlinks, contents, and usage data. Springer, Heidelberg

    MATH  Google Scholar 

  • Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, Cambridge, MA

    Book  MATH  Google Scholar 

  • Marchionini G (1997) Information seeking in electronic environments. Cambridge University Press, Cambridge, MA

    Google Scholar 

  • Mizzaro S (2007) Teaching of Web information retrieval: Web first or IR first? In: MacFarlane A, Fernández-Luna JM, Ounis I, Huete JF (eds) Proceedings of the first international workshop on teaching and learning in information retrieval (TLIR 2007). http://www.bcs.org/server.php?show=nav.00100v00500300100d001

  • Page L, Brin S, Motwani R, Winograd T (1998) The PageRank citation ranking: bringing order to the web. http://www-db.stanford.edu/˜backrub/pageranksub.ps

  • Salton G (1989) Automatic text processing – the transformation, analysis, and retrieval of information by computer. Addison-Wesley, Harlow

    Google Scholar 

  • Salton G, McGill MJ (1984) Introduction to modern Information retrieval. McGraw-Hill, London

    Google Scholar 

  • van Rijsbergen CJ (1979) Information retrieval, 2nd edn. Butterworths, London

    Google Scholar 

  • van Rijsbergen CJ (2004) The geometry of information retrieval. Cambridge University Press, Cambridge, MA

    Book  MATH  Google Scholar 

  • Witten IH, Moffat A, Bell TC (1999) Managing gigabytes – compressing and indexing documents and images, 2nd edn. Morgan Kaufmann, San Francisco

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stefano Mizzaro .

Editor information

Editors and Affiliations

Appendix A. The Questionnaire

Appendix A. The Questionnaire

This questionnaire aims at providing to the lecturer useful data to improve the course. The questionnaire is fully anonymous and it is not used as an evaluation of the student.

Web Information Retrieval: Final Questionnaire

The questionnaire is divided into two parts gathering, respectively, a-priori background knowledge and a-posteriori feelings and impressions on the course. Mark with a circle the numbers corresponding to your answers, remembering that:

1 means I strongly disagree

2 means I partially disagree

3 means I do not have a clear opinion

4 means I partially agree

5 means I strongly agree

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Mizzaro, S. (2011). Teaching Web Information Retrieval to Computer Science Students: Concrete Approach and Its Analysis. In: Efthimiadis, E., Fernández-Luna, J., Huete, J., MacFarlane, A. (eds) Teaching and Learning in Information Retrieval. The Information Retrieval Series, vol 31. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22511-6_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22511-6_10

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-22510-9

  • Online ISBN: 978-3-642-22511-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics