Abstract
In this paper I report about my experience in teaching Web Information Retrieval (IR) at graduate level. I have been teaching a Web IR course for two Master’s degrees in Computer Science and Information Technology at Udine University for the last 5 years. I present the topics I am currently teaching in my course, briefly describing the syllabus, discussing the available textbooks and reading material, and term projects for students. I provide some evidence, gathered by means of a questionnaire, about students’ satisfaction with the course.
I also discuss in detail a crucial choice that every lecturer of Web IR has to make, namely whether (1) to teach the classical pre-Web IR issues first and present the Web-specific issues later, or (2) to teach directly the Web IR discipline per se. The first approach has the advantages of building on prerequisite knowledge, of presenting the historical development of the discipline, and probably appears more natural to most lecturers, who have followed the historical development of the field. Conversely, the second approach has the advantage of concentrating on a more modern view of the field, and probably leads to a higher motivation in the students, since the more appealing Web issues are dealt with at the beginning of the course. By discussing textbook support, analyzing related disciplines, and reporting on students’ and lecturers’ feedback, I provide some evidence that the first approach seems preferable.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
I rely on the soft/hard science dichotomy. Although it has obvious limitations and it is quite rough, it is also well known (http://en.wikipedia.org/wiki/Hard_and_soft_science) and it will do here.
- 2.
I am still teaching the same course this academic year, with almost no changes and about 20 students, but I am not presenting any data from this year.
- 3.
In Italy students can try the exam when they want after course end, even in the subsequent years.
- 4.
Indeed the first and the third ones are more Web data mining than WIR books; however, the two disciplines overlap quite a lot and the boundaries between them are not so clear.
- 5.
I did not perform any deep statistical analysis of these data. I just note that using the median, instead of the mean would lead to similar considerations, and that I do not analyze variance (or interquartile range) values, nor statistical significance.
References
Albert R, Barabasi A-L (1999) Emergence of scaling in random networks. Science 286:509–512
Albert R, Jeong H, Barabasi A-L (1999) Internet: diameter of the world-wide web. Nature 401:130–131
Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. ACM, New York
Barroso LA, Dean J, Hölzle U (2003) Web search for a planet: the Google cluster architecture. IEEE Micro 23(2):22–28
Belew RK (2000) Finding out about. Cambridge University Press, Cambridge, MA
Bharat K, Chang B-W, Henzinger M, Ruhl M (2001) Who links to whom: mining linkage between web sites. In: IEEE International Conference on Data Mining (ICDM ’01), San Jose, CA, 29 Nov–2 Dec 2001
Blair DC (1990) Language and representation in information retrieval. Elsevier, New York
Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. WWW 7:107–117
Broder A, Kumar R, Maghoul F, Raghavan P, Rajagopalan S, Stata R, Tomkins A, Wiener J (2000) Graph structure in the web. Comput Netw 33(1–6):309–320
Chakrabarti S (2003) Mining the web. Morgan Kaufmann, San Francisco
Cho J, Garcia-Molina H (2000) The evolution of the web and implications for an incremental crawler. In: Proceedings of the 26th VLDB. http://rose.cs.ucla.edu/˜cho/papers/cho-evol.pdf
Croft WB, Metzler D, Strohman T (2009) Search engines – information retrieval in practice. Addison-Wesley, Harlow
Fetterly D, Manasse M, Najork M, Wiener JL (2004) A large-scale study of the evolution of web pages. Softw Pract Exp 34:213–237
Frakes WB, Baeza-Yates R (1992) Information retrieval: data structures and algorithms. Prentice-Hall, Englewood Cliffs, NJ
Grossman DA, Frieder O (2004) Information retrieval: algorithms and heuristics, 2nd edn. Springer, Heidelberg
Gulli A, Signorini A (2005) The indexable web is more than 11.5 billion pages. In: WWW14. http://www.cs.uiowa.edu/˜asignori/web-size/
IEEE & ACM (2001). The Joint Task Force on Computing Curricula – Computing curricula 2001: Computer Science – Final Report, 15 Dec 2001. http://acm.org/education/curric_vols/cc2001.pdf
Ingwersen P (1992) Information retrieval interaction. Taylor Graham, London
Ingwersen P, Järvelin K (2005) The TURN: integration of information seeking and retrieval in context. Springer, Heidelberg
Kleinberg J (1999) Authoritative sources in a hyperlinked environment. J ACM 46(5):604–632
Korfhage RR (1997) Information storage and retrieval. Wiley, New York
Lawrence S, Giles CL (1998) Searching the world wide web. Science 280:98–100
Lawrence S, Giles CL (1999) Accessibility of information on the web. Nature 400:107–109
Levene M (2006) An introduction to search engines and Web navigation. Addison-Wesley, Harlow
Liu B (2007) Web data mining – exploring hyperlinks, contents, and usage data. Springer, Heidelberg
Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, Cambridge, MA
Marchionini G (1997) Information seeking in electronic environments. Cambridge University Press, Cambridge, MA
Mizzaro S (2007) Teaching of Web information retrieval: Web first or IR first? In: MacFarlane A, Fernández-Luna JM, Ounis I, Huete JF (eds) Proceedings of the first international workshop on teaching and learning in information retrieval (TLIR 2007). http://www.bcs.org/server.php?show=nav.00100v00500300100d001
Page L, Brin S, Motwani R, Winograd T (1998) The PageRank citation ranking: bringing order to the web. http://www-db.stanford.edu/˜backrub/pageranksub.ps
Salton G (1989) Automatic text processing – the transformation, analysis, and retrieval of information by computer. Addison-Wesley, Harlow
Salton G, McGill MJ (1984) Introduction to modern Information retrieval. McGraw-Hill, London
van Rijsbergen CJ (1979) Information retrieval, 2nd edn. Butterworths, London
van Rijsbergen CJ (2004) The geometry of information retrieval. Cambridge University Press, Cambridge, MA
Witten IH, Moffat A, Bell TC (1999) Managing gigabytes – compressing and indexing documents and images, 2nd edn. Morgan Kaufmann, San Francisco
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix A. The Questionnaire
Appendix A. The Questionnaire
This questionnaire aims at providing to the lecturer useful data to improve the course. The questionnaire is fully anonymous and it is not used as an evaluation of the student.
Web Information Retrieval: Final Questionnaire
The questionnaire is divided into two parts gathering, respectively, a-priori background knowledge and a-posteriori feelings and impressions on the course. Mark with a circle the numbers corresponding to your answers, remembering that:
1 means I strongly disagree
2 means I partially disagree
3 means I do not have a clear opinion
4 means I partially agree
5 means I strongly agree
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Mizzaro, S. (2011). Teaching Web Information Retrieval to Computer Science Students: Concrete Approach and Its Analysis. In: Efthimiadis, E., Fernández-Luna, J., Huete, J., MacFarlane, A. (eds) Teaching and Learning in Information Retrieval. The Information Retrieval Series, vol 31. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22511-6_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-22511-6_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22510-9
Online ISBN: 978-3-642-22511-6
eBook Packages: Computer ScienceComputer Science (R0)