Skip to main content

The Importance of Document Ranking and User-Generated Content for Faceted Search and Book Suggestions

  • Conference paper
  • 578 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7424))

Abstract

In this paper we describe our participation in INEX 2011 in the Books and Social Search Track and the Data Centric Track. For the Books and Social Search Track we focus on the impact of different document representations of book metadata for book search, using either professional metadata, user-generated content or both. We evaluate the retrieval results against ground truths derived from the recommendations in the LibraryThing discussion groups and from relevance judgements obtained from Amazon Mechanical Turk. Our findings show that standard retrieval models perform better on user-generated metadata than on professional metadata. For the Data Centric Track we focus on the selection of a restricted set of facets and facet values that would optimally guide the user toward relevant information in the Internet Movie Database (IMDb). We explore different methods for effective result summarisation by means of weighted aggregation. These weighted aggregations are used to achieve maximal coverage of search results, while at the same time penalising overlap between sets of documents that are summarised by different facet values. We found that weighted result aggregation combined with redundancy avoidance results in a compact summary of available relevant information.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bates, M.J.: Task Force Recommendation 2.3 Research and Design Review: Improving user access to library catalog and portal information. In: LoC Bicentennial Conf. on Bibliographic Control for the New Millennium (2003)

    Google Scholar 

  2. Ben-Yitzhak, O., Golbandi, N., Har’El, N., Lempel, R.: Beyond basic faceted search. In: WSDM 2008 (2008)

    Google Scholar 

  3. Buckland, M.: Vocabulary as a Central Concept in Library and Information Science. In: Digital Libraries: Interdisciplinary Concepts, Challenges, and Opportunities. Proceedings of CoLIS3 (1999)

    Google Scholar 

  4. Cleverdon, C.W.: The Cranfield tests on index language devices. Aslib 19, 173–192 (1967)

    Article  Google Scholar 

  5. Gross, T., Taylor, A.G.: What Have We Got to Lose? The Effect of Controlled Vocabulary on Keyword Searching Results. College & Research Libraries 66(3) (2005)

    Google Scholar 

  6. Hearst, M.A., Elliott, A., English, J., Sinha, R., Swearingen, K., Yee, K.-P.: Finding the flow in web site search. Communications of the ACM 45, 42–49 (2002)

    Article  Google Scholar 

  7. Lancaster, F.W.: Vocabulary control for information retrieval, 2nd edn. Information Resources Press, Arlington (1986)

    Google Scholar 

  8. Li, C., Yan, N., Roy, S.B., Lisham, L., Das, G.: Facetedpedia: Dynamic generation of query-dependent faceted interfaces for wikipedia. In: Proceedings of WWW 2010 (2010)

    Google Scholar 

  9. Mathes, A.: Folksonomies - cooperative classification and communication through shared metadata (December 2004), http://www.adammathes.com/academic/computer-mediated-communication/folksonomies.html

  10. Peters, I., Schumann, L., Terliesner, J., Stock, W.G.: Retrieval Effectiveness of Tagging Systems. In: Grove, A. (ed.) Proceedings of the 74th ASIS&T Annual Meeting, vol. 48 (2011)

    Google Scholar 

  11. Schuth, A., Marx, M.: Evaluation Methods for Rankings of Facetvalues for Faceted Search. In: Forner, P., Gonzalo, J., Kekäläinen, J., Lalmas, M., de Rijke, M. (eds.) CLEF 2011. LNCS, vol. 6941, pp. 131–136. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  12. Strohman, T., Metzler, D., Turtle, H., Croft, W.B.: Indri: a language-model based search engine for complex queries. In: Proceedings of the International Conference on Intelligent Analysis (2005)

    Google Scholar 

  13. Svenonius, E.: Unanswered questions in the design of controlled vocabularies. JASIS 37(5), 331–340 (1986)

    Google Scholar 

  14. Tunkelang, D.: Faceted Search. Morgan and Claypool Publishers (2009)

    Google Scholar 

  15. Wang, Q., Ramírez, G., Marx, M., Theobald, M., Kamps, J.: Overview of the INEX 2011 Data Centric Track. In: Geva, S., Kamps, J., Schenkel, R. (eds.) INEX 2011. LNCS, vol. 7424, pp. 118–137. Springer, Heidelberg (2012)

    Google Scholar 

  16. Yi, K., Chan, L.M.: Linking folksonomy to Library of Congress subject headings: an exploratory study. Journal of Documentation 65(6), 872–900 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Adriaans, F., Kamps, J., Koolen, M. (2012). The Importance of Document Ranking and User-Generated Content for Faceted Search and Book Suggestions. In: Geva, S., Kamps, J., Schenkel, R. (eds) Focused Retrieval of Content and Structure. INEX 2011. Lecture Notes in Computer Science, vol 7424. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35734-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35734-3_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35733-6

  • Online ISBN: 978-3-642-35734-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics