Skip to main content

Faceted Wikipedia Search

  • Conference paper
Business Information Systems (BIS 2010)

Abstract

Wikipedia articles contain, besides free text, various types of structured information in the form of wiki markup. The type of wiki content that is most valuable for search are Wikipedia infoboxes, which display an article’s most relevant facts as a table of attribute-value pairs on the top right-hand side of the Wikipedia page. Infobox data is not used by Wikipedia’s own search engine. Standard Web search engines like Google or Yahoo also do not take advantage of the data. In this paper, we present Faceted Wikipedia Search, an alternative search interface for Wikipedia, which facilitates infobox data in order to enable users to ask complex questions against Wikipedia knowledge. By allowing users to query Wikipedia like a structured database, Faceted Wikipedia Search helps them to truly exploit Wikipedia’s collective intelligence.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bizer, C.: The emerging web of linked data. IEEE Intelligent Systems 24, 87–92 (2009)

    Article  Google Scholar 

  2. Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semantic Web Inf. Syst. 5(3), 1–22 (2009)

    Google Scholar 

  3. Chen, K.: Computing query previews in the flamenco system. Technical report, University of Berkeley (2004)

    Google Scholar 

  4. Bizer, C., et al.: Dbpedia - a crystallization point for the web of data. Journal of Web Semantics 7(3), 154–165 (2009)

    Google Scholar 

  5. English, J., Hearst, M., Sinha, R., Swearingen, K., Yee, K.-P.: Flexible search and navigation using faceted metadata. Technical report, University of Berkeley (2002)

    Google Scholar 

  6. Hearst, M., Elliott, A., English, J., Sinha, R., Swearingen, K., Yee, K.-P.: Finding the flow in web site search. Commun. ACM 45(9), 42–49 (2002)

    Article  Google Scholar 

  7. Hearst, M.A.: Uis for faceted navigation: Recent advances and remaining open problems. In: HCIR 2008 Second Workshop on Human-Computer Interaction and Information Retrieval. Microsoft (October 2008)

    Google Scholar 

  8. Kazama, J., Torisawa, K.: Exploiting wikipedia as external knowledge for named entity recognition. In: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (2007)

    Google Scholar 

  9. Klyne, G., Carroll, J.: Resource description framework (rdf): Concepts and abstract syntax - w3c recommendation (2004), http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/

  10. Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)

    MATH  Google Scholar 

  11. Metaweb Technologies. Freebase wikipedia extraction (wex) (2009), http://download.freebase.com/wex/

  12. Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: A large ontology from wikipedia and wordnet. Journal of Web Semantics 6(3), 203–217 (2008)

    Google Scholar 

  13. Wu, F., Weld, D.: Automatically Refining the Wikipedia Infobox Ontology. In: Proceedings of the 17th World Wide Web Conference (2008)

    Google Scholar 

  14. Yitzhak, O.B., Golbandi, N., Har’el, N., Lempel, R., Neumann, A., Koifman, S.O., Sheinwald, D., Shekita, E., Sznajder, B., Yogev, S.: Beyond basic faceted search. In: WSDM 2008: Proceedings of the international conference on Web search and web data mining, pp. 33–44. ACM, New York (2008)

    Chapter  Google Scholar 

  15. Zaragoza, H., Rode, H., Mika, P., Atserias, J., Ciaramita, M., Attardi, G.: Ranking very many typed entities on wikipedia. In: CIKM 2007: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, pp. 1015–1018. ACM, New York (2007)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hahn, R. et al. (2010). Faceted Wikipedia Search. In: Abramowicz, W., Tolksdorf, R. (eds) Business Information Systems. BIS 2010. Lecture Notes in Business Information Processing, vol 47. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12814-1_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12814-1_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12813-4

  • Online ISBN: 978-3-642-12814-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics