Abstract
Wikipedia articles contain, besides free text, various types of structured information in the form of wiki markup. The type of wiki content that is most valuable for search are Wikipedia infoboxes, which display an article’s most relevant facts as a table of attribute-value pairs on the top right-hand side of the Wikipedia page. Infobox data is not used by Wikipedia’s own search engine. Standard Web search engines like Google or Yahoo also do not take advantage of the data. In this paper, we present Faceted Wikipedia Search, an alternative search interface for Wikipedia, which facilitates infobox data in order to enable users to ask complex questions against Wikipedia knowledge. By allowing users to query Wikipedia like a structured database, Faceted Wikipedia Search helps them to truly exploit Wikipedia’s collective intelligence.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bizer, C.: The emerging web of linked data. IEEE Intelligent Systems 24, 87–92 (2009)
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semantic Web Inf. Syst. 5(3), 1–22 (2009)
Chen, K.: Computing query previews in the flamenco system. Technical report, University of Berkeley (2004)
Bizer, C., et al.: Dbpedia - a crystallization point for the web of data. Journal of Web Semantics 7(3), 154–165 (2009)
English, J., Hearst, M., Sinha, R., Swearingen, K., Yee, K.-P.: Flexible search and navigation using faceted metadata. Technical report, University of Berkeley (2002)
Hearst, M., Elliott, A., English, J., Sinha, R., Swearingen, K., Yee, K.-P.: Finding the flow in web site search. Commun. ACM 45(9), 42–49 (2002)
Hearst, M.A.: Uis for faceted navigation: Recent advances and remaining open problems. In: HCIR 2008 Second Workshop on Human-Computer Interaction and Information Retrieval. Microsoft (October 2008)
Kazama, J., Torisawa, K.: Exploiting wikipedia as external knowledge for named entity recognition. In: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (2007)
Klyne, G., Carroll, J.: Resource description framework (rdf): Concepts and abstract syntax - w3c recommendation (2004), http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)
Metaweb Technologies. Freebase wikipedia extraction (wex) (2009), http://download.freebase.com/wex/
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: A large ontology from wikipedia and wordnet. Journal of Web Semantics 6(3), 203–217 (2008)
Wu, F., Weld, D.: Automatically Refining the Wikipedia Infobox Ontology. In: Proceedings of the 17th World Wide Web Conference (2008)
Yitzhak, O.B., Golbandi, N., Har’el, N., Lempel, R., Neumann, A., Koifman, S.O., Sheinwald, D., Shekita, E., Sznajder, B., Yogev, S.: Beyond basic faceted search. In: WSDM 2008: Proceedings of the international conference on Web search and web data mining, pp. 33–44. ACM, New York (2008)
Zaragoza, H., Rode, H., Mika, P., Atserias, J., Ciaramita, M., Attardi, G.: Ranking very many typed entities on wikipedia. In: CIKM 2007: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, pp. 1015–1018. ACM, New York (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hahn, R. et al. (2010). Faceted Wikipedia Search. In: Abramowicz, W., Tolksdorf, R. (eds) Business Information Systems. BIS 2010. Lecture Notes in Business Information Processing, vol 47. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12814-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-12814-1_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12813-4
Online ISBN: 978-3-642-12814-1
eBook Packages: Computer ScienceComputer Science (R0)