Extracting Knowledge Using Wikipedia Semi-structured Resources

Firoozeh, Nazanin

doi:10.1007/978-3-319-41754-7_22

Nazanin Firoozeh¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9612))

Included in the following conference series:

International Conference on Applications of Natural Language to Information Systems

2129 Accesses
1 Altmetric

Abstract

Automatic knowledge discovery has been an active research field for years. Knowledge can be extracted from source files with different data structures and using different types of resources. In this paper, we propose a pattern-based approach of extraction, which exploits Wikipedia semi-structured data in order to extract the implicit knowledge behind any unstructured text. The proposed approach first identifies concepts of the studied text and then extracts their corresponding common sense and basic knowledge. We explored the effectiveness of our knowledge extraction model on city domain textual sources. The initial evaluation of the approach shows its good performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Example: http://en.Wikipedia.org/w/index.php?action=raw&title=Paris.

References

Auer, S., Lehmann, J.: What have innsbruck and leipzig in common? Extracting semantics from wiki content. In: Franconi, E., Kifer, M., May, W. (eds.) ESWC 2007. LNCS, vol. 4519, pp. 503–517. Springer, Heidelberg (2007)
Chapter Google Scholar
Frawley, W.J., Piatetsky-Shapiro, G., Matheus, C.J.: Knowledge discovery in databases: an overview. AI Mag. 13(3), 57–70 (1992)
Google Scholar
Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: Yago2: a spatially and temporally enhanced knowledge base from wikipedia. Artif. Intell. 194, 28–61 (2010)
Article MathSciNet MATH Google Scholar
Hovy, E., Navigli, R., Ponzetto, S.P.: Collaboratively built semi-structured content and artificial intelligence: the story so far. Artif. Intell. 194, 2–27 (2013)
Article MathSciNet MATH Google Scholar
Miller, G.A.: WordNet: a lexical database for english. Commun. ACM 38, 39–41 (1995)
Article Google Scholar
Morsey, M., Lehmann, J., Auer, S., Stadler, C., Hellmann, S.: DBpedia and the live extraction of structured data from wikipedia. Program Electron. Library Inform. Syst. 46, 157–181 (2012)
Article Google Scholar
Ponzetto, S.P., Strube, M.: Deriving a large scale taxonomy from Wikipedia. In: National Conference on Artificial Intelligence, vol. 2, pp. 1440–1445 (2007)
Google Scholar
Ruiz-Casado, M., Alfonseca, E., Castells, P.: Automatic extraction of semantic relationships for wordnet by means of pattern learning from wikipedia. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 67–79. Springer, Heidelberg (2005)
Chapter Google Scholar
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: A core of semantic knowledge. In: International Conference on World Wide Web, USA, pp. 697–706 (2007)
Google Scholar
Yan, Y., Okazaki, N., Matsuo, Y., Yang, Z., Ishizuka, M.: Unsupervised relation extraction by mining Wikipedia texts using information from the web. In: ACL/IJCNLP, USA, vol. 2, pp. 1021–1029 (2009)
Google Scholar
Zhao, S.H., Betz, J.: Corroborate and learn facts from the web. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 995–1003 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratoire d’Informatique de Paris-Nord, Université Paris 13, Pixalione SAS, Paris, France
Nazanin Firoozeh

Authors

Nazanin Firoozeh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nazanin Firoozeh .

Editor information

Editors and Affiliations

ConservatoireNational desArts et Métiers, Paris, France
Elisabeth Métais
University of Salford, Salford, United Kingdom
Farid Meziane
University of Salford, Salford, United Kingdom
Mohamad Saraee
Oakland University, Rochester, Michigan, USA
Vijayan Sugumaran
University of Salford, Salford, United Kingdom
Sunil Vadera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Firoozeh, N. (2016). Extracting Knowledge Using Wikipedia Semi-structured Resources. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2016. Lecture Notes in Computer Science(), vol 9612. Springer, Cham. https://doi.org/10.1007/978-3-319-41754-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-41754-7_22
Published: 17 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41753-0
Online ISBN: 978-3-319-41754-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics