Skip to main content

Relation Extraction between Related Concepts by Combining Wikipedia and Web Information for Japanese Language

  • Conference paper
Information Retrieval Technology (AIRS 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6458))

Included in the following conference series:

  • 1413 Accesses

Abstract

Construction of a huge scale ontology covering many named entities, domain-specific terms and relations among these concepts is one of the essential technologies in the next generation Web based on semantics. Recently, a number of studies have proposed automated ontology construction methods using the wide coverage of concepts in Wikipedia. However, since they tried to extract formal relations such as is-a and a-part-of relations, generated ontologies have only a narrow coverage of the relations among concepts. In this work, we aim at automated ontology construction with a wide coverage of both concepts and these relations by combining information on the Web with Wikipedia. We propose a relation extraction method which receives pairs of co-related concepts from an association thesaurus extracted from Wikipedia and extracts their relations from the Web.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  2. Eguchi, K.: Overview of the Topical Classification Task at NTCIR-4 WEB. Working Notes of the 4th NTCIR Meeting, Supplement 1, 48–55 (2004)

    Google Scholar 

  3. Giles, J.: Internet encyclopedias go head to head. Nature 438(7070), 900–901 (2005)

    Article  Google Scholar 

  4. Järvelin, K., Kekäläinen, J.: IR Evaluation Methods for Retrieving Highly Relevant Documents. In: Proc. of International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 41–48 (2000)

    Google Scholar 

  5. Kawahara, D., Kurohashi, S.: Case Frame Compilation from the Web using High-Performance Computing. In: Proc. of International Conference on Language Resources and Evaluation, (LREC) (2006)

    Google Scholar 

  6. Kudo, T., Matsumoto, Y.: Fast Methods for Kernel-Based Text Analysis. In: Proc. of Annual Meeting on Association for Computational Linguistics (ACL), pp. 24–31 (2003)

    Google Scholar 

  7. Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying Conditional Random Fields to Japanese Morphological Analysis. In: Proc. of Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 230–237 (2004)

    Google Scholar 

  8. Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM (CACM) 38(11), 39–41 (1995)

    Article  Google Scholar 

  9. Nakayama, K., Hara, T., Nishio, S.: A Thesaurus Construction Method from Large Scale Web Dictionaries. In: Proc. of IEEE International Conference on Advanced Information Networking and Applications (AINA), pp. 932–939 (2007)

    Google Scholar 

  10. Nakayama, K., Hara, T., Nishio, S.: Wikipedia Mining for An Association Web Thesaurus Construction. In: Benatallah, B., Casati, F., Georgakopoulos, D., Bartolini, C., Sadiq, W., Godart, C. (eds.) WISE 2007. LNCS, vol. 4831, pp. 322–334. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  11. Nakayama, K., Pei, M., Erdmann, M., Ito, M., Shirakawa, M., Hara, T.: Shojiro: Wikipedia Mining - Wikipedia as a Corpus for Knowledge Extraction -. In: Proc. of Wikimedia International Conference, (Wikimania) (2008)

    Google Scholar 

  12. Ohshima, H., Tanaka, K.: High-speed Detection of Ontological Knowledge and Bi-directional Lexico-Syntactic Patterns from the Web. Journal of Software 5(2), 195–205 (2010)

    Article  Google Scholar 

  13. Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Large Ontology from Wikipedia and WordNet. Journal of Web Semantics 6(3), 203–217 (2008)

    Article  Google Scholar 

  14. Suhara, Y., Toda, H., Sakurai, A.: Extracting Related Named Entities from Blogosphere for Event Mining. In: Proc. of International Conference on Ubiquitous Information Management and Communication (ICUIMC), pp. 242–246 (2008)

    Google Scholar 

  15. Yan, Y., Okazaki, N., Matsuo, Y., Yang, Z., Ishizuka, M.: Unsupervised Relation Extraction by Mining Wikipedia Texts using Information from the Web. In: Proc. of Annual Meeting on Association for Computational Linguistics, International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP), pp. 1021–1029 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shirakawa, M., Nakayama, K., Aramaki, E., Hara, T., Nishio, S. (2010). Relation Extraction between Related Concepts by Combining Wikipedia and Web Information for Japanese Language. In: Cheng, PJ., Kan, MY., Lam, W., Nakov, P. (eds) Information Retrieval Technology. AIRS 2010. Lecture Notes in Computer Science, vol 6458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17187-1_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17187-1_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17186-4

  • Online ISBN: 978-3-642-17187-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics