Relation Extraction between Related Concepts by Combining Wikipedia and Web Information for Japanese Language

Shirakawa, Masumi; Nakayama, Kotaro; Aramaki, Eiji; Hara, Takahiro; Nishio, Shojiro

doi:10.1007/978-3-642-17187-1_30

Masumi Shirakawa²⁰,
Kotaro Nakayama²¹,
Eiji Aramaki²¹,
Takahiro Hara²⁰ &
…
Shojiro Nishio²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6458))

Included in the following conference series:

Asia Information Retrieval Symposium

1413 Accesses

Abstract

Construction of a huge scale ontology covering many named entities, domain-specific terms and relations among these concepts is one of the essential technologies in the next generation Web based on semantics. Recently, a number of studies have proposed automated ontology construction methods using the wide coverage of concepts in Wikipedia. However, since they tried to extract formal relations such as is-a and a-part-of relations, generated ontologies have only a narrow coverage of the relations among concepts. In this work, we aim at automated ontology construction with a wide coverage of both concepts and these relations by combining information on the Web with Wikipedia. We propose a relation extraction method which receives pairs of co-related concepts from an association thesaurus extracted from Wikipedia and extracts their relations from the Web.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Building Wikipedia Ontology with More Semi-structured Information Resources

Large Scale Semantic Relation Discovery: Toward Establishing the Missing Link Between Wikipedia and Semantic Network

From Open Information Extraction to Semantic Web: A Context Rule-Based Strategy

References

Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Chapter Google Scholar
Eguchi, K.: Overview of the Topical Classification Task at NTCIR-4 WEB. Working Notes of the 4th NTCIR Meeting, Supplement 1, 48–55 (2004)
Google Scholar
Giles, J.: Internet encyclopedias go head to head. Nature 438(7070), 900–901 (2005)
Article Google Scholar
Järvelin, K., Kekäläinen, J.: IR Evaluation Methods for Retrieving Highly Relevant Documents. In: Proc. of International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 41–48 (2000)
Google Scholar
Kawahara, D., Kurohashi, S.: Case Frame Compilation from the Web using High-Performance Computing. In: Proc. of International Conference on Language Resources and Evaluation, (LREC) (2006)
Google Scholar
Kudo, T., Matsumoto, Y.: Fast Methods for Kernel-Based Text Analysis. In: Proc. of Annual Meeting on Association for Computational Linguistics (ACL), pp. 24–31 (2003)
Google Scholar
Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying Conditional Random Fields to Japanese Morphological Analysis. In: Proc. of Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 230–237 (2004)
Google Scholar
Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM (CACM) 38(11), 39–41 (1995)
Article Google Scholar
Nakayama, K., Hara, T., Nishio, S.: A Thesaurus Construction Method from Large Scale Web Dictionaries. In: Proc. of IEEE International Conference on Advanced Information Networking and Applications (AINA), pp. 932–939 (2007)
Google Scholar
Nakayama, K., Hara, T., Nishio, S.: Wikipedia Mining for An Association Web Thesaurus Construction. In: Benatallah, B., Casati, F., Georgakopoulos, D., Bartolini, C., Sadiq, W., Godart, C. (eds.) WISE 2007. LNCS, vol. 4831, pp. 322–334. Springer, Heidelberg (2007)
Chapter Google Scholar
Nakayama, K., Pei, M., Erdmann, M., Ito, M., Shirakawa, M., Hara, T.: Shojiro: Wikipedia Mining - Wikipedia as a Corpus for Knowledge Extraction -. In: Proc. of Wikimedia International Conference, (Wikimania) (2008)
Google Scholar
Ohshima, H., Tanaka, K.: High-speed Detection of Ontological Knowledge and Bi-directional Lexico-Syntactic Patterns from the Web. Journal of Software 5(2), 195–205 (2010)
Article Google Scholar
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Large Ontology from Wikipedia and WordNet. Journal of Web Semantics 6(3), 203–217 (2008)
Article Google Scholar
Suhara, Y., Toda, H., Sakurai, A.: Extracting Related Named Entities from Blogosphere for Event Mining. In: Proc. of International Conference on Ubiquitous Information Management and Communication (ICUIMC), pp. 242–246 (2008)
Google Scholar
Yan, Y., Okazaki, N., Matsuo, Y., Yang, Z., Ishizuka, M.: Unsupervised Relation Extraction by Mining Wikipedia Texts using Information from the Web. In: Proc. of Annual Meeting on Association for Computational Linguistics, International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP), pp. 1021–1029 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Multimedia Engineering, Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka, 565-0871, Japan
Masumi Shirakawa, Takahiro Hara & Shojiro Nishio
The Center for Knowledge Structuring, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
Kotaro Nakayama & Eiji Aramaki

Authors

Masumi Shirakawa
View author publications
You can also search for this author in PubMed Google Scholar
Kotaro Nakayama
View author publications
You can also search for this author in PubMed Google Scholar
Eiji Aramaki
View author publications
You can also search for this author in PubMed Google Scholar
Takahiro Hara
View author publications
You can also search for this author in PubMed Google Scholar
Shojiro Nishio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Information Engineering, Roosevelt Road National Taiwan University, No. 1, Sec. 4, 10617, Taipei, Taiwan R.O.C.
Pu-Jen Cheng
School of Computing, National University of Singapore (NUS), Computing 1, 13 Computing Drive, 117417, Singapore
Min-Yen Kan
Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong Shatin, N.T. Hong Kong, China
Wai Lam
School of Computing, Computing 1, National University of Singapore (NUS), 13 Computing Drive, 117417, Singapore
Preslav Nakov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shirakawa, M., Nakayama, K., Aramaki, E., Hara, T., Nishio, S. (2010). Relation Extraction between Related Concepts by Combining Wikipedia and Web Information for Japanese Language. In: Cheng, PJ., Kan, MY., Lam, W., Nakov, P. (eds) Information Retrieval Technology. AIRS 2010. Lecture Notes in Computer Science, vol 6458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17187-1_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-17187-1_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17186-4
Online ISBN: 978-3-642-17187-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics