Abstract
Construction of a huge scale ontology covering many named entities, domain-specific terms and relations among these concepts is one of the essential technologies in the next generation Web based on semantics. Recently, a number of studies have proposed automated ontology construction methods using the wide coverage of concepts in Wikipedia. However, since they tried to extract formal relations such as is-a and a-part-of relations, generated ontologies have only a narrow coverage of the relations among concepts. In this work, we aim at automated ontology construction with a wide coverage of both concepts and these relations by combining information on the Web with Wikipedia. We propose a relation extraction method which receives pairs of co-related concepts from an association thesaurus extracted from Wikipedia and extracts their relations from the Web.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Eguchi, K.: Overview of the Topical Classification Task at NTCIR-4 WEB. Working Notes of the 4th NTCIR Meeting, Supplement 1, 48–55 (2004)
Giles, J.: Internet encyclopedias go head to head. Nature 438(7070), 900–901 (2005)
Järvelin, K., Kekäläinen, J.: IR Evaluation Methods for Retrieving Highly Relevant Documents. In: Proc. of International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 41–48 (2000)
Kawahara, D., Kurohashi, S.: Case Frame Compilation from the Web using High-Performance Computing. In: Proc. of International Conference on Language Resources and Evaluation, (LREC) (2006)
Kudo, T., Matsumoto, Y.: Fast Methods for Kernel-Based Text Analysis. In: Proc. of Annual Meeting on Association for Computational Linguistics (ACL), pp. 24–31 (2003)
Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying Conditional Random Fields to Japanese Morphological Analysis. In: Proc. of Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 230–237 (2004)
Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM (CACM) 38(11), 39–41 (1995)
Nakayama, K., Hara, T., Nishio, S.: A Thesaurus Construction Method from Large Scale Web Dictionaries. In: Proc. of IEEE International Conference on Advanced Information Networking and Applications (AINA), pp. 932–939 (2007)
Nakayama, K., Hara, T., Nishio, S.: Wikipedia Mining for An Association Web Thesaurus Construction. In: Benatallah, B., Casati, F., Georgakopoulos, D., Bartolini, C., Sadiq, W., Godart, C. (eds.) WISE 2007. LNCS, vol. 4831, pp. 322–334. Springer, Heidelberg (2007)
Nakayama, K., Pei, M., Erdmann, M., Ito, M., Shirakawa, M., Hara, T.: Shojiro: Wikipedia Mining - Wikipedia as a Corpus for Knowledge Extraction -. In: Proc. of Wikimedia International Conference, (Wikimania) (2008)
Ohshima, H., Tanaka, K.: High-speed Detection of Ontological Knowledge and Bi-directional Lexico-Syntactic Patterns from the Web. Journal of Software 5(2), 195–205 (2010)
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: A Large Ontology from Wikipedia and WordNet. Journal of Web Semantics 6(3), 203–217 (2008)
Suhara, Y., Toda, H., Sakurai, A.: Extracting Related Named Entities from Blogosphere for Event Mining. In: Proc. of International Conference on Ubiquitous Information Management and Communication (ICUIMC), pp. 242–246 (2008)
Yan, Y., Okazaki, N., Matsuo, Y., Yang, Z., Ishizuka, M.: Unsupervised Relation Extraction by Mining Wikipedia Texts using Information from the Web. In: Proc. of Annual Meeting on Association for Computational Linguistics, International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP), pp. 1021–1029 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shirakawa, M., Nakayama, K., Aramaki, E., Hara, T., Nishio, S. (2010). Relation Extraction between Related Concepts by Combining Wikipedia and Web Information for Japanese Language. In: Cheng, PJ., Kan, MY., Lam, W., Nakov, P. (eds) Information Retrieval Technology. AIRS 2010. Lecture Notes in Computer Science, vol 6458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17187-1_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-17187-1_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17186-4
Online ISBN: 978-3-642-17187-1
eBook Packages: Computer ScienceComputer Science (R0)