Abstract
The purpose of this study is to establish a method for identifying semantic relations in compound nouns in patent documents by using linguistic information. The information such as grammatical or semantic features plays a key role for analyzing semantic relations in compound nouns. We used the information about immediately succeeding case particles, adjective-forming suffixes and a verb suru “do” as grammatical features, concept classifications in EDR dictionary as semantic features. The performance of the fully automated statistical method was found to be good at 84% by using both grammatical and semantic features. This result shows advantage of our method.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aihara, S., Uchiyama, K., Ishizaki, S.: Extraction of compound nouns for do-main specific ontology and definition of relation-ship between compound nouns in patent documents. In: Proceedings of the 11th annual meeting of the association for natural language processing, Shiga, Japan (2007)
Baldwin, T., Bond, F.: Multiword Expressions: Some Problems for Japanese NLP. In: Proceedings of the 8th annual meeting of the association for natural language processing (2002)
Barker, K., Szpakowicz, S.: Semi-automatic recognition of noun modifier relationships. In: Proceedings of COLING-ACL 1998, Canada, pp. 96–102 (1998)
Japan Electronic Dictionary Research Institute, Ltd.: EDR Electronic Dictionary Technical Guide, National Institute of Information and Communications Technology (1995)
Kim, S.N., Baldwin, T.: Interpreting semantic relations in noun compounds via verb semantics. In: Proceedings of COLING/ACL 2006, Sydney, Australia, pp. 491–498 (2006)
Mainichi Newspapers Co.: Mainichi Newspapers data collection. Mainichi Newspapers Co, Japan (1993, 1994, 1995, 2003, 2004)
Miyazaki, M.: Automatic segmentation method for compound words using semantic dependent between words. IPSJ Journal 25(6), 970–979 (1984)
Miyazaki, M., Ikehara, S., Yokoo, A.: Combined word retrieval for bilingual dictionary based on the analysis of compound words. IPSJ Journal 34(4), 743–753 (1993)
Nakagawa, H.: Term recognition based on statistics of compound nouns and their components. Terminology 9(2), 201–219 (2003)
Noguchi, S., Tokunaga, T.: Japanese compound noun analysis using case frame information. IPSJ SIG Techinical Report 179(12), 67–72 (2007)
Takeuchi, K., et al.: Analysis of Japanese deverbal compounds based on Lexical Conceptual Structure. IPSJ Journal 43(5), 1446–1456 (2002)
Tsujimura, N.: An Introduction to Japanese Linguistics. Blackwel Publishers, Cambridge (1996)
Uchiyama, K., et al.: A study of grammatical categories based on grammatical features for analysis of compound nouns in specialized field. Mathematical Linguistics 23(1), 1–24 (2001)
Uchiyama, K., Ishizaki, S.: Analysis of compound nouns in patent documents. In: Proceedings of the 9th Annual Meeting of Association for Natural Language Processing, Tokyo, Japan, pp. 1107–1110 (2006)
Institute of Semantic Computing.: Concept Description Language CDL.core Specifications, ISeC Technical Report:2007-1-29 (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Uchiyama, K., Aihara, S., Ishizaki, S. (2008). Identifying Semantic Relations in Japanese Compound Nouns for Patent Documents Analysis. In: Tokunaga, T., Ortega, A. (eds) Large-Scale Knowledge Resources. Construction and Application. LKR 2008. Lecture Notes in Computer Science(), vol 4938. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78159-2_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-78159-2_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78158-5
Online ISBN: 978-3-540-78159-2
eBook Packages: Computer ScienceComputer Science (R0)