Automatic Relation Extraction with Model Order Selection and Discriminative Label Identification

Jinxiu, Chen; Donghong, Ji; Lim, Tan Chew; Zhengyu, Niu

doi:10.1007/11562214_35

Automatic Relation Extraction with Model Order Selection and Discriminative Label Identification

Chen Jinxiu²²,
Ji Donghong²²,
Tan Chew Lim²³ &
…
Niu Zhengyu²²

Conference paper

1554 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3651))

Abstract

In this paper, we study the problem of unsupervised relation extraction based on model order identification and discriminative feature analysis. The model order identification is achieved by stability-based clustering and used to infer the number of the relation types between entity pairs automatically. The discriminative feature analysis is used to find discriminative feature words to name the relation types. Experiments on ACE corpus show that the method is promising.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Defense Advanced Research Projects Agency.: Proceedings of the Sixth Message Understanding Conference (MUC-6). Morgan Kaufmann Publishers, Inc., San Francisco (1995)
Google Scholar
Califf, M.E., Mooney, R.J.: Relational Learning of Pattern-Match Rules for Information Extraction. AAAI, Menlo Park (1999)
Google Scholar
Brin, S.: Extracting patterns and relations from world wide web. In: Proc. of WebDB Workshop at 6th International Conference on Extending Database Technology, pp. 172–183 (1998)
Google Scholar
Sudo, K., Sekine, S., Grishman, R.: An Improved Extraction Pattern Representation Model for Automatic IE Pattern Acquisition. In: Proceedings of ACL, Sapporo, Japan (2003)
Google Scholar
Yangarber, R., Grishman, R., Tapanainen, P., Huttunen, S.: Unsupervised discovery of scenario-level patterns for information extraction. In: Proceedings of the Applied Natural Language Processing Conference, Seattle, WA (2000)
Google Scholar
Agichtein, E., Gravano, L.: Snowball: Extracting Relations from large Plain-Text Collections. In: Proc. of the 5^th ACM International Conference on Digital Libraries (2000)
Google Scholar
Hasegawa, T., Sekine, S., Grishman, R.: Discovering Relations among Named Entities from Large Corpora. In: Proceeding of Conference ACL, Barcelona, Spain (2004)
Google Scholar
Zelenko, D., Aone, C., Richardella, A.: Kernel Methods for Relation Extraction. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia (2002)
Google Scholar
Soderland, S.: Learning information extraction rules for semi-structured and free text. Machine Learning 31(1-3), 233–272 (1999)
Article Google Scholar
Lange, T., Braun, M., Roth, V., Buhmann, J.M.: Stability-Based Model Selection. Advances in Neural Information Processing Systems 15 (2002)
Google Scholar
Levine, E., Domany, E.: Resampling Method for Unsupervised Estimation of Cluster Calidity. Neural Computation 13, 2573–2593 (2001)
Article MATH Google Scholar
Niu, Z., Ji, D., Tan, C.L.: Document Clustering Based on Cluster Validation. In: CIKM 2004, Washington, DC, USA, November 8-13 (2004)
Google Scholar
Roth, V., Lange, T.: Feature Selection in Clustering Problems. In: NIPS 2003 workshop (2003)
Google Scholar
Fung, G.P.C., Yu, J.X., Lu, H.: Discriminative Category Matching: Efficient Text Classification for Huge Document Collections. In: Proceedings of the IEEE International Conference on Data Mining (ICDM), Maebashi City, Japan, December 09-12 (2002)
Google Scholar
Lin, D.: Using syntactic dependency as a local context to resolve word sense ambiguity. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, Madrid, July 1997, pp. 64–71 (1997)
Google Scholar
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal (August 1995)
Google Scholar
Pedersen, T., Patwardhan, S., Michelizzi, J.: WordNet:Similarity-Measuring the Relatedness of Concepts. AAAI, Menlo Park (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Chen Jinxiu, Ji Donghong & Niu Zhengyu
Department of Computer Science, National University of Singapore, 117543, Singapore
Tan Chew Lim

Authors

Chen Jinxiu
View author publications
You can also search for this author in PubMed Google Scholar
Ji Donghong
View author publications
You can also search for this author in PubMed Google Scholar
Tan Chew Lim
View author publications
You can also search for this author in PubMed Google Scholar
Niu Zhengyu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Language Technology, Macquarie University, 2019, Sydney, NSW, Australia
Robert Dale
Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong
Kam-Fai Wong
Institute for Infocomm Research, 21, Heng Mui Keng Terrace, 119613, Singapore
Jian Su
Language Information Sciences Research Centre, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
Oi Yee Kwong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jinxiu, C., Donghong, J., Lim, T.C., Zhengyu, N. (2005). Automatic Relation Extraction with Model Order Selection and Discriminative Label Identification. In: Dale, R., Wong, KF., Su, J., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2005. IJCNLP 2005. Lecture Notes in Computer Science(), vol 3651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562214_35

Download citation

DOI: https://doi.org/10.1007/11562214_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29172-5
Online ISBN: 978-3-540-31724-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics