Self-supervised Relation Extraction Using UMLS

Roller, Roland; Stevenson, Mark

doi:10.1007/978-3-319-11382-1_12

Roland Roller²² &
Mark Stevenson²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8685))

Included in the following conference series:

International Conference of the Cross-Language Evaluation Forum for European Languages

1166 Accesses

Abstract

Self-supervised relation extraction uses a knowledge base to automatically annotate a training corpus which is then used to train a classifier. This approach has been successfully applied to different domains using a range of knowledge bases. This paper applies the approach to the biomedical domain using UMLS, a large biomedical knowledge base containing millions of concepts and relations among them. The approach is evaluated using two different techniques. The presented results are promising and indicate that UMLS is a useful resource for semi-supervised relation extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Weakly Supervised Relation Extraction

Relation Extraction

BioRel: towards large-scale biomedical relation extraction

Article Open access 16 December 2020

References

Agichtein, E., Gravano, L.: Snowball: extracting relations from large plain-text collections. In: Proceedings of the Fifth ACM Conference on Digital libraries, DL 2000, pp. 85–94 (2000)
Google Scholar
Aronson, A., Lang, F.: An overview of MetaMap: historical perspective and recent advances. Journal of the American Medical Association 17(3), 229–236 (2010)
Google Scholar
Björne, J., Salakoski, T.: Generalizing biomedical event extraction. In: Proceedings of BioNLP Shared Task 2011 Workshop, pp. 183–191. Association for Computational Linguistics, Portland (2011)
Google Scholar
Björne, J., Salakoski, T.: Tees 2.1: Automated annotation scheme learning in the bionlp 2013 shared task. In: Proceedings of the BioNLP Shared Task 2013 Workshop, pp. 16–25. Association for Computational Linguistics, Sofia (2013)
Google Scholar
Brin, S.: Extracting patterns and relations from the world wide web. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 172–183. Springer, Heidelberg (1999)
Chapter Google Scholar
Charniak, E., Johnson, M.: Coarse-to-fine n-best parsing and maxent discriminative reranking. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL 2005, pp. 173–180 (2005)
Google Scholar
Collins, M., Duffy, N.: New ranking algorithms for parsing and tagging: Kernels over discrete structures, and the voted perceptron. In: Proceedings of 40th Annual Meeting of the Association for Computational Linguistics, pp. 263–270. Association for Computational Linguistics, Philadelphia (2002)
Google Scholar
Craven, M., Kumlien, J.: Constructing biological knowledge bases by extracting information from text sources. In: Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology (ISMB), pp. 77–86. AAAI Press (1999)
Google Scholar
Dietterich, T.G., Lathrop, R.H., Lozano-Perez, T., Pharmaceutical, A.: Solving the multiple-instance problem with axis-parallel rectangles. Artificial Intelligence 89, 31–71 (1997)
Article MATH Google Scholar
Hoffmann, R., Zhang, C., Ling, X., Zettlemoyer, L., Weld, D.S.: Knowledge-based weak supervision for information extraction of overlapping relations. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, ACL 2011, pp. 541–550 (2011)
Google Scholar
Hoffmann, R., Zhang, C., Weld, D.S.: Learning 5000 relational extractors. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL 2010, pp. 286–295 (2010)
Google Scholar
Joachims, T.: Making Large-scale Support Vector Machine Learning Practical. In: Advances in Kernel Methods, pp. 169–184 (1999)
Google Scholar
Klein, D., Manning, C.D.: Accurate unlexicalized parsing. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pp. 423–430 (2003)
Google Scholar
Mintz, M., Bills, S., Snow, R., Jurafsky, D.: Distant supervision for relation extraction without labeled data. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: ACL 2009, vol. 2, pp. 1003–1011 (2009)
Google Scholar
Moschitti, A.: Making tree kernels practical for natural language learning. In: EACL, pp. 113–120 (2006)
Google Scholar
Porter, M.F.: An Algorithm for Suffix Stripping. In: Readings in Information Retrieval, pp. 313–316 (1997)
Google Scholar
Riedel, S., McClosky, D., Surdeanu, M., McCallum, A.: D. Manning, C.: Model combination for event extraction in bionlp 2011. In: Proceedings of BioNLP Shared Task 2011 Workshop, pp. 51–55. Association for Computational Linguistics, Portland (2011)
Google Scholar
Riedel, S., Yao, L., McCallum, A.: Modeling relations and their mentions without labeled text. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds.) ECML PKDD 2010, Part III. LNCS, vol. 6323, pp. 148–163. Springer, Heidelberg (2010)
Chapter Google Scholar
Segura-Bedmar, I., Martínez, P., de Pablo-Sánchez, C.: Using a shallow linguistic kernel for drug-drug interaction extraction. Journal of Biomedical Informatics 44(5), 789–804 (2011)
Article Google Scholar
Snow, R., Jurafsky, D., Ng, A.Y.: Learning syntactic patterns for automatic hypernym discovery. In: Advances in Neural Information Processing Systems (NIPS 2004) (November 2004)
Google Scholar
Takamatsu, S., Sato, I., Nakagawa, H.: Reducing wrong labels in distant supervision for relation extraction. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, ACL 2012, vol. 1, pp. 721–729 (2012)
Google Scholar
Thomas, P., Neves, M., Solt, I., Tikk, D., Leser, U.: Relation extraction for drug-drug interactions using ensemble learning. In: DDIExtraction2011: First Challenge Task: Drug-Drug Interaction Extraction at SEPLN 2011, vol. 4, pp. 11–18 (2011)
Google Scholar
Thomas, I.P., Solt, Klinger, R., Leser, U.: Learning protein protein interaction extraction using distant supervision. In: Proceedings of Robust Unsupervised and Semi-Supervised Methods in Natural Language Processing, pp. 34–41 (2011)
Google Scholar
Xu, W., Hoffmann, R., Zhao, L., Grishman, R.: Filling knowledge base gaps for distant supervision of relation extraction. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 665–670. Association for Computational Linguistics, Sofia (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello, S1 4DP, Sheffield, England
Roland Roller & Mark Stevenson

Authors

Roland Roller
View author publications
You can also search for this author in PubMed Google Scholar
Mark Stevenson
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Google Inc., Brandschenkestraße 110, 8002, Zurich, Switzerland
Evangelos Kanoulas
Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstrasse 9-11, 1040, Vienna, Austria
Mihai Lupu
Information School, University of Sheffield, Sheffield, UK
Paul Clough
Department of Computer Science and IT, RMIT University, 3000, Melbourne, VIC, Australia
Mark Sanderson
Department of Computing, Edge Hill University, L39 4QP, Ormskirk, Lancashire, UK
Mark Hall
Vienna University of Technology, Austria
Allan Hanbury
Information School, University of Sheffield, Regent Court, 211 Portobello, S1 4DP, Sheffield, UK
Elaine Toms

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roller, R., Stevenson, M. (2014). Self-supervised Relation Extraction Using UMLS. In: Kanoulas, E., et al. Information Access Evaluation. Multilinguality, Multimodality, and Interaction. CLEF 2014. Lecture Notes in Computer Science, vol 8685. Springer, Cham. https://doi.org/10.1007/978-3-319-11382-1_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-11382-1_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11381-4
Online ISBN: 978-3-319-11382-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics