Biomedical Event Trigger Detection Based on Hybrid Methods Integrating Word Embeddings

Li, Lishuang; Qin, Meiyue; Huang, Degen

doi:10.1007/978-981-10-3168-7_7

Lishuang Li¹⁶,
Meiyue Qin¹⁶ &
Degen Huang¹⁶

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 650))

Included in the following conference series:

China Conference on Knowledge Graph and Semantic Computing

1493 Accesses

Abstract

Trigger detection as the preceding task is of great importance in biomedical event extraction. By now, most of the state-of-the-art systems have been based on single classifiers, and the words encoded by one-hot are unable to represent the semantic information. In this paper, we utilize hybrid methods integrating word embeddings to get higher performance. In hybrid methods, first, multiple single classifiers are constructed based on rich manual features including dependency and syntactic parsed results. Then multiple predicting results are integrated by set operation, voting and stacking method. Hybrid methods can take advantage of the difference among classifiers and make up for their deficiencies and thus improve performance. Word embeddings are learnt from large scale unlabeled texts and integrated as unsupervised features into other rich features based on dependency parse graphs, and thus a lot of semantic information can be represented. Experimental results show our method outperforms the state-of-the-art systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Biomedical event trigger detection by dependency-based word embedding

Article Open access 10 August 2016

A multiple distributed representation method based on neural network for biomedical event extraction

Article Open access 20 December 2017

Biomedical event trigger extraction based on multi-layer residual BiLSTM and contextualized word representations

Article 10 April 2021

References

Björne, J., Heimonen, J., Ginter, F., Airola, A., Pahikkala, T., Salakoski, T.: Extracting complex biological events with rich graph-based feature sets. In: Proceedings of Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task, pp. 10–18. ACL, Boulder, Colorado (2009)
Google Scholar
Martinez, D., Baldwin, T.: Word sense disambiguation for event trigger word detection in biomedicine. BMC Bioinform. 12(Suppl. 2), S4 (2011)
Article Google Scholar
Zhang, Y., Lin, H., Yang, Z., Wang, J., Li, Y.: Biomolecular event trigger detection using neighborhood hash features. J. Theoret. Biol. 318, 22–28 (2013)
Article Google Scholar
Majumder, A.: Multiple features based approach to extract bio-molecular event triggers using conditional random field. Int. J. Intell. Syst. Appl. 4(12), 41–47 (2012)
Google Scholar
Wang, J., Wu, Y., Lin, H., Yang, Z.: Biological event trigger extraction based on deep parsing. Comput. Eng. 39, 25–30 (2013)
Google Scholar
Domingos, P.: A few useful things to know about machine learning. Commun. ACM 55(10), 78–87 (2012)
Article Google Scholar
Li, L., Fan, W., Huang, D., Dang, Y., Sun, J.: Boosting performance of gene mention tagging system by hybrid methods. J. Biomed. Inf. 45(1), 156–164 (2012)
Article Google Scholar
Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. J. Mach. Learn. Res. 7, 551–585 (2006)
MathSciNet MATH Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Article MATH Google Scholar
Tang, B., Cao, H., Wang, X., Chen, Q., Xu, H.: Evaluating word representation features in biomedical named entity recognition tasks. BioMed. Res. Int. 2014, Article ID 240403, 1–6 (2014). Hindawi Publishing Corporation
Google Scholar
Turian, J., Ratinov, L., Bengio, Y.: Word representations: a simple and general method for semi-supervised learning. In: Proceedings of 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, pp. 384–394 (2010)
Google Scholar
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P., Collins, M.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)
MATH Google Scholar
Mnih, A., Hinton, G.: A scalable hierarchical distributed language model. In: NIPS, pp. 1081–1088 (2008)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111–3119 (2013)
Google Scholar
Mikolov, T., Yih, W.T., Zweig, G.: Linguistic regularities in continuous space word representations. In: Proceedings of NAACL-HLT, Atlanta, Georgia, pp. 746–751 (2013)
Google Scholar
McClosky, D., Charniak, E.: Self-training for biomedical parsing. In: Proceedings of 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies, Columbus, Ohio, pp. 101–104 (2008)
Google Scholar
Miyao, Y., Sagae, K., Saetre, R., Matsuzaki, T., Tsujii, J.: Evaluating contributions of natural language parsers to protein–protein interaction extraction. Bioinformatics 25(3), 394–400 (2009)
Article Google Scholar
Miwa, M., Saetre, R., Kim, J.D., Tsujii, J.: Event extraction with complex event classification using rich features. J. Bioinform. Comput. Biol. 8(1), 131–146 (2010). doi:10.1142/S0219720010004586
Article Google Scholar
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Berlin (1995)
Book MATH Google Scholar
Kim, J.D., Ohta, T., Pyysalo, S., Kano, Y., Tsujii, J.: Overview of BioNLP’09 shared task on event extraction. In: Proceedings of Workshop on BioNLP: Shared Task, Boulder, Colorado, pp. 1–9 (2009)
Google Scholar
Kim, J.D., Pyysalo, S., Ohta, T., Bossy, R., Nguyen, N., Tsujii, J.: Overview of BioNLP shared task 2011. In: Proceedings of BioNLP Shared Task 2011 Workshop, pp. 1–6. Association for Computational Linguistics, Portland (2011)
Google Scholar

Download references

Acknowledgments

The authors gratefully acknowledge the financial support provided by the National Natural Science Foundation of China under No. 61672126, 61173101.

Author information

Authors and Affiliations

School of Computer Science and Technology, Dalian University of Technology, Dalian, China
Lishuang Li, Meiyue Qin & Degen Huang

Authors

Lishuang Li
View author publications
You can also search for this author in PubMed Google Scholar
Meiyue Qin
View author publications
You can also search for this author in PubMed Google Scholar
Degen Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lishuang Li .

Editor information

Editors and Affiliations

Zhejiang University, Zhejiang, China
Huajun Chen
Rensselaer Polytechnic Institute, Troy, New York, USA
Heng Ji
Chinese Academy of Sciences, Beijing, China
Le Sun
Google Research, Mountain View, California, USA
Haixun Wang
Wuhan University, Wuhan, Hubei, China
Tieyun Qian
East China University of Science and Technology, Shanghai, China
Tong Ruan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, L., Qin, M., Huang, D. (2016). Biomedical Event Trigger Detection Based on Hybrid Methods Integrating Word Embeddings. In: Chen, H., Ji, H., Sun, L., Wang, H., Qian, T., Ruan, T. (eds) Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data. CCKS 2016. Communications in Computer and Information Science, vol 650. Springer, Singapore. https://doi.org/10.1007/978-981-10-3168-7_7

Download citation

DOI: https://doi.org/10.1007/978-981-10-3168-7_7
Published: 23 November 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3167-0
Online ISBN: 978-981-10-3168-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics