Abstract
Tremendous amount of knowledge is present in the ever-growing scientific literature. In order to grasp this massive amount knowledge, various computational tasks are proposed for training computers to read and analyze scientific documents. As one of these task, semantic relationship classification aims at automatically analyzing semantic relationships in scientific documents. Conventionally, only a limited number of commonly used knowledge bases such as Wikipedia are used for collecting background information for this task. In this work, we hypothesize that scientific papers also could be utilized as a source of background information for semantic relationship classification. Based on the hypothesis, we propose the model that is capable of extracting background information from unannotated scientific papers. Preliminary experiments on the RANIS dataset [1] proves the effectiveness of the proposed model on relationship classification in scientific articles.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The phrase target entity refers not merely the concept denoted by noun or noun phrase, but it could be an action denoted by a verb or verb phrase and some quality denoted by an adjective, adverb, etc.
- 2.
This example is taken from W13-2242, ACL anthology (http://aclanthology.info).
- 3.
- 4.
We used the Stanford CoreNLP. https://stanfordnlp.github.io/CoreNLP/.
References
Tateisi, Y., Shidahara, Y., Miyao, Y., Aizawa, A.: Annotation of computer science papers for semantic relation extraction. In: LREC, pp. 1423–1429 (2014)
Teufel, S., et al.: Argumentative zoning: information extraction from scientific text. Ph.D. thesis, University of Edinburgh (2000)
Cohen, K.B., Demner-Fushman, D., Ananiadou, S., Tsujii, J.: BioNLP 2017 (2017)
Augenstein, I., Das, M., Riedel, S., Vikraman, L., McCallum, A.: SemEval 2017 Task 10: ScienceIE-extracting keyphrases and relations from scientific publications. arXiv preprint arXiv:1704.02853 (2017)
Kumar, S.: A survey of deep learning methods for relation extraction. arXiv preprint arXiv:1705.03645 (2017)
Zhou, D., Zhong, D., He, Y.: Biomedical relation extraction: from binary to complex. Comput. Math. Methods Med. 2014, 18 p. (2014). https://doi.org/10.1155/2014/298473. Article ID 298473
Boschee, E., Weischedel, R., Zamanian, A.: Automatic information extraction. In: Proceedings of the International Conference on Intelligence Analysis, vol. 71. Citeseer (2005)
Suchanek, F.M., Ifrim, G., Weikum, G.: Combining linguistic and statistical analysis to extract relations from web documents. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 712–717. ACM (2006)
Chan, Y.S., Roth, D.: Exploiting background knowledge for relation extraction. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 152–160. Association for Computational Linguistics (2010)
Zeng, D., Liu, K., Lai, S., Zhou, G., Zhao, J., et al.: Relation classification via convolutional deep neural network. In: COLING, pp. 2335–2344 (2014)
Zhang, D., Wang, D.: Relation classification via recurrent neural network. arXiv preprint arXiv:1508.01006 (2015)
dos Santos, C.N., Xiang, B., Zhou, B.: Classifying relations by ranking with convolutional neural networks. arXiv preprint arXiv:1504.06580 (2015)
Ammar, W., Peters, M., Bhagavatula, C., Power, R.: The AI2 system at SemEval-2017 Task 10 (ScienceIE): semi-supervised end-to-end entity and relation extraction. In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pp. 592–596 (2017)
Miwa, M., Bansal, M.: End-to-end relation extraction using LSTMs on sequences and tree structures. arXiv preprint arXiv:1601.00770 (2016)
GuoDong, Z., Jian, S., Jie, Z., Min, Z.: Exploring various knowledge in relation extraction. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 427–434. Association for Computational Linguistics (2005)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Gu, J., Sun, F., Qian, L., Zhou, G.: Chemical-induced disease relation extraction via convolutional neural network. Database 2017, bax024 (2017). https://doi.org/10.1093/database/bax024
Hahn-Powell, G., Bell, D., Valenzuela-EscĂ¡rcega, M.A., Surdeanu, M.: This before that: causal precedence in the biomedical domain. arXiv preprint arXiv:1606.08089 (2016)
Yin, W., Kann, K., Yu, M., SchĂ¼tze, H.: Comparative study of CNN and RNN for natural language processing. arXiv preprint arXiv:1702.01923 (2017)
Elman, J.L.: Finding structure in time. Cogn. Sci. 14(2), 179–211 (1990)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Galley, M., McKeown, K.: Improving word sense disambiguation in lexical chaining (2003)
Morris, J., Hirst, G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Comput. Linguisti. 17(1), 21–48 (1991)
Mascarell, L.: Lexical chains meet word embeddings in document-level statistical machine translation. In: Proceedings of the Third Workshop on Discourse in Machine Translation, pp. 99–109 (2017)
Wang, T., Li, Y., Bontcheva, K., Cunningham, H., Wang, J.: Automatic extraction of hierarchical relations from text. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 215–229. Springer, Heidelberg (2006). https://doi.org/10.1007/11762256_18
Arora, S., Liang, Y., Ma, T.: A simple but tough-to-beat baseline for sentence embeddings (2016)
Bergstra, J., Breuleux, O., Bastien, F., Lamblin, P., Pascanu, R., Desjardins, G., Turian, J., Warde-Farley, D., Bengio, Y.: Theano: a CPU and GPU math compiler in Python. In: Proceedings of the 9th Python in Science Conference, pp. 1–7 (2010)
Bird, S., Dale, R., Dorr, B.J., Gibson, B.R., Joseph, M.T., Kan, M.Y., Lee, D., Powley, B., Radev, D.R., Tan, Y.F., et al.: The ACL anthology reference corpus: a reference dataset for bibliographic research in computational linguistics. In: LREC (2008)
Acknowledgement
This work was supported by JST CREST Grant Number JPMJCR1513, Japan and KAKENHI Grant Number 16H06614.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Dai, Q., Inoue, N., Reisert, P., Inui, K. (2018). Leveraging Document-Specific Information for Classifying Relations in Scientific Articles. In: Arai, S., Kojima, K., Mineshima, K., Bekki, D., Satoh, K., Ohta, Y. (eds) New Frontiers in Artificial Intelligence. JSAI-isAI 2017. Lecture Notes in Computer Science(), vol 10838. Springer, Cham. https://doi.org/10.1007/978-3-319-93794-6_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-93794-6_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93793-9
Online ISBN: 978-3-319-93794-6
eBook Packages: Computer ScienceComputer Science (R0)