Abstract
Methodological sentence is the smallest unit that depicts how the research method is used in one paper. Researchers can understand a method by reading these sentences. So, extracting methodological sentences automatically is meaningful for them to evaluate and select appropriate methods in their research process. However, previous studies rely too much on manually annotated corpus, in which the quantity is limited. Furthermore, some studies do not perform well when generalized to testing sets. In this paper, we use structured abstracts as training data to alleviate the burden of manually annotation. The label for each sentence is determined by its corresponding title in the abstract. Moreover, in order to extract methodological sentences more precisely, a rule-based method is applied for pruning the prediction result. In experimental results, the P, R, and F1 value after pruning are 65.14%, 57.00% and 60.80% respectively, which are all higher than those are not pruned.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bornmann, L., Mutz, R.: Growth rates of modern science: a bibliometric analysis based on the number of publications and cited references. J. Assoc. Inform. Sci. Technol. 66(11), 2215–2222 (2015)
Noriega-Atala, E., Hein, P.D., Thumsi, S.S., Wong, Z., Wang, X., Morrison, C.T.: Inter-sentence relation extraction for associating biological context with events in biomedical texts. In: 2018 IEEE International Conference on Data Mining Workshops (ICDMW), Singapore, pp. 722–731 (2018)
Wang, H., et al.: Evidence sentence extraction for machine reading comprehension. arXiv preprint arXiv:1902.08852 (2019)
Morid, M.A., Fiszman, M., Raja, K., Jonnalagadda, S.R., Del Fiol, G.: Classification of clinically useful sentences in clinical evidence resources. J. Biomed. Inform. 60, 14–22 (2016)
Venkatesan, R., Li, B.: Convolutional Neural Networks in Visual Computing: A Concise Guide. CRC Press, Florida (2017)
Wakuya, H., Zurada, J.M.: Bi-directional computing architecture for time series prediction. Neural Netw. 14(9), 1307–1321 (2001)
Hadji, I., Wildes, R.P.: What do we understand about convolutional networks? arXiv preprint arXiv:1803.08834 (2018)
Rezaeinia, S.M., Ghodsi, A., Rahmani, R.: Text classification based on multiple block convolutional highways. arXiv preprint arXiv:1807.09602 (2018)
Chengliang, G., Hua, X.U., Kai, G.: Attention-based BiLSTM network with part-of-speech features for chinese text classification. J. Hebei Univ. Sci. Technol. (2018)
Cristianint, N.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Printed in the United Kingdom at the University Press (2000)
Palous, J.: Machine Learning and Data Mining. Horwood Publishing, Chichester (2007)
Guo, G., Wang, H., Bell, D., Bi, Y., Greer, K.: KNN model-based approach in classification. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE. OTM 2003. LNCS, vol. 2888, pp. 986–996. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39964-3_62
Kingma, D., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014)
Levy, O., Goldberg, Y.: Dependency-based word embeddings. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (2014)
Mikolov, T., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20(1), 37–46 (1960)
Landis, J.R., Koch, G.G.: The measurement of interrater agreement for categorical data. Biometrics 33, 159–174 (1977)
Acknowledgment
This work is supported by Major Projects of National Social Science Fund (No. 17ZDA291) and Postgraduate Research & Practice Innovation Program of Jiangsu Province (No. KYCX19_0246).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, R., Zhang, C., Zhang, Y., Zhang, J. (2020). Extracting Methodological Sentences from Unstructured Abstracts of Academic Articles. In: Sundqvist, A., Berget, G., Nolin, J., Skjerdingstad, K. (eds) Sustainable Digital Communities. iConference 2020. Lecture Notes in Computer Science(), vol 12051. Springer, Cham. https://doi.org/10.1007/978-3-030-43687-2_66
Download citation
DOI: https://doi.org/10.1007/978-3-030-43687-2_66
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-43686-5
Online ISBN: 978-3-030-43687-2
eBook Packages: Computer ScienceComputer Science (R0)