Abstract
Biomedical researchers often need to carefully identify and read multiple articles to exclude unproven or controversial biomedical evidence about specific issues. These articles thus need to be highly related to each other. They should share similar core contents, including research goals, methods, and findings. However, given an article r, existing search engines and information retrieval techniques are difficult to retrieve highly related articles for r. We thus present a technique KPC (key passage of citations) that extracts key passages of the citations (out-link references) in each article, and based on the key passages, estimates the similarity between articles. Empirical evaluation on over ten thousand biomedical articles shows that KPC can significantly improve the retrieval of those articles that biomedical experts believe to be highly related to specific articles. The contribution is of practical significance to the writing, reviewing, reading, and analysis of biomedical articles.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amsler, R.A.: Application of citation-based automatic classification. Technical report, Linguistics Research Center, University of Texas at Austin (1972)
Belew, R.K., Chang, M.: Purposeful retrieval: Applying domain insight for topically-focused groups of biologists. In: Proceedings of the SIGIR 2004 Bio Workshop: Search and Discovery in Bioinformatics (2004)
Calado, P., Cristo, M., Moura, E., Ziviani, N., Ribeiro-Neto, B., Goncalves, M.A.: Combining link-based and content-based methods for web document classification. In: Proc. of the 2003 ACM CIKM International Conference on Information and Knowledge Management (CIKM 2003), New Orleans, Louisiana, USA (2003)
Cambria, E., Hussain, A., Havasi, C., Eckl, C., Munro, J.: Towards crowd validation of the UK national health service. In: Proc. of Web Science Conference, Raleigh, NC, USA (2010)
Couto, T., Cristo, M., Gonc¸alves, M.A., Calado, P., Nivio Ziviani, N., Moura, E., Ribeiro-Neto, B.: A Comparative study of citations and links in document classification. In: Proc. of the 6th ACM/IEEE-CS joint conference on Digital libraries, pp. 75–84 (2006)
Gipp, B., Meuschke, N.: Citation pattern matching algorithms for citation-based plagiarism detection: greedy citation tiling, citation chunking and longest common citation sequence. In: Proc. of 11th ACM Symposium on Document Engineering, Mountain View, CA, USA (2011)
Gipp, B., Beel, J.: Citation proximity analysis (CPA) – a new approach for identifying related work based on Co-citation analysis. In: Proc. of the 12th International Conference on Scientometrics and Informetrics, vol. 2, pp. 571–575 (2009)
Heck, T.: Combining social information for academic networking. In: Proc. of the 16th ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW 2013), San Antonio, Texas, USA
Jeh, G., Widom, J.: SimRank: a measure of structural-context similarity. In: Proc. of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data mining, pp. 538–543 (2002)
Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of ACM SIGKDD, Edmonton, Alberta, Canada, pp. 133–142 (2002)
Jung, S., Segev, A.: Analyzing future communities in growing citation networks. Knowledge-Based Systems 69, 34–44 (2014)
Kumar, S., P. Reddy, K., Reddy, V.B., Singh, A.: Similarity analysis of legal judgments. In: Proc. of the Fourth Annual ACM Bangalore Conference (COMPUTE 2011), Bangalore, Karnataka, India
Kessler, M.M.: Bibliographic coupling between scientific papers. American Documentation 14(1), 10–25 (1963)
Small, H.G.: Co-citation in the scientific literature: A new measure of relationship between two documents. Journal of the American Society for Information Science 24(4), 265–269 (1973)
White, H.D., Griffith, B.C.: Author cocitation: A literature measure of intellectual structure. Journal of the American Society for Information Science 32(3), 163–171 (1981)
Yoon, S.-H., Kim, S.-W., Park, S.: A link-based similarity measure for scientific literature. In: Proc. of The 19th International World Wide Web Conference (WWW 2010), North Carolina, USA
Zhang, M., He, Z., Hu, H., Wang, W.: E-Rank: a structural-based similarity measure in social networks. In: Proc. of IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (2012)
Zhao, P., Han, J., Sun, Y.: P-Rank: a comprehensive structural similarity measure over information networks. In: Proc. of the International Conference on Information and Knowledge Management, pp. 553–562 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Liu, RL. (2015). Retrieval of Highly Related Biomedical References by Key Passages of Citations. In: Ali, M., Kwon, Y., Lee, CH., Kim, J., Kim, Y. (eds) Current Approaches in Applied Artificial Intelligence. IEA/AIE 2015. Lecture Notes in Computer Science(), vol 9101. Springer, Cham. https://doi.org/10.1007/978-3-319-19066-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-19066-2_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19065-5
Online ISBN: 978-3-319-19066-2
eBook Packages: Computer ScienceComputer Science (R0)