Abstract
Many existing plagiarism detection systems fail in detecting plagiarism when there are an abundant garbage in the copied programs. This is because they do not use the structural information efficiently. In this paper, we propose a novel plagiarism detection system which uses parse tree kernels. By incorporating parse tree kernels into the system, it efficiently handles the structural information within source programs. A comparison with existing systems such as SID and JPlag shows that the proposed system can detect plagiarism more accurately due to its ability of handling structural information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Parker, A., Hamblen, J.: Computer Algorithms for Plagiarism Detection. IEEE Transactions on Education 32(2), 94–99 (1989)
Halstead, M.: Elements of Software Science. Elsevier, Amsterdam (1977)
Prechelt, L., Malphol, G., Philippsen, M.: Finding Plagiarisms among a Set of Programs with JPlag. Journal of Universal Computer Science 8(11), 1016–1038 (2002)
Chen, X., Francia, B., Li, M., McKinnon, B., Seker, A.: Shared Information and Program Plagiarism Detection. IEEE Transactions on Information Theory 50(7), 1545–1551 (2004)
Collins, M., Duffy, N.: Convolution Kernels for Natural Language. In: Proceedings of the 14th Neural Information Processing Systems (2001)
Parr, T., Quong, R.: ANTLR: A Predicated-LL(k) Parser Generator. Journal of Software Practice & Experience 25(7) (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Son, JW., Park, SB., Park, SY. (2006). Program Plagiarism Detection Using Parse Tree Kernels. In: Yang, Q., Webb, G. (eds) PRICAI 2006: Trends in Artificial Intelligence. PRICAI 2006. Lecture Notes in Computer Science(), vol 4099. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-36668-3_122
Download citation
DOI: https://doi.org/10.1007/978-3-540-36668-3_122
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36667-6
Online ISBN: 978-3-540-36668-3
eBook Packages: Computer ScienceComputer Science (R0)