Abstract
Document copy detection is a very important tool for protecting author’s copyright. We present a document copy detection system that calculates the similarity between documents based on plagiarism patterns. Experiments were performed using CISI document collection and show that the proposed system produces more precise results than existing systems.
This research was supported by the MIC (Ministry of Information and Communication), Korea, under the Chung-Ang University HNRC-ITRC (Home Network Research Center) support program supervised by the IITA (Institute of Information Technology Assessment).
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shivakumar, N., Garcia-Monlina, H.: SCAM: A Copy Detection Mechanisms for Digital Documents. In: Proceedings of International Conference on Theory and Practice of Digital Libraries, Libraries, Austin, Texas (June 1995)
Brin, S., Davis, J., Garcia-Molina, H.: Copy Detection Mechanisms for Digital Documents. In: Proceedings of ACM SIGMOD Annual Conference, San Jose, CA (1995)
Si, A., Leong, H., Lau, R.: CHECK: A Document Plagiarism Detection System. In: Proceedings of ACM Symposium for Applied Computing, February 1997, pp. 70–77 (1997)
Jun-Peng, B., Jun-Yi, S., Xiao-Dong, L., Hai-Yan, L., Xiao-Di, Z.: Document Copy Detection Based On Kernel Method. In: Proceedings of 2003 International Conference on Natural Language Processing and Knowledge Engineering (2003)
Fullam, K., Park, J.: Improvements for Scalable and Accurate Plagiarism Detection in Digital Documents (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kang, N., Han, S. (2006). Document Copy Detection System Based on Plagiarism Patterns. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2006. Lecture Notes in Computer Science, vol 3878. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11671299_60
Download citation
DOI: https://doi.org/10.1007/11671299_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32205-4
Online ISBN: 978-3-540-32206-1
eBook Packages: Computer ScienceComputer Science (R0)