Abstract
Nowadays, most of documents are produced in digital format, in which they can be easily accessed and copied. Document copy detection is a very important tool for protecting the author’s copyright. We present PPChecker, a document copy detection system based on plagiarism pattern checking. PPChecker calculates the amount of data copied from the original document to the query document, based on linguistically-motivated plagiarism patterns. Experiments performed on CISI document collection show that PPChecker produces better decision information for document copy detection than existing systems.
This research was supported by the MIC (Ministry of Information and Communication), Korea, under the Chung-Ang University HNRC-ITRC (Home Network Research Center) support program supervised by the IITA (Institute of Information Technology Assessment).
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shivakumar, N., Garcia-Monlina, H.: SCAM: A Copy Detection Mechanisms for Digital Documents. In: Proceedings of International Conference on Theory and Practice of Digital Libraries, Austin, Texas (June 1995)
Brin, S., Davis, J., Garcia-Molina, H.: Copy Detection Mechanisms for Digital Documents. In: Proceedings of ACM SIGMOD Annual Conference, San Jose, CA (May 1995)
Si, A., Leong, H., Lau, R.: CHECK: A Document Plagiarism Detection System. In: Proceedings of ACM Symposium for Applied Computing, February 1997, pp. 70–77 (1997)
Jun-Peng, B., Jun-Yi, S., Xiao-Dong, L., Hai-Yan, L., Xiao-Di, Z.: Document Copy Detection Based On Kernel Method. In: 2003 International Conference on Natural Language Processing and Knowledge Engineering Proceedings (2003)
Monostori, K., Zaslavsky, A., Schmidt, H.: Document Overlap Detection System for Distributed Digital Libraries. In: Proc. of the 5th ACM conference on DL, pp. 226–227 (2000)
Bloomfield, L.: The Plagiarism Resource Site Charlottesville, Virginia, http://plagiarism.phys.virginia.edu
Fullam, K., Park, J.: Improvements for Scalable and Accurate Plagiarism Detection in Digital Documents (2002)
Shivakumar, N., Garcia-Molina, H.: Building a Scalable and Accurate Copy Detection Mechanism. In: 1st ACM Int. Conference on Digital Libraries (DL 1996), March 1996, pp. 160–168 (1996)
Finkel, R., Zaslavsky, A., Monostori, K., Schmidt, H.: Signature Extraction for Overlap Detection in Documents. In: Proceedings of Australasian Computer Science Conference (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kang, N., Gelbukh, A., Han, S. (2006). PPChecker: Plagiarism Pattern Checker in Document Copy Detection. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_83
Download citation
DOI: https://doi.org/10.1007/11846406_83
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)