Abstract
In this paper we present the accuracy gains that spell corrector systems can provide to the plagiarism detection task when the appropriations contain spelling mistakes. These may have been introduced on purpose to avoid detection systems from finding the aforementioned appropriations, which could happen specially if such systems are based on lexical similarities. This document will detail the components that we have developed for both plagiarism detection and spell correction, and the significant gains that their combination produces.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Clough, P., Stevenson, M.: Developing a corpus of plagiarised short answers. Language Resources and Evaluation 45(1) (2011)
Gao, J., Li, X., Micol, D., Quirk, C., Sun, X.: Learning Phrase-Based Spelling Error Models from Clickthrough Data. In: Proceedings of the 23rd International Conference on Computational Linguistics, Beijing, China (August 2010)
Micol, D., Ferrández, Ó., Llopis, F., Muñoz, R.: A Lexical Similarity Approach for Efficient and Scalable External Plagiarism Detection. In: Proceedings of the SEPLN 2010 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse, Padua, Italy (2010)
Philips, L.: Hanging on the metaphone. Computer Language Magazine 7(12), 38–44 (1990)
Sun, X., Gao, J., Micol, D., Quirk, C.: A Large scale Ranker-Based System for Search Query Spelling Correction. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, pp. 266–274 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Micol, D., Ferrández, Ó., Muñoz, R. (2012). On the Application of Spell Correction to Improve Plagiarism Detection. In: Bouma, G., Ittoo, A., Métais, E., Wortmann, H. (eds) Natural Language Processing and Information Systems. NLDB 2012. Lecture Notes in Computer Science, vol 7337. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31178-9_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-31178-9_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31177-2
Online ISBN: 978-3-642-31178-9
eBook Packages: Computer ScienceComputer Science (R0)