skip to main content
10.1145/3144826.3145438acmotherconferencesArticle/Chapter ViewAbstractPublication PagesteemConference Proceedingsconference-collections
research-article

Detecting plagiarism in micro-blogging social networks

Published:18 October 2017Publication History

ABSTRACT

Fighting plagiarism can be an exhausting task. In the context of social networks, it is common that some users regularly copy the content prepared by others and exhibit them as own. In the higher education context, it is not different and it may be even more critical. In a write-to-learn approach, if students copying text of others get similar rewards as those doing the effort, it could be a cause of dis-encouragement for those working students. Despite the interest in anti-plagiarism tools, there are no anti-plagiarism libraries that allow a self-hosted solution. Existing APIs are commercial ones, and are not tailored for specific languages. The paper introduces a plagiarism detection tool developed using free resources and that has been integrated into Bolotweet, a social network for teaching support which allows to score student micro-annotations. The tool is prepared to compare micro-annotations and tell the similarity between one and others that were posted in the past. It aids the teacher to identify cheating students and also to provide consistency in the scoring. By identifying similar annotations, and informing which score was assigned, the teacher can generate similar scores to similar micro-annotations.

References

  1. Alexander Budanitsky and Graeme Hirst. 2006. Evaluating WordNet-based Measures of Lexical Semantic Relatedness. Computational Linguistics 32, 1 (2006),13--47. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Evgeniy Gabrilovich and Shaul Markovitch. 2007. Computing semantic relatedness using wikipedia-based explicit semantic analysis. In IJcAI, Vol. 7. 1606--1611 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Jorge J Gomez-Sanz, Álvaro Ortego, and Juan Pavón. 2016. BoloTweet: A MicroBlogging System for Education. In Methodologies and Intelligent Systems for Technology Enhanced Learning. Springer, 53--60.Google ScholarGoogle Scholar
  4. Caichun Gong, Yulan Huang, Xueqi Cheng, and Shuo Bai. 2008. Detecting near-duplicates in large-scale short text databases. Advances in knowledge discovery and data mining (2008), 877--883. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Aitor Gonzalez-Agirre, Egoitz Laparra, and German Rigau. 2012. Multilingual Central Repository version 3.0. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, Istanbul, Turkey, May 23-25,2012. 2525--2529. http://www.lrec-conf.org/proceedings/lrec2012/summaries/293.htmlGoogle ScholarGoogle Scholar
  6. Gabriela Grosseck and Carmen Holotescu. 2011. ACADEMIC RESEARCH IN 140 CHARACTERS OR LESS. eLearning & Software for Education (2011).Google ScholarGoogle Scholar
  7. Graeme Hirst and Olga Feiguina. 2007. Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts. LLC 22, 4 (2007), 405--417.Google ScholarGoogle ScholarCross RefCross Ref
  8. George A Miller. 1995. WordNet: a lexical database for English. Commun. ACM 38, 11 (1995), 39--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Carsten Ullrich, Kerstin Borau, Heng Luo, Xiaohong Tan, Liping Shen, and Ruimin Shen. 2008. Why web 2.0 is good for learning and for research: principles and prototypes. In Proceedings of the 17th international conference on World Wide Web.ACM, 705--714. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Daniel Bär1 Torsten Zesch and Iryna Gurevych. 2012. Text reuse detection using a composition of text similarity measures. In Proceedings of COLING, Vol. 1. 167--184.Google ScholarGoogle Scholar

Index Terms

  1. Detecting plagiarism in micro-blogging social networks

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          TEEM 2017: Proceedings of the 5th International Conference on Technological Ecosystems for Enhancing Multiculturality
          October 2017
          723 pages
          ISBN:9781450353861
          DOI:10.1145/3144826

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 18 October 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          TEEM 2017 Paper Acceptance Rate84of109submissions,77%Overall Acceptance Rate496of705submissions,70%
        • Article Metrics

          • Downloads (Last 12 months)4
          • Downloads (Last 6 weeks)2

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader