skip to main content
10.1145/3575882.3575955acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesic3inaConference Proceedingsconference-collections
research-article

SSTI: Semantic Similarity to detect Novelty of Thesis Ideas

Published:27 February 2023Publication History

ABSTRACT

The act of plagiarism in a thesis can decrease the quality of a student’s thesis. The plagiarism is perhaps unintentional. One form of this plagiarism in the thesis is the similarity of ideas in terms of topics, methods and cases used. Detecting the similarity of ideas has its difficulties because it must be able to understand the context of a document. This research implements a similar idea detection framework with a semantic similarity approach. Calculating the similarity value between the thesis proposal and the crawled data is carried out by considering the cluster distance and the hierarchical structure in the knowledge base. Comparative data were obtained from open portal publications. The evaluation results return accuracy is over 100 data testing.

References

  1. Dewi. 2020. The Evaluation of Semantic Mapping. In Journal of Physics: Conference Series, Vol. 1500. IOP Publishing, 012101.Google ScholarGoogle Scholar
  2. Dewi and Yuni Susanti. 2020. Improving Graph-Based Summarization with HTML Tag and Metadata Features.Engineering Letters 28, 2 (2020).Google ScholarGoogle Scholar
  3. Dewi W and Josef Kiing. 2014. Semantic mapping relational to graph model. In 2014 International Conference on Computer, Control, Informatics and Its Applications (IC3INA). IEEE, 160–165.Google ScholarGoogle Scholar
  4. Eddra Adhim and Dewi. 2021. Improving the Result of Question Answering System with Semantic Similarity Method Based on Hierarchy in Ontology. In The 2021 International Conference on Computer, Control, Informatics and Its Applications. 97–103.Google ScholarGoogle Scholar
  5. Yeshwanth Balachander and Teng-Sheng Moh. 2018. Ontology Based Similarity for Information Technology Skills. In Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (Barcelona, Spain) (ASONAM ’18). IEEE Press, 302–305.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Jun-Peng Bao, Jun-Yi Shen, Xiao-Dong Liu, Hai-Yan Liu, and Xiao-Di Zhang. 2004. Finding plagiarism based on common semantic sequence model. In International Conference on Web-Age Information Management. Springer, 640–645.Google ScholarGoogle ScholarCross RefCross Ref
  7. Fabio Benedetti, Domenico Beneventano, Sonia Bergamaschi, and Giovanni Simonini. 2019. Computing inter-document similarity with context semantic analysis. Information Systems 80(2019), 136–147.Google ScholarGoogle ScholarCross RefCross Ref
  8. Imam Cartealy and Li Liao. 2019. Metabolic Pathway Membership Inference Using an Ontology-Based Similarity Approach. In Proceedings of the 2019 8th International Conference on Bioinformatics and Biomedical Science (Beijing, China) (ICBBS 2019). Association for Computing Machinery, New York, NY, USA, 97–102. https://doi.org/10.1145/3369166.3369174Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Robert Dale. 2018. Text analytics APIs, part 2: The smaller players. Natural Language Engineering 24, 5 (2018), 797–803.Google ScholarGoogle ScholarCross RefCross Ref
  10. Sunil Dhakal, Lu Zhang, and Xuan Lv. 2022. Ontology-based semantic modelling to support knowledge-based document classification on disaster-resilient construction practices. International Journal of Construction Management 22, 11(2022), 2059–2078.Google ScholarGoogle ScholarCross RefCross Ref
  11. Rocky Yefrenes Dillak, Folkes Laumal, and Ledy Jen Kadja. 2016. Sistem Deteksi Dini Plagiarisme Tugas Akhir Mahasiswa Menggunakan Algoritma Ngrams Dan Winnowing. Jurnal Ilmiah Flash 2, 1 (2016), 12–18.Google ScholarGoogle ScholarCross RefCross Ref
  12. Taiseer Abdalla Elfadil Eisa, Naomie Salim, and Abdelzahir Abdelmaboud. 2019. Content-based scientific figure plagiarism detection using semantic mapping. In International Conference of Reliable Information and Communication Technology. Springer, 420–427.Google ScholarGoogle Scholar
  13. Marc Franco-Salvador, Parth Gupta, and Paolo Rosso. 2013. Cross-language plagiarism detection using a multilingual semantic network. In European Conference on Information Retrieval. Springer, 710–713.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Marc Franco-Salvador, Parth Gupta, and Paolo Rosso. 2013. Knowledge graphs as context models: Improving the detection of cross-language plagiarism with paraphrasing. In PROMISE Winter School. Springer, 227–236.Google ScholarGoogle Scholar
  15. Tirthankar Ghosal, Vignesh Edithal, Asif Ekbal, Pushpak Bhattacharyya, George Tsatsaronis, and Srinivasa Satya Sameer Kumar Chivukula. 2018. Novelty goes deep. A deep neural solution to document level novelty detection. In Proceedings of the 27th international conference on Computational Linguistics. 2802–2813.Google ScholarGoogle Scholar
  16. Sébastien Harispe, Sylvie Ranwez, Stefan Janaqi, and Jacky Montmain. 2015. Semantic similarity from natural language and ontology analysis. Synthesis Lectures on Human Language Technologies 8, 1(2015), 1–254.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Rebecca Moore Howard. 2000. The ethics of plagiarism. The ethics of writing instruction: Issues in theory and practice 4 (2000), 79–89.Google ScholarGoogle Scholar
  18. Niraj Kumar. 2014. A graph based automatic plagiarism detection technique to handle artificial word reordering and paraphrasing. In International Conference on Intelligent Text Processing and Computational Linguistics. Springer, 481–494.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Sushil Kumar and Komal Kumar Bhatia. 2020. Semantic similarity and text summarization based novelty detection. SN Applied Sciences 2, 3 (2020), 1–15.Google ScholarGoogle Scholar
  20. Yinglong Ma, Peng Zhang, and Jiangang Ma. 2018. An ontology driven knowledge block summarization approach for Chinese judgment document classification. IEEE Access 6(2018), 71327–71338.Google ScholarGoogle ScholarCross RefCross Ref
  21. Tereza Novotná. 2020. Document similarity of czech supreme court decisions. Masaryk UJL & Tech. 14(2020), 105.Google ScholarGoogle ScholarCross RefCross Ref
  22. RVSVS Ram, Efstathios Stamatatos, and Sobha Lalitha Devi. 2014. Identification of plagiarism using syntactic and semantic filters. In International Conference on Intelligent Text Processing and Computational Linguistics. Springer, 495–506.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Arup Sarkar, Ujjal Marjit, and Utpal Biswas. 2014. A conceptual model to develop an advanced plagiarism checking tool based on semantic matching. In 2014 2nd International Conference on Business and Information Management (ICBIM). IEEE, 104–108.Google ScholarGoogle ScholarCross RefCross Ref
  24. Niloofer Shanavas, Hui Wang, Zhiwei Lin, and Glenn Hawe. 2020. Ontology-based enriched concept graphs for medical document classification. Information Sciences 525(2020), 172–181.Google ScholarGoogle ScholarCross RefCross Ref
  25. Manjula K Shenoy, KC Shet, and U Dinesh Acharya. 2012. Semantic plagiarism detection system using ontology mapping. Advanced Computing 3, 3 (2012), 59.Google ScholarGoogle Scholar
  26. Yasunao Takano, Yusuke Iijima, Kou Kobayashi, Hiroshi Sakuta, Hiroki Sakaji, Masaki Kohana, and Akio Kobayashi. 2019. Improving document similarity calculation using cosine-similarity graphs. In International Conference on Advanced Information Networking and Applications. Springer, 512–522.Google ScholarGoogle Scholar

Index Terms

  1. SSTI: Semantic Similarity to detect Novelty of Thesis Ideas

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      IC3INA '22: Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications
      November 2022
      415 pages
      ISBN:9781450397902
      DOI:10.1145/3575882

      Copyright © 2022 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 February 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)12
      • Downloads (Last 6 weeks)1

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format