Abstract
Identifying plagiarism of the document is a mandatory task in the academic domain. Generally online available tools are used to check plagiarism. These tools calculate similarity between the documents using a sequence of the tokens/words present in the documents which are to be compared. A semantic relationship between the words for eg., word and its synonym are treated as different, while calculating the similarity between the documents. Few tools may be available for checking the similarity of English documents. But checking the plagiarism of Marathi documents is comparatively untouched field. Information present in the Marathi language is growing due to multilingual processing. The existing MaPla (Marathi Plagiarism checker) proved that Document synset matrix for Marathi (DSMM) similarity results are near to readings observed using cognitive ability of humans and it was performed on 4 documents. To further confirm robustness of MaPla, we experimented with 24 documents to calculate the similarity between all pairs of documents using cosine measure. Thus two, 24 × 24 matrices are formulated using DSMM and manual readings. Paired t-test, which was not carried out in MaPla, proves that there is no significant difference between two matrices and hence proves the robustness of the proposed technique.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Naik, R.R., Landge, M.B., Mahender, C.N.: Development of marathi text corpus for plagiarism detection in the marathi language. Corpus 6, 340 (2011)
Lamba, H., Govilkar, S.: A survey on plagiarism detection techniques for indian regional languages. Int. J. Comput. Appl. 975, 8887 (2017)
Shenoy, N., Potey, M.A.: Semantic similarity search model for obfuscated plagiarism detection in Marathi language using Fuzzy and Naïve Bayes approaches IOSR. J. Comput. Eng. 18(3), 83–88 (2016)
Bafna P.B., Saini J.R.: MaPla: a marathi plagiarism checker using document synset matrix. Int. J. Adv. Sci. Technol. (2020). in press
Bafna P.B., Saini J.R.: Marathi text analysis using unsupervised learning and word cloud. Int. J. Eng. Adv. Technol. 9(3) (2020)
Naik, R.R., Landge, M.B.: Plagiarism detection in marathi language using semantic analysis. In: Scholarly Ethics and Publishing: Breakthroughs in Research and Practice, pp. 473–482. IGI Global (2019)
Al-Ayyoub, M., Nuseir, A., Alsmearat, K., Jararweh, Y., Gupta, B.: Deep learning for Arabic NLP: a survey. J. Comput. Sci. 26, 522–531 (2018)
Gupta, N., Mathur, P.: Spell Checking Techniques in NLP: A Survey (2012)
Khan, W., Daud, A., Nasir, J.A., Amjad, T.: A survey on the state-of-the-art machine learning models in the context of NLP. Kuwait J. Sci. 43(4) (2016)
Ranjan, N., Mundada, K., Phaltane, K., Ahmad, S.: A survey on techniques in NLP. Int. J. Comput. Appl. 134(8), 6–9 (2016). odelling, pa
Naik, R.R., Landge, M.B., Mahender, C.N.: Word level plagiarism detection of marathi text using N-Gram approach. In: International Conference on Recent Trends in Image Processing and Pattern Recognition, pp. 14–23. Springer, Singapore (2018)
Srivastava, S., Govilkar, S.: Paraphrase identification of marathi sentences. In: International Conference on Intelligent Data Communication Technologies and Internet of Things, pp. 534–544. Springer, Cham (2018); Intelligent Computing: Theory and Applications, pp. 797–806. Springer, Singapore (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Saini, J.R., Bafna, P.B. (2022). RoMaPla: Using t-Test for Evaluating Robustness of Marathi Plagiarism. In: Bhateja, V., Tang, J., Satapathy, S.C., Peer, P., Das, R. (eds) Evolution in Computational Intelligence. Smart Innovation, Systems and Technologies, vol 267. Springer, Singapore. https://doi.org/10.1007/978-981-16-6616-2_5
Download citation
DOI: https://doi.org/10.1007/978-981-16-6616-2_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-6615-5
Online ISBN: 978-981-16-6616-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)