Reference Hub

This research has been cited in:

Chapter
Deep Neural Network Models for Paraphrased Text Classification in the Arabic LanguageNatural Language Processing and Information Systems10.1007/978-3-030-23281-8_1
Article
BLSTM-API: Bi-LSTM Recurrent Neural Network-Based Approach for Arabic Paraphrase IdentificationArabian Journal for Science and Engineering10.1007/s13369-020-05320-w
Article
Heterogeneous Information Network-Based Content Caching in the Internet of VehiclesIEEE Transactions on Vehicular Technology10.1109/TVT.2019.2936792
Conference
HIN-VMReSys: Heterogeneous Information Network based Vehicle Music Recommendation System2019 IEEE International Conference on Signal, Information and Data Processing (ICSIDP)10.1109/ICSIDP47821.2019.9173063
Article
An external plagiarism detection system based on part-of-speech (POS) tag n-grams and word embeddingExpert Systems with Applications10.1016/j.eswa.2022.116677
Article
Heterogeneous information network-based music recommendation system in mobile networksComputer Communications10.1016/j.comcom.2019.12.002
Article
A Word to the Wise Analyzing the Impact of Textual Strategies in Determining House PricingJournal of Housing Research10.1080/10527001.2021.2013058
Conference
Arabic Semantic Textual Similarity Identification based on Convolutional Gated Recurrent Units2021 International Conference on INnovations in Intelligent SysTems and Applications (INISTA)10.1109/INISTA52262.2021.9548576
Article
Research on customer opinion summarization using topic mining and deep neural networkMathematics and Computers in Simulation10.1016/j.matcom.2020.12.009
Article
Semantic-Based Integrated Plagiarism Detection Approach for English DocumentsIETE Journal of Research10.1080/03772063.2021.2004383
Article
Automatic plagiarism detection in obfuscated textPattern Analysis and Applications10.1007/s10044-020-00882-9
Conference
Research on Paper Intelligent Plagiarism Detection Method Based on Idea TendencyProceedings of the 2019 2nd International Conference on Algorithms, Computing and Artificial Intelligence10.1145/3377713.3377786
Article
An effective text plagiarism detection system based on feature selection and SVM techniquesMultimedia Tools and Applications10.1007/s11042-023-15703-4

Latent Dirichlet Allocation and POS Tags Based Method for External Plagiarism Detection: LDA and POS Tags Based Plagiarism Detection

Ali Daud, Jamal Ahmad Khan, Jamal Abdul Nasir, Rabeeh Ayaz Abbasi, Naif Radi Aljohani, Jalal S. Alowibdi

Source Title: International Journal on Semantic Web and Information Systems (IJSWIS)14(3)

ISSN: 1552-6283|EISSN: 1552-6291|EISBN13: 9781522542926|DOI: 10.4018/IJSWIS.2018070103

Cite Article Cite Article

MLA

Daud, Ali, et al. "Latent Dirichlet Allocation and POS Tags Based Method for External Plagiarism Detection: LDA and POS Tags Based Plagiarism Detection." IJSWIS vol.14, no.3 2018: pp.53-69. http://doi.org/10.4018/IJSWIS.2018070103

APA

Daud, A., Khan, J. A., Nasir, J. A., Abbasi, R. A., Aljohani, N. R., & Alowibdi, J. S. (2018). Latent Dirichlet Allocation and POS Tags Based Method for External Plagiarism Detection: LDA and POS Tags Based Plagiarism Detection. International Journal on Semantic Web and Information Systems (IJSWIS), 14(3), 53-69. http://doi.org/10.4018/IJSWIS.2018070103

Chicago

Daud, Ali, et al. "Latent Dirichlet Allocation and POS Tags Based Method for External Plagiarism Detection: LDA and POS Tags Based Plagiarism Detection," International Journal on Semantic Web and Information Systems (IJSWIS) 14, no.3: 53-69. http://doi.org/10.4018/IJSWIS.2018070103

Export Reference

Favorite Full-Issue Download

View Full Text HTML

View Full Text PDF

Abstract

In this article we present a new semantic and syntactic-based method for external plagiarism detection. In the proposed approach, latent dirichlet allocation (LDA) and parts of speech (POS) tags are used together to detect plagiarism between the sample and a number of source documents. The basic hypothesis is that considering semantic and syntactic information between two text documents may improve the performance of the plagiarism detection task. Our method is based on two steps, naming, which is a pre-processing where we detect the topics from the sentences in documents using the LDA and convert each sentence in POS tags array; then a post processing step where the suspicious cases are verified purely on the basis of semantic rules. For two types of external plagiarism (copy and random obfuscation), we empirically compare our approach to the state-of-the-art N-gram based and stop-word N-gram based methods and observe significant improvements.

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.

Username or email:

Password:

Forgot individual login password?

Create individual account

Latent Dirichlet Allocation and POS Tags Based Method for External Plagiarism Detection: LDA and POS Tags Based Plagiarism Detection

MLA

APA

Chicago

Export Reference

Abstract

Request Access