Skip to main content

Classifying Political Arabic Articles Using Support Vector Machine with Different Feature Extraction

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1174))

Abstract

In the recent years, the number of web logs, and the amount of opinionated data on the World Wide Web, have been grown substantially. The ability to determine the political orientation of an article automatically can be beneficial in many areas from academia to security. However, the sentiment classification of web log posts (political web log posts in particular), is apparently more complex than the sentiment classification of conventional text. In this paper, a supervised machine learning with two feature extraction techniques Term Frequency (TF) and Term Frequency-Inverse Document Frequency (TF-IDF) are used for the classification process. For investigation, SVM with four kernels for supervised machine learning have been employed. Subsequent to testing, the results reveal that the linear with TF achieved the results in accuracy of 91.935% also with TF-IDF achieved the 95.161%. The linear kernel was deemed the most suitable for our model.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends® Inf. Retr. 2(1–2), 1–135 (2008)

    Article  Google Scholar 

  2. Liu, B.: Sentiment analysis and subjectivity. In: Handbook of Natural Language Processing, vol. 2, no. 2010, pp. 627–666 (2010)

    Google Scholar 

  3. Liu, B.: Sentiment analysis and opinion mining. In: Synthesis Lectures on Human Language Technologies, vol. 5, no. 1, pp. 1–167 (2012)

    Article  Google Scholar 

  4. Tan, C., Lee, L., Tang, J., Jiang, L., Zhou, M., Li, P.: User-level sentiment analysis incorporating social networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1397–1405. ACM (2011)

    Google Scholar 

  5. Xia, R., Zong, C., Li, S.: Ensemble of feature sets and classification algorithms for sentiment classification. Inf. Sci. 181(6), 1138–1152 (2011)

    Article  Google Scholar 

  6. Prabowo, R., Thelwall, M.: Sentiment analysis: a combined approach. J. Inform. 3(2), 143–157 (2009)

    Article  Google Scholar 

  7. Brahimi, B., Touahria, M., Tari, A.: Data and text mining techniques for classifying Arabic tweet polarity. J. Dig. Inf. Manag. 14(1), 15–25 (2016)

    Google Scholar 

  8. Catal, C., Nangir, M.: A sentiment classification model based on multiple classifiers. Appl. Soft Comput. 50, 135–141 (2017)

    Article  Google Scholar 

  9. Balazs, J.A., Velásquez, J.D.: Opinion mining and information fusion: a survey. Inf. Fusion 27, 95–110 (2016)

    Article  Google Scholar 

  10. Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5(4), 1093–1113 (2014)

    Article  Google Scholar 

  11. Ravi, K., Ravi, V.: A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl.-Based Syst. 89, 14–46 (2015)

    Article  Google Scholar 

  12. Zhou, S., Chen, Q., Wang, X.: Fuzzy deep belief networks for semi-supervised sentiment classification. Neurocomputing 131, 312–322 (2014)

    Article  Google Scholar 

  13. Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424. Association for Computational Linguistics (2002)

    Google Scholar 

  14. Sindhwani, V., Melville, P.: Document-word co-regularization for semi-supervised sentiment analysis. In: 2008 Eighth IEEE International Conference on Data Mining, pp. 1025–1030. IEEE (2008)

    Google Scholar 

  15. Nasukawa, T., Yi, J.: Sentiment analysis: capturing favorability using natural language processing. In: Proceedings of the 2nd International Conference on Knowledge Capture, pp. 70–77. ACM (2003)

    Google Scholar 

  16. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)

    Google Scholar 

  17. Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics, p. 271. Association for Computational Linguistics (2004)

    Google Scholar 

  18. Hearst, M.A.: Direction-based text interpretation as an information access refinement. In: Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval, pp. 257–274 (1992)

    Google Scholar 

  19. Sack, W.: On the computation of point of view. In: AAAI, p. 1488 (1994)

    Google Scholar 

  20. Huettner, A., Subasic, P.: Fuzzy typing for document management. In: ACL 2000 Companion Volume: Tutorial Abstracts and Demonstration Notes, pp. 26–27 (2000)

    Google Scholar 

  21. Das, S., Chen, M.: Yahoo! for Amazon: extracting market sentiment from stock message boards. In: Proceedings of the Asia Pacific Finance Association Annual Conference (APFA), Bangkok, Thailand, vol. 35, p. 43 (2001)

    Google Scholar 

  22. Wang, Z., Tong, V.J.C., Ruan, P., Li, F.: Lexicon knowledge extraction with sentiment polarity computation. In: 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW), pp. 978–983. IEEE (2016)

    Google Scholar 

  23. Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, pp. 174–181. Association for Computational Linguistics (1997)

    Google Scholar 

  24. Turney, P.D., Littman, M.L.: Unsupervised learning of semantic orientation from a hundred-billion-word corpus. arXiv preprint cs/0212012 (2002)

    Google Scholar 

  25. Oussous, A., Lahcen, A.A., Belfkih, S.: Impact of text pre-processing and ensemble learning on Arabic sentiment analysis. In: Proceedings of the 2nd International Conference on Networking, Information Systems & Security, p. 65. ACM (2019)

    Google Scholar 

  26. Hardeniya, N., Perkins, J., Chopra, D., Joshi, N., Mathur, I.: Natural Language Processing: Python and NLTK. Packt Publishing Ltd., Birmingham (2016)

    Google Scholar 

  27. Mustafa, M., Eldeen, A.S., Bani-Ahmad, S., Elfaki, A.O.: A comparative survey on arabic stemming: approaches and challenges. Intell. Inf. Manag. 9(02), 39 (2017)

    Google Scholar 

  28. Abooraig, R., Al-Zu’bi, S., Kanan, T., Hawashin, B., Al Ayoub, M., Hmeidi, I.: Automatic categorization of Arabic articles based on their political orientation. Digit. Investig. 25, 24–41 (2018)

    Article  Google Scholar 

  29. Taghva, K., Elkhoury, R., Coombs, J.: Arabic stemming without a root dictionary. In: International Conference on Information Technology: Coding and Computing (ITCC 2005)-Volume II, vol. 1, pp. 152–157. IEEE (2005)

    Google Scholar 

  30. Aggarwal, C.C., Zhai, C.: A survey of text classification algorithms. In: Aggarwal, C., Zhai, C. (eds.) Mining Text Data, pp. 163–222. Springer, Boston (2012). https://doi.org/10.1007/978-1-4614-3223-4_6

    Chapter  Google Scholar 

  31. Alowaidi, S., Saleh, M., Abulnaja, O.: Semantic sentiment analysis of arabic texts. Int. J. Adv. Comput. Sci. Appl. 8(2), 256–262 (2017)

    Google Scholar 

  32. Abd, D.H., Sadiq, A.T., Abbas, A.R.: A new framework for automatic extraction polarity and target of articles (2019)

    Google Scholar 

  33. Deng, N., Tian, Y.: Support Vector Machines: A New Method in Data Mining. Science Press, Beijing (2004)

    Google Scholar 

  34. Al-Mejibli, I.S., Abd, D.H., Alwan, J.K., Rabash, A.J.: Performance evaluation of kernels in support vector machine. In: 2018 1st Annual International Conference on Information and Sciences (AiCIS), pp. 96–101. IEEE (2018)

    Google Scholar 

  35. Khalaf, M., et al.: An application of using support vector machine based on classification technique for predicting medical data sets. In: Huang, D.-S., Jo, K.-H., Huang, Z.-K. (eds.) ICIC 2019. LNCS, vol. 11644, pp. 580–591. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26969-2_55

    Chapter  Google Scholar 

  36. Khalaf, M., et al.: Recurrent neural network architectures for analysing biomedical data sets. In: 2017 10th International Conference on Developments in eSystems Engineering (DeSE), pp. 232–237. IEEE (2017)

    Google Scholar 

  37. Abd, D.H., Alwan, J.K., Ibrahim, M., Naeem, M.B.: The utilisation of machine learning approaches for medical data classification and personal care system management for sickle cell disease. In: Annual Conference on New Trends in Information & Communications Technology 2017 (2017)

    Google Scholar 

  38. Mayoraz, E., Alpaydin, E.: Support vector machines for multi-class classification. In: Mira, J., Sánchez-Andrés, Juan V. (eds.) IWANN 1999. LNCS, vol. 1607, pp. 833–842. Springer, Heidelberg (1999). https://doi.org/10.1007/BFb0100551

    Chapter  Google Scholar 

Download references

Acknowledgments

The authors would like to thank Al-Maarif University College and Dr. Falah Mubark Bardan for supporting this research.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Dhafar Hamed Abd , Ahmed T. Sadiq or Ayad R. Abbas .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Abd, D.H., Sadiq, A.T., Abbas, A.R. (2020). Classifying Political Arabic Articles Using Support Vector Machine with Different Feature Extraction. In: Khalaf, M., Al-Jumeily, D., Lisitsa, A. (eds) Applied Computing to Support Industry: Innovation and Technology. ACRIT 2019. Communications in Computer and Information Science, vol 1174. Springer, Cham. https://doi.org/10.1007/978-3-030-38752-5_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-38752-5_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-38751-8

  • Online ISBN: 978-3-030-38752-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics