skip to main content
research-article

An Intelligent Unsupervised Approach for Handling Context-Dependent Words in Urdu Sentiment Analysis

Published:29 April 2022Publication History
Skip Abstract Section

Abstract

The characteristic of context dependency in Urdu words needs to be handled carefully while performing Urdu sentiment analysis. In this research, an already constructed Urdu sentiment lexicon of positive and negative words is further expanded by the addition of context-dependent words. These context-dependent words are used with or without conjunctions. Rules are formulated for assigning polarities to those context-dependent words that are surrounded by the positive or negative words. These rules were incorporated in the Urdu sentiment analyzer. Fusion of these rules for handling context-dependent words and the expanded Urdu sentiment lexicon resulted in increasing the accuracy of the Urdu sentiment analyzer from 83.43% to 89.03% with 0.8655 precision, 0.9053 recall, and 0.8799 F-measure, which is a statistically significant improvement.

REFERENCES

  1. [1] Cambria E.. 2016. Affective computing and sentiment analysis. IEEE Intelligent Systems 31, 2 (2016), 102107.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Schouten K. and Frasincar F.. 2016. Survey on aspect-level sentiment analysis. IEEE Transactions on Knowledge and Data Engineering 28, 3 (2016), 813829.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. [3] Liu B.. 2012. Sentiment Analysis and Opinion Mining. Morgan & Claypool.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Poria S., Cambria E., and Gelbukh A.. 2016. Aspect extraction for opinion mining with a deep convolutional neural network. Knowledge-Based Systems 108 (2016), 4249.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Yu Y., Lin H., Meng J., and Zhao Z.. 2016. Visual and textual sentiment analysis of a microblog using deep convolutional neural networks. Algorithms 9, 2 (2016), 41.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Farra N., Challita E., Assi R. A., and Hajj H.. 2010. Sentence-level and document-level sentiment mining for arabic texts. In Proceedings of the 2010 IEEE International Conference on Data Mining Workshops (ICDMW’10).Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Turney P. D.. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL’02). 417424.Google ScholarGoogle Scholar
  8. [8] Khattak A., Asghar M. Z., Saeed A., Hameed I. A., Hassan S. A., and Ahmad S.. 2021. A survey on sentiment analysis in Urdu: A resource-poor language. Egyptian Informatics Journal 22, 1 (2021), 5374.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Jan A. U., Khan M. A., and Mukhtar. N. 2020. Opinion mining and summarization: A comprehensive review. Journal of Communication Technologies and Robotics Applications 11, 1 (2020), 7696.Google ScholarGoogle Scholar
  10. [10] Boia M., Musat C. C., and Faltings B.. 2014. Acquiring commonsense knowledge for sentiment analysis through human computation. In Proceedings of the 23rd International Conference on World Wide Web (WWW’14). 225226.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Zhang Z., D. Miao, and Yuan B.. 2014. Context-Dependent Sentiment Classification Using Antonym Pairs and Double Expansion. Springer International.Google ScholarGoogle Scholar
  12. [12] Kansal H. and Toshniwal D.. 2014. Aspect based summarization of context dependent opinion words. Procedia Computer Science 35 (2014), 166175.Google ScholarGoogle Scholar
  13. [13] Wilson T., Wiebe J., and Hoffmann. P. 2009. Recognizing contextual polarity: An exploration of features for phrase-level sentiment analysis. Computational Linguistics 35, 3 (2009), 399433.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Garg S. and Sharma. D. K. 2016. Sentiment classification of context dependent words. In Proceedings of the International Conference on ICT for Sustainable Development.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Liu B.. 2010. Sentiment analysis and subjectivity. In Handbook of Natural Language Processing (2nd ed.), Nitin Indurkhya and Fred J. Damerau (Eds.). CRC Press, Boca Raton, FL, 627666.Google ScholarGoogle Scholar
  16. [16] Pang B. and Lee L.. 2008. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2, 1–2 (2008), 1135.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Mukhtar N., Khan M. A., and Chiragh N.. 2018. Lexicon-based approach outperforms supervised machine learning approach for Urdu sentiment analysis in multiple domains. Telematics and Informatics 35, 8 (2018), 21732183.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Mukhtar N., Khan M. A., Chiragh N., and Nazir S.. Identification and handling of intensifiers for enhancing accuracy of Urdu sentiment analysis. Expert Systems 35, 6 (2018), 112.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Mukhtar N. and Khan M. A. 2018. Urdu sentiment analysis using supervised machine learning approach. International Journal of Pattern Recognition and Artificial Intelligence 32, 1 (2018), 18510011851015.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Yi J., Nasukawa T., Bunescu R., and Niblack W.. 2003. Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Proceedings of the 3rd IEEE International Conference on Data Mining. IEEE, Los Alamitos, CA.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Wilson T., Wiebe J., and Hoffmann P.. 2005. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT’05).Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Ding X., Liu B., and Yu P. S. 2008. A holistic lexicon-based approach to opinion mining. In Proceedings of the 2008 International Conference on Web Search and Data Mining (WSDM ’08).Google ScholarGoogle Scholar
  23. [23] Yang S.-C. and Liu M.-J.. 2010. YSC-DSAA: An approach to disambiguate sentiment ambiguous adjectives based on SAAOL. In Proceedings of the 5th International Workshop on Semantic Evaluation. 440443.Google ScholarGoogle Scholar
  24. [24] Xu R., Xu J. and Kit C.. 2010. HITSZ_CITYU: Combine collocation, context words and neighboring sentence sentiment in sentiment adjectives disambiguation. In Proceedings of the 5th International Workshop on Semantic Evaluation. 448451.Google ScholarGoogle Scholar
  25. [25] Xia Y., Cambria E., Hussain A., and Zhao H.. 2015. Word polarity disambiguation using Bayesian model and opinion-level features. Cognitive Computation 7 (2015), 369380.Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Martineau J. and Finin T.. 2009. Delta TFIDF: An improved feature space for sentiment analysis. In Proceedings of the 3rd AAAI International Conference on Weblogs and Social Media.Google ScholarGoogle Scholar
  27. [27] Lin Y., Zhang J., Wang X., and Zhou A.. 2012. An information theoretic approach to sentiment polarity classification. In Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality (WebQuality’12). ACM, New York, NY, 3540.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Saad F. and Mathiak B.. 2013. Revised mutual information approach for German text sentiment classification. In Proceedings of the 22nd International Conference on World Wide Web Companion.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Chaoticity.com. 2016. Urdu Sentiment Lexicon. Retrieved July 5, 2016 from http://chaoticity.com/urdusentimentlexicon/.Google ScholarGoogle Scholar
  30. [30] Data Science Lab. 2016. Multilingualsentiment. Retrieved July 15, 2016 from https://sites.google.com/site/datascienceslab/projects/multilingualsentiment.Google ScholarGoogle Scholar
  31. [31] Urdu lughat. 2016. XXX. Retrieved May 20, 2016 from http://urdulughat.info/.Google ScholarGoogle Scholar
  32. [32] Center for Language Engineering. 2016. News. Retrieved July 8, 2016 from http://www.cle.org.pk/.Google ScholarGoogle Scholar
  33. [33] Center for Language Engineering. 2016. POS tagset. Retrieved June 2016 from http://www.cle.org.pk/software/langproc/POStagset.htm/Google ScholarGoogle Scholar
  34. [34] XXX. XXX. XXX. Retrieved XXX from https://urdusentidata.wordpress.com//2019.Google ScholarGoogle Scholar
  35. [35] Dietterich T. G.. 1998. Approximate statistical tests for comparing supervised classification learning algorithms. Neural Computation 10 (1998), 18951923.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Bostanci B. and Bostanci E.. 2013. An evaluation of classification algorithms using Mc Nemar's test. Advances in Intelligent Systems and Computing 201 (2013), 1526.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Hassan M. and Shoaib M.. 2018. Opinion within opinion: Segmentation approach for Urdu sentiment analysis. International Arab Journal of Information Technology 15, 1 (2018), 2128.Google ScholarGoogle Scholar
  38. [38] Asghar M. Z., Khan A., Ahmad S., Qasim M., and Khan I. A.. 2017. Lexicon-enhanced sentiment analysis framework using rule-based classification scheme. PLoS One 12, 2 (2017), 122.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Syed A. Z., Muhammad A., and Enríquez A. M. M. 2011. Adjectival phrases as the sentiment carriers in Urdu. Journal of American Science 7, 3 (2011), 644652.Google ScholarGoogle Scholar
  40. [40] Daud M., Khan R., and Duad A. R. 2014. Urdu opinion mining system (RUOMiS). Computer Science & Engineering: An International Journal 4, 6 (2014), 19.Google ScholarGoogle ScholarCross RefCross Ref
  41. [41] Syed A. Z., Muhammad A., and Enríquez A. M. M. 2014. Associating targets with SentiUnits: A step forward in sentiment analysis of Urdu text. Artificial Intelligence Review 41, 4 (2014), 535561.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. [42] Rehman Z. U. and Bajwa I. S.. 2016. Lexicon-based sentiment analysis for Urdu language. In Proceedings of the 6th International Conference on Innovative Computing Technology (INTECH’16).Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Asghar M. Z., Sattar A., Khan A., Ali A., Kundi F. M., and Ahmad S.. 2019. Creating sentiment lexicon for sentiment analysis in Urdu: The case of a resource-poor language. Expert Systems 36, 3 (2019), e12397.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. An Intelligent Unsupervised Approach for Handling Context-Dependent Words in Urdu Sentiment Analysis

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 21, Issue 5
      September 2022
      486 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3533669
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 April 2022
      • Online AM: 21 March 2022
      • Accepted: 1 January 2022
      • Revised: 1 May 2021
      • Received: 1 July 2020
      Published in tallip Volume 21, Issue 5

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format