skip to main content
10.1145/3038884.3038904acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedpraiConference Proceedingsconference-collections
research-article

Pashto Sentiment Analysis Using Lexical Features

Published: 22 November 2016 Publication History

Abstract

Individuals use various platforms to express their opinion regarding products, services, political situations and other events. Knowing the opinion of people is very important for the concerned individuals and organizations in order to devise future strategies according to the wishes of people. The present research study focuses on extraction of opinion from digital-born Pashto text. The study involved the creation of multiple state-of-the-art classifiers by adapting methodology of message level task using sentiment analysis of Tweets'. In addition to this, word-sentiment lexicons with tokenization of sentences and translation of existing English lexicons were generated. The findings show that lexical features based Pashto sentiment analysis extracts sentiments with a high accuracy.

References

[1]
Mohammad Ehsan Basiri, Ahmad Reza Naghsh-Nilchi, and Nasser Ghassem-Aghaee. 2014. A Framework for Sentiment Analysis in Persian. Open Transactions on Information Processing 1, 3 (November 2014), 1--14.
[2]
Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2 (2011), 27:1--27:27. Issue 3. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
[3]
Xiaowen Ding, Bing Liu, and Philip S. Yu. 2008. A Holistic Lexicon-based Approach to Opinion Mining. In Proceedings of the 2008 International Conference on Web Search and Data Mining (WSDM '08). ACM, New York, NY, USA, 231--240.
[4]
Nadir Durrani and Sarmad Hussain. 2010. Urdu Word Segmentation. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 528--536. http://dl.acm.org/citation.cfm?id=1857999.1858076
[5]
R. M. Duwairi, R. Marji, N. Sha'ban, and S. Rushaidat. 2014. Sentiment Analysis in Arabic tweets. In 2014 5th International Conference on Information and Communication Systems (ICICS). 1--6.
[6]
Nir Friedman, Dan Geiger, and Moises Goldszmidt. 1997. Bayesian Network Classifiers. Mach. Learn. 29, 2-3 (Nov. 1997), 131--163.
[7]
Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H. Witten. 2009. The WEKA Data Mining Software: An Update. SIGKDD Explor. Newsl. 11, 1 (Nov. 2009), 10--18.
[8]
Marti A. Hearst. 1998. Support Vector Machines. IEEE Intelligent Systems 13, 4 (July 1998), 18--28.
[9]
Michael C. McCord. 1989. Design of LMT: A Prolog-based Machine Translation System. Comput. Linguist. 15, 1 (March 1989), 33--52. http://dl.acm.org/citation.cfm?id=68960.68963
[10]
Saif M. Mohammad, Svetlana Kiritchenko, and Xiaodan Zhu. 2013. NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets. CoRR 2 (June 2013), 321 327. http://arxiv.org/abs/1308.6242
[11]
F. Å. Nielsen. 2011. AFINN. Informatics and Mathematical Modelling, Technical University of Denmark. (March 2011). http://www2.imm.dtu.dk/pubdb/p.php?6010
[12]
Bo Pang and Lillian Lee. 2008. Opinion Mining and Sentiment Analysis. Found. Trends Inf. Retr. 2, 1-2 (Jan. 2008), 1--135.
[13]
Ross Quinlan. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo, CA.
[14]
I. Rabbi, M. A. Khan, and R. Ali. 2008. Developing a tagset for Pashto part of speech tagging. In 2008 Second International Conference on Electrical Engineering. IEEE Xplore, Lahore, Pakistan, 1--6.
[15]
Kashif Riaz. 2008. Concept Search in Urdu. In Proceedings of the 2Nd PhD Workshop on Information and Knowledge Management (PIKM '08). ACM, New York, NY, USA, 33--40.
[16]
Afraz Z. Syed, Muhammad Aslam, and Ana Maria Martinez-Enriquez. 2010. Lexicon Based Sentiment Analysis of Urdu Text Using SentiUnits. In Proceedings of the 9th Mexican International Conference on Advances in Artificial Intelligence: Part I (MICAI '10). Springer-Verlag, Berlin, Heidelberg, 32--43. http://dl.acm.org/citation.cfm?id=1927149.1927155

Cited By

View all
  • (2024)A Roman Urdu Corpus for sentiment analysisThe Computer Journal10.1093/comjnl/bxae05267:9(2864-2876)Online publication date: 18-Jun-2024
  • (2024)Aspect-based sentiment analysis in Urdu language: resource creation and evaluationNeural Computing and Applications10.1007/s00521-024-10145-xOnline publication date: 28-Aug-2024
  • (2023)Analysis of Cursive Text Recognition Systems: A Systematic Literature ReviewACM Transactions on Asian and Low-Resource Language Information Processing10.1145/359260022:7(1-30)Online publication date: 13-Apr-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
MedPRAI-2016: Proceedings of the Mediterranean Conference on Pattern Recognition and Artificial Intelligence
November 2016
163 pages
ISBN:9781450348768
DOI:10.1145/3038884
  • General Chairs:
  • Chawki Djeddi,
  • Imran Siddiqi,
  • Akram Bennour,
  • Program Chairs:
  • Youcef Chibani,
  • Haikal El Abed
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 November 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Pashto text
  2. lexical features
  3. sentiment analysis

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

MedPRAI-2016

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)2
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Roman Urdu Corpus for sentiment analysisThe Computer Journal10.1093/comjnl/bxae05267:9(2864-2876)Online publication date: 18-Jun-2024
  • (2024)Aspect-based sentiment analysis in Urdu language: resource creation and evaluationNeural Computing and Applications10.1007/s00521-024-10145-xOnline publication date: 28-Aug-2024
  • (2023)Analysis of Cursive Text Recognition Systems: A Systematic Literature ReviewACM Transactions on Asian and Low-Resource Language Information Processing10.1145/359260022:7(1-30)Online publication date: 13-Apr-2023
  • (2019)Sentiment Analysis in E-commerce Using SVM on Roman Urdu TextEmerging Technologies in Computing10.1007/978-3-030-23943-5_16(213-222)Online publication date: 14-Jul-2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media