skip to main content
10.1145/3128128.3128155acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicsdeConference Proceedingsconference-collections
research-article

Topic and sentiment model applied to the colloquial Arabic: a case study of Maghrebi Arabic

Published: 21 July 2017 Publication History

Abstract

Recently, the multiplication of communication and sharing platforms such as social networks, personal blogs, forums, etc., has facilitated the expression of views and opinions about products, personalities, and public policy. However, gathering these points of view is a complex task that requires resolution of many problems in different disciplines, especially issues related to our language. Among the research areas, topic modeling and sentiment analysis stimulates interest and curiosity of the scientific community. Lately, the current economic, geo-political and geostrategic trends have made researchers specifically more interested in Arabic language, except that the majority of these studies focus on the classical Arabic; nevertheless it is a language of the elites which is different from what is mainly used on the Web. Our paper focuses on Maghrebi colloquial Arabic since the little research that exists in this area is limited to East colloquial Arabic. On a corpus extracted from different Facebook pages we implemented a supervised approach to extract the sentiments, and an unsupervised approach to extract topic, then we proposed a new, semi-supervised, approach in the Arabic language that combines the topic and the sentiment in a single model, in order to join each topic to a specific sentiment.

References

[1]
Pang, B., & Lee, L.2008. Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval, (2008), 1--135.
[2]
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., & Stede, M. 2011. Lexicon-based methods for sentiment analysis. Computational linguistics, (2011). 267--307.
[3]
Rushdi Saleh, M., Martín Valdivia, M. T., Ureña López, L. A., & Perea Ortega, J. M. 2011 . OCA: Opinion corpus for Arabic. Journal of the American Society for Information Science and Technology, (2011), 2045--2054.
[4]
Farra, N., Challita, E., Assi, R. A., & Hajj, H. 2010. Sentence-level and document-level sentiment mining for arabic texts. In Data Mining Workshops (ICDMW), 2010 IEEE International Conference, (December 2010). 1114--1119.
[5]
Refaee, Eshrag, and Verena Rieser.2014. An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis.
[6]
Mourad, Ahmed, and Kareem Darwish. 2013. Subjectivity and sentiment analysis of modern standard Arabic and Arabic microblogs. Proceedings of the 4th workshop on computational approaches to subjectivity, sentiment and social media analysis. (2013)
[7]
Shoukry, Amira, and Ahmed Rafea. 2012. Sentence-level Arabic sentiment analysis. "Collaboration Technologies and Systems (CTS), 2012 International Conference on. IEEE, (2012).
[8]
Abdul-Mageed, Muhammad, Mona Diab, and Sandra Kübler.2014. SAMAR: Subjectivity and sentiment analysis for Arabic social media. Computer Speech & Language 28.1 (2014): 20--37.
[9]
Abdul-Mageed, Muhammad, Mona T. Diab, and Mohammed Korayem. 2011. Subjectivity and sentiment analysis of modern standard arabic." Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers-Volume 2. Association for Computational Linguistics, (2011).
[10]
Bautin, Mikhail, Lohit Vijayarenu, and Steven Skiena. 2008. International Sentiment Analysis for News and Blogs." ICWSM. (2008)
[11]
Hoffmann, T.1999. Probabilistic latent semantic indexing. In Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, (August 1999). 50--57.
[12]
Brahmi, A., Ech-Cherif, A., & Benyettou, A. 2012. Arabic texts analysis for topic modeling evaluation. Information retrieval, (2012), 33--53.
[13]
Brants, T., Chen, F., & Farahat, A. 2002. Arabic document topic analysis. In Proceedings of the Post Workshop of LREC. (June 2002)
[14]
Brahmi, A., Ech-Cherif, A., & Benyettou, A. 2013. An arabic lemma-based stemmer for latent topic modeling. Int. Arab J. Inf. Technol., (2013), 160--168.
[15]
Zrigui, M., Ayadi, R., Mars, M., & Maraoui, M. 2012. Arabic text classification framework based on latent dirichlet allocation. CIT. Journal of Computing and Information Technology, (2012), 125--140.
[16]
Porteous, I., Newman, D., Ihler, A., Asuncion, A., Smyth, P., & Welling, M. 2008. Fast collapsed gibbs sampling for latent dirichlet allocation. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, (August 2008), 569--577
[17]
Pang, B., & Lee, L. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd annual meeting on Association for Computational Linguistics, (July 2004), 271.
[18]
Vapnik, V. N., & Vapnik, V. 1998. Statistical learning theory (Vol. 1). New York: Wiley.(1998)
[19]
Melville, P., Gryc, W., & Lawrence, R. D. 2009. Sentiment analysis of blogs by combining lexical knowledge with text classification. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, (June 2009), 1275--1284.
[20]
Jo, Y., & Oh, A. H. 2011. Aspect and sentiment unification model for online review analysis. In Proceedings of the fourth ACM international conference on Web search and data mining, (February 2011), 815--824
[21]
Al-Kabi, M. N., Abdulla, N. A., & Al-Ayyoub, M. 2013. An analytical study of arabic sentiments: Maktoob case study. In : Internet Technology and Secured Transactions (ICITST), 2013 8th International Conference for. IEEE, 2013. p. 89--94.

Cited By

View all
  • (2022)Opinion Mining using Sentiment Analysis and Topic Modelling for an Entertainment Event2022 Fifth National Conference of Saudi Computers Colleges (NCCC)10.1109/NCCC57165.2022.10067654(1-6)Online publication date: 17-Dec-2022
  • (2022)TunTap: A Tunisian Dataset for Topic and Polarity Extraction in Social MediaComputational Collective Intelligence10.1007/978-3-031-16014-1_40(507-519)Online publication date: 28-Sep-2022
  • (2022)Arabic Topic Modeling-Based Sentiment Analysis on COVID-19 Feedback CommentsAdvances in Information, Communication and Cybersecurity10.1007/978-3-030-91738-8_9(87-95)Online publication date: 12-Jan-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICSDE '17: Proceedings of the 2017 International Conference on Smart Digital Environment
July 2017
245 pages
ISBN:9781450352819
DOI:10.1145/3128128
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 July 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Maghrebi Arabic
  2. colloquial Arabic
  3. latent dirichlet allocution
  4. naive Bayes
  5. sentiment analysis
  6. topic modeling

Qualifiers

  • Research-article

Conference

ICSDE '17

Acceptance Rates

ICSDE '17 Paper Acceptance Rate 36 of 139 submissions, 26%;
Overall Acceptance Rate 68 of 219 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Opinion Mining using Sentiment Analysis and Topic Modelling for an Entertainment Event2022 Fifth National Conference of Saudi Computers Colleges (NCCC)10.1109/NCCC57165.2022.10067654(1-6)Online publication date: 17-Dec-2022
  • (2022)TunTap: A Tunisian Dataset for Topic and Polarity Extraction in Social MediaComputational Collective Intelligence10.1007/978-3-031-16014-1_40(507-519)Online publication date: 28-Sep-2022
  • (2022)Arabic Topic Modeling-Based Sentiment Analysis on COVID-19 Feedback CommentsAdvances in Information, Communication and Cybersecurity10.1007/978-3-030-91738-8_9(87-95)Online publication date: 12-Jan-2022
  • (2021)Generate a list of Stop Words in Moroccan Dialect from Social Network Data Using Word Embedding2021 International Conference on Digital Age & Technological Advances for Sustainable Development (ICDATA)10.1109/ICDATA52997.2021.00022(66-73)Online publication date: Jun-2021
  • (2020)Language resources for Maghrebi Arabic dialects’ NLP: a surveyLanguage Resources and Evaluation10.1007/s10579-020-09490-9Online publication date: 25-Apr-2020
  • (2020)Sentiment Analysis in Google Play Store: Algerian Reviews CaseModelling and Implementation of Complex Systems10.1007/978-3-030-58861-8_8(107-121)Online publication date: 6-Sep-2020
  • (2019)A Survey of Opinion Mining in ArabicACM Transactions on Asian and Low-Resource Language Information Processing10.1145/329566218:3(1-52)Online publication date: 7-May-2019
  • (2019)A review on Arabic Sentiment Analysis: State-of-the-Art, Taxonomy and Open Research ChallengesIEEE Access10.1109/ACCESS.2019.2951530(1-1)Online publication date: 2019
  • (2018)Topic Detection Approaches in Identifying Topics and Events from Arabic CorporaProcedia Computer Science10.1016/j.procs.2018.10.492142(270-277)Online publication date: 2018
  • (2018)Opinion Mining in Social Networks for Algerian DialectInformation Processing and Management of Uncertainty in Knowledge-Based Systems. Applications10.1007/978-3-319-91479-4_52(629-641)Online publication date: 18-May-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media