Skip to main content
Log in

An adaptable scheme to enhance the sentiment classification of Telugu language

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Nowadays, the big data is ruling the entire digital world with its applications and facilities. Thus, to run the online services in better way, some of the machine learning models are utilized, also the machine learning strategy is became a trending field in big data; hence the success of online services or business is based upon the customer reviews. Almost the review contains neutral, positive, and negative sentiment value; Manual classification of sentiment value is a difficult task so that the natural language processing (NLP) scheme is used which is processed using machine learning strategy. Moreover, the part of speech specification for different languages is difficult. To overcome this issue, the current research aims to develop a novel less error pruning-shortest description length (LEP-SDL) for error pruning and ant lion boosting model (ALBM) for opinion specification purpose. Here, the Telugu news review dataset adopted to process the sentiment analysis in NLP. Furthermore, the fitness function of ant lion model in boosting approach improves the accuracy and precision of opinion specification also makes the classification process easier. Thus, to evaluate the competence of the projected model, it is evaluated with recent existing works in terms of accuracy, precision, etc., and achieved better results by obtaining high accuracy and precision of opinion specification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Bloh J, Broekel T, Özgun B, Sternberg R (2019) New (s) data for entrepreneurship research? An innovative approach to use big data on media coverage. Small Bus Econ,pp 1–22. https://doi.org/10.1007/s11187-019-00209-x

  • Choudhary N, Singh R, Bindlish I, Shrivastava M (2018) Sentiment analysis of code-mixed languages leveraging resource rich languages. arXiv preprint.

  • Cunningham H (2002) GATE: A framework and graphical development environment for robust NLP tools and applications. In: Proceedings of 40th annual meeting of the association for computational linguistics, pp 168–175.

  • Dadvar M, Hauff C, Jong F (2011) Scope of negation detection in sentiment analysis. In: Proceedings of the Dutch-Belgian Information Retrieval Workshop (DIR 2011), University of Amsterdam, pp 16–20.

  • Edgcomb JB, Zima B (2019) Machine Learning, Natural Language Processing, and the Electronic Health Record: Innovations in Mental Health Services Research. Psychiatr Serv 70:346–349. https://doi.org/10.1176/appi.ps.201800401

    Article  Google Scholar 

  • Gangula RRR, Mamidi R (2018) Resource creation towards automated sentiment analysis in telugu (a low resource language) and integrating multiple domain sources to enhance sentiment prediction. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). https://www.aclweb.org/anthology/L18-1100.pdf.

  • Hammou BA, Lahcen AA, Mouline S (2019) A distributed eensemble of deep convolutional neural networks with random forest for big data sentiment analysis. International Conference on Mobile, Secure, and Programmable Networkin. Springer, Cham, pp 153–162. https://doi.org/10.1007/978-3-030-22885-9_14.

  • Kagan V, Stevens A, Subrahmanian VS (2015) Using twitter sentiment to forecast the 2013 pakistani election and the 2014 indian election. IEEE Intell Syst 30(1):2–5. https://doi.org/10.1109/MIS.2015.16

    Article  Google Scholar 

  • Kim C, Zhu V, Obeid J, Lenert L (2019) Natural language processing and machine learning algorithm to identify brain MRI reports with acute ischemic stroke. PLoS ONE 14(2). https://doi.org/10.1371/journal.pone.0212778

  • Krishna NS, Murthy HA (2004) A new prosodic phrasing model for Indian language Telugu. EIn; ighth International Conference on Spoken Language Processing.

  • Lavergne T, Urvoy T, Yvon F (2011) Filtering artificial texts with statistical machine learning techniques. Lang Resour Eval 45(1):25–43. https://doi.org/10.1007/s10579-009-9113-0

    Article  Google Scholar 

  • Liu B, Blasch E, Chen Y, Shen D, Chen G (2013) Scalable sentiment classification for big data analysis using naive bayes classifier. In: 2013 IEEE international conference on big data, pp 99–104. https://doi.org/10.1109/BigData.2013.6691740.

  • Liu B, Zhang L (2012) A survey of opinion mining and sentiment analysis. Mining text data. Springer, Boston, pp 415–463. https://doi.org/10.1007/978-1-4614-3223-4_13.

  • Melville P, Gryc W, Lawrence RD (2009) Sentiment analysis of blogs by combining lexical knowledge with text classification. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 1275–1284. https://doi.org/10.1145/1557019.1557156.

  • Mishra M, Barman SK, Maity D, Maiti DK (2019) Ant lion optimisation algorithm for structural damage detection using vibration data. J Civ Struct Heal Monit 9(1):117–136. https://doi.org/10.1007/s13349-018-0318-z

    Article  Google Scholar 

  • Mukku SS, Oota SR, Mamidi R (2017) Tag me a label with multi-arm: Active learning for telugu sentiment analysis. In: International Conference on Big Data Analytics and Knowledge Discovery, Springer, Cham, pp 355–367. https://doi.org/10.1007/978-3-319-64283-3_26.

  • Nakagawa T, Inui K, Kurohashi S (2010) Dependency tree-based sentiment classification using CRFs with hidden variables. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp 786–794.

  • Naidu R, Bharti SK, Babu KS, Mohapatra RK (2019) Text summarization with automatic keyword extraction in telugu e-newspapers. Smart Computing and Informatics, Springer, Singapore, pp. 555–564. https://doi.org/10.1007/978-981-10-5544-7_54.

  • Negi K, Pavuri A, Patel L, Jain C (2019) A novel method for drug-adverse event extraction using machine learning. Informatics in Medicine Unlocked 100190. https://doi.org/10.1016/j.imu.2019.100190

  • Pedhazur EJ, Kerlinger FN (1982) Multiple regression in behavioral research. Holt, Rinehart, and Winston. New York

  • Pham BT, Prakash I, Singh SK, Shirzadi A, Shahabi H (2019) Landslide susceptibility modeling using reduced error pruning trees and different ensemble techniques: Hybrid machine learning approaches. CATENA 175:203–218. https://doi.org/10.1016/j.catena.2018.12.018

    Article  Google Scholar 

  • Qiu C, Wang G, Bu J, Liu K, Chen C (2008) Incorporate the syntactic knowledge in opinion mining in user-generated content. In: Proceedings of WWW 2008 Workshop NLP Challenges in the Information Explosion Era.

  • Rajalakshmi S, Asha S, Pazhaniraja N (2017) A comprehensive survey on sentiment analysis. 2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN), pp 1–5. https://doi.org/10.1109/ICSCN.2017.8085673.

  • Schintler LA, Kulkarni R (2014) Big data for policy analysis: the good, the bad, and the ugly. Review of Policy Research 31(4). https://doi.org/10.1111/ropr.12079.

  • Bandari S, Bulusu VV (2020) Survey on ontology-based sentiment analysis of customer reviews for products and services. Data Engineering and Communication Technology, Springer, Singapore, pp 91–101. https://doi.org/10.1007/978-981-15-1097-7_8

  • Tromp E, Pechenizkiy M, Gaber MM (2017) Expressive modeling for trusted big data analytics: techniques and applications in sentiment analysis. Big Data Analytics 2(1):5. https://doi.org/10.1186/s41044-016-0018-9

    Article  Google Scholar 

  • Trupthi M, Pabboju S, Gugulotu N (2019) Deep Sentiments Extraction for Consumer Products Using NLP-Based Technique. Soft Computing and Signal Processing, Springer, Singapore, pp 191–201. https://doi.org/10.1007/978-981-13-3393-4_20.

  • Wang L, Zhan J, Luo C, Zhu Y, Yang Q, He Y, Zheng C (2014) Bigdatabench: A big data benchmark suite from internet services. In: 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA), pp 488–499. https://doi.org/10.1109/HPCA.2014.6835958.

  • Yang HC, Dasdan A, Hsiao RL, Parker DS (2007) Map-reduce-merge: simplified relational data processing on large clusters. In:Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp 1029–1040. https://doi.org/10.1145/1247480.1247602.

  • Yang H, Luo L, Chueng LP, Ling D, Chin F (2019) Deep Learning and Its Applications to Natural Language Processing. Deep Learning: Fundamentals, Theory and Applications. Springer, Cham, pp 89–109. https://doi.org/10.1007/978-3-030-06073-2_4.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Midde.Venkateswarlu Naik.

Ethics declarations

Conflict of interest

The authors declare that they have no potential conflict of interest.

Human and Animal Rights

All applicable institutional and/or national guidelines for the care and use of animals were followed.

Informed Consent

For this type of study formal consent is not required.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Naik, M., Vasumathi, D. & Kumar, A.P.S. An adaptable scheme to enhance the sentiment classification of Telugu language. Soc. Netw. Anal. Min. 11, 60 (2021). https://doi.org/10.1007/s13278-021-00764-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-021-00764-w

Keywords

Navigation