skip to main content
research-article

Semantic and Context Understanding for Sentiment Analysis in Hindi Handwritten Character Recognition Using a Multiresolution Technique

Published: 15 January 2024 Publication History

Abstract

The rapid growth of Web 2.0, which enables people to generate, communicate, and share information, has resulted in an increase in the total number of users. In developing countries, online users’ sentiment influences decision-making, social views, individual consumption decisions, and entity quality monitoring. As a result, more accurate sentiment analysis, particularly in their native language such as Hindi, is preferred over crude binary categorization. This is because of the abundance of web-based data in Indian languages such as Hindi, Marathi, Kannada, Tamil, and so on. Analyzing this data and recovering valuable and relevant information from handwritten text has become extremely important. Despite years of research and development, no optical writing recognition (OCR) system has ever been certified as completely reliable. The first step in any pattern recognition system is feature selection. In many fields, feature selection is studied as a combinatorial optimization problem. The primary goal of feature selection is to reduce the number of redundant and ineffective traits in the recognition system. This feature selection is used to maintain or improve the performance of the classifier used by the recognition system: A support vector machine (SVM) technique could be used to solve this character recognition problem. The Hindi character recognition system recognizes Hindi characters by employing morphological operations, edge detection, HOG feature extraction, and an SVM-based classifier. The proposed model outperformed the current state-of-the-art method, achieving an accuracy of 96.77%.

References

[1]
Nur Atiqah Sia Abdullah and Nur Ida Aniza Rusli. 2021. Multilingual sentiment analysis: A systematic literature review. Pertanika J. Sci. Technol. 29, 1 (2021).
[2]
Divya Acharya, Anosh Billimoria, Neishka Srivastava, Shivani Goel, and Arpit Bhardwaj. 2020. Emotion recognition using Fourier transform and genetic programming. Appl. Acoust. 164, 107260 (2020), 107260.
[3]
Md. Shad Akhtar, Tarun Garg, and Asif Ekbal. 2020. Multi-task learning for aspect term extraction and aspect sentiment classification. Neurocomputing 398 (2020), 247–256.
[4]
Saqib Ali, Guojun Wang, and Shazia Riaz. 2020. Aspect based sentiment analysis of ridesharing platform reviews for Kansei engineering. IEEE Access 8 (2020), 173186–173196.
[5]
Abhijit Bera, Mrinal Kanti Ghose, and Dibyendu Kumar Pal. 2021. Sentiment analysis of multilingual tweets based on natural language processing (NLP). Int. J. Syst. Dyn. Appl. 10, 4 (2021), 1–12.
[6]
Santosh Kumar Bharti, Reddy Naidu, and Korra Sathya Babu. 2020. Hyperbolic feature-based sarcasm detection in Telugu conversation sentences. J. Intell. Syst. 30, 1 (2020), 73–89.
[7]
Kareem Darwish, Nizar Habash, Mourad Abbas, Hend Al-Khalifa, Huseein T. Al-Natsheh, Houda Bouamor, Karim Bouzoubaa, Violetta Cavalli-Sforza, Samhaa R., Wassim El-Hajj, Mustafa Jarrar, and Hamdy Mubarak. 2021. A panoramic survey of natural language processing in the Arab world. Commun. ACM 64, 4 (2021), 72–81.
[8]
Sunil Dhankhar, Mukesh Kumar Gupta, Fida Hussain Memon, Surbhi Bhatia, Pankaj Dadheech, and Arwa Mashat. 2022. Support vector machine based handwritten Hindi character recognition and summarization. Comput. Syst. Sci. Eng. 43, 1 (2022), 397–412.
[9]
Matthias Függer, Thomas Nowak, and Manfred Schwarz. 2021. Tight bounds for asymptotic and approximate consensus. J. ACM 68, 6 (2021), 1–35.
[10]
Kanika Garg and D. K. Lobiyal. 2020. Hindi EmotionNet: A scalable emotion lexicon for sentiment classification of Hindi text. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 19, 4 (2020), 1–35.
[11]
Akanksha Gaur and Sunita Yadav. 2015. Handwritten Hindi character recognition using k-means clustering and SVM. In 2015 4th International Symposium on Emerging Trends and Technologies in Libraries and Information Services. IEEE.
[12]
Vedika Gupta, Nikita Jain, Shubham Shubham, Agam Madan, Ankit Chaudhary, and Qin Xin. 2021. Toward integrated CNN-based sentiment analysis of tweets for scarce-resource language—Hindi. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 20, 5 (2021), 1–23.
[13]
Deepak Jain, Akshi Kumar, and Geetanjali Garg. 2020. Sarcasm detection in mash-up language using soft-attention based bi-directional LSTM and feature-rich CNN. Appl. Soft Comput. 91, 106198 (2020), 106198.
[14]
Anupam Jamatia, Steve Durairaj Swamy, Björn Gambäck, Amitava Das, and Swapan Debbarma. 2020. Deep learning based sentiment analysis in a code-mixed English-Hindi and English-Bengali social media corpus. Int. J. Artif. Intell. Tools 29, 05 (2020), 2050014.
[15]
R. Jayadevan, Satish R. Kolhe, Pradeep M. Patil, and Umapada Pal. 2011. Offline recognition of Devanagari script: A survey. IEEE Trans. Syst. Man Cybern. C Appl. Rev. 41, 6 (2011), 782–796.
[16]
Gauri Katiyar and Shabana Mehfuz. 2015. MLPNN based handwritten character recognition using combined feature extraction. In International Conference on Computing, Communication & Automation. IEEE.
[17]
Dhanashree S. Kulkarni and Sunil S. Rodd. 2022. Sentiment analysis in Hindi—A survey on the state-of-the-art techniques. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 21, 1 (2022), 1–46.
[18]
Akshi Kumar and Victor Hugo C. Albuquerque. 2021. Sentiment analysis using XLM-R transformer and zero-shot transfer learning on resource-poor Indian language. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 20, 5 (2021), 1–13.
[19]
Ankit Kumar, Abhishek Kumar, Ali Kashif Bashir, Mamoon Rashid, V. D. Ambeth Kumar, and Rupak Kharel. 2021. Distance based pattern driven mining for outlier detection in high dimensional big dataset. ACM Trans. Manage. Inf. Syst. 13, 1, Article 8 (Oct. 2021), 17 pages.
[20]
Ankit Kumar, Kamred Udham Singh, Chetan Swarup, Teekam Singh, Linesh Raja, and Abhishek Kumar. 2022. Detection of copy-move forgery using Euclidean distance and texture features. Trait. du Signal 39, 3 (2022), 781–788.
[21]
Chinmayee Ojha, Manju Venugopalan, and Deepa Gupta. 2021. A rule based approach for aspect extraction in Hindi reviews. J. Intell. Fuzzy Syst. 41, 5 (2021), 5477–5485.
[22]
Suraj Pandey, Md. Shad Akhtar, and Tanmoy Chakraborty. 2021. Syntactically coherent text augmentation for sequence classification. IEEE Trans. Comput. Soc. Syst. 8, 6 (2021), 1323–1332.
[23]
Abhilash Pathak, Sudhanshu Kumar, Partha Roy, and Byung-Gyu Kim. 2021. Aspect-based sentiment analysis in Hindi language by ensembling pre-trained mBERT models. Electronics (Basel) 10, 21 (2021), 2641.
[24]
Rupali S. Patil and Satish R. Kolhe. 2022. Supervised classifiers with TF-IDF features for sentiment analysis of Marathi tweets. Soc. Netw. Anal. Min. 12, 1 (2022).
[25]
Heidi Pauwels. 2021. Cultivating emotion and the rise of the vernacular: The role of affect in “early Hindi-Urdu” song. South Asian Hist. Cult. 12, 2-3 (2021), 146–165.
[26]
Muhammad Aasim Qureshi, Muhammad Asif, Mohd Fadzil Hassan, Adnan Abid, Asad Kamal, Sohail Safdar, and Rehan Akber. 2022. Sentiment analysis of reviews in natural language: Roman Urdu as a case study. IEEE Access 10 (2022), 24945–24954.
[27]
Sujata Rani and Parteek Kumar. 2022. Aspect-based sentiment analysis using dependency parsing. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 21, 3 (2022), 1–19.
[28]
Kamal Sarkar. 2020. Heterogeneous classifier ensemble for sentiment analysis of Bengali and Hindi tweets. Sadhana 45, 1 (2020).
[29]
Hewan Shrestha, Chandramohan Dhasarathan, Shanmugam Munisamy, and Amudhavel Jayavel. 2020. Natural language processing based sentimental analysis of Hindi (SAH) script an optimization approach. Int. J. Speech Technol. 23, 4 (2020), 757–766.
[30]
Kush Shrivastava and Shishir Kumar. 2020. A sentiment analysis system for the Hindi language by integrating Gated Recurrent Unit with Genetic Algorithm. Int. Arab J. Inf. Technol. 17, 6 (2020), 954–964.
[31]
Xun Zhu, Yinxia Lou, Hongtao Deng, and Donghong Ji. 2022. Leveraging bilingual-view parallel translation for code-switched emotion detection with adversarial dual-channel encoder. Knowl. Based Syst. 235, 107436 (2022), 107436.

Cited By

View all
  • (2024)Decoding the Devanagari - Handwritten Hindi Recognition Using Deep Learning Methodology2024 International Conference on Advances in Computing Research on Science Engineering and Technology (ACROSET)10.1109/ACROSET62108.2024.10743214(1-6)Online publication date: 27-Sep-2024
  • (2023)Reading Scene Text with Aggregated Temporal Convolutional EncoderACM Transactions on Asian and Low-Resource Language Information Processing10.1145/362582222:11(1-16)Online publication date: 12-Oct-2023
  • (2023)Deep Learning: The Future of Medical Image Processing2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES)10.1109/CISES58720.2023.10183477(699-704)Online publication date: 28-Apr-2023

Index Terms

  1. Semantic and Context Understanding for Sentiment Analysis in Hindi Handwritten Character Recognition Using a Multiresolution Technique

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Transactions on Asian and Low-Resource Language Information Processing
          ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 23, Issue 1
          January 2024
          385 pages
          EISSN:2375-4702
          DOI:10.1145/3613498
          Issue’s Table of Contents

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 15 January 2024
          Online AM: 06 October 2022
          Accepted: 21 July 2022
          Revised: 30 June 2022
          Received: 30 May 2022
          Published in TALLIP Volume 23, Issue 1

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Datasets
          2. neural networks
          3. gaze detection
          4. text tagging

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)150
          • Downloads (Last 6 weeks)20
          Reflects downloads up to 02 Mar 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)Decoding the Devanagari - Handwritten Hindi Recognition Using Deep Learning Methodology2024 International Conference on Advances in Computing Research on Science Engineering and Technology (ACROSET)10.1109/ACROSET62108.2024.10743214(1-6)Online publication date: 27-Sep-2024
          • (2023)Reading Scene Text with Aggregated Temporal Convolutional EncoderACM Transactions on Asian and Low-Resource Language Information Processing10.1145/362582222:11(1-16)Online publication date: 12-Oct-2023
          • (2023)Deep Learning: The Future of Medical Image Processing2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES)10.1109/CISES58720.2023.10183477(699-704)Online publication date: 28-Apr-2023

          View Options

          Login options

          Full Access

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Full Text

          View this article in Full Text.

          Full Text

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media