research-article

Ensemble Classifier for Hindi Hostile Content Detection

Authors:

Angana Chakraborty,

Subhankar Joardar,

Arif Ahmed SekhAuthors Info & Claims

ACM Transactions on Asian and Low-Resource Language Information Processing, Volume 23, Issue 1

Article No.: 13, Pages 1 - 17

https://doi.org/10.1145/3591353

Published: 15 January 2024 Publication History

Get Access

Abstract

Detection of hostile content from social media posts (Facebook, Twitter, etc.) is a demanding task in the field of Natural Language Processing. The increase of hostile content in different electronic media has opened up new challenges in language understanding. It becomes more difficult in regional languages. AI-based solutions are required to identify hostile content on a large scale. Although a satisfactory amount of research has been carried out in the English language, finding hostile content in regional languages is still under development due to the unavailability of suitable datasets and tools. In terms of the number of speakers, Hindi ranks third in the world and first on the Indian subcontinent. The objective of this article is to design a hostile content detection system in Hindi using coarse-grained (binary) classification and fine-grained (multi-class, multi-label) classification. We note that different baseline learning methods with different pre-trained language models perform differently. Using the Constraint 2021 Hindi Dataset, this research proposes a Bidirectional Encoder Representations from Transformers–(BERT) based contextual embedding technique with a concatenation of emoji2vec embeddings to classify social media posts in Hindi Devanagari script as hostile or non-hostile. Additionally, for the fine-grained tasks where hostile posts are sub-categorized as defamation, fake, hate, and offensive, we develop an ensemble classifier varying different learning methods and embedding models. With an F1-Score of 0.9721, it is found that our proposed Indic-BERT+emoji model outperforms the baseline model and other existing models for the coarse-grained task. We have also observed that our proposed ensemble method provides better results than the existing models and the baseline model for the fine-grained tasks with F1-Scores of 0.43, 0.82, 0.58, and 0.62 for the defamation, fake, hate, and offensive classes, respectively. The code and the data are available at https://github.com/skarifahmed/hostile.

Supplementary Material

3613498.supp (3613498.supp.pdf)

Supplementary material

Download
292.76 KB

References

[1]

2015. Coarse-grained vs. Fine-grained Sentiment Analysis. Retrieved May 25, 2015 from https://www.linkedin.com/pulse/coarse-grained-vs-fine-grained-sentiment-analysis-wei-li.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

An unsupervised Hindi stemmer with heuristic improvements

Interpretable and High-Performance Hate and Offensive Speech Detection

Hostility Detection in Online Hindi-English Code-Mixed Conversations

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Full Text

Share

Share this Publication link

Share on social media

Affiliations