NPABT: Naming Pattern Analysis Of Bengali Text To Detect Various Community Using Machine Learning Approach | IEEE Conference Publication | IEEE Xplore

NPABT: Naming Pattern Analysis Of Bengali Text To Detect Various Community Using Machine Learning Approach


Abstract:

Natural Language processing is an important part of Artificial Intelligence which enriches languages. It creates a bridge between humans and machines to communicate. In t...Show More

Abstract:

Natural Language processing is an important part of Artificial Intelligence which enriches languages. It creates a bridge between humans and machines to communicate. In this paper, we have proposed a method that can automatically predict a person's community where he may belong by using their name. Based on the research available on online literature, we are the first to do research on it. We have collected more than 8,000 names of the four major communities of Bangladesh, including both males and females. Then data has been preprocessed to clean those texts by following the natural language processing approach. The six most popular machine learning classifiers have been used to train and test this data. Usually, human names have some prefix or suffix of their community. From that prefix or suffix, this method determines the community of that person's or their predecessors' community. The used approaches worked really well for these data and give higher accuracy for the models. The total performance of those classifiers is up to 65.12%-78.25%. Random Forest (RF) classifiers achieved the highest accuracy which is 78.25%.
Date of Conference: 06-08 July 2021
Date Added to IEEE Xplore: 03 November 2021
ISBN Information:
Conference Location: Kharagpur, India

Contact IEEE to Subscribe

References

References is not available for this document.