Skip to main content

Bengali Named Entity Recognition Using Margin Infused Relaxed Algorithm

  • Conference paper
Text, Speech and Dialogue (TSD 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8655))

Included in the following conference series:

Abstract

The present work describes the automatic recognition of named entities based on language independent and dependent features. Margin Infused Relaxed Algorithm is applied for the first time in order to learn named entities for Bengali language. We used openly available annotated corpora with twelve different tagset defined in IJCNLP-08 NERSSEAL shared task and obtained 91.23%, 87.29% and 89.69% precision, recall and F-measure respectively. The proposed work outperforms the existing models with satisfactory margin.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bandyopadhyay, S.: Multilingual Named Entity Recognition. In: Proceedings of the IJCNLP 2008 Workshop on NER for South and South East Asian Languages, Hyderabad, India (2008)

    Google Scholar 

  2. Ralph, G.: The New York University System MUC-6 or Where’s the syntax? In: Proceedings of Message Understanding Conference (1995)

    Google Scholar 

  3. McDonald, D.: Internal and external evidence in the identification and semantic categorization of proper names. In: Boguraev, B., Pustejovsky, J. (eds.) Corpus Processing for Lexical Acquisition, pp. 21–39 (1996)

    Google Scholar 

  4. Takahiro, W., Gaizauskas, R., Wilks, Y.: Evaluation of an algorithm for the recognition and classification of proper names. In: Proceedings of COLING (1996)

    Google Scholar 

  5. Hewavitharana, S., Vogel, S.: Extracting parallel phrases from comparable data. In: Proceedings of the Workshop on Building and Using Comparable Corpora, ACL, Portland, Oregon, pp. 61–68 (2011)

    Google Scholar 

  6. Bikel, D.M., Scott, M., Richard, S., Ralph, S.: Nymble: A High Performance Learning Name-finder. In: Proceedings of Applied Natural Language Processing, Hyderabad, India, pp. 194–201 (1997)

    Google Scholar 

  7. Wei, L., Andrew, M.: Rapid Development of Hindi Named Entity Recognition using Conditional Random Fields and Feature Induction. ACM Transactions on Computational Logic (2004)

    Google Scholar 

  8. Hiroyasu, Y., Kudo, T., Matsumoto, Y.: Japanese Named Entity Extraction using Support Vector Machine. Transactions of IPSJ 43(1), 44–53 (2002)

    Google Scholar 

  9. Andrew, B.: A Maximum Entropy Approach to Named Entity Recognition. Ph.D. Thesis, New York University (1999)

    Google Scholar 

  10. Saha, S.K., Chatterji, S., Dantapat, S., Sarkar, S., Mitra, P.: A Hybrid Approach for Named Entity Recognition in Indian Languages. In: NERSSEAL-IJCNLP 2008, Hyderabad, India, pp. 17–24 (2008)

    Google Scholar 

  11. Sharma, P., Sharma, U., Kalita, J.: Named Entity Recognition: A Survey for the Indian Languages. In: Parsing in Indian Languages, pp. 35–39 (2011)

    Google Scholar 

  12. Ekbal, A., Haque, R., Das, A., Bandyopadhyay, S.: Language Independent Named Entity Recognition in Indian Languages. In: Proceedings of the NERSSEAL-IJCNLP 2008, Hyderabad, India, pp. 33–40 (2008)

    Google Scholar 

  13. Ekbal, A., Saha, S.: Weighted Vote Based Classifier Ensemble Selection Using Genetic Algorithm for Named Entity Recognition. In: Hopfe, C.J., Rezgui, Y., Métais, E., Preece, A., Li, H. (eds.) NLDB 2010. LNCS, vol. 6177, pp. 256–267. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  14. Ekbal, A., Saha, S.: Classifier Ensemble using Multiobjective Optimization for Named Entity Recognition. In: European Conference on Artificial Intelligence (ECAI 2010), Lisbon, Portugal, pp. 783–788 (2010)

    Google Scholar 

  15. Ekbal, A., Saha, S.: Maximum Entropy Classifier Ensembling using Genetic Algorithm for NER in Bengali. In: International Conference on Language Resources and Evaluation (LREC 2010), Malta (2010)

    Google Scholar 

  16. Ekbal, A., Bandyopadhyay, S.: Maximum Entropy Approach for Named Entity Recognition in Bengali. In: Proceedings of International Symposium on Natural Language Processing (SNLP 2007), Thailand, pp. 1–6 (2007)

    Google Scholar 

  17. Ekbal, A., Bandyopadhyay, S.: Bengali Named Entity Recognition using Support Vector Machine. In: NERSSEAL-IJCNLP 2008, Hyderabad, India, pp. 51–58 (2008)

    Google Scholar 

  18. Ekbal, A., Bandyopadhyay, S.: Voted NER System using Appropriate Unlabeled Data. In: Named Entities Workshop: Shared Task on Transliteration (NEWS 2009), ACL-IJCNLP, Singapore, pp. 202–210 (2009)

    Google Scholar 

  19. Chaudhuri, B., Bhattacharya, S.: An Experiment on Automatic Detection of Named Entities in Bangla. In: NERSSEAL-IJCNLP 2008, Hyderabad, India, pp. 75–82 (2008)

    Google Scholar 

  20. Gali, K., Surana, H., Vaidya, A., Shishtla, P., Sharma, D.M.: Aggregating Machine Learning and Rule Based Heuristics for Named Entity Recognition. In: NERSSEAL-IJCNLP 2008, Hyderabad, India, pp. 25–32 (2008)

    Google Scholar 

  21. Ganchev, K., Pereira, F., Mandel, M., Carroll, S., WhiteCrammer, P., Singer, Y.: Semi-automated named entity annotation. In: Proceedings of the Linguistic Annotation Workshop, pp. 53–56. ACL (2007)

    Google Scholar 

  22. Crammer, K., Singer, Y.: Ultraconservative Online Algorithms for Multiclass Problems. Journal of Machine Learning Research, 951–991 (2003)

    Google Scholar 

  23. Singh, A.K.: Named Entity Recognition for South and South East Asian Languages: Taking Stock. In: NERSSEAL-IJCNLP 2008, Hyderabad, India (2008)

    Google Scholar 

  24. Ekbal, A., Bandyopadhyay, S.: Named entity recognition using support vector machine: A language independent approach. International Journal of Electrical, Computer, and Systems Engineering 4(2), 155–170 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Banerjee, S., Naskar, S.K., Bandyopadhyay, S. (2014). Bengali Named Entity Recognition Using Margin Infused Relaxed Algorithm. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-10816-2_16

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-10815-5

  • Online ISBN: 978-3-319-10816-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics