Skip to main content

Arabic Named Entity Recognition—A Survey and Analysis

  • Conference paper
  • First Online:
Intelligent Interactive Multimedia Systems and Services 2016

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 55))

Abstract

As Arabic digital data has been increasing in abundance; the need for processing this information is growing. Named entity recognition (NER) is an information extraction technique that is vital to the processes of natural language processing (NLP). The ambiguous characteristics of the Arabic language make tasks related to NER and NLP very challenging. In addition to that, work related to Arabic NER is rather limited and under-studied. In this study, we survey previous works and methodologies and provide an analysis and discussion on the feature sets used, evaluation tools and advantages and disadvantages of each technique.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Al-Kouz, A., Awajan, A., Jeet, M., Al-Zaqqa, A.: Extracting Arabic semantic graph from Aljazeera. net. In: 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT), (pp. 1–6). IEEE, Dec 2013

    Google Scholar 

  2. Attia, M., Rashwan, M.A., Al-Badrashiny, M.A.S.A.A.: Fassieh, a semi-automatic visual interactive tool for morphological, PoS-Tags, phonetic, and semantic annotation of Arabic Text Corpora. IEEE Trans. Audio Speech Lang. Process. 17(5), 916–925 (2009)

    Article  Google Scholar 

  3. Benajiba, Y., Diab, M., Rosso, P.: Arabic named entity recognition: A feature-driven study. IEEE Trans. Audio Speech Lang. Process. 17(5), 926–934 (2009)

    Article  Google Scholar 

  4. Chen, C., Ng, V.: Combining the best of two worlds: a hybrid approach to multilingual coreference resolution. In: Joint Conference on EMNLP and CoNLL-Shared Task, pp. 56–63. Association for Computational Linguistics, July 2012

    Google Scholar 

  5. Diab, M.: Second generation AMIRA tools for Arabic processing: fast and robust tokenization, POS tagging, and base phrase chunking. In: 2nd International Conference on Arabic Language Resources and Tools (2009)

    Google Scholar 

  6. Habash, N., Roth, R., Rambow, O., Eskander, R., Tomeh, N.: Morphological analysis and disambiguation for dialectal Arabic. In: HLT-NAACL, pp. 426–432 (2013)

    Google Scholar 

  7. Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)

    Article  Google Scholar 

  8. Pasha, A., Al-Badrashiny, M., Diab, M., El Kholy, A., Eskander, R., Habash, N., Roth, R.M.: (2014). Madamira: a fast, comprehensive tool for morphological analysis and disambiguation of Arabic. In: Proceedings of the Language Resources and Evaluation Conference (LREC). Reykjavik, Iceland

    Google Scholar 

  9. Zaraket, F.A., Jaber, A.: MATAr: Morphology-based Tagger for Arabic. In Computer Systems and Applications (AICCSA), 2013 ACS International Conference on, pp. 1–4. IEEE, May 2013

    Google Scholar 

  10. Zitouni, I., Benajiba, Y.: Aligned-parallel-corpora based semi-supervised learning for Arabic mention detection. IEEE/ACM Trans. Audio Speech Lang. Process. 22(2), 314–324 (2014)

    Article  Google Scholar 

  11. Zitouni, I., Luo, X., Florian, R.: A cascaded approach to mention detection and chaining in arabic. IEEE Trans. Audio Speech Lang. Process. 17(5), 935–944 (2009)

    Article  Google Scholar 

Download references

Acknowledgements

This publication was made possible by GSRA grant # 1-1-1202-13026 from the Qatar National Research Fund (a member of Qatar Foundation). The findings achieved herein are solely the responsibility of the author(s).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amal Dandashi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Dandashi, A., Al Jaam, J., Foufou, S. (2016). Arabic Named Entity Recognition—A Survey and Analysis. In: Pietro, G., Gallo, L., Howlett, R., Jain, L. (eds) Intelligent Interactive Multimedia Systems and Services 2016. Smart Innovation, Systems and Technologies, vol 55. Springer, Cham. https://doi.org/10.1007/978-3-319-39345-2_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-39345-2_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-39344-5

  • Online ISBN: 978-3-319-39345-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics