Abstract
As Arabic digital data has been increasing in abundance; the need for processing this information is growing. Named entity recognition (NER) is an information extraction technique that is vital to the processes of natural language processing (NLP). The ambiguous characteristics of the Arabic language make tasks related to NER and NLP very challenging. In addition to that, work related to Arabic NER is rather limited and under-studied. In this study, we survey previous works and methodologies and provide an analysis and discussion on the feature sets used, evaluation tools and advantages and disadvantages of each technique.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Al-Kouz, A., Awajan, A., Jeet, M., Al-Zaqqa, A.: Extracting Arabic semantic graph from Aljazeera. net. In: 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT), (pp. 1–6). IEEE, Dec 2013
Attia, M., Rashwan, M.A., Al-Badrashiny, M.A.S.A.A.: Fassieh, a semi-automatic visual interactive tool for morphological, PoS-Tags, phonetic, and semantic annotation of Arabic Text Corpora. IEEE Trans. Audio Speech Lang. Process. 17(5), 916–925 (2009)
Benajiba, Y., Diab, M., Rosso, P.: Arabic named entity recognition: A feature-driven study. IEEE Trans. Audio Speech Lang. Process. 17(5), 926–934 (2009)
Chen, C., Ng, V.: Combining the best of two worlds: a hybrid approach to multilingual coreference resolution. In: Joint Conference on EMNLP and CoNLL-Shared Task, pp. 56–63. Association for Computational Linguistics, July 2012
Diab, M.: Second generation AMIRA tools for Arabic processing: fast and robust tokenization, POS tagging, and base phrase chunking. In: 2nd International Conference on Arabic Language Resources and Tools (2009)
Habash, N., Roth, R., Rambow, O., Eskander, R., Tomeh, N.: Morphological analysis and disambiguation for dialectal Arabic. In: HLT-NAACL, pp. 426–432 (2013)
Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)
Pasha, A., Al-Badrashiny, M., Diab, M., El Kholy, A., Eskander, R., Habash, N., Roth, R.M.: (2014). Madamira: a fast, comprehensive tool for morphological analysis and disambiguation of Arabic. In: Proceedings of the Language Resources and Evaluation Conference (LREC). Reykjavik, Iceland
Zaraket, F.A., Jaber, A.: MATAr: Morphology-based Tagger for Arabic. In Computer Systems and Applications (AICCSA), 2013 ACS International Conference on, pp. 1–4. IEEE, May 2013
Zitouni, I., Benajiba, Y.: Aligned-parallel-corpora based semi-supervised learning for Arabic mention detection. IEEE/ACM Trans. Audio Speech Lang. Process. 22(2), 314–324 (2014)
Zitouni, I., Luo, X., Florian, R.: A cascaded approach to mention detection and chaining in arabic. IEEE Trans. Audio Speech Lang. Process. 17(5), 935–944 (2009)
Acknowledgements
This publication was made possible by GSRA grant # 1-1-1202-13026 from the Qatar National Research Fund (a member of Qatar Foundation). The findings achieved herein are solely the responsibility of the author(s).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Dandashi, A., Al Jaam, J., Foufou, S. (2016). Arabic Named Entity Recognition—A Survey and Analysis. In: Pietro, G., Gallo, L., Howlett, R., Jain, L. (eds) Intelligent Interactive Multimedia Systems and Services 2016. Smart Innovation, Systems and Technologies, vol 55. Springer, Cham. https://doi.org/10.1007/978-3-319-39345-2_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-39345-2_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39344-5
Online ISBN: 978-3-319-39345-2
eBook Packages: EngineeringEngineering (R0)