Enhancing Performance of Hybrid Named Entity Recognition for Amazighe Language

Talha, Meryem; Boulaknadel, Siham; Aboutajdine, Driss

doi:10.1007/978-3-030-02357-7_10

Meryem Talha³,
Siham Boulaknadel⁴ &
Driss Aboutajdine³

Part of the book series: Studies in Computational Intelligence ((SCI,volume 801))

1651 Accesses

Abstract

Named Entity Recognition (NER) involves the identification and classification of named entities in texts. This is an important subtask in most high level NLP applications and semantic Web technologies. Besides, various studies have been done on NER for most of the languages and in particular for English. However the studies for Amazighe have lagged behind these for a long while. Recently, Amazighe NER have caught more attention due to the increasing flow of Amazighe texts available on the Web and the need to discover named entities occurring in these texts, considering the fact that a difference in language impose new challenges. Some systems using different approaches have been proposed in terms of extracting Amazighe named entities, however the recent system proposed based on a hybrid approach, the only existing hybrid system, reports a drop in F-Measure from 93 to 73% when compared to the rule based approach. In this paper, we present our enhancement of the previously proposed method by adding a new set of handcrafted lexical resources and a new set of features. The system is able to identify seven different kinds of entities such as “Person”, “Location”, “Organization”, “Numbers”, “Percent”, “Money”, “Date/Time”, it was tested on our Amazighe corpus “AMCorp” with satisfactory results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.hcp.ma/Presentation-des-premiers-resultats-du-RGPH-2014_a1605.html.
2.
https://gate.ac.uk/.
3.
JAPE is a pattern specification language, which enables the implementation of grammatical based on regular expression.
4.
The articles were collected from: http://www.mapamazighe.ma/am/.

References

Amrouch, M., Rachidi, A., El Yassa, M., Mammass, D.: Handwritten Amazigh character recognition based on Hidden Markov models. Int. J. Gr. Vis. Image Process. 10(5), 11–18 (2010)
Google Scholar
Es, S.Y., Rachidi, A., El Yassa, M., Mammas, D.: Printed Amazigh character recognition by a syntactic approach using finite automata. Int. J. Gr. Vis. Image Process. 10(2), 1–8 (2010)
Google Scholar
Fakir, M., Bouikhalene, B., Moro, K.: Skeletonization methods evaluation for the recognition of printed tifinaghe characters. In: Proceedings of the 1er Symposium International sur le Traitement Automatique de la Culture Amazighe. Agadir, Morocco, pp. 33–47 (2009)
Google Scholar
Boulaknadel, S., Ataa allah, F.: Building a standard Amazigh corpus. In: Proceedings of the International Conference on Intelligent Human Computer Interaction, Prague, Tchec (2011)
Google Scholar
Boulaknadel, S., Ataa Allah, F.: Online Amazigh concordancer. In: Proceedings of International Symposium on Image Video Communications and Mobile Networks, Rabat, Maroc (2010)
Google Scholar
Ataa Allah, F., Boulaknadel, S.: Pseudo-racinisation de la langue amazighe. In: Proceeding of Traitement Automatique des Langues Naturelles, Montréal, Canada (2010)
Google Scholar
Nejme, F., Boulaknadel, S., Aboutajdine, D.: Analyse Automatique de la Morphologie Nominale Amazighe. Actes de la conférence du Traitement Automatique du Langage Naturel (TALN), Les Sables d’Olonne, France (2013)
Google Scholar
Nejme, F., Boulaknadel, S., Aboutajdine, D.: Finite state morphology for Amazigh language. In: Proceeding of International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Samos, Greece (2013)
Google Scholar
Chinchor, N.A., Marsh, E.: Muc-7 information extraction task definition. In: Proceeding of the Seventh Message Understanding Conference (MUC-7), Appendices (1998)
Google Scholar
Voorhees, E.M., Harman, D.K. (eds.): TREC: Experiment and Evaluation in Information Retrieval, vol. 1. MIT Press, Cambridge (2005)
Google Scholar
Molla, D., Zaanen, M., Smith, D.: Named entity recognition for question answering. In: Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), pp. 51–58 (2006)
Google Scholar
Babych, B., Hartley, A.: Improving machine translation quality with automatic named entity recognition. In: Proceedings of the 7th International EAMT Workshop on MT and Other Language Technology Tools, Improving MT Through Other Language Technology Tools: Resources and Tools for Building MT, pp. 1–8. Association for Computational Linguistics (2003)
Google Scholar
Chen, Z., Ji, H.: Collaborative ranking: a case study on entity linking. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 771–781 (2011)
Google Scholar
Darvinder, K., Gupta, V.: A survey of named entity recognition in English and other Indian languages. IJCSI Int. J. Comput. Sci. Iss. 7(6), 1694-0814 (2010)
Google Scholar
Andoni, A., Montse, C., Seán, G.: NERC-fr: supervised named entity recognition for French. In: International Conference on Text, Speech, and Dialogue, pp. 158–165. Springer, Cham (2014)
Google Scholar
Galicia-Haro, S.N., Gelbukh, A., Bolshakov, I.A.: Recognition of named entities in Spanish texts. In: MICAI 2004: Advances in Artificial Intelligence, pp. 420–429 (2004)
Google Scholar
Bai, S., et al.: System for Chinese tokenization and named entity recognition. U.S. Patent No. 6,311,152, 30 Oct 2001
Google Scholar
Sasano, R., Kurohashi, S.: Japanese named entity recognition using structural natural language processing. In: Proceedings of IJCNLP, pp. 607–612 (2008)
Google Scholar
Doğan, K., Arici, N., Dilek, K.: Named entity recognition in Turkish: approaches and issues. In: International Conference on Applications of Natural Language to Information Systems, pp. 176–181. Springer, Cham (2017)
Google Scholar
Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)
Article Google Scholar
Talha, M., Boulaknadel, S., Aboutajdine, D.: NERAM: named entity recognition for Amazighe language. In: 21st International Conference of TALN, pp. 517–524. Aix Marseille University, Marseille (2014)
Google Scholar
Boulaknadel, S., Talha, M., Aboutajdine, D.: Amazighe named entity recognition using a rule based approach. In: 11th ACS/IEEE International Conference on Computer Systems and Applications. Doha, Qatar (2014)
Google Scholar
Talha, M., Boulaknadel, S., Aboutajdine, D.: L’apport d’une approche symbolique pour le repérage des entités nommées en langue amazighe. In: EGC, pp. 29–34, Luxembourg (2015)
Google Scholar
Talha, M., Boulaknadel, S., Aboutajdine, D.: Development of Amazighe named entity recognition system using hybrid method. J. Res. Comput. Sci. 90, 151–161 (2015)
Google Scholar
Küçük, D., Yazıcı, A.: Named entity recognition experiments on Turkish texts. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds.) Flexible Query Answering Systems. FQAS 2009. Lecture Notes in Computer Science, vol. 5822, pp. 524–535. Springer, Berlin, Heidelberg (2009)
Google Scholar
Shaalan, K., Raza, H.: NERA: named entity recognition for Arabic. J. Am. Soc. Inform. Sci. Technol. 60(8), 1652–1663 (2009)
Article Google Scholar
Sharnagat, R.: Named Entity Recognition: A Literature Survey (2014)
Google Scholar
Bikel, D.M., Miller, S., Schwartz, R., Weischedel, R.: Nymble: a high-performance learning name-finder. In: Proceedings of the Fifth Conference on Applied Natural Language Processing, pp. 194–201. Association for Computational Linguistics (1997)
Google Scholar
Zhou, G., Su, J.: Named entity recognition using an HMM-based chunk tagger. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 473–480. Association for Computational Linguistics (2002)
Google Scholar
Zhou, G., Su, J.: Named entity recognition using an HMM-based chunk tagger. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 473–480. Association for Computational Linguistics (2002)
Google Scholar
Borthwick, A., Sterling, J., Agichtein, E., Grishman, R.: NYU: description of the MENE named entity system as used in MUC-7. In: Proceedings of the Seventh Message Understanding Conference (MUC-7) (1998)
Google Scholar
McCallum, A., Li, W.: Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003, vol. 4, pp. 188–191. Association for Computational Linguistics (2003)
Google Scholar
Benajiba, Y.: Arabic named entity recognition. Ph.D. thesis, Techninal University of Valencia (2009)
Google Scholar
Abdallah, S., Shaalan, K., Shoaib, M.: Integrating rule-based system with classification for Arabic named entity recognition. In: Gelbukh, A. (ed.) Computational Linguistics and Intelligent Text Processing. Lecture Notes in Computer Science, vol. 7181, pp. 311–322. Springer, Berlin, Heidelberg (2012)
Chapter Google Scholar
Greenberg, J.: The Languages of Africa. The Hague (1966)
Google Scholar
Ouakrim, O.: Fonética y fonología del Bereber. Survey at the University of Autònoma de Barcelona (1995)
Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995). ISBN 0-387-94559-8
Book Google Scholar
Vapnik, V.: Statistical Learning Theory. Springer, New York (1998)
MATH Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. In: Machine Learning, pp. 273–297 (1995)
Google Scholar
Hsu, C.W., Lin, C.J.: A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 13(2), 415–425 (2002)
Article Google Scholar
Kreßel, U.H.G.: Pairwise classification and support vector machines. In: Advances in Kernel Methods, pp. 255–268. MIT Press (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

LRIT, Unité Associée au CNRST (URAC 29), Faculty of Science, Mohammed V University, Agdal, Rabat, Morocco
Meryem Talha & Driss Aboutajdine
Royal Institut of Amazighe Culture Allal El Fassi Avenue, Madinat al Irfane, Rabat-Instituts, Rabat, Morocco
Siham Boulaknadel

Authors

Meryem Talha
View author publications
You can also search for this author in PubMed Google Scholar
Siham Boulaknadel
View author publications
You can also search for this author in PubMed Google Scholar
Driss Aboutajdine
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Meryem Talha .

Editor information

Editors and Affiliations

Faculty of Computers and Information, Cairo University, Giza, Egypt
Aboul Ella Hassanien

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Talha, M., Boulaknadel, S., Aboutajdine, D. (2019). Enhancing Performance of Hybrid Named Entity Recognition for Amazighe Language. In: Hassanien, A. (eds) Machine Learning Paradigms: Theory and Application. Studies in Computational Intelligence, vol 801. Springer, Cham. https://doi.org/10.1007/978-3-030-02357-7_10

Download citation

DOI: https://doi.org/10.1007/978-3-030-02357-7_10
Published: 08 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02356-0
Online ISBN: 978-3-030-02357-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics