Abstract
Named Entity Recognition (NER) involves the identification and classification of named entities in texts. This is an important subtask in most high level NLP applications and semantic Web technologies. Besides, various studies have been done on NER for most of the languages and in particular for English. However the studies for Amazighe have lagged behind these for a long while. Recently, Amazighe NER have caught more attention due to the increasing flow of Amazighe texts available on the Web and the need to discover named entities occurring in these texts, considering the fact that a difference in language impose new challenges. Some systems using different approaches have been proposed in terms of extracting Amazighe named entities, however the recent system proposed based on a hybrid approach, the only existing hybrid system, reports a drop in F-Measure from 93 to 73% when compared to the rule based approach. In this paper, we present our enhancement of the previously proposed method by adding a new set of handcrafted lexical resources and a new set of features. The system is able to identify seven different kinds of entities such as “Person”, “Location”, “Organization”, “Numbers”, “Percent”, “Money”, “Date/Time”, it was tested on our Amazighe corpus “AMCorp” with satisfactory results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
JAPE is a pattern specification language, which enables the implementation of grammatical based on regular expression.
- 4.
The articles were collected from: http://www.mapamazighe.ma/am/.
References
Amrouch, M., Rachidi, A., El Yassa, M., Mammass, D.: Handwritten Amazigh character recognition based on Hidden Markov models. Int. J. Gr. Vis. Image Process. 10(5), 11–18 (2010)
Es, S.Y., Rachidi, A., El Yassa, M., Mammas, D.: Printed Amazigh character recognition by a syntactic approach using finite automata. Int. J. Gr. Vis. Image Process. 10(2), 1–8 (2010)
Fakir, M., Bouikhalene, B., Moro, K.: Skeletonization methods evaluation for the recognition of printed tifinaghe characters. In: Proceedings of the 1er Symposium International sur le Traitement Automatique de la Culture Amazighe. Agadir, Morocco, pp. 33–47 (2009)
Boulaknadel, S., Ataa allah, F.: Building a standard Amazigh corpus. In: Proceedings of the International Conference on Intelligent Human Computer Interaction, Prague, Tchec (2011)
Boulaknadel, S., Ataa Allah, F.: Online Amazigh concordancer. In: Proceedings of International Symposium on Image Video Communications and Mobile Networks, Rabat, Maroc (2010)
Ataa Allah, F., Boulaknadel, S.: Pseudo-racinisation de la langue amazighe. In: Proceeding of Traitement Automatique des Langues Naturelles, Montréal, Canada (2010)
Nejme, F., Boulaknadel, S., Aboutajdine, D.: Analyse Automatique de la Morphologie Nominale Amazighe. Actes de la conférence du Traitement Automatique du Langage Naturel (TALN), Les Sables d’Olonne, France (2013)
Nejme, F., Boulaknadel, S., Aboutajdine, D.: Finite state morphology for Amazigh language. In: Proceeding of International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Samos, Greece (2013)
Chinchor, N.A., Marsh, E.: Muc-7 information extraction task definition. In: Proceeding of the Seventh Message Understanding Conference (MUC-7), Appendices (1998)
Voorhees, E.M., Harman, D.K. (eds.): TREC: Experiment and Evaluation in Information Retrieval, vol. 1. MIT Press, Cambridge (2005)
Molla, D., Zaanen, M., Smith, D.: Named entity recognition for question answering. In: Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), pp. 51–58 (2006)
Babych, B., Hartley, A.: Improving machine translation quality with automatic named entity recognition. In: Proceedings of the 7th International EAMT Workshop on MT and Other Language Technology Tools, Improving MT Through Other Language Technology Tools: Resources and Tools for Building MT, pp. 1–8. Association for Computational Linguistics (2003)
Chen, Z., Ji, H.: Collaborative ranking: a case study on entity linking. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 771–781 (2011)
Darvinder, K., Gupta, V.: A survey of named entity recognition in English and other Indian languages. IJCSI Int. J. Comput. Sci. Iss. 7(6), 1694-0814 (2010)
Andoni, A., Montse, C., Seán, G.: NERC-fr: supervised named entity recognition for French. In: International Conference on Text, Speech, and Dialogue, pp. 158–165. Springer, Cham (2014)
Galicia-Haro, S.N., Gelbukh, A., Bolshakov, I.A.: Recognition of named entities in Spanish texts. In: MICAI 2004: Advances in Artificial Intelligence, pp. 420–429 (2004)
Bai, S., et al.: System for Chinese tokenization and named entity recognition. U.S. Patent No. 6,311,152, 30 Oct 2001
Sasano, R., Kurohashi, S.: Japanese named entity recognition using structural natural language processing. In: Proceedings of IJCNLP, pp. 607–612 (2008)
Doğan, K., Arici, N., Dilek, K.: Named entity recognition in Turkish: approaches and issues. In: International Conference on Applications of Natural Language to Information Systems, pp. 176–181. Springer, Cham (2017)
Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)
Talha, M., Boulaknadel, S., Aboutajdine, D.: NERAM: named entity recognition for Amazighe language. In: 21st International Conference of TALN, pp. 517–524. Aix Marseille University, Marseille (2014)
Boulaknadel, S., Talha, M., Aboutajdine, D.: Amazighe named entity recognition using a rule based approach. In: 11th ACS/IEEE International Conference on Computer Systems and Applications. Doha, Qatar (2014)
Talha, M., Boulaknadel, S., Aboutajdine, D.: L’apport d’une approche symbolique pour le repérage des entités nommées en langue amazighe. In: EGC, pp. 29–34, Luxembourg (2015)
Talha, M., Boulaknadel, S., Aboutajdine, D.: Development of Amazighe named entity recognition system using hybrid method. J. Res. Comput. Sci. 90, 151–161 (2015)
Küçük, D., Yazıcı, A.: Named entity recognition experiments on Turkish texts. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds.) Flexible Query Answering Systems. FQAS 2009. Lecture Notes in Computer Science, vol. 5822, pp. 524–535. Springer, Berlin, Heidelberg (2009)
Shaalan, K., Raza, H.: NERA: named entity recognition for Arabic. J. Am. Soc. Inform. Sci. Technol. 60(8), 1652–1663 (2009)
Sharnagat, R.: Named Entity Recognition: A Literature Survey (2014)
Bikel, D.M., Miller, S., Schwartz, R., Weischedel, R.: Nymble: a high-performance learning name-finder. In: Proceedings of the Fifth Conference on Applied Natural Language Processing, pp. 194–201. Association for Computational Linguistics (1997)
Zhou, G., Su, J.: Named entity recognition using an HMM-based chunk tagger. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 473–480. Association for Computational Linguistics (2002)
Zhou, G., Su, J.: Named entity recognition using an HMM-based chunk tagger. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 473–480. Association for Computational Linguistics (2002)
Borthwick, A., Sterling, J., Agichtein, E., Grishman, R.: NYU: description of the MENE named entity system as used in MUC-7. In: Proceedings of the Seventh Message Understanding Conference (MUC-7) (1998)
McCallum, A., Li, W.: Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003, vol. 4, pp. 188–191. Association for Computational Linguistics (2003)
Benajiba, Y.: Arabic named entity recognition. Ph.D. thesis, Techninal University of Valencia (2009)
Abdallah, S., Shaalan, K., Shoaib, M.: Integrating rule-based system with classification for Arabic named entity recognition. In: Gelbukh, A. (ed.) Computational Linguistics and Intelligent Text Processing. Lecture Notes in Computer Science, vol. 7181, pp. 311–322. Springer, Berlin, Heidelberg (2012)
Greenberg, J.: The Languages of Africa. The Hague (1966)
Ouakrim, O.: Fonética y fonología del Bereber. Survey at the University of Autònoma de Barcelona (1995)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995). ISBN 0-387-94559-8
Vapnik, V.: Statistical Learning Theory. Springer, New York (1998)
Cortes, C., Vapnik, V.: Support-vector networks. In: Machine Learning, pp. 273–297 (1995)
Hsu, C.W., Lin, C.J.: A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 13(2), 415–425 (2002)
Kreßel, U.H.G.: Pairwise classification and support vector machines. In: Advances in Kernel Methods, pp. 255–268. MIT Press (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Talha, M., Boulaknadel, S., Aboutajdine, D. (2019). Enhancing Performance of Hybrid Named Entity Recognition for Amazighe Language. In: Hassanien, A. (eds) Machine Learning Paradigms: Theory and Application. Studies in Computational Intelligence, vol 801. Springer, Cham. https://doi.org/10.1007/978-3-030-02357-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-02357-7_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02356-0
Online ISBN: 978-3-030-02357-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)