Abstract
In this paper, we report our work on a Finite State Transducer-based entity extractor, which applies named-entity extraction techniques to identify useful entities from prophetic narrations texts. A Finite State Transducer has been implemented in order to capture different types of named entities. For development and testing purposes, we collected a set of prophetic narrations texts from “Sahîh Al-Bukhari” corpus. Preliminary evaluation results demonstrated that our approach is feasible. Our system achieved encouraging precision and recall rates, the overall precision and recall are 71% and 39% respectively. Our future work includes conducting larger-scale evaluation studies and enhancing the system to capture named entities from chains of transmitters (Salasil Al-Assanid) and biographical texts of narrators (Tarajims).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abney, S.: Partial parsing via finite-state cascades. In: Workshop on Robust Parsing, 8th European Summer School in Logic, Language and Information, Prague, Czech Republic, pp. 8–15 (1996)
Ait-Mokhtar, S., Chanod, J.: Incremental finite state parsing. In: ANLP 1997 (1997)
Attia, M., Toral, A., Tounsi, L., Monachini, M., Genabith, J.V.: An automatically built Named Entity lexicon for Arabic, pp. 3614–3621 (2010)
Benajiba, Y., Rosso, P., BenedíRuiz, J.M.: ANERsys: An Arabic Named Entity Recognition System Based on Maximum Entropy. In: Gelbukh, A. (ed.) CICLing 2007. LNCS, vol. 4394, pp. 143–153. Springer, Heidelberg (2007)
Ben-Dov, M., Feldman, R.: Text Mining And Information Extraction. In: Data Mining And Mvowledge Discovery Handbook, ch. 38, pp. 801–831. Mdx University, London (2006)
Demiros, I., Boutsis, S., Giouli, V., Liakata, M., Papageorgiou, H., Piperidis, S.: Named Entity Recognition in Greek Texts (2000)
Gala-Pavia, N.: Using the Incremental Finite-State Architecture to create a Spanish Shallow Parser. In: Proceedings of XV Congres of SEPLN, Lleida, Spain (1999)
Grishman, R.: The NYU system for MUC-6 or where’s the syntax. In: Proceedings of Sixth Message Understanding Conference (1995)
Elsebai, A., Meziane, F., Belkredim, F.Z.: A Rule Based Persons Names Arabic Extraction System. Communications of the IBIMA 11, 53–59 (2009)
Harrag, F.: A text mining approach based on topic classification and segmentation Application to the corpus of Prophetic Traditions (Hadith), PhD thesis, Computer Science Dept., Faculty of Sciences, Farhat Abbas University, Setif, Algeria (2011)
Hobbs, J.R., Appelt, D.E., Bear, J., Israel, D., Kameyama, M., Stickel, M., Tyson, M.: FASTUS: A cascaded finite-state transducer for extracting information from natural-language text. In: Finite-State Devices for Natural Language Processing. MIT Press, Cambridge (1996)
Kokkinakis, D., Johansson-Kokkinakis, S.: A Cascaded Finite-State Parser for Syntactic Analysis of Swedish. In: Proceedings of the 9th EACL, Bergen, Norway (1999)
Neumann, G., Backofen, R., Baur, J., Becker, M., Braun, C.: An information extraction core system for real world German text processing. In: ACL (1997)
Padro, M., Padro, L.: A Named Entity Recognition System based on a Finite Automata Acquisition Algorithm. TALP Research Center, Universitat Politecnica de Catalunya (2005)
Pazienza, M.T. (ed.): SCIE 1997. LNCS (LNAI), vol. 1299. Springer, Heidelberg (1997)
Shaalan, K., Raza, H.: Arabic Named Entity Recognition from Diverse Text Types. In: Nordström, B., Ranta, A. (eds.) GoTAL 2008. LNCS (LNAI), vol. 5221, pp. 440–451. Springer, Heidelberg (2008)
Traboulsi, H.: Arabic Named Entity Extraction: A Local Grammar-Based Approach. In: Proceedings of the International Multiconference on Computer Science and Information Technology, vol. 4, pp. 139–143 (2009)
Wikipedia, Sahih Bukhari, http://fr.wikipedia.org/wiki/Sahih_al-Bukhari (last Visited May 30, 2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Harrag, F., El-Qawasmeh, E., Salman Al-Salman, A.M. (2011). Extracting Named Entities from Prophetic Narration Texts (Hadith). In: Zain, J.M., Wan Mohd, W.M.b., El-Qawasmeh, E. (eds) Software Engineering and Computer Systems. ICSECS 2011. Communications in Computer and Information Science, vol 180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22191-0_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-22191-0_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22190-3
Online ISBN: 978-3-642-22191-0
eBook Packages: Computer ScienceComputer Science (R0)