ABSTRACT
Question answering systems have evolved into a whole new domain. Question answering systems are of emense importance in the present scenario as well as for many purposes such as humanlike conversation, machine translation, summarization, etc. For the present work, the data is collected from the Arabic language textbook of the 5th standard of Yemen. The text is pre-processed for removing punctuation marks, stop words. The paragraphs are segmented first sentence-level and then word level. For the generation of questions, a rule based approach is used, which had a pre-requirement of properly tagged words. Thus pos tagger and NER relevant to the present domain is also developed considering linguistic aspect and rule based approach. A clear explanation of how wh-type questions are developed is given in detail. The main focus on which the work is done is types of nouns. The types of nouns have been used to generate wh-questions from the Arabic text.
- Albared, M., Omar, N., & Ab Aziz, M. (2011). Developing competitive HMM Arabic POS tagger using small training corpora. Asian Conference on Intelligent Information and Database Systems. pp.288--296.Google ScholarCross Ref
- Aliwy1, A., & Al_Raza2, D. (2018). part of speech Tagging in Arabic long sentence. International Journal of Engineering & Technology, 7 (3.27) 125--128.Google Scholar
- Attia, M., & Rashwan M. (2004). A large-scale Arabic POS tagger based on a compact Arabic POS tags set, and application on the statistical inference of syntactic diacritics of Arabic text words. Proceedings of the Arabic Language Technologies and Resources Int'l Conference.Google Scholar
- Zirikly, A., & Diab, M. Named Entity Recognition for Dialectal Arabic. (2014). Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP). pages 78--86.Google Scholar
- Bert, F., Green, J., Alice, K., Carol C., & Kenneth L. Baseball: An Automatic Question-Answerer. (1961). In Proceedings of Western Computing Conference, Vol. 19. pp. 219--224.Google Scholar
- David N., Peter T., & Stan M. (2006). Unsupervised named-entity recognition: Generating gazetteers and resolving ambiguity. Conference of the Canadian Society for Computational Studies of Intelligence. pp 266--277.Google Scholar
- Deepali, K., Gaikwad, C., & Namrata, M., (2018). Question Generation System for Marathi Text. International Journal of Scientific Research in Computer Science, Engineering and Information Technology. Volume 3. Issue 3.Google Scholar
- Fadl D., Ameur, T., & Hassan, M. First Order Hidden Markov Model for Automatic Arabic Name Entity Recognition. (2015). International Journal of Computer Applications. (0975 - 8887). Volume 123 - No. 7.Google Scholar
- Frank, A., Krieger, Hans-Ulrich., Xu, Feiyu., Uszkoreit, Hans., Crysmann, Berthold., Jörg, Brigitte. and Ulrich, S., (2007). Question answering from structured knowledge sources. Journal of Applied Logic 5. 20 - 48. DOI:10.1016/j.jal.2005.12.006. Elsevier.Google ScholarCross Ref
- Gaebel, M., Kupriyanova, V., Morais, R., Colucci, E. (2014). E-learning in European higher education institutions: Results of a mapping survey conducted in October-December 2013. Tech. rep.: EuropeanUniversity Association.Google Scholar
- Goldbach, R., & Hamza-Lup, F. (2017). Survey on e-learning implementation in Eastern-Europespotlight on Romania. In: the Ninth International Conference on Mobile, Hybrid, and On-LineLearning.Google Scholar
- Kamaldeep, K., and Vishal, G. (2012). Name Entity Recognition for Punjabi Language, IRACST - International Journal of Computer Science and Information Technology & Security (IJCSITS), Vol. 2, No.3.Google Scholar
- Poonam G., and Vishal G. Survey of Text Question Answering Techniques. (2012). International Journal of Computer Applications (0975-8887) Volume 53-No.4.Google Scholar
- Qayyum, A., & Zawacki-Richter, O. (2018). Distance education in Australia, Europe and the Americas. Springer, Berlin.Google ScholarCross Ref
- Ray, s., and Shaalan, K. (2016). A Review and Future Perspectives of Arabic Question Answering Systems. IEEE Transactions on Knowledge and Data Engineering PP(99):1--1Google ScholarDigital Library
- Samir AbdelRahman, Mohamed Elarnaoty, Marwa Magdy and Aly Fahmy, "Integrated Machine Learning Techniques for Arabic Named Entity Recognition", IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 4, No 3, July 2010.Google Scholar
- Suhad al-shoukry and nazlia omar, "Arabic named entity recognition for crime documents using classifiers combination", International Review on Computers and Software (2015).Google Scholar
- Kamaldeep Kaur and Vishal Gupta, "Name Entity Recognition for Punjabi Language," IRACST - International Journal of Computer Science and Information Technology & Security (IJCSITS), ISSN: 2249--9555.Vol. 2, No.3, June 2012.Google Scholar
- Rosso, Paolo & Benajiba, Yassine & Lyhyaoui, Abdelouahid. (2006). Towards an Arabic Question Answering system.Google Scholar
- Thalheimer, W. (2003). The learning benefits of questions. Tech. rep., Work Learning Research. http://www.learningadvantage.co.za/pdfs/questionmark/LearningBenefitsOfQuestions.pdf.Google Scholar
Index Terms
- Rule Based Question Generation for Arabic Text: Question Answering System
Recommendations
A novel Arabic lemmatization algorithm
AND '08: Proceedings of the second workshop on Analytics for noisy unstructured text dataTokenization is a fundamental step in processing textual data preceding the tasks of information retrieval, text mining, and natural language processing. Tokenization is a language-dependent approach, including normalization, stop words removal, ...
Toward enhanced Arabic speech recognition using part of speech tagging
One major source of suboptimal performance in automatic continuous speech recognition systems is misrecognition of small words. In general, errors resulting from small words are much more than errors resulting from long words. Therefore, compounding ...
Developing and performance evaluation of a new Arabic heavy/light stemmer
BDCA'17: Proceedings of the 2nd international Conference on Big Data, Cloud and ApplicationsStemming is the main step used for handling the morphologically rich languages such as Arabic. It is usually used in several fields such as Natural Language Processing, Information Retrieval (IR), and Text Mining. The goal of stemming is reducing ...
Comments