research-article

Rule Based Question Generation for Arabic Text: Question Answering System

Authors:
Samah Ali Alazani

Dept. of Computer Science &IT, Dr. Babasaheb Ambedkar Marathawada University, Aurangabad, Maharashtra, India

Dept. of Computer Science &IT, Dr. Babasaheb Ambedkar Marathawada University, Aurangabad, Maharashtra, India
View Profile

,
C. Namarta Mahender

Dept. of Computer Science &IT, Dr. Babasaheb Ambedkar Marathawada University, Aurangabad, Maharashtra, India

Dept. of Computer Science &IT, Dr. Babasaheb Ambedkar Marathawada University, Aurangabad, Maharashtra, India
View Profile

DSMLAI '21': Proceedings of the International Conference on Data Science, Machine Learning and Artificial IntelligenceAugust 2021Pages 7–12https://doi.org/10.1145/3484824.3484882

Published:13 January 2022Publication History

DSMLAI '21': Proceedings of the International Conference on Data Science, Machine Learning and Artificial Intelligence

Pages 7–12

ABSTRACT

Question answering systems have evolved into a whole new domain. Question answering systems are of emense importance in the present scenario as well as for many purposes such as humanlike conversation, machine translation, summarization, etc. For the present work, the data is collected from the Arabic language textbook of the 5th standard of Yemen. The text is pre-processed for removing punctuation marks, stop words. The paragraphs are segmented first sentence-level and then word level. For the generation of questions, a rule based approach is used, which had a pre-requirement of properly tagged words. Thus pos tagger and NER relevant to the present domain is also developed considering linguistic aspect and rule based approach. A clear explanation of how wh-type questions are developed is given in detail. The main focus on which the work is done is types of nouns. The types of nouns have been used to generate wh-questions from the Arabic text.

References

Albared, M., Omar, N., & Ab Aziz, M. (2011). Developing competitive HMM Arabic POS tagger using small training corpora. Asian Conference on Intelligent Information and Database Systems. pp.288--296.Google ScholarCross Ref
Aliwy1, A., & Al_Raza2, D. (2018). part of speech Tagging in Arabic long sentence. International Journal of Engineering & Technology, 7 (3.27) 125--128.Google Scholar
Attia, M., & Rashwan M. (2004). A large-scale Arabic POS tagger based on a compact Arabic POS tags set, and application on the statistical inference of syntactic diacritics of Arabic text words. Proceedings of the Arabic Language Technologies and Resources Int'l Conference.Google Scholar
Zirikly, A., & Diab, M. Named Entity Recognition for Dialectal Arabic. (2014). Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP). pages 78--86.Google Scholar
Bert, F., Green, J., Alice, K., Carol C., & Kenneth L. Baseball: An Automatic Question-Answerer. (1961). In Proceedings of Western Computing Conference, Vol. 19. pp. 219--224.Google Scholar
David N., Peter T., & Stan M. (2006). Unsupervised named-entity recognition: Generating gazetteers and resolving ambiguity. Conference of the Canadian Society for Computational Studies of Intelligence. pp 266--277.Google Scholar
Deepali, K., Gaikwad, C., & Namrata, M., (2018). Question Generation System for Marathi Text. International Journal of Scientific Research in Computer Science, Engineering and Information Technology. Volume 3. Issue 3.Google Scholar
Fadl D., Ameur, T., & Hassan, M. First Order Hidden Markov Model for Automatic Arabic Name Entity Recognition. (2015). International Journal of Computer Applications. (0975 - 8887). Volume 123 - No. 7.Google Scholar
Frank, A., Krieger, Hans-Ulrich., Xu, Feiyu., Uszkoreit, Hans., Crysmann, Berthold., Jörg, Brigitte. and Ulrich, S., (2007). Question answering from structured knowledge sources. Journal of Applied Logic 5. 20 - 48. DOI:10.1016/j.jal.2005.12.006. Elsevier.Google ScholarCross Ref
Gaebel, M., Kupriyanova, V., Morais, R., Colucci, E. (2014). E-learning in European higher education institutions: Results of a mapping survey conducted in October-December 2013. Tech. rep.: EuropeanUniversity Association.Google Scholar
Goldbach, R., & Hamza-Lup, F. (2017). Survey on e-learning implementation in Eastern-Europespotlight on Romania. In: the Ninth International Conference on Mobile, Hybrid, and On-LineLearning.Google Scholar
Kamaldeep, K., and Vishal, G. (2012). Name Entity Recognition for Punjabi Language, IRACST - International Journal of Computer Science and Information Technology & Security (IJCSITS), Vol. 2, No.3.Google Scholar
Poonam G., and Vishal G. Survey of Text Question Answering Techniques. (2012). International Journal of Computer Applications (0975-8887) Volume 53-No.4.Google Scholar
Qayyum, A., & Zawacki-Richter, O. (2018). Distance education in Australia, Europe and the Americas. Springer, Berlin.Google ScholarCross Ref
Ray, s., and Shaalan, K. (2016). A Review and Future Perspectives of Arabic Question Answering Systems. IEEE Transactions on Knowledge and Data Engineering PP(99):1--1Google ScholarDigital Library
Samir AbdelRahman, Mohamed Elarnaoty, Marwa Magdy and Aly Fahmy, "Integrated Machine Learning Techniques for Arabic Named Entity Recognition", IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 4, No 3, July 2010.Google Scholar
Suhad al-shoukry and nazlia omar, "Arabic named entity recognition for crime documents using classifiers combination", International Review on Computers and Software (2015).Google Scholar
Kamaldeep Kaur and Vishal Gupta, "Name Entity Recognition for Punjabi Language," IRACST - International Journal of Computer Science and Information Technology & Security (IJCSITS), ISSN: 2249--9555.Vol. 2, No.3, June 2012.Google Scholar
Rosso, Paolo & Benajiba, Yassine & Lyhyaoui, Abdelouahid. (2006). Towards an Arabic Question Answering system.Google Scholar
Thalheimer, W. (2003). The learning benefits of questions. Tech. rep., Work Learning Research. http://www.learningadvantage.co.za/pdfs/questionmark/LearningBenefitsOfQuestions.pdf.Google Scholar

Index Terms

Rule Based Question Generation for Arabic Text: Question Answering System

Recommendations

A novel Arabic lemmatization algorithm
AND '08: Proceedings of the second workshop on Analytics for noisy unstructured text data

Tokenization is a fundamental step in processing textual data preceding the tasks of information retrieval, text mining, and natural language processing. Tokenization is a language-dependent approach, including normalization, stop words removal, ...
Read More
Toward enhanced Arabic speech recognition using part of speech tagging

One major source of suboptimal performance in automatic continuous speech recognition systems is misrecognition of small words. In general, errors resulting from small words are much more than errors resulting from long words. Therefore, compounding ...
Read More
Developing and performance evaluation of a new Arabic heavy/light stemmer
BDCA'17: Proceedings of the 2nd international Conference on Big Data, Cloud and Applications

Stemming is the main step used for handling the morphologically rich languages such as Arabic. It is usually used in several fields such as Natural Language Processing, Information Retrieval (IR), and Text Mining. The goal of stemming is reducing ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DSMLAI '21': Proceedings of the International Conference on Data Science, Machine Learning and Artificial Intelligence
August 2021
415 pages
ISBN:9781450387637
DOI:10.1145/3484824
Editors:
Dharm Singh Jat
Namibia University of Science and Technology
,
Colin Stanley
Namibia University of Science and Technology
,
José Quenum
Namibia University of Science and Technology
,
Nilanjan Dey
JIS University, Kolkata
,
Arpit Jain
Namibia University of Science and Technology
Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 January 2022
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
NER
Part of Speech Tagging
Question Answering
Stemmer
Tokenization
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 69
  Total Downloads
- Downloads (Last 12 months)18
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Rule Based Question Generation for Arabic Text: Question Answering System

DSMLAI '21': Proceedings of the International Conference on Data Science, Machine Learning and Artificial Intelligence

ABSTRACT

References

Cited By

Index Terms

Recommendations

A novel Arabic lemmatization algorithm

Toward enhanced Arabic speech recognition using part of speech tagging

Developing and performance evaluation of a new Arabic heavy/light stemmer

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Rule Based Question Generation for Arabic Text: Question Answering System

DSMLAI '21': Proceedings of the International Conference on Data Science, Machine Learning and Artificial Intelligence

ABSTRACT

References

Cited By

Index Terms

Recommendations

A novel Arabic lemmatization algorithm

Toward enhanced Arabic speech recognition using part of speech tagging

Developing and performance evaluation of a new Arabic heavy/light stemmer

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media