Skip to main content

Towards a New Standard Arabic Test Collection for Mono- and Cross-Language Information Retrieval

  • Conference paper
Natural Language Processing and Information Systems (NLDB 2014)

Abstract

We propose in this paper a new standard Arabic test collection for mono- and cross-language Information Retrieval (CLIR). To do this, we exploit the “Hadith” texts and we provide a portal for sampling and evaluation of Hadiths’ results listed in both Arabic and English versions. The new called “Kunuz” standard Arabic test collection will promote and restart the development of Arabic mono retrieval and CLIR systems blocked since the earlier TREC-2001 and TREC-2002 editions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Abu El-Khair, I.: Arabic information retrieval. Annu. Rev. Inf. Sci. Technol. 41, 505–533 (2007)

    Article  Google Scholar 

  2. Beseiso, M., Ahmad, A.R., Ismail, R.: A Survey of Arabic language Support in Semantic web. Int. J. Comput. Appl. 9, 35–40 (2010)

    Google Scholar 

  3. Zayed, O., El-Beltagy, S., Haggag, O.: An Approach for Extracting and Disambiguating Arabic Persons’ Names Using Clustered Dictionaries and Scored Patterns. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds.) NLDB 2013. LNCS, vol. 7934, pp. 201–212. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  4. Gey, F.C., Oard, D.W.: The TREC-2001 Cross-Language Information Retrieval Track: Searching Arabic Using English, French or Arabic Queries. In: The Tenth Text REtrieval Conference (TREC), pp. 16–25 (2002)

    Google Scholar 

  5. Bounhas, I., Elayeb, B., Evrard, F., Slimani, Y.: Toward a Computer Study of the Reliability of Arabic Stories. J. Am. Soc. Inf. Sci. Technol. 61, 1686–1705 (2010)

    Google Scholar 

  6. Clarke, C.L.A., Craswell, N., Soboroff, I., Cormack, G.V.: Overview of the TREC 2010 Web Track. In: The 19th Text REtrieval Conference (TREC) (2011)

    Google Scholar 

  7. Ayed, R., Bounhas, I., Elayeb, B., Evrard, F., Bellamine Ben Saoud, N.: Arabic Morphological Analysis and Disambiguation Using a Possibilistic Classifier. In: Huang, D.-S., Ma, J., Jo, K.-H., Gromiha, M.M. (eds.) ICIC 2012. LNCS, vol. 7390, pp. 274–279. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  8. Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Lioma, C.: Terrier: A High Performance and Scalable Information Retrieval Platform. In: Proceedings of ACM SIGIR 2006 Workshop on Open Source Information Retrieval (OSIR), pp. 18–25 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Ben Khiroun, O., Ayed, R., Elayeb, B., Bounhas, I., Ben Saoud, N.B., Evrard, F. (2014). Towards a New Standard Arabic Test Collection for Mono- and Cross-Language Information Retrieval. In: Métais, E., Roche, M., Teisseire, M. (eds) Natural Language Processing and Information Systems. NLDB 2014. Lecture Notes in Computer Science, vol 8455. Springer, Cham. https://doi.org/10.1007/978-3-319-07983-7_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07983-7_23

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07982-0

  • Online ISBN: 978-3-319-07983-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics