Skip to main content

SemApp: A Semantic Approach to Enhance Information Retrieval

  • Conference paper
  • First Online:
Computational Science and Its Applications – ICCSA 2021 (ICCSA 2021)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12951))

Included in the following conference series:

  • 1185 Accesses

Abstract

The present work proposed a semantic retrieval approach to treat the issues of semantic ambiguity of indexed terms, the uncertainty, and imprecision that is inherent in the information retrieval process. The proposed approach constitutes of three different phases. The query meaning was discovered in the first phase by formulating a set of candidate queries from possible contexts. A score for each alternative was calculated based on its semantic tree and inherent dispersion between its concepts. This score assesses the overall meaning of the alternative query. This phase was finished by selecting the candidate query that attains the highest score to be the best representative to the original query. A semantic index was built in the second phase exploiting the classic and semantic characteristics of the document concepts to finally assign a weight for each concept to estimate its relative importance. The third phase proposed a ranking model that utilizes the semantic similarities and relations between concepts to calculate the query-document relevance. This ranking model is based on a query likelihood language model and a conceptual weighting model. The validity of the proposed approach was evaluated through performance comparisons with the related benchmarks measured in terms of the standard IR performance metrics. The proposed approach outperformed the compared baselines and improved the measured metrics. A statistical significance test was conducted to guarantee that the obtained improvements are true enhancements and are not a cause of random variation of the compared systems. The statistical test supported the hypothesis that the obtained improvements were significant.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Nidelkou, E., Papastathis, V., Papadogiorgaki, M.: User Profile Modeling and Learning. in Encyclopedia of Information Science and Technology, 2nd edn, pp. 3934–3939. IGI Global, Hershey (2009)

    Google Scholar 

  2. Yibing, S., Qinglong, M.: Research of literature information retrieval method based on ontology. In: 2014 International Conference on Multisensor Fusion and Information Integration for Intelligent Systems (MFI), pp. 1–6 (2014)

    Google Scholar 

  3. Kumar, S., Singh, M., De, A.: OWL-based ontology indexing and retrieving algorithms for semantic search engine. In: 2012 7th International Conference on Computing and Convergence Technology (ICCCT), pp. 1135–1140 (2012)

    Google Scholar 

  4. Chauhan, R., Goudar, R., Sharma, R., Chauhan, A.: Domain ontology based semantic search for efficient information retrieval through automatic query expansion. In: 2013 International Conference on Intelligent Systems and Signal Processing (ISSP), pp. 397–402 (2013)

    Google Scholar 

  5. Ali, A., Bari, P., Ahmad, I.: Concept-based information retrieval approaches on the web: a brief survey. IJAIR 3, 14–18 (2011)

    Google Scholar 

  6. Zhou, D., Lawless, S., Liu, J., Zhang, S., Xu, Y.: Query expansion for personalized cross-language information retrieval. In: 2015 10th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP), pp. 1–5 (2015)

    Google Scholar 

  7. Carpineto, C., Romano, G.: A survey of automatic query expansion in information retrieval. ACM Comput. Surv. 44, 1–50 (2012)

    Article  Google Scholar 

  8. Segura, A., Vidal-Castro, C., Ferreira-Satler, M., Salvador-Sánchez: Domain ontology-based query expansion: relationships types-centered analysis using gene ontology. In: Castillo, L., Cristancho, M., Isaza, G., Pinzón, A., Rodríguez, J. (eds.) Advances in Computational Biology. AISC, vol. 232, pp. 183–188. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-01568-2_27

    Chapter  Google Scholar 

  9. Laatar, R., Aloulou, C., Belguith, L.H.: Disambiguating Arabic words according to their historical appearance in the document based on recurrent neural networks. ACM Trans. Asian Low-Resource Lang. Inf. Process, 19, 1–16 (2020)

    Google Scholar 

  10. Boughareb, D., Farah, N.: A query expansion approach using the context of the search. In: van Berlo, A., Hallenborg, K., Rodríguez, J., Tapia, D., Novais, P. (eds.) Ambient Intelligence - Software and Applications. AISC, vol. 219, pp. 57–63. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-319-00566-9_8

    Chapter  Google Scholar 

  11. Kumar, N., Carterette, B.: Time based feedback and query expansion for twitter search. In: Serdyukov, P., et al. (eds.) ECIR 2013. LNCS, vol. 7814, pp. 734–737. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-36973-5_72

    Chapter  Google Scholar 

  12. Nagaraj, R., Thiagarasu, V., Vijayakumar, P.: A novel semantic level text classification by combining NLP and thesaurus concepts. IOSR J. Comput. Eng. 16, 14–26 (2014)

    Article  Google Scholar 

  13. Luo, Q., Chen, E., Xiong, H.: A semantic term weighting scheme for text categorization. Expert Syst. Appl. 38, 12708–12716 (2011)

    Article  Google Scholar 

  14. Boubekeur, F., Azzoug, W., Chiout, S., Boughanem, M.: Indexation sémantique de documents textuels. In: 14e Colloque International sur le Document Electronique (CIDE14), Rabat, Maroc (2011)

    Google Scholar 

  15. Boughanem, M., Mallak, I., Prade, H.: A new factor for computing the relevance of a document to a query. In: International Conference on Fuzzy Systems, pp. 1–6 (2010)

    Google Scholar 

  16. Boubekeur, F., Boughanem, M., Tamine, L., Daoud, M.: Using WordNet for concept-based document indexing in information retrieval. In: Fourth International Conference on Semantic Processing (SEMAPRO), Florence, Italy (2010)

    Google Scholar 

  17. Al-Zoghby, A.: A new semantic distance measure for the VSM-based information retrieval systems. In: Shaalan, K., Hassanien, A.E., Tolba, F. (eds.) Intelligent Natural Language Processing: Trends and Applications. SCI, vol. 740, pp. 229–250. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-67056-0_12

    Chapter  Google Scholar 

  18. Luo, C., Liu, Y., Zhang, M., Ma, S.: Query ambiguity identification based on user behavior information. In: Jaafar, A., et al. (eds.) AIRS 2014. LNCS, vol. 8870, pp. 36–47. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-12844-3_4

    Chapter  Google Scholar 

  19. Hawalah, A., Fasli, M.: Dynamic user profiles for web personalisation. Expert Syst. Appl. 42, 2547–2569 (2015)

    Article  Google Scholar 

  20. Safi, H., Jaoua, M., Belguith, L.H.: Intégration du profil utilisateur basé sur les ontologies dans la reformulation des requêtes Arabes. In: ACTES DU COLLOQUE, p. 40 (2015)

    Google Scholar 

  21. Codocedo, V., Lykourentzou, I., Napoli, A.: A semantic approach to concept lattice-based information retrieval. Ann. Math. Artif. Intell. 72(1–2), 169–195 (2014). https://doi.org/10.1007/s10472-014-9403-0

    Article  MathSciNet  MATH  Google Scholar 

  22. Formica, A.: Semantic web search based on rough sets and fuzzy formal concept analysis. Knowl.-Based Syst. 26, 40–47 (2012)

    Article  Google Scholar 

  23. Poelmans, J., Ignatov, D.I., Kuznetsov, S.O., Dedene, G.: Fuzzy and rough formal concept analysis: a survey. Int. J. Gen. Syst. 43, 105–134 (2014)

    Article  MathSciNet  Google Scholar 

  24. Kumar, C.A., Mouliswaran, S.C., Amriteya, P., Arun, S.R.: Fuzzy formal concept analysis approach for information retrieval. In: Ravi, V., Panigrahi, B., Das, S., Suganthan, P. (eds.) FANCCO - 2015. AISC, vol. 415, pp. 255–271. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27212-2_20

    Chapter  Google Scholar 

  25. Raj, T.F.M., Ravichandran, K.S.: A novel approach for intelligent information retrieval in semantic web using ontology. World Appl. Sci. J. 29, 149–154 (2014)

    Google Scholar 

  26. Martín, A., León, C., López, A.: Enhancing semantic interoperability in digital library by applying intelligent techniques. In: 2015 SAI Intelligent Systems Conference (IntelliSys), pp. 904–911 (2015)

    Google Scholar 

  27. Damaševičius, R.: Automatic generation of concept taxonomies from web search data using support vector machine. In: Proceedings of 5th International Conference on Web Information Systems and Technologies, pp. 673–680 (2009)

    Google Scholar 

  28. Damaševičius, R.: Automatic generation of part-whole hierarchies for domain ontologies using web search data. In: 32nd International Convention Proceedings: Computers in Technical Systems and Intelligent Systems, vol. 3, pp. 215–220 (2009)

    Google Scholar 

  29. Tu, X., He, T., Chen, L., Luo, J., Zhang, M.: Wikipedia-based semantic smoothing for the language modeling approach to information retrieval. In: Gurrin, C., et al. (eds.) ECIR 2010. LNCS, vol. 5993, pp. 370–381. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-12275-0_33

    Chapter  Google Scholar 

  30. Chenaina, T., Neji, S., Shoeb, A.: Query sense discovery approach to realize the user search intent. Int. J. Inf. Retr. Res. 12 (2022)

    Google Scholar 

  31. Neji, S., Jemni Ben Ayed, L., Chenaina, T., Shoeb, A.: A novel conceptual weighting model for semantic information retrieval. 07-Inf. Sci. Lett. 10, (2021)

    Google Scholar 

  32. Neji, S., Chenaina, T., Shoeb, A., Ben Ayed, L.: HyRa: an effective hybrid ranking model. In: 25th International Conference on Knowledge Based and Intelligent information and Engineering Systems, vol. 10 (2021)

    Google Scholar 

  33. Neji, S., Chenaina, T., Shoeb, A., Ben Ayed, L.: HIR: a hybrid IR ranking model. In: International Computer Software and Applications Conference (2021)

    Google Scholar 

  34. Billami, M.B., Gala, N.: Approches d’analyse distributionnelle pour améliorer la désambiguïsation sémantique. Journées internationales d’Analyse statistique des Données Textuelles (JADT) (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sameh Neji .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Neji, S., Chenaina, T., Shoeb, A.M., Ayed, L.B. (2021). SemApp: A Semantic Approach to Enhance Information Retrieval. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2021. ICCSA 2021. Lecture Notes in Computer Science(), vol 12951. Springer, Cham. https://doi.org/10.1007/978-3-030-86970-0_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-86970-0_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-86969-4

  • Online ISBN: 978-3-030-86970-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics