Abstract
The present work proposed a semantic retrieval approach to treat the issues of semantic ambiguity of indexed terms, the uncertainty, and imprecision that is inherent in the information retrieval process. The proposed approach constitutes of three different phases. The query meaning was discovered in the first phase by formulating a set of candidate queries from possible contexts. A score for each alternative was calculated based on its semantic tree and inherent dispersion between its concepts. This score assesses the overall meaning of the alternative query. This phase was finished by selecting the candidate query that attains the highest score to be the best representative to the original query. A semantic index was built in the second phase exploiting the classic and semantic characteristics of the document concepts to finally assign a weight for each concept to estimate its relative importance. The third phase proposed a ranking model that utilizes the semantic similarities and relations between concepts to calculate the query-document relevance. This ranking model is based on a query likelihood language model and a conceptual weighting model. The validity of the proposed approach was evaluated through performance comparisons with the related benchmarks measured in terms of the standard IR performance metrics. The proposed approach outperformed the compared baselines and improved the measured metrics. A statistical significance test was conducted to guarantee that the obtained improvements are true enhancements and are not a cause of random variation of the compared systems. The statistical test supported the hypothesis that the obtained improvements were significant.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Nidelkou, E., Papastathis, V., Papadogiorgaki, M.: User Profile Modeling and Learning. in Encyclopedia of Information Science and Technology, 2nd edn, pp. 3934–3939. IGI Global, Hershey (2009)
Yibing, S., Qinglong, M.: Research of literature information retrieval method based on ontology. In: 2014 International Conference on Multisensor Fusion and Information Integration for Intelligent Systems (MFI), pp. 1–6 (2014)
Kumar, S., Singh, M., De, A.: OWL-based ontology indexing and retrieving algorithms for semantic search engine. In: 2012 7th International Conference on Computing and Convergence Technology (ICCCT), pp. 1135–1140 (2012)
Chauhan, R., Goudar, R., Sharma, R., Chauhan, A.: Domain ontology based semantic search for efficient information retrieval through automatic query expansion. In: 2013 International Conference on Intelligent Systems and Signal Processing (ISSP), pp. 397–402 (2013)
Ali, A., Bari, P., Ahmad, I.: Concept-based information retrieval approaches on the web: a brief survey. IJAIR 3, 14–18 (2011)
Zhou, D., Lawless, S., Liu, J., Zhang, S., Xu, Y.: Query expansion for personalized cross-language information retrieval. In: 2015 10th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP), pp. 1–5 (2015)
Carpineto, C., Romano, G.: A survey of automatic query expansion in information retrieval. ACM Comput. Surv. 44, 1–50 (2012)
Segura, A., Vidal-Castro, C., Ferreira-Satler, M., Salvador-Sánchez: Domain ontology-based query expansion: relationships types-centered analysis using gene ontology. In: Castillo, L., Cristancho, M., Isaza, G., Pinzón, A., Rodríguez, J. (eds.) Advances in Computational Biology. AISC, vol. 232, pp. 183–188. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-01568-2_27
Laatar, R., Aloulou, C., Belguith, L.H.: Disambiguating Arabic words according to their historical appearance in the document based on recurrent neural networks. ACM Trans. Asian Low-Resource Lang. Inf. Process, 19, 1–16 (2020)
Boughareb, D., Farah, N.: A query expansion approach using the context of the search. In: van Berlo, A., Hallenborg, K., Rodríguez, J., Tapia, D., Novais, P. (eds.) Ambient Intelligence - Software and Applications. AISC, vol. 219, pp. 57–63. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-319-00566-9_8
Kumar, N., Carterette, B.: Time based feedback and query expansion for twitter search. In: Serdyukov, P., et al. (eds.) ECIR 2013. LNCS, vol. 7814, pp. 734–737. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-36973-5_72
Nagaraj, R., Thiagarasu, V., Vijayakumar, P.: A novel semantic level text classification by combining NLP and thesaurus concepts. IOSR J. Comput. Eng. 16, 14–26 (2014)
Luo, Q., Chen, E., Xiong, H.: A semantic term weighting scheme for text categorization. Expert Syst. Appl. 38, 12708–12716 (2011)
Boubekeur, F., Azzoug, W., Chiout, S., Boughanem, M.: Indexation sémantique de documents textuels. In: 14e Colloque International sur le Document Electronique (CIDE14), Rabat, Maroc (2011)
Boughanem, M., Mallak, I., Prade, H.: A new factor for computing the relevance of a document to a query. In: International Conference on Fuzzy Systems, pp. 1–6 (2010)
Boubekeur, F., Boughanem, M., Tamine, L., Daoud, M.: Using WordNet for concept-based document indexing in information retrieval. In: Fourth International Conference on Semantic Processing (SEMAPRO), Florence, Italy (2010)
Al-Zoghby, A.: A new semantic distance measure for the VSM-based information retrieval systems. In: Shaalan, K., Hassanien, A.E., Tolba, F. (eds.) Intelligent Natural Language Processing: Trends and Applications. SCI, vol. 740, pp. 229–250. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-67056-0_12
Luo, C., Liu, Y., Zhang, M., Ma, S.: Query ambiguity identification based on user behavior information. In: Jaafar, A., et al. (eds.) AIRS 2014. LNCS, vol. 8870, pp. 36–47. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-12844-3_4
Hawalah, A., Fasli, M.: Dynamic user profiles for web personalisation. Expert Syst. Appl. 42, 2547–2569 (2015)
Safi, H., Jaoua, M., Belguith, L.H.: Intégration du profil utilisateur basé sur les ontologies dans la reformulation des requêtes Arabes. In: ACTES DU COLLOQUE, p. 40 (2015)
Codocedo, V., Lykourentzou, I., Napoli, A.: A semantic approach to concept lattice-based information retrieval. Ann. Math. Artif. Intell. 72(1–2), 169–195 (2014). https://doi.org/10.1007/s10472-014-9403-0
Formica, A.: Semantic web search based on rough sets and fuzzy formal concept analysis. Knowl.-Based Syst. 26, 40–47 (2012)
Poelmans, J., Ignatov, D.I., Kuznetsov, S.O., Dedene, G.: Fuzzy and rough formal concept analysis: a survey. Int. J. Gen. Syst. 43, 105–134 (2014)
Kumar, C.A., Mouliswaran, S.C., Amriteya, P., Arun, S.R.: Fuzzy formal concept analysis approach for information retrieval. In: Ravi, V., Panigrahi, B., Das, S., Suganthan, P. (eds.) FANCCO - 2015. AISC, vol. 415, pp. 255–271. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27212-2_20
Raj, T.F.M., Ravichandran, K.S.: A novel approach for intelligent information retrieval in semantic web using ontology. World Appl. Sci. J. 29, 149–154 (2014)
Martín, A., León, C., López, A.: Enhancing semantic interoperability in digital library by applying intelligent techniques. In: 2015 SAI Intelligent Systems Conference (IntelliSys), pp. 904–911 (2015)
Damaševičius, R.: Automatic generation of concept taxonomies from web search data using support vector machine. In: Proceedings of 5th International Conference on Web Information Systems and Technologies, pp. 673–680 (2009)
Damaševičius, R.: Automatic generation of part-whole hierarchies for domain ontologies using web search data. In: 32nd International Convention Proceedings: Computers in Technical Systems and Intelligent Systems, vol. 3, pp. 215–220 (2009)
Tu, X., He, T., Chen, L., Luo, J., Zhang, M.: Wikipedia-based semantic smoothing for the language modeling approach to information retrieval. In: Gurrin, C., et al. (eds.) ECIR 2010. LNCS, vol. 5993, pp. 370–381. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-12275-0_33
Chenaina, T., Neji, S., Shoeb, A.: Query sense discovery approach to realize the user search intent. Int. J. Inf. Retr. Res. 12 (2022)
Neji, S., Jemni Ben Ayed, L., Chenaina, T., Shoeb, A.: A novel conceptual weighting model for semantic information retrieval. 07-Inf. Sci. Lett. 10, (2021)
Neji, S., Chenaina, T., Shoeb, A., Ben Ayed, L.: HyRa: an effective hybrid ranking model. In: 25th International Conference on Knowledge Based and Intelligent information and Engineering Systems, vol. 10 (2021)
Neji, S., Chenaina, T., Shoeb, A., Ben Ayed, L.: HIR: a hybrid IR ranking model. In: International Computer Software and Applications Conference (2021)
Billami, M.B., Gala, N.: Approches d’analyse distributionnelle pour améliorer la désambiguïsation sémantique. Journées internationales d’Analyse statistique des Données Textuelles (JADT) (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Neji, S., Chenaina, T., Shoeb, A.M., Ayed, L.B. (2021). SemApp: A Semantic Approach to Enhance Information Retrieval. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2021. ICCSA 2021. Lecture Notes in Computer Science(), vol 12951. Springer, Cham. https://doi.org/10.1007/978-3-030-86970-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-86970-0_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86969-4
Online ISBN: 978-3-030-86970-0
eBook Packages: Computer ScienceComputer Science (R0)