Abstract
Traditional Information Retrieval systems (keyword-based search) suffer several problems. For instance, synonyms or hyponym are not taken into consideration when retrieving documents that are important for a user’s query. This study adopts an ontology of computer science and proposes an ontology indexing weight based on Wu and Palmer’s edge counting measure for solving this problem. This paper used the N-grams method for computing a family of word similarity. The study also compares the subsumption weight between Hliaoutakis and Nicola’s weight and query keywords (Decision Making, Genetic Algorithm, Machine Learning, Heuristic). A probability value (p-values) from the t-test (p = 0.105) is higher 0.05 and indicates no significant evidence, of not differences between both weights methods. The experimental results show that the document similarity score between a user’s query and the paper suggests that the new measures were effectively ranked.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lai, L.-F., Wu, C.-C., Lin, P.-Y.: Developing a Fuzzy Search Engine Based on Fuzzy Ontology and Semantic Search. In: IEEE International Conference on Fuzzy, pp. 2684–2689. IEEE Press, Taipei (2011)
Hliaoutakis, A., Varelas, G., Voutsakis, E., Petrakis, E.G.M., Milios, E.: Information Retrieval by Semantic Similarity. International Journal on Semantic Web and Information Systems (IJSWIS) 2(3) (2006)
Varelas, G., Voutsakis, E., Raftopoulou, P., Petrakis, E.G.M., Milios, E.: Semantic Similarity Methods in Wordnet and their Application to Information Retrieval on the web. In: ACM International Workshop on Web Information and Data Management, pp. 10–130. ACM, Bremen (2005)
Shenoy, K.M., Shet, K.C., Acharya, U.D.: A New Similarity Measure for Taxonomy based on Edge Counting. International Journal of Web & Semantic Technology (JJWesT) 3(4), 23–30 (2012)
Schwering, A., Kuhn, W.: A Hybrid Semantic Similarity Measure for Spatial Information Retrieval. An Interdisciplinary Journal of Spatial Cognition & Computation 9(1), 30–63 (2009)
Fernandez, M., Cantador, I., Lopez, V., Vallet, D., Castells, P., Motta, E.: Semantically enhanced Information Retrieval: An ontology-based approach. Journal of Web Semantics: Science, Services and Agents on the World Wide Web 9, 434–452 (2010)
Weng, S.-S., Tsai, H.-J., Hsu, C.-H.: Ontology construction for information classification. Journal of Expert Systems with Applications 31(1), 1–12 (2006)
John, T.: What is Semantic Search and how it works with Google search, http://www.techulator.com/resources/5933-What-Semantic-Search.aspx
Batet, M., Sanchez, D., Valls, A.: An Ontology-based measure to compute semantic similarity in biomedicine. Journal of Biomedical Informatics 44, 118–125 (2011)
Wu, Z., Palmer, M.: Verb semantics and lexical selection. In: Proceeding of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, vol. 13, pp. 133–138 (1994)
Kondrak, G.: N-Gram Similarity and Distance. In: Consens, M.P., Navarro, G. (eds.) SPIRE 2005. LNCS, vol. 3772, pp. 115–126. Springer, Heidelberg (2005)
Sembok, T.M., Bakar, Z.A.: Effectiveness of Stemming and N-grams String Similarity Matching on Malay Documents. International Journal of Applied Mathematics and Informatics 5(3), 208–215 (2011)
Stoke, N.: Applications of Lexical Cohesion Analysis in the Topic Detection and Tracking Domain. A thesis submitted for the degree of Doctor of Philosophy in Computer Science Department of Computer Science Faculty of Science National University of Ireland, Dublin (2004)
Watthananon, J., Mingkhwan, A.: A Comparative Efficiency of Correlation Plot Data Classification. The Journal of KMUTNB 22(1) (2012)
Lertmahakrit, W., Mingkhoan, A.: The Innovation of Multiple Relations Information Retrieval. The Journal of KMUTNB 20(3) (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Boonyoung, T., Mingkhwan, A. (2014). Semantic Search Using Computer Science Ontology Based on Edge Counting and N-Grams. In: Boonkrong, S., Unger, H., Meesad, P. (eds) Recent Advances in Information and Communication Technology. Advances in Intelligent Systems and Computing, vol 265. Springer, Cham. https://doi.org/10.1007/978-3-319-06538-0_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-06538-0_28
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06537-3
Online ISBN: 978-3-319-06538-0
eBook Packages: EngineeringEngineering (R0)