Abstract
The importance of web document ranking algorithms for search engines keeps increasing along with the growth of web. Traditional ranking algorithms rank web documents according to hyperlinks in them. These algorithms are prone to problem of topic drifting, and it is difficult to ensure the accuracy of the sorting result. In recent years, researchers began to study the semantic understanding of web content. Methods such as ontology based ranking were proposed to sort web documents better. In this paper, the existing methods of ontology construction and web document ranking algorithm are comprehensively reviewed. Then a multi-domain ontologies construction method for massive unstructured text, singular value decomposition ontology construct (SVDOC) is proposed. Based on multi-domain ontology, a ranking algorithm that can exploit the semantic meaning of the text, multiply domain ontology PageRank (MDOPR) is also presented. We use SVDOC in the experiments to construct multi-domain ontologies in the field of shipping industry (ship, port, route and other related fields). The results show that SVDOC can effectively construct multi-domain ontologies, and MDOPR has higher accuracy and feasibility than other ranking methods.
Similar content being viewed by others
References
Aleman-Meza B, Arpinar IB, Nural MV, Sheth AP (2010) Ranking documents semantically using ontological relationships. In: Semantic computing (ICSC), 2010 IEEE fourth international conference, pp 299–304
Bouramoul A, Kholladi MK, Doan BL (2011) How ontology can be used to improve semantic information retrieval: the AnimSe finder tool. Int J Comput Appl (IJCA) ISSN 97:5–8887
Bouramoul A, Kholladi MK, Doan BL (2012) An ontology-based approach for semantics ranking of the web search engines results. Int Conf Multimed Comput Syst 248:797–802
Du Y, Hai Y (2013) Semantic ranking of web documents based on formal concept analysis. J Syst Softw 86(1):187–197
Duhan N, Sharma AK, Bhatia KK (2009) Page ranking algorithms: a survey. In: Advance computing conference, 2009. IACC 2009. IEEE international, pp 1530–1537
Fernández-López M, Gómez-Pérez A, Juristo N (1997) Methontology: from ontological art towards ontological engineering. In: Proceedings of the AAAI97 spring symposium series on ontological engineering, Stanford, USA, pp 33–40
Fu Z, Huang F, Ren K, Weng J, Wang C (2017a) Privacy-preserving smart semantic search based on conceptual graphs over encrypted outsourced data. IEEE Trans Inf Forensics Secur 12(8):1874–1884
Fu Z, Wu X, Guan C, Sun X, Ren K (2017b) Toward efficient multi-keyword fuzzy search over encrypted outsourced data with accuracy improvement. IEEE Trans Inf Forensics Secur 11(12):2706–2716
Grüninger M, Fox MS (1995) Methodology for the design and evaluation of ontologies
Haveliwala T (1999) Efficient computation of PageRank. Stanford Db Group Technical Report
Hirankitti V, Mai TX (2012) A meta-logical approach for reasoning with an OWL 2 ontology. In: IEEE Rivf international conference on computing and communication technologies, research, innovation, and vision for the future vol.3, pp 1–6
Islam MN, Najmul Islam AKM (2016) Ontology mapping and semantics of web interface signs. Hum Cent Comput Inf Sci 6(1):20
Jindal V, Bawa S, Batra S (2014) A review of ranking approaches for semantic search on Web. Inf Process Manag 50(2):416–425
Kayed A, El-Qawasmeh E, Qawaqneh Z (2010) Ranking web sites using domain ontology concepts. Inf Manag 47(7):350–355
Lee J, Min JK, Oh A, Chung CW (2014) Effective ranking and search techniques for Web resources considering semantic relationships. Inf Process Manag 50(1):132–155
Mcguinness DL, Harmelen F (2004) Owl web ontology language overview. Febr 63(45):990–996
Mingjun X, Liusheng H, Yonglong L (2006) SHITS: a Web page sorting method based on hyperlink and content. Small Microcomput Syst 27(12):2177–2182
Noy NF, Mcguinness DL (2001) Ontology development 101: a guide to creating your first ontology. And Stanford Medical Informatics
Nyein SS (2011) Mining contents in Web page using cosine similarity. In: Computer research and development (ICCRD) 2011 3rd international conference on vol 2, pp 472–475
Page L (1999) The PageRank citation ranking: bringing order to the web, online manuscript. Stanf Digit Libr Work Pap 9.1:1–14
Qu C, Liu F, Tao M, Deng D (2016) An owl-s based specification model of dynamic entity services for internet of things. J Ambient Intell Hum Comput 7(1):73–82
Roman Y, Shtykh, Jin Qun (2011) A human-centric integrated approach to web information search and sharing. Hum Centr Comput Inf Sci 1(2):2
Schreiber G, Wielinga B, Jansweijer W (1995) The KACTUS view on the ‘O’word. IJCAI workshop on basic ontological issues in knowledge sharing, pp 159–168
Sharma DK, Sharma AK (2010) A comparative analysis of Web document ranking algorithms. Int J Comput Sci Eng 2(08):2670–2676
Toti D, Longhi A (2017) Semanto: a graphical ontology management system for knowledge discovery. J Ambient Intell Hum Comput 259:1–10
Uschold M, Gruninger M (1996) Ontologies: principles, methods and applications. Knowl Eng Rev 11(02):93–136
Vijayarajan V, Dinakaran M, Tejaswin P, Lohani M (2016) A generic framework for ontology-based information retrieval and image retrieval in web data. Hum Centr Comput Inf Sci 6(1):18
Yifei S (2009) Research on semi-automatic ontology construction method doctoral dissertation, Jilin University
Zhoubin Z (2008) Research on topic-related website sorting algorithm based on content and link analysis. Master’s thesis, Zhejiang University
Acknowledgements
This work was supported by Shanghai Maritime University research fund project (20130469), and by Shanghai Science & Technology Innovation Plan Fund (14511107400), and by State Oceanic Administration China research fund project (201305026). Professor Jin Wang is the corresponding author.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, J., Zhou, M., Lin, L. et al. Rank web documents based on multi-domain ontology. J Ambient Intell Human Comput 15, 1573–1582 (2024). https://doi.org/10.1007/s12652-017-0566-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-017-0566-5