Skip to main content
Log in

Rank web documents based on multi-domain ontology

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

The importance of web document ranking algorithms for search engines keeps increasing along with the growth of web. Traditional ranking algorithms rank web documents according to hyperlinks in them. These algorithms are prone to problem of topic drifting, and it is difficult to ensure the accuracy of the sorting result. In recent years, researchers began to study the semantic understanding of web content. Methods such as ontology based ranking were proposed to sort web documents better. In this paper, the existing methods of ontology construction and web document ranking algorithm are comprehensively reviewed. Then a multi-domain ontologies construction method for massive unstructured text, singular value decomposition ontology construct (SVDOC) is proposed. Based on multi-domain ontology, a ranking algorithm that can exploit the semantic meaning of the text, multiply domain ontology PageRank (MDOPR) is also presented. We use SVDOC in the experiments to construct multi-domain ontologies in the field of shipping industry (ship, port, route and other related fields). The results show that SVDOC can effectively construct multi-domain ontologies, and MDOPR has higher accuracy and feasibility than other ranking methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. https://lucene.apache.org/core/4_6_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html.

  2. https://spark.apache.org/.

  3. http://cn.bing.com.

  4. http://www.baidu.com.

  5. http://info.shippingchina.com.

  6. https://protege.stanford.edu/.

References

  • Aleman-Meza B, Arpinar IB, Nural MV, Sheth AP (2010) Ranking documents semantically using ontological relationships. In: Semantic computing (ICSC), 2010 IEEE fourth international conference, pp 299–304

  • Bouramoul A, Kholladi MK, Doan BL (2011) How ontology can be used to improve semantic information retrieval: the AnimSe finder tool. Int J Comput Appl (IJCA) ISSN 97:5–8887

    Google Scholar 

  • Bouramoul A, Kholladi MK, Doan BL (2012) An ontology-based approach for semantics ranking of the web search engines results. Int Conf Multimed Comput Syst 248:797–802

    Google Scholar 

  • Du Y, Hai Y (2013) Semantic ranking of web documents based on formal concept analysis. J Syst Softw 86(1):187–197

    Article  Google Scholar 

  • Duhan N, Sharma AK, Bhatia KK (2009) Page ranking algorithms: a survey. In: Advance computing conference, 2009. IACC 2009. IEEE international, pp 1530–1537

  • Fernández-López M, Gómez-Pérez A, Juristo N (1997) Methontology: from ontological art towards ontological engineering. In: Proceedings of the AAAI97 spring symposium series on ontological engineering, Stanford, USA, pp 33–40

  • Fu Z, Huang F, Ren K, Weng J, Wang C (2017a) Privacy-preserving smart semantic search based on conceptual graphs over encrypted outsourced data. IEEE Trans Inf Forensics Secur 12(8):1874–1884

    Article  Google Scholar 

  • Fu Z, Wu X, Guan C, Sun X, Ren K (2017b) Toward efficient multi-keyword fuzzy search over encrypted outsourced data with accuracy improvement. IEEE Trans Inf Forensics Secur 11(12):2706–2716

    Article  Google Scholar 

  • Grüninger M, Fox MS (1995) Methodology for the design and evaluation of ontologies

  • Haveliwala T (1999) Efficient computation of PageRank. Stanford Db Group Technical Report

  • Hirankitti V, Mai TX (2012) A meta-logical approach for reasoning with an OWL 2 ontology. In: IEEE Rivf international conference on computing and communication technologies, research, innovation, and vision for the future vol.3, pp 1–6

  • Islam MN, Najmul Islam AKM (2016) Ontology mapping and semantics of web interface signs. Hum Cent Comput Inf Sci 6(1):20

    Article  Google Scholar 

  • Jindal V, Bawa S, Batra S (2014) A review of ranking approaches for semantic search on Web. Inf Process Manag 50(2):416–425

    Article  Google Scholar 

  • Kayed A, El-Qawasmeh E, Qawaqneh Z (2010) Ranking web sites using domain ontology concepts. Inf Manag 47(7):350–355

    Article  Google Scholar 

  • Lee J, Min JK, Oh A, Chung CW (2014) Effective ranking and search techniques for Web resources considering semantic relationships. Inf Process Manag 50(1):132–155

    Article  Google Scholar 

  • Mcguinness DL, Harmelen F (2004) Owl web ontology language overview. Febr 63(45):990–996

    Google Scholar 

  • Mingjun X, Liusheng H, Yonglong L (2006) SHITS: a Web page sorting method based on hyperlink and content. Small Microcomput Syst 27(12):2177–2182

    Google Scholar 

  • Noy NF, Mcguinness DL (2001) Ontology development 101: a guide to creating your first ontology. And Stanford Medical Informatics

  • Nyein SS (2011) Mining contents in Web page using cosine similarity. In: Computer research and development (ICCRD) 2011 3rd international conference on vol 2, pp 472–475

  • Page L (1999) The PageRank citation ranking: bringing order to the web, online manuscript. Stanf Digit Libr Work Pap 9.1:1–14

    Google Scholar 

  • Qu C, Liu F, Tao M, Deng D (2016) An owl-s based specification model of dynamic entity services for internet of things. J Ambient Intell Hum Comput 7(1):73–82

    Article  Google Scholar 

  • Roman Y, Shtykh, Jin Qun (2011) A human-centric integrated approach to web information search and sharing. Hum Centr Comput Inf Sci 1(2):2

    Google Scholar 

  • Schreiber G, Wielinga B, Jansweijer W (1995) The KACTUS view on the ‘O’word. IJCAI workshop on basic ontological issues in knowledge sharing, pp 159–168

  • Sharma DK, Sharma AK (2010) A comparative analysis of Web document ranking algorithms. Int J Comput Sci Eng 2(08):2670–2676

    Google Scholar 

  • Toti D, Longhi A (2017) Semanto: a graphical ontology management system for knowledge discovery. J Ambient Intell Hum Comput 259:1–10

    Google Scholar 

  • Uschold M, Gruninger M (1996) Ontologies: principles, methods and applications. Knowl Eng Rev 11(02):93–136

    Article  Google Scholar 

  • Vijayarajan V, Dinakaran M, Tejaswin P, Lohani M (2016) A generic framework for ontology-based information retrieval and image retrieval in web data. Hum Centr Comput Inf Sci 6(1):18

    Article  Google Scholar 

  • Yifei S (2009) Research on semi-automatic ontology construction method doctoral dissertation, Jilin University

  • Zhoubin Z (2008) Research on topic-related website sorting algorithm based on content and link analysis. Master’s thesis, Zhejiang University

Download references

Acknowledgements

This work was supported by Shanghai Maritime University research fund project (20130469), and by Shanghai Science & Technology Innovation Plan Fund (14511107400), and by State Oceanic Administration China research fund project (201305026). Professor Jin Wang is the corresponding author.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jin Wang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, J., Zhou, M., Lin, L. et al. Rank web documents based on multi-domain ontology. J Ambient Intell Human Comput 15, 1573–1582 (2024). https://doi.org/10.1007/s12652-017-0566-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-017-0566-5

Keywords

Navigation