Skip to main content
Log in

LOD search engine: A semantic search over linked data

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

In the last few years, there has been a significant growth in the amount of data published in RDF and adoption of Linked Data principles. Every day, a large number of people and communities contribute to the publication of datasets as Linked Data on Linked Open Data (LOD) cloud. Due to a large size of LOD cloud on the Web and the RDF representation of linked dataset, searching and retrieving relevant data on the Web is a major challenge. Because the data is published in RDF triple format, i.e. an interlinked structure, traditional search engines are unable to perform searches on Linked Data. This article introduces LOD search engine, a novel semantic search engine that searches on Semantic Web documents (such as Linked Data or triples) to retrieve a set of relevant information based on user queries. For searching over triples, we proposed two semantic search methods: Forward Search and Backward Search. To improve search results, two new ranking methods have also been introduced: Domain Ranking and Triple Ranking. The proposed LOD search engine produced remarkable results and outperformed other semantic search engines. In the best-case scenario, the proposed LOD search engine outperforms the swoogle and falcons by 22.35%, 43.38% and 33.18% in terms of precision, recall, and F-Measure respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Data Availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

Notes

  1. http://ws.nju.edu.cn/falcons

  2. http://swoogle.umbc.edu/

  3. http://watson.kmi.open.ac.uk/WatsonWUI/

  4. http://sindice.com/

  5. http://data.dws.informatik.uni-mannheim.de/lodcloud/2014/ISWC-RDB/

  6. https://github.com/ldspider/ldspider

  7. http://fire.irsi.res.in/

References

  • Chapter seven (2015). distribution and query federation. In O. Curé G. Blin (Eds.) RDF Database systems. https://doi.org/10.1016/B978-0-12-799957-9.00007-9(pp. 169–190). Boston: Morgan Kaufmann.

  • Azad, H.K., & Deepak, A. (2019a). A new approach for query expansion using wikipedia and wordnet. Information Sciences, 492, 147–163.

    Article  Google Scholar 

  • Azad, H.K., & Deepak, A. (2019b). Query expansion techniques for information retrieval: a survey. Information Processing & Management, 56 (5), 1698–1735.

    Article  Google Scholar 

  • Azad, H. K., Deepak, A., & Abhishek, K. (2016). Linked open data search engine. In Proceedings of the Second International Conference on Information and Communication Technology for Competitive Strategies (p. 17). ACM.

  • Baeza-Yates, R., & Davis, E. (2004). Web page ranking using link attributes. In Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters, pp. 328–329. ACM.

  • Balmin, A., Hristidis, V., & Papakonstantinou, Y. (2004). Objectrank: Authority-based keyword search in databases. In Proceedings of the Thirtieth international conference on Very large data bases-Volume 30 (pp. 564–575). VLDB Endowment.

  • Berkani, N., Bellatreche, L., Khouri, S., & Ordonez, C. (2020). The contribution of linked open data to augment a traditional data warehouse. Journal of Intelligent Information Systems, 55(3), 397–421.

    Article  Google Scholar 

  • Berners-Lee, T. (2006). Linked data http://www.w3.org/designissues. LinkedData. html.

  • Bizer, C. (2009). The emerging web of linked data. IEEE Intelligent Systems, 24(5), 87–92.

    Article  Google Scholar 

  • Bizer, C., Cyganiak, R., Heath, T., & et al. (2007). How to publish linked data on the web 2007.

  • Brill, E. (1992). Penn treebank tagger. Copyright by MIT and University of Pennsylvania.

  • Cheng, G., Ge, W., & Qu, Y. (2008). Falcons: searching and browsing entities on the semantic web. In Proceedings of the 17th international conference on World Wide Web (pp. 1101–1102). ACM.

  • Cheng, G., & Qu, Y. (2009). Searching linked objects with falcons: Approach implementation and evaluation.

  • d’Aquin, M., Sabou, M., Dzbor, M., Baldassarre, C., Gridinoc, L., Angeletou, S., & Motta, E. (2007). Watson: A gateway for the semantic Web.

  • Data and Web Science Group. (2018). University of mannheim. https://dws.informatik.uni-mannheim.de/.

  • Dimitrakis, E., Sgontzos, K., & Tzitzikas, Y. (2020). A survey on question answering systems over linked data and documents. Journal of Intelligent Information Systems, 55(2), 233–259.

    Article  Google Scholar 

  • Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R. S., Peng, Y., Reddivari, P., Doshi, V., & Sachs, J. (2004). Swoogle: a search and metadata engine for the semantic web. In Proceedings of the thirteenth ACM international conference on Information and knowledge management (pp. 652–659). ACM.

  • El-Gayar, M., Mekky, N.E., Atwan, A., & Soliman, H. (2019). Enhanced search engine using proposed framework and ranking algorithm based on semantic relations. IEEE Access, 7, 139,337–139,349.

    Article  Google Scholar 

  • Fernández, J. D., Beek, W., Martínez-prieto, M.A., & Arias, M. (2017). Lod-a-lot. In International semantic web conference (pp. 75–83). Berlin: Springer.

  • Fernández, J.D., Martínez-prieto, M.A., Gutié, rrez C., Polleres, A., & Arias, M. (2013). Binary rdf representation for publication and exchange (hdt). Journal of Web Semantics, 19, 22–41.

    Article  Google Scholar 

  • Hartig, O., & Pirrò, G. (2015). A context-based semantics for sparql property paths over the Web. In European semantic web conference (pp. 71–87). Berlin: Springer.

  • Hogan, A., Harth, A., Umbrich, J., Kinsella, S., Polleres, A., & Decker, S. (2011). Searching and browsing linked data with swse: the semantic web search engine. Web Semantics: Science, Services and Agents on the World Wide Web, 9 (4), 365–401.

    Article  Google Scholar 

  • Linked Data contributors. (2018). Linked open data. https://lod-cloud.net/.

  • Manola, F., Miller, E., McBride, B., & et al. (2004). Rdf primer. W3C Recommendation, 10(1-107), 6.

    Google Scholar 

  • McGuinness, D.L., Van Harmelen, F., & et al. (2004). Owl web ontology language overview. W3C Recommendation, 10(10), 2004.

    Google Scholar 

  • Millard, I., Glaser, H., Salvadores, M., & Shadbolt, N. (2010). Consuming multiple linked data sources: Challenges and experiences.

  • Oğuz, D., Ergenc, B., Yin, S., Dikenelli, O., & Hameurlain, A. (2015). Federated query processing on linked data: a qualitative survey and open challenges.

  • Oren, E., Delbru, R., Catasta, M., Cyganiak, R., Stenzhorn, H., & Tummarello, G. (2008). Sindice. com: a document-oriented lookup index for open linked data. International Journal of Metadata, Semantics and Ontologies, 3(1), 37–52.

    Article  Google Scholar 

  • Pérez, J., Arenas, M., & Gutierrez, C. (2009). Semantics and complexity of sparql. ACM Transactions on Database Systems (TODS), 34(3), 1–45.

    Article  Google Scholar 

  • Schmachtenberg, M., Bizer, C., & Paulheim, H. (2014). State of the lod cloud 2014. University of Mannheim, Data and Web Science Group, August 30.

  • Thomas, E., Alani, H., Sleeman, D., & Brewster, C. (2005). Searching and ranking ontologies on the semantic web.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hiteshwar kumar Azad.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Azad, H.k., Deepak, A. & Azad, A. LOD search engine: A semantic search over linked data. J Intell Inf Syst 59, 71–91 (2022). https://doi.org/10.1007/s10844-021-00687-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-021-00687-0

Keywords

Navigation