Abstract
In the last few years, there has been a significant growth in the amount of data published in RDF and adoption of Linked Data principles. Every day, a large number of people and communities contribute to the publication of datasets as Linked Data on Linked Open Data (LOD) cloud. Due to a large size of LOD cloud on the Web and the RDF representation of linked dataset, searching and retrieving relevant data on the Web is a major challenge. Because the data is published in RDF triple format, i.e. an interlinked structure, traditional search engines are unable to perform searches on Linked Data. This article introduces LOD search engine, a novel semantic search engine that searches on Semantic Web documents (such as Linked Data or triples) to retrieve a set of relevant information based on user queries. For searching over triples, we proposed two semantic search methods: Forward Search and Backward Search. To improve search results, two new ranking methods have also been introduced: Domain Ranking and Triple Ranking. The proposed LOD search engine produced remarkable results and outperformed other semantic search engines. In the best-case scenario, the proposed LOD search engine outperforms the swoogle and falcons by 22.35%, 43.38% and 33.18% in terms of precision, recall, and F-Measure respectively.
Similar content being viewed by others
Data Availability
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
References
Chapter seven (2015). distribution and query federation. In O. Curé G. Blin (Eds.) RDF Database systems. https://doi.org/10.1016/B978-0-12-799957-9.00007-9(pp. 169–190). Boston: Morgan Kaufmann.
Azad, H.K., & Deepak, A. (2019a). A new approach for query expansion using wikipedia and wordnet. Information Sciences, 492, 147–163.
Azad, H.K., & Deepak, A. (2019b). Query expansion techniques for information retrieval: a survey. Information Processing & Management, 56 (5), 1698–1735.
Azad, H. K., Deepak, A., & Abhishek, K. (2016). Linked open data search engine. In Proceedings of the Second International Conference on Information and Communication Technology for Competitive Strategies (p. 17). ACM.
Baeza-Yates, R., & Davis, E. (2004). Web page ranking using link attributes. In Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters, pp. 328–329. ACM.
Balmin, A., Hristidis, V., & Papakonstantinou, Y. (2004). Objectrank: Authority-based keyword search in databases. In Proceedings of the Thirtieth international conference on Very large data bases-Volume 30 (pp. 564–575). VLDB Endowment.
Berkani, N., Bellatreche, L., Khouri, S., & Ordonez, C. (2020). The contribution of linked open data to augment a traditional data warehouse. Journal of Intelligent Information Systems, 55(3), 397–421.
Berners-Lee, T. (2006). Linked data http://www.w3.org/designissues. LinkedData. html.
Bizer, C. (2009). The emerging web of linked data. IEEE Intelligent Systems, 24(5), 87–92.
Bizer, C., Cyganiak, R., Heath, T., & et al. (2007). How to publish linked data on the web 2007.
Brill, E. (1992). Penn treebank tagger. Copyright by MIT and University of Pennsylvania.
Cheng, G., Ge, W., & Qu, Y. (2008). Falcons: searching and browsing entities on the semantic web. In Proceedings of the 17th international conference on World Wide Web (pp. 1101–1102). ACM.
Cheng, G., & Qu, Y. (2009). Searching linked objects with falcons: Approach implementation and evaluation.
d’Aquin, M., Sabou, M., Dzbor, M., Baldassarre, C., Gridinoc, L., Angeletou, S., & Motta, E. (2007). Watson: A gateway for the semantic Web.
Data and Web Science Group. (2018). University of mannheim. https://dws.informatik.uni-mannheim.de/.
Dimitrakis, E., Sgontzos, K., & Tzitzikas, Y. (2020). A survey on question answering systems over linked data and documents. Journal of Intelligent Information Systems, 55(2), 233–259.
Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R. S., Peng, Y., Reddivari, P., Doshi, V., & Sachs, J. (2004). Swoogle: a search and metadata engine for the semantic web. In Proceedings of the thirteenth ACM international conference on Information and knowledge management (pp. 652–659). ACM.
El-Gayar, M., Mekky, N.E., Atwan, A., & Soliman, H. (2019). Enhanced search engine using proposed framework and ranking algorithm based on semantic relations. IEEE Access, 7, 139,337–139,349.
Fernández, J. D., Beek, W., Martínez-prieto, M.A., & Arias, M. (2017). Lod-a-lot. In International semantic web conference (pp. 75–83). Berlin: Springer.
Fernández, J.D., Martínez-prieto, M.A., Gutié, rrez C., Polleres, A., & Arias, M. (2013). Binary rdf representation for publication and exchange (hdt). Journal of Web Semantics, 19, 22–41.
Hartig, O., & Pirrò, G. (2015). A context-based semantics for sparql property paths over the Web. In European semantic web conference (pp. 71–87). Berlin: Springer.
Hogan, A., Harth, A., Umbrich, J., Kinsella, S., Polleres, A., & Decker, S. (2011). Searching and browsing linked data with swse: the semantic web search engine. Web Semantics: Science, Services and Agents on the World Wide Web, 9 (4), 365–401.
Linked Data contributors. (2018). Linked open data. https://lod-cloud.net/.
Manola, F., Miller, E., McBride, B., & et al. (2004). Rdf primer. W3C Recommendation, 10(1-107), 6.
McGuinness, D.L., Van Harmelen, F., & et al. (2004). Owl web ontology language overview. W3C Recommendation, 10(10), 2004.
Millard, I., Glaser, H., Salvadores, M., & Shadbolt, N. (2010). Consuming multiple linked data sources: Challenges and experiences.
Oğuz, D., Ergenc, B., Yin, S., Dikenelli, O., & Hameurlain, A. (2015). Federated query processing on linked data: a qualitative survey and open challenges.
Oren, E., Delbru, R., Catasta, M., Cyganiak, R., Stenzhorn, H., & Tummarello, G. (2008). Sindice. com: a document-oriented lookup index for open linked data. International Journal of Metadata, Semantics and Ontologies, 3(1), 37–52.
Pérez, J., Arenas, M., & Gutierrez, C. (2009). Semantics and complexity of sparql. ACM Transactions on Database Systems (TODS), 34(3), 1–45.
Schmachtenberg, M., Bizer, C., & Paulheim, H. (2014). State of the lod cloud 2014. University of Mannheim, Data and Web Science Group, August 30.
Thomas, E., Alani, H., Sleeman, D., & Brewster, C. (2005). Searching and ranking ontologies on the semantic web.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Azad, H.k., Deepak, A. & Azad, A. LOD search engine: A semantic search over linked data. J Intell Inf Syst 59, 71–91 (2022). https://doi.org/10.1007/s10844-021-00687-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10844-021-00687-0