Skip to main content

Efficient Semantic Search Over Structured Web Data: A GPU Approach

  • Conference paper
  • First Online:
Computational Linguistics and Intelligent Text Processing (CICLing 2017)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10762))

Abstract

Semantic search is an advanced topic in information retrieval which has attracted increasing attention in recent years. The growing availability of structured semantic data offers opportunities for semantic search engines, which can support more expressive queries able to address complex information needs. However, due to the fact that many new concepts (mined from the Web or learned through crowd-sourcing) are continuously integrated into knowledge bases, those search engines face the challenging performance issue of scalability. In this paper, we present a parallel method, termed gSparql, which utilizes the massive computation power of general-purpose GPUs to accelerate the performance of query processing and inference. Our method is based on the backward-chaining approach which makes inferences at query time. Experimental results show that gSparql outperforms the state-of-the-art algorithm and efficiently answers structured queries on large datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.w3.org/TR/rdf-primer/.

  2. 2.

    https://www.w3.org/TR/rdf-sparql-query/.

References

  1. Abadi, D.J., Marcus, A., Madden, S.R., Hollenbach, K.: Scalable semantic web data management using vertical partitioning. Proc. VLDB Endow. 1(1), 411–422 (2007)

    Google Scholar 

  2. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-76298-0_52

    Chapter  Google Scholar 

  3. Bishop, B., Kiryakov, A., Ognyanoff, D., Peikov, I., Tashev, Z., Velkov, R.: Owlim: a family of scalable semantic repositories. Semant. Web 2(1), 33–42 (2011)

    Google Scholar 

  4. Broekstra, J., Kampman, A., van Harmelen, F.: Sesame: a generic architecture for storing and querying RDF and RDF schema. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, pp. 54–68. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-48005-6_7

    Chapter  Google Scholar 

  5. Cambria, E., Hussain, A.: Sentic Computing: A Common-Sense-Based Framework for Concept-Level Sentiment Analysis. Springer, Cham, Switzerland (2015)

    Book  Google Scholar 

  6. Cambria, E., Poria, S., Bajpai, R., Schuller, B.: SenticNet 4: a semantic resource for sentiment analysis based on conceptual primitives. In: the 26th International Conference on Computational Linguistics, pp. 2666–2677 (2016)

    Google Scholar 

  7. Cambria, E., Wang, H., White, B.: Guest editorial: Big social data analysis. Knowl. Based Syst. 69, 1–2 (2014)

    Article  Google Scholar 

  8. Carroll, J.J., Dickinson, I., Dollin, C., Reynolds, D., Seaborne, A., Wilkinson, K.: Jena: implementing the semantic web recommendations. In: World Wide Web Conference, pp. 74–83. ACM (2004)

    Google Scholar 

  9. Green, O., McColl, R., Bader, D.A.: Gpu merge path: a gpu merging algorithm. In: Proceedings of the 26th ACM International Conference on Supercomputing, pp. 331–340. ACM (2012)

    Google Scholar 

  10. Guo, Y., Pan, Z., Heflin, J.: Lubm: a benchmark for owl knowledge base systems. Web Semant. Sci. Serv. Agents World Wide Web 3(2), 158–182 (2005)

    Article  Google Scholar 

  11. Harris, M., Sengupta, S., Owens, J.D.: Gpu gems 3, chapter parallel prefix sum (scan) with cuda (2007)

    Google Scholar 

  12. Heino, N., Pan, J.Z.: RDFS reasoning on massively parallel hardware. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012. LNCS, vol. 7649, pp. 133–148. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35176-1_9

    Chapter  Google Scholar 

  13. Neumann, T., Weikum, G.: Rdf-3x: a risc-style engine for rdf. Proc. VLDB Endow. 1(1), 647–659 (2008)

    Article  Google Scholar 

  14. Paul, J., He, J., He, B.: Gpl: a gpu-based pipelined query processing engine. In: Proceedings of the 2016 International Conference on Management of Data, pp. 1935–1950. ACM (2016)

    Google Scholar 

  15. Peters, M., Brink, C., Sachweh, S., Zündorf, A.: Scaling parallel rule-based reasoning. In: Presutti, V., d’Amato, C., Gandon, F., d’Aquin, M., Staab, S., Tordai, A. (eds.) ESWC 2014. LNCS, vol. 8465, pp. 270–285. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07443-6_19

    Chapter  Google Scholar 

  16. Rajagopal, D., Cambria, E., Olsher, D., Kwok, D.: A graph-based approach to commonsense concept extraction and semantic similarity detection. In: WWW, pp. 565–570. Rio De Janeiro (2013)

    Google Scholar 

  17. Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: Proceedings of the 16th International Conference on World Wide Web, pp. 697–706. ACM (2007)

    Google Scholar 

  18. Tran, H.-N., Cambria, E.: GpSense: A GPU-friendly method for common-sense subgraph matching in massively parallel architectures. In: CICLing, Konya (2016)

    Google Scholar 

  19. Tran, H.-N., Cambria, E., Hussain, A.: Towards gpu-based common-sense reasoning: using fast subgraph matching. Cognit. Comput. 8(6), 1074–1086 (2016)

    Article  Google Scholar 

  20. Tran, Ha-Nguyen, Kim, Jung-jae, He, Bingsheng: Fast subgraph matching on large graphs using graphics processors. In: Renz, Matthias, Shahabi, Cyrus, Zhou, Xiaofang, Cheema, Muhammad Aamir (eds.) DASFAA 2015. LNCS, vol. 9049, pp. 299–315. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-18120-2_18

    Chapter  Google Scholar 

  21. Urbani, Jacopo, van Harmelen, Frank, Schlobach, Stefan, Bal, Henri: QueryPIE: backward reasoning for OWL horst over very large knowledge bases. In: Aroyo, L., et al. (eds.) ISWC 2011. LNCS, vol. 7031, pp. 730–745. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-25073-6_46

    Chapter  Google Scholar 

  22. Weiss, C., Karras, P., Bernstein, A.: Hexastore: sextuple indexing for semantic web data management. Proc. VLDB Endow. 1(1), 1008–1019 (2008)

    Article  Google Scholar 

  23. Zheng, V., Cavallari, S., Cai, H., Chang, K., Cambria, E.: From node embedding to community embedding (2017). https://arxiv.org/abs/1610.09950

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ha-Nguyen Tran .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tran, HN., Cambria, E., Giang Do, H. (2018). Efficient Semantic Search Over Structured Web Data: A GPU Approach. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2017. Lecture Notes in Computer Science(), vol 10762. Springer, Cham. https://doi.org/10.1007/978-3-319-77116-8_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-77116-8_41

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-77115-1

  • Online ISBN: 978-3-319-77116-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics