Efficient Index Maintenance for Frequently Updated Semantic Data

Liang, Yan; Wang, Haofen; Liu, Qiaoling; Tran, Thanh; Penin, Thomas; Yu, Yong

doi:10.1007/978-3-540-89704-0_13

Yan Liang³,
Haofen Wang³,
Qiaoling Liu³,
Thanh Tran⁴,
Thomas Penin³ &
…
Yong Yu³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5367))

Included in the following conference series:

Asian Semantic Web Conference

780 Accesses
1 Citations

Abstract

Nowadays, the demand on querying and searching the Semantic Web is increasing. Some systems have adopted IR (Information Retrieval) approaches to index and search the Semantic Web data due to its capability to handle the Web-scale data and efficiency on query answering. Additionally, the huge volumes of data on the Semantic Web are frequently updated. Thus, it further requires effective update mechanisms for these systems to handle the data change. However, the existing update approaches only focus on document. It still remains a big challenge to update IR index specially designed for semantic data in the form of finer grained structured objects rather than unstructured documents. In this paper, we present a well-designed update mechanism on the IR index for triples. Our approach provides a flexible and effective update mechanism by dividing the index into blocks. It reduces the number of update operations during the insertion of triples. At the same time, it preserves the efficiency on query processing and the capability to handle large scale semantic data. Experimental results show that the index update time is a fraction of that by complete reconstruction w.r.t. the portion of the inserted triples. Moreover, the query response time is not notably affected. Thus, it is capable to make newly arrived semantic data immediately searchable for users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ding, L., Finin, T., Joshi, A., Pan, R., Cost, R.S., Peng, Y., Reddivari, P., Doshi, V.C., Sachs, J.: Swoogle: A Search and Metadata Engine for the Semantic Web. In: Proceedings of the Thirteenth ACM Conference on Information and Knowledge Management. ACM Press, New York (2004)
Google Scholar
d’Aquin, M., Baldassarre, C., Gridinoc, L., Sabou, M., Angeletou, S.: Watson: Supporting next generation semantic web applications. In: WWW 2007 (2007)
Google Scholar
Tomasic, A., García-Molina, H., Shoens, K.: Incremental updates of inverted lists for text document retrieval, pp. 289–300 (1994)
Google Scholar
Brown, E., Callan, J., Croft, W.: Fast incremental indexing for full-text information retrieval. In: Proceedings of the 20th International Conference on Very Large Databases (VLDB), Santiago, Chille, pp. 192–202 (1994)
Google Scholar
Büttcher, S., Clarke, C.L.A., Lushman, B.: Hybrid index maintenance for growing text collections. In: Proceedings of SIGIR 2006, New York, NY, USA, pp. 356–363. ACM, New York (2006)
Chapter Google Scholar
Lester, N., Zobel, J., Williams, H.: Efficient online index maintenance for contiguous inverted lists. Inf. Process. Manage. 42(4), 916–933 (2006)
Article Google Scholar
Lim, L., Wang, M., Padmanabhan, S., Vitter, J.S., Agarwal, R.C.: Efficient update of indexes for dynamically changing web documents. In: World Wide Web, pp. 37–69 (2007)
Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30(1–7), 107–117 (1998)
Article Google Scholar
Zhang, L., Liu, Q., Zhang, J., Wang, H., Pan, Y., Yu, Y.: Semplore: An ir approach to scalable hybrid query of semantic web data. In: Proceedings of ISWC/ASWC 2007, pp. 652–665 (2007)
Google Scholar
Lempel, R., Mass, Y., Ofek-Koifman, S., Sheinwald, D., Petruschka, Y., Sivan, R.: Just in time indexing for up to the second search. In: CIKM, pp. 97–106 (2007)
Google Scholar
Jang, H., Kim, Y., Shin, D.: An effective mechanism for index update in structured documents. In: CIKM, pp. 383–390 (1999)
Google Scholar
Rocha, C., Schwabe, D., Aragao, M.P.: A hybrid approach for searching in the semantic web. In: Proceedings of the 13th international conference on World Wide Web, pp. 374–383. ACM Press, New York (2004)
Chapter Google Scholar
Bast, H., Chitea, A., Suchanek, F.M., Weber, I.: ESTER: efficient search on Text, Entities, and Relations. In: Proceedings of SIGIR 2007, Amsterdam, Netherlands, pp. 671–678. ACM, New York (2007)
Chapter Google Scholar
Tummarello, G., Oren, E., Delbru, R.: Sindice.com: Weaving the open linked data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 552–565. Springer, Heidelberg (2007)
Chapter Google Scholar
Chu-Carroll, J., Prager, J.M., Czuba, K., Ferrucci, D.A., Duboué, P.A.: Semantic search via XML fragments: a high-precision approach to ir. In: Proceddings of SIGIR, pp. 445–452 (2006)
Google Scholar
Li, Q., Moon, B.: Indexing and querying XML data for regular path expressions. In: The VLDB Journal, pp. 361–370 (2001)
Google Scholar
Tatarinov, I., Viglas, S., Beyer, K.S., Shanmugasundaram, J., Shekita, E.J., Zhang, C.: Storing and querying ordered XML using a relational database system. In: SIGMOD Conference (2002)
Google Scholar
Horrocks, I., Tessaris, S.: Querying the semantic web: A formal approach. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, pp. 177–191. Springer, Heidelberg (2002)
Chapter Google Scholar
Guo, Y., Pan, Z., Heflin, J.: Lubm: A benchmark for owl knowledge base systems. J. Web Sem. 3(2-3), 158–182 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science & Engineering, Shanghai Jiao Tong University, Shanghai, China
Yan Liang, Haofen Wang, Qiaoling Liu, Thomas Penin & Yong Yu
Institute AIFB, Universität Karlsruhe, Germany
Thanh Tran

Authors

Yan Liang
View author publications
You can also search for this author in PubMed Google Scholar
Haofen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qiaoling Liu
View author publications
You can also search for this author in PubMed Google Scholar
Thanh Tran
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Penin
View author publications
You can also search for this author in PubMed Google Scholar
Yong Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Open University Knowledge Media Institute, Walton Hall, MK6 7AA, Milton Keynes, United Kingdom
John Domingue
Shinawatra University 99 Moo 10 Bangtoey, Samkok, 12160, Pathum Thani, Thailand
Chutiporn Anutariya

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liang, Y., Wang, H., Liu, Q., Tran, T., Penin, T., Yu, Y. (2008). Efficient Index Maintenance for Frequently Updated Semantic Data. In: Domingue, J., Anutariya, C. (eds) The Semantic Web. ASWC 2008. Lecture Notes in Computer Science, vol 5367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89704-0_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-89704-0_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89703-3
Online ISBN: 978-3-540-89704-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics