skip to main content
10.1145/2491845.2491877acmotherconferencesArticle/Chapter ViewAbstractPublication PagespciConference Proceedingsconference-collections
research-article

Improving the real-time performance of heterogeneous extremely large datasets

Published: 19 September 2013 Publication History

Abstract

The POWDER protocol is a Semantic Web technology --and W3C Recommendation- that takes advantage of natural groupings of URIs, as identifiers as well as navigational paths, to annotate all the resources in a regular expression-delineated sub-space of the URI space. POWDER was designed as a mechanism for accreditation, trustmarking and resource discovery, emphasizing the publishing of attributed metadata by third parties and trusted authorities. However, its versatility allows the application of POWDER in different use cases such as repository compression. In this paper, we present the POWDER protocol, briefly discuss current implementations and use cases and present how POWDER can be implemented over existing well-tested and robust semantic storage systems. Furthermore, we discuss a novel solution for the scalable storing data summaries in the form of metadata for the purposes of source selection and source schema coordination in large-scale, heterogeneous federations of semantic querying endpoints. Our solution takes advantage of POWDER's ability to exploit naming conventions and other natural groupings of URIs in order to compress instance-level metadata about the nodes of a data service federation, especially in situations where URI hashing cannot be used to efficiently resolve the sources that hold statements regarding a given URI resource.

References

[1]
Archer, P., Smith, K., Perego, A, 2009. Protocol for Web Description Resources (POWDER): Description Resources, W3C Recommendation, 1 September 2009, http://www.w3.org/TR/powder-dr.
[2]
Gibbins, N., Shadbolt, N., 2010. Resource Description Framework (RDF), Encyclopedia of Library and Information Sciences, 3rd Edition, CRC Press.
[3]
Guttman, A., 1984. R-Trees, A dynamic index structure for spatial indexing, in Proceedings of the Annual Meeting of ACM SIG on the Management of Data (SIGMOD '84), Boston, MA, USA.
[4]
Harth, A., Hose, K., Karnstedt, M., Polleres, A., Sattler, K.-U., Umbrich, J., 2010. Data summaries for on-demand queries over Linked Data, in Proceedings of the 19th International World Wide Web Conference (WWW 2010), Raleigh, NC, USA.
[5]
Hayes, P., 2004. RDF Semantics, W3C Recommendation, 1-February 2004, http://www.w3.org/TR/rdf-mt
[6]
Forman, G. 2003. An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3 (Mar. 2003), 1289--1305.
[7]
Hose, K., Klan, D., Sattler, K.-U., 2006. Distributed data summaries for approximate query processing in PDMS, in Proceedings of the 10th International Database Engineering and Applications Symposium (IDEAS '06), Delhi, India.
[8]
Huang, J., D. J. Abadi, D. J., Ren, K., 2011. Scalable SPARQL querying of large RDF graphs, in Proceedings of the VLDB Endowment, Vol. 4(11)
[9]
Ioannidis, Y, 2003. The history of histograms (abridged), in Proceedings of the 29th International Conference on Very Large Databases (VLDB 2003), Berlin, Germany, Ten-Year Best Paper Award.
[10]
Konstantopoulos, S., Archer, P., 2009. Protocol for Web Description Resources (POWDER): Formal Semantics, W3C Recommendation, 1 September 2009, http://www.w3.org/TR/powder-formal.
[11]
Konstantopoulos, S., Archer, P., 2011. POWDER and the multi-million triple store, in Proceedings of the 3rd International Workshop on Semantic Web Information Management, ACM SIGMOD/PODS, Athens, Greece.
[12]
Konstantopoulos, S, Archer, P., Karampiperis, P., Karkaletsis, V., 2012. The POWDER protocol as infrastructure for serving and compressing semantic data, International Journal of Metadata, Semantics, and Ontologies. Accepted to appear.
[13]
Stuckenschmidt, H., Vdovjak, R., Houben, G.-J., Broekstra, J., 2004. Index structures and algorithms for querying distributed RDF repositories, in Proceedings of the 13th International World Wide Web Conference (WWW 2004), New York, USA (2004

Cited By

View all
  • (2015)Discovering, Indexing and Interlinking Information ResourcesF1000Research10.12688/f1000research.6848.14(432)Online publication date: 30-Jul-2015

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
PCI '13: Proceedings of the 17th Panhellenic Conference on Informatics
September 2013
359 pages
ISBN:9781450319690
DOI:10.1145/2491845
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

  • University of Macedonia
  • Aristotle University of Thessaloniki
  • The University of Sheffield: The University of Sheffield
  • Alexander TEI of Thessaloniki

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 September 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. metadata publishing
  2. resource discovery
  3. semantic web
  4. triple store compression

Qualifiers

  • Research-article

Funding Sources

Conference

PCI 2013
Sponsor:
  • The University of Sheffield
PCI 2013: 17th Panhellenic Conference on Informatics
September 19 - 21, 2013
Thessaloniki, Greece

Acceptance Rates

Overall Acceptance Rate 190 of 390 submissions, 49%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2015)Discovering, Indexing and Interlinking Information ResourcesF1000Research10.12688/f1000research.6848.14(432)Online publication date: 30-Jul-2015

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media