skip to main content
10.1145/3297280.3297503acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

Exploiting context and quality for linked data source selection

Published: 08 April 2019 Publication History

Abstract

The traditional Web is evolving into the Web of Data, which gathers huge collections of structured data over distributed, heterogeneous data sources. Live queries are needed to get current information out of this global data space. In live query processing, source selection deserves special attention, because it allows us to identify the sources that most likely contain relevant content. Due to the semantic heterogeneity of the Web of Data, however, it is not always easy to assess relevancy. Context information might help in interpreting the user needs. Moreover, besides relevancy, another important criterion for selecting sources is the quality of data they contain. In this paper, we discuss how context and quality metadata can be exploited to improve source selection.

References

[1]
M. Acosta, O. Hartig, and J. Sequeda. Federated RDF Query Processing. In Encyclopedia of Big Data Technologies, 2018", Springer International Publishing.
[2]
A. Assaf, A. Senart, and R. Troncy. Roomba: automatic validation, correction and generation of dataset metadata. In Proceedings of the 24th International Conference on World Wide Web Companion, pages 159--162. International World Wide Web Conferences Steering Committee, 2015.
[3]
L. Bertossi, F. Rizzolo, and L. Jiang. Data quality is context dependent. In International Workshop on Business Intelligence for the Real-Time Enterprise, pages 52--67. Springer, 2010.
[4]
C. Bettini, O. Brdiczka, K. Henricksen, J. Indulska, D. Nicklas, A. Ranganathan, and D. Riboni. A survey of context modelling and reasoning techniques. Pervasive and Mobile Computing 6(2):161--180, 2010.
[5]
C. Bizer, T. Heath, and T. Berners-Lee. Linked data-the story so far. Semantic services, interoperability and web applications: emerging concepts, pages 205--227, 2009.
[6]
C. Bolchini, C. A. Curino, E. Quintarelli, F. A. Schreiber, and L. Tanca. A data-oriented survey of context models. ACM Sigmod Record, 36(4):19--26, 2007.
[7]
B. Catania et al. Wearable queries: Adapting common retrieval needs to data and users. In DBRank, 2013.
[8]
B. Catania, G. Guerrini, and B. Yaman. Context-dependent quality-aware source selection for live queries on linked data. In EDBT, 2016.
[9]
S. Cazalens, J. Leblay, P. Lamarre, I. Manolescu, and X. Tannier, Xavier. Computational Fact Checking: A Content Management Perspective. Proc. VLDB Endow., 11(12):2110--2113, 2018.
[10]
J. Debattista, S. Auer, and C. Lange. Luzzu-a framework for linked data quality assessment. In Semantic Computing (ICSC), 2016 IEEE Tenth International Conference on, pages 124--131. IEEE, 2016.
[11]
I. Ermilov, M. Martin, J. Lehmann, and S. Auer. Linked open data statistics: Collection and exploitation. In Knowledge Engineering and the Semantic Web, pages 242--249. Springer, 2013.
[12]
M. Franklin, A. Halevy, and D. Maier. From databases to dataspaces: a new abstraction for information management. ACM Sigmod Record, 34(4):27--33, 2005.
[13]
C. Ghidini and F. Giunchiglia. Local Models Semantics, or contextual reasoning=locality+compatibility. Artif. Intell. 127(2):221--259, 2001.
[14]
A. Harth, K. Hose, M. Karnstedt, A. Polleres, K.-U. Sattler, and J. Umbrich. Data summaries for on-demand Queries over Linked Data. In WWW, 2010.
[15]
T. Heath and C. Bizer. Linked data: Evolving the web into a global data space. Synthesis lectures on the semantic web: theory and technology, 1(1):1--136, 2011.
[16]
S. Khatchadourian and M. P. Consens. Exploring RDF usage and interlinking in the linked open data cloud using explod. In C. Bizer, T. Heath, T. Berners-Lee, and M. Hausenblas, editors, Proceedings of the WWW2010 Workshop on Linked Data on the Web, LDOW 2010, Raleigh, USA, April 27, 2010, volume 628 of CEUR Workshop Proceedings. CEUR-WS.org, 2010.
[17]
E. Mäkelä. Aether-generating and viewing extended void statistical descriptions of rdf datasets. In European Semantic Web Conference, pages 429--433. Springer, 2014.
[18]
P. N. Mendes, H. Mühleisen, and C. Bizer. Sieve: Linked Data Quality Assessment and Fusion. In D. Srivastava and I. Ari, editors, EDBT/ICDT Workshops, pages 116--123. ACM, 2012.
[19]
R. Meusel, B. Spahiu, C. Bizer, and H. Paulheim. Towards automatic topical classification of lod datasets. In CEUR Workshop Proceedings. Vol. 1409, 2015.
[20]
N. Mihindukulasooriya and M. P. Villalon. Loupe-an online tool for inspecting datasets in the linked data cloud.
[21]
A. Miles, B. Matthews, M. Wilson, and D. Brickley. Skos core: simple knowledge organisation for the web. In International Conference on Dublin Core and Metadata Applications, pages 3--10, 2005.
[22]
M. Palmonari, A. Rula, R. Porrini, A. Maurino, B. Spahiu, and V. Ferme. Abstat: Linked data summaries with abstraction and statistics. In The Semantic Web: ESWC 2015 Satellite Events, pages 128--132. Springer, 2015.
[23]
T. Rekatsinas, X. L. Dong, L. Getoor, and D. Srivastava. Finding quality in quantity: The challenge of discovering valuable sources for integration. In CIDR, 2015.
[24]
J. Umbrich, K. Hose, M. Karnstedt, A. Harth, and A. Polleres. Comparing Data Summaries for Processing Live Queries over Linked Data. World Wide Web, 14(5--6):495--544, 2011.
[25]
B. Yaman. "Exploiting Context-Dependent Quality Metadata for Linked Data Source Selection". PhD Thesis, Università di Genova, 2018.
[26]
A. Zaveri, A. Rula, A. Maurino, R. Pietrobon, J. Lehmann, and S. Auer. Quality assessment for linked data: A survey. Semantic Web, 7(1):63--93, 2016.

Cited By

View all
  • (2024)LinkedDataOps:quality oriented end-to-end geospatial linked data production governanceSemantic Web10.3233/SW-23329315:2(555-581)Online publication date: 30-Apr-2024
  • (2024)Use of Context in Data Quality Management: A Systematic Literature ReviewJournal of Data and Information Quality10.1145/367208216:3(1-41)Online publication date: 4-Oct-2024
  • (2023)IndeGxWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2023.10077576:COnline publication date: 1-Apr-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '19: Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing
April 2019
2682 pages
ISBN:9781450359337
DOI:10.1145/3297280
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 April 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. context
  2. data quality
  3. linked data
  4. live queries
  5. source selection

Qualifiers

  • Research-article

Conference

SAC '19
Sponsor:

Upcoming Conference

SAC '25
The 40th ACM/SIGAPP Symposium on Applied Computing
March 31 - April 4, 2025
Catania , Italy

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)LinkedDataOps:quality oriented end-to-end geospatial linked data production governanceSemantic Web10.3233/SW-23329315:2(555-581)Online publication date: 30-Apr-2024
  • (2024)Use of Context in Data Quality Management: A Systematic Literature ReviewJournal of Data and Information Quality10.1145/367208216:3(1-41)Online publication date: 4-Oct-2024
  • (2023)IndeGxWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2023.10077576:COnline publication date: 1-Apr-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media