skip to main content
research-article

Tracking Data Provenance of Archaeological Temporal Information in Presence of Uncertainty

Published: 07 April 2022 Publication History

Abstract

The interpretation process is one of the main tasks performed by archaeologists who, starting from ground data about evidences and findings, incrementally derive knowledge about ancient objects or events. Very often more than one archaeologist contributes in different time instants to discover details about the same finding and thus, it is important to keep track of history and provenance of the overall knowledge discovery process. To this aim, we propose a model and a set of derivation rules for tracking and refining data provenance during the archaeological interpretation process. In particular, among all the possible interpretation activities, we concentrate on the one concerning the dating that archaeologists perform to assign one or more time intervals to a finding to define its lifespan on the temporal axis. In this context, we propose a framework to represent and derive updated provenance data about temporal information after the mentioned derivation process. Archaeological data, and in particular their temporal dimension, are typically vague, since many different interpretations can coexist, thus, we will use Fuzzy Logic to assign a degree of confidence to values and Fuzzy Temporal Constraint Networks to model relationships between dating of different findings represented as a graph-based dataset. The derivation rules used to infer more precise temporal intervals are enriched to manage also provenance information and their following updates after a derivation step. A MapReduce version of the path consistency algorithm is also proposed to improve the efficiency of the refining process on big graph-based datasets.

References

[1]
2013. World Wide Web Consortium - PROV-DM: The PROV Data Model. (2013). Retrieved from https://www.w3.org/TR/prov-dm/.
[2]
J. F. Allen. 1983. Maintaining knowledge about temporal intervals. Commun. ACM 26, 11 (1983), 832–843.
[3]
S. Badaloni, M. Falda, and M. Giacomin. 2004. Integrating quantitative and qualitative fuzzy temporal constraints. AI Commun. 17, 4 (2004), 187–200.
[4]
S. Badaloni and M. Giacomin. 2006. The algebra IA\(^{fuz}\): A framework for qualitative fuzzy temporal reasoning. Artif. Intell. 170, 10 (2006), 872–908.
[5]
J. A. Barceló. 2010. Computational intelligence in archaeology. State of the art. In Proceedings of the 37th International Conference on Computer Applications & Qualitative Methods in Archaeology (CAA). 11–21.
[6]
Juan Antonio Barceló. 2019. Computing archaeological stratigraphies. A state-of-the-art. In Book of Abstracts of the 47th Computer Applications and Quantitative Methods in Archaeology (CAA’19). Retrieved from https://2019.caaconference.org/wp-content/uploads/sites/25/2019/04/CAA2019_programabstracts_v20190423.pdf.
[7]
P. Basso, P. Grossi, B. Bruno, A. Belussi, and S. Migliorini. 2017. From Rome, to Verona, to the Agro areas: Roundtrip. An experimentation of interoperability between SITAR, SITAVR and SITAIS. Archeologia e Calcolatori 2017 (2017), 157–170.
[8]
M. J. Baxter. 2009. Archaeological data analysis and fuzzy clustering. Archaeometry 51, 6 (2009), 1035–1054.
[9]
A. Belussi and S. Migliorini. 2014. A framework for managing temporal dimensions in archaeological data. In Proceedings of 21st International Symposium on Temporal Representation and Reasoning (TIME). 81–90. DOI:DOI:.
[10]
A. Belussi and S. Migliorini. 2014. Modeling Time in Archaeological Data: The Verona Case Study. Technical Report RR 93/2014. Department of Computer Science, University of Verona. Retrieved from http://www.di.univr.it/report.
[11]
A. Belussi and S. Migliorini. 2017. A spatio-temporal framework for managing archeological data. Ann. Math. Artif. Intell. 80, 3 (Aug. 2017), 175–218. DOI:DOI:.
[12]
Alberto Belussi, Sara Migliorini, and Piergiovanna Grossi. 2015. Managing time dimension in the archaeological urban information system of the historical heritage of Rome and Verona. In Proceedings of the 21st Century Archaeology: Concepts, Methods and Tools. Proceedings of the 42nd Annual Conference on Computer Applications and Quantitative Methods in Archaeology (CAA’14). 235–244. Retrieved from https://caa2014.sciencesconf.org/45964/document.
[13]
A. Belussi, S. Migliorini, and P. Grossi. 2016. Mapping of the SITAR and NIOBE data models towards the standard CIDOC-CRM\(_{archeo}\) of the ARIADNE project with data transformation into RDF format. Retrieved from https://www.di.univr.it/?ent=progetto&id=4578&lang=en.
[14]
Igor Bogdanović, Vasiliki Andreaki, Joan A. Barceló, Raquel Piqué, Xavier Terradas, Antoni Palomo, Núria Morera, and Oriol López. 2019. Discovering the time of La Draga. In Book of Abstracts of the 47th Computer Applications and Quantitative Methods in Archaeology (CAA’19). Retrieved from https://2019.caaconference.org/wp-content/uploads/sites/25/2019/04/CAA2019_programabstracts_v20190423.pdf.
[15]
P. Buneman, S. Khanna, and W. C. Tan. 2001. Why and where: A characterization of data provenance. In Proceedings of the 8th International Conference on Database Theory. 316–330. DOI:DOI:.
[16]
P. Buneman and W. C. Tan. 2018. Data provenance: What next?SIGMOD Rec. 47, 3 (2018), 5–16. DOI:DOI:.
[17]
Enrico R. Crema and Anne Kandler. 2019. An R package for inferring patterns of social learning from archaeological frequency data. In Book of Abstracts of the 47th Computer Applications and Quantitative Methods in Archaeology (CAA’19). Retrieved from https://2019.caaconference.org/wp-content/uploads/sites/25/2019/04/CAA2019_programabstracts_v20190423.pdf.
[18]
Jeffrey Dean and Sanjay Ghemawat. 2004. MapReduce: Simplified data processing on large clusters. In Proceedings of the 6th Symposium on Operating System Design and Implementation. 137–150.
[19]
R. Dechter, I. Meiri, and J. Pearl. 1991. Temporal constraint networks. Artif. Intell. 49, 1–3 (1991), 61–95.
[20]
Peter Demján. 2019. Analysing settlement dynamics using statistics based on archaeological theory. In Book of Abstracts of the 47th Computer Applications and Quantitative Methods in Archaeology (CAA’19). Retrieved from https://2019.caaconference.org/wp-content/uploads/sites/25/2019/04/CAA2019_programabstracts_v20190423.pdf.
[21]
M. J. Egenhofer and R. Franzosa. 1991. Point-set topological spatial relations. Int. J. Geog. Inf. Syst. 2, 5 (1991), 161–174.
[22]
A. Felicetti, A. Masur, A. Kritsotaki, G. Hiebel, K. May, M. Theodoridou, M. Doerrand, P. Ronzino, S. Hermon, and W. Schmidle. 2016. Definition of the CRM\(_{archaeo}\). An extension of CIDOC CRM to support archaeological excavation process. Retrieved from http://www.cidoc-crm.org/crmarchaeo/ModelVersion/version-1.4.1.
[23]
Gabriele Gattiglia, Nevio Dubbini, and Francesca Anichini. 2019. Spatio-temporal network analysis applied to Roman Terra Sigillata data. In Book of Abstracts of the 47th Computer Applications and Quantitative Methods in Archaeology (CAA’19). 77–78. Retrieved from https://2019.caaconference.org/wp-content/uploads/sites/25/2019/04/CAA2019_programabstracts_v20190423.pdf.
[24]
Cesar Gonzalez-Perez. 2018. Temporality. Springer International Publishing, 143–155. DOI:DOI:.
[25]
Edward C. Harris. 1989. Principles of Archaeological Stratigraphy, 2nd ed.Academic Press.
[26]
David Holland, Uri Braun, Diana Maclean, Kiran-Kumar Muniswamy-Reddy, and Margo Seltzer. 2008. Choosing a data model and query language for provenance. In Proceedings of the 2nd International Provenance and Annotation Workshop (IPAW’08). Springer.
[27]
ICOM/CIDOC CRM Special Interest Group. 2020. Definition of the CIDOC Conceptual Reference Model, version 6.2.2. (2020). Retrieved from http://www.cidoc-crm.org/.
[28]
ISO 2002. ISO 19108 Geographic Information – Temporal Schema. ISO. Retrieved from https://www.iso.org/standard/26013.html.
[29]
ISO 2019. ISO 19107 Geographic Information – Spatial Schema. ISO. Retrieved from https://www.iso.org/standard/66175.html.
[30]
M. Katsianis, S. Tsipidis, K. Kotsakis, and A. Kousoulakou. 2008. A 3D digital workflow for archaeological intra-site research using GIS. J. Archaeol. Sci. 35, 3 (2008), 655–667.
[31]
C. C. Kolb. 2014. Provenance Studies in Archaeology. Springer New York, New York, NY, 6172–6181. DOI:DOI:.
[32]
David P. Lanter. 1991. Design of a lineage-based meta-data base for GIS. Cartog. Geog. Inf. Syst. 18, 4 (1991), 255–261. DOI:DOI:.
[33]
Zhiguo Long, Michael Sioutis, and Sanjiang Li. 2016. Efficient path consistency algorithm for large qualitative constraint networks. In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI’16). AAAI Press, 1202–1208.
[34]
Alan K. Mackworth. 1977. Consistency in networks of relations. Artif. Intell. 8, 1 (1977), 99–118. DOI:DOI:.
[35]
Keith May, James Stuart Taylor, and Steve Roskams. 2019. When Harris met Allen in the matrix: How can the conceptual modelling of stratigraphic relationships facilitate deeper understanding of archaeological space and time? In Book of Abstracts of the 47th Computer Applications and Quantitative Methods in Archaeology (CAA’19). Retrieved from https://2019.caaconference.org/wp-content/uploads/sites/25/2019/04/CAA2019_programabstracts_v20190423.pdf.
[36]
Adam Mertel and David Zbíral. 2019. Early Christian baptisteries: Geocoding, exploring and analysing a spatiotemporal dataset. In Book of Abstracts of the 47th Computer Applications and Quantitative Methods in Archaeology (CAA’19). 49–50. Retrieved from https://2019.caaconference.org/wp-content/uploads/sites/25/2019/04/CAA2019_programabstracts_v20190423.pdf.
[37]
S. Migliorini. 2019. Enhancing CIDOC-CRM models for GeoSPARQL processing with MapReduce. In Proceedings of the 2nd Workshop on Computing Techniques for Spatio-temporal Data in Archaeology and Cultural Heritage. CEUR-WS, 51–65. Retrieved from http://ceur-ws.org/Vol-2230/.
[38]
Sara Migliorini, Alberto Belussi, and Elisa Quintarelli. 2020. Promoting data provenance tracking in the archaeological interpretation process. In Proceedings of the Workshops of the EDBT/ICDT 2020 Joint Conference (CEUR Workshop Proceedings), Vol. 2578. CEUR-WS.org. Retrieved from http://ceur-ws.org/Vol-2578/PIE5.pdf.
[39]
S. Migliorini and P. Grossi. 2018. Towards the extraction of semantics from incomplete archaeological records. In Proceedings of Workshops and Posters at the 13th International Conference on Spatial Information Theory (COSIT’17). Springer International Publishing, 349–358. DOI:DOI:.
[40]
Sara Migliorini, Piergiovanna Grossi, and Alberto Belussi. 2017. An interoperable spatio-temporal model for archaeological data based on ISO standard 19100. J. Comput. Cult. Herit. 11, 1 (2017). DOI:DOI:.
[41]
P. Missier and K. Belhajjame. 2012. A PROV encoding for provenance analysis using deductive rules. In Proceedings of the 4th International Provenance and Annotation Workshop—Provenance and Annotation of Data and Processes. 67–81. DOI:DOI:.
[42]
Paolo Missier, Khalid Belhajjame, and James Cheney. 2013. The W3C PROV family of specifications for modelling provenance metadata. In Proceedings of the Joint 2013 EDBT/ICDT Conferences. ACM, 773–776. DOI:DOI:.
[43]
F. Mörchen. 2007. Unsupervised pattern mining from symbolic temporal data. SIGKDD Explor. Newslett. 9, 1 (2007), 41–55.
[44]
Guillem Santos, Joan Masó, Alaitz Zabala Torres, Lluís Pesquer, and Xavier Pons. 2019. A provenance metadata model integrating ISO geospatial lineage and the OGC WPS: Conceptual model and implementation. Trans. GIS 23 (07 2019). DOI:DOI:.
[45]
Eddie Schwalb and Lluís Vila. 1998. Temporal constraints: A survey. Constraints Int. J. 3, 2/3 (1998), 129–149. DOI:DOI:.
[46]
Yogesh Simmhan, Beth Plale, and Dennis Gannon. 2005. A survey of data provenance in e-science. SIGMOD Rec. 34, 3 (2005), 31–36. DOI:DOI:.
[47]
Richard T. Snodgrass, Michael H. Boehlen, Christian S. Jensen, and Andreas Steiner. 1996. Adding Transaction Time to SQL/Temporal. change proposal, ANSI X3H2-96-502r2, ISO/IEC JTC1/SC21/ WG3 DBL MAD-147r2.
[48]
Richard T. Snodgrass, Michael H. Boehlen, Christian S. Jensen, and Andreas Steiner. 1996. Adding Valid Time to SQL/Temporal. change proposal, ANSI X3H2-96-501r2, ISO/IEC JTC1/SC21/ WG3 DBL MAD-146r2.
[49]
Stephen Stead, Martin Doerr, Christian-Emil Ore, Athina Kritsotaki, et al. 2019. CRM\(_{inf}\): the Argumentation Model. An Extension of CIDOC-CRM to support argumentation. Retrieved from http://www.cidoc-crm.org/crminf/ModelVersion/version-10.1.
[50]
Lluis V. and Lluis G.1994. On fuzzy temporal constraint networks. Math. Soft Comput. 3 (1994), 315–334.

Cited By

View all
  • (2024)Comprehensive Review and Future Research Directions on ICT StandardisationInformation10.3390/info1511069115:11(691)Online publication date: 2-Nov-2024
  • (2023)Fuzzy Typological (Re)arrangement: a Prototype of Rethinking the Typology of Roman Tablewares from Sagalassos, Southwest AnatoliaJournal of Archaeological Method and Theory10.1007/s10816-023-09614-931:3(707-760)Online publication date: 21-Aug-2023
  • (2023)Tracking social provenance in chains of retweetsKnowledge and Information Systems10.1007/s10115-023-01878-765:10(3967-3994)Online publication date: 9-May-2023

Index Terms

  1. Tracking Data Provenance of Archaeological Temporal Information in Presence of Uncertainty

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Journal on Computing and Cultural Heritage
    Journal on Computing and Cultural Heritage   Volume 15, Issue 2
    June 2022
    403 pages
    ISSN:1556-4673
    EISSN:1556-4711
    DOI:10.1145/3514179
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 April 2022
    Accepted: 01 August 2021
    Revised: 01 July 2021
    Received: 01 October 2020
    Published in JOCCH Volume 15, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Provenance
    2. temporal constraints
    3. information discovery

    Qualifiers

    • Research-article
    • Refereed

    Funding Sources

    • Italian National Group for Scientific Computation (GNCS-INDAM)
    • “Progetto di Eccellenza” of the Computer Science Dept., Univ. of Verona, Italy

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)51
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Comprehensive Review and Future Research Directions on ICT StandardisationInformation10.3390/info1511069115:11(691)Online publication date: 2-Nov-2024
    • (2023)Fuzzy Typological (Re)arrangement: a Prototype of Rethinking the Typology of Roman Tablewares from Sagalassos, Southwest AnatoliaJournal of Archaeological Method and Theory10.1007/s10816-023-09614-931:3(707-760)Online publication date: 21-Aug-2023
    • (2023)Tracking social provenance in chains of retweetsKnowledge and Information Systems10.1007/s10115-023-01878-765:10(3967-3994)Online publication date: 9-May-2023

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media