ABSTRACT
Data and metadata suffer many different kinds of change: values are inserted, deleted or updated; entities appear and disappear; properties are added or re-purposed, etc. Explicitly recognizing, exploring, and evaluating such change can alert to changes in data ingestion procedures, can help assess data quality, and can improve the general understanding of the dataset and its behavior over time. We propose a data model-independent framework to formalize such change. Our change-cube enables exploration and discovery of such changes to reveal dataset behavior over time.
- Ziawasch Abedjan, Lukasz Golab, and Felix Naumann. 2015. Profiling relational data: a survey. VLDB Journal 24, 4 (2015), 557--581. Google ScholarDigital Library
- Charu C. Aggarwal. 2007. Data streams: models and algorithms. Vol. 31. Springer Science & Business Media. Google ScholarDigital Library
- Juan M. Ale and Gustavo H. Rossi. 2000. An approach to discovering temporal association rules. In Proc. of SAC. 294--300. Google ScholarDigital Library
- Peter Buneman, Sanjeev Khanna, and Tan Wang-Chiew. 2001. Why and where: A characterization of data provenance. In Proc. of the International Conference on Database Theory (ICDT). 316--330. Google ScholarDigital Library
- Tamraparni Dasu, Theodore Johnson, and Amit Marathe. 2006. Database Exploration Using Database Dynamics. IEEE Data Engineering Bulletin 29, 2 (2006), 43--59.Google Scholar
- Tamraparni Dasu, Theodore Johnson, S. Muthukrishnan, and Vladislav Shkapenyuk. 2002. Mining Database Structure; Or, How to Build a Data Quality Browser. In Proc. of SIGMOD. 240--251. Google ScholarDigital Library
- Stratos Idreos, Olga Papaemmanouil, and Surajit Chaudhuri. 2015. Overview of Data Exploration Techniques. In Proc. of SIGMOD. 277--281. Google ScholarDigital Library
- Yannis Roussakis, Ioannis Chrysakis, Kostas Stefanidis, Giorgos Flouris, and Yannis Stavrakas. 2015. A flexible framework for understanding the dynamics of evolving RDF datasets. In Proc. of ISWC. 495--512. Google ScholarDigital Library
- Praveen Seshadri, Miron Livny, and Raghu Ramakrishnan. 1995. SEQ: A Model for Sequence Databases. In Proc. of ICDE. 232--239. Google ScholarDigital Library
- Richard T. Snodgrass. 2000. Developing Time-Oriented Database Applications in SQL. Morgan Kaufmann. Google ScholarDigital Library
- Michael Stillger, Guy M. Lohman, Volker Markl, and Mokhtar Kandil. 2001. LEO -- DB2's LEarning Optimizer. In Proc. of VLDB. 19--28. Google ScholarDigital Library
- Jürgen Umbrich, Boris Villazón-Terrazas, and Michael Hausenblas. 2010. Dataset Dynamics Compendium: A Comparative Study. In Proc. of the International Workshop on Consuming Linked Data (COLD). Google ScholarDigital Library
Recommendations
Enabling discovery through visual exploration: an introduction to data visualization & its applications
Visual metaphors have assisted human understanding since early days of mankind; the modern scientific and social-scientific evolution especially benefits greatly from the visual medium. With increasing size and complexity of contemporary data, ...
A Model and Framework for Visualization Exploration
Visualization exploration is the process of extracting insight from data via interaction with visual depictions of that data. Visualization exploration is more than presentation; the interaction with both the data and its depiction is as important as ...
Value and Relation Display for Interactive Exploration of High Dimensional Datasets
INFOVIS '04: Proceedings of the IEEE Symposium on Information VisualizationTraditional multi-dimensional visualization techniques, such as glyphs, parallel coordinates and scatterplot matrices, suffer from clutter at the display level and difficult user navigation among dimensions when visualizing high dimensional datasets. In ...
Comments