Abstract
Data is a modern commodity. Yet the pricing models in use on electronic data markets either focus on the usage of computing resources, or are proprietary, opaque, most likely ad hoc, and not conducive of a healthy commodity market dynamics. In this paper we propose a generic data pricing model that is based on minimal provenance, i.e. minimal sets of tuples contributing to the result of a query. We show that the proposed model fulfills desirable properties such as contribution monotonicity, bounded-price and contribution arbitrage-freedom. We present a baseline algorithm to compute the exact price of a query based on our pricing model. We show that the problem is NP-hard. We therefore devise, present and compare several heuristics. We conduct a comprehensive experimental study to show their effectiveness and efficiency.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Andrei, S., Robert, V.: The limits of arbitrage. Journal of Finance (1997)
Balazinska, M., Howe, B., Suciu, D.: Data markets in the cloud: An opportunity for the database community. PVLDB 4(12), 1482–1485 (2011)
Brynjolfsson, E., Hitt, L.M., Kim, H.: Strength in numbers: How does data-driven decision-making affect firm performance? In: ICIS (2011)
Buneman, P., Khanna, S., Tan, W.-C.: Why and Where: A Characterization of Data Provenance. In: ICDT, pp. 316–330 (2001)
Cheney, J., Chiticariu, L., Tan, W.C.: Provenance in Databases: Why, How, and Where. Foundations and Trends in Databases 1(4), 379–474 (2009)
Cui, Y., Widom, J.: Practical Lineage Tracing in Data Warehouses. In: ICDE, pp. 367–378 (2000)
Durkee, D.: Why cloud computing will never be free. Commun. ACM (2010)
Hey, T., Tansley, S., Tolle, K.M. (eds.): The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research (2009)
Green, T.J., Karvounarakis, G., Tannen, V.: Provenance Semirings. In: PODS (2007)
Khanna, S., Sudan, M., Trevisan, L., Williamson, D.P.: The approximability of constraint satisfaction problems. SIAM J. Comput. 30(6), 1863–1920 (2000)
Koutris, P., Upadhyaya, P., Balazinska, M., Howe, B., Suciu, D.: Query-based data pricing. In: PODS (2012)
Li, X.Y.: Optimization algorithms for the minimum-cost satisfiability problem
A. LLC. AggData, http://www.aggdata.com/
Microsoft. Windows Azure Marketplace, https://datamarket.azure.com/
Muschalle, A., Stahl, F., Loser, A., Vossen, G.: Pricing approaches for data markets. In: BIRTE (2012)
Prugovečki, E.: Quantum Mechanics in Hilbert Space, 2nd edn. Dover Books on Physics Series. Dover Publications (2006)
Tang, R., Wu, H., Bao, Z., Bressan, S., Valduriez, P.: Technical report: The price is right – a model and algorithms for the price of data, http://www.comp.nus.edu.sg/~tang1987/tech-reports/pricing-provenance.pdf
Wu, S., Banker, R.D.: Best pricing strategy for information services. J. AIS 11(6) (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tang, R., Wu, H., Bao, Z., Bressan, S., Valduriez, P. (2013). The Price Is Right. In: Decker, H., Lhotská, L., Link, S., Basl, J., Tjoa, A.M. (eds) Database and Expert Systems Applications. DEXA 2013. Lecture Notes in Computer Science, vol 8056. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40173-2_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-40173-2_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40172-5
Online ISBN: 978-3-642-40173-2
eBook Packages: Computer ScienceComputer Science (R0)