skip to main content
10.1145/1739041.1739050acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Correlation aware synchronization for near real time decision support systems

Published: 22 March 2010 Publication History

Abstract

Many large companies, especially those in financial and insurance service sectors, approach the market with a decentralized management structure, such as by line of business or geographical market segments. However, these companies require access to distributed and possibly heterogeneous data sources for corporate level decision making. In this paper, we focus on challenges of supporting a decision support system (DSS) based on a hybrid approach (i.e. a federation system with replication of frequently accessed remote data sources) for time-sensitive agile business intelligence applications. The response time requirement (and a realistic goal) for such a DSS is near real time (i.e. 2 ~ 3 minutes to 20 ~ 30 minutes). The users of a DSS care about not only the response time but also the time stamp of the business operation reports since out-dated reports introduce uncertainty and risks to decision-making. Thus, the information value of a report decreases as time passes. We present a framework of correlation aware synchronization of replicas used in DSS to optimize information values of business reports as a whole. The framework exploits correlation of usage and synchronization latency of replicas in a single query and a workload of queries for an optimal synchronization schedule. We have conducted extensive evaluations based on both TPC-H and synthetic workload. The proposed correlation aware synchronization effectively improves up to 50% of information value comparing with fixed synchronization plans on average.

References

[1]
A. Land and A. Doig. An Automatic Method of Solving Discrete Programming Problems. Econometrica, (28):497--520, 1960.
[2]
S. Acharya, M. Franklin, and S. Zdonik. Balancing push and pull for data broadcast. In Proceedings of the ACM SIGMOD International Conference on Management of Data, 1997.
[3]
Philip A. Bernstein, Alan Fekete, Hongfei Guo, Raghu Ramakrishnan, and Pradeep Tamma. Relaxed-currency serializability for middle-tier caching and replication. In Proc. of the ACM SIGMOD Int'l Conference on Management of Data, pages 599--610, Chicago, Illinois, USA, 2006.
[4]
P. Deolasee, A. Katkar, A. Panchbudhe, K. Ramamritham, and P. Shenoy. Adaptive Push-Pull: Dissemination of Dynamic Web Data. In the Proceedings of the 10th WWW Conference, Hong Kong, China, May 2001.
[5]
Lukasz Golab, Theodore Johnson, and Vladislav Shkapenyuk. Scheduling updates in a real-time stream warehouse. In ICDE, pages 1207--1210, 2009.
[6]
D. Goldberg. Genetic Algorithms in Searth, Optimization, and Machine Learning. Kluwer Academic, 1989.
[7]
R. Graham. Bounds for certain multiprocessing anomalies. Bell Systems Technical Journal, (45):1563--1581, 1966.
[8]
Hongfei Guo, Per-Åke Larson, Raghu Ramakrishnan, and Jonathan Goldstein. Relaxed currency and consistency: How to say "good enough" in SQL. In Proc. of the ACM SIGMOD Int'l Conference on Management of Data, pages 815--826, Paris, France, June 2004.
[9]
J. Holland. Adaptation in Natural and Artificial Systems. MIT Press, 1992.
[10]
http://javasim.ncl.ac.uk/. JavaSim User's Guide.
[11]
http://www.tpc.org/hspec.html. TPC-H Benchmark Specification.
[12]
Haifeng Jiang, Dengfeng Gao, and Wen-Syan Li. Exploiting Correlation and Parallelism for Materialized-View Recommendation in Distributed Data Warehouses. In ICDE, 2007.
[13]
L. Costa and P. Oliveira. Evolutionary algorithms approach to the solution of mixed integer nonlinear programming problems. In Comput. Chem. Eng. 25, 2001.
[14]
Paul Larson, Jonathan Goldstein, and Jingren Zhou. MTCache: Transparent Mid-Tier Database Caching in SQL Server. In ICDE, 2004.
[15]
W. Lehner, B. Cochrane, H. Pirahesh, and M. Zaharioudakis. Applying Mass Query Optimization to Speed up Automatic Summary Table Refresh. In ICDE, 2001.
[16]
Hennadiy Leontyev, Theodore Johnson, and James H. Andersonl. Predicting maximum data staleness in real-timewarehouses. In Proceedings of the 30th IEEE Real-Time Systems Symposium, 2007.
[17]
Wen-Syan Li, Kemal Altintas, and Murat Kantarcioglu. On demand synchronization and load distribution for database grid-based Web applications. Data and Knowledge Engineering, 51(3), 2004.
[18]
Wen-Syan Li, Daniel Zilio, Vishal S. Batra, Mahadevan Subramanian, Calisto Zuzarte, and Inderpal Narang. Load Balancing for Multi-tiered Database Systems through Autonomic Placement of Materialized Views. In Proceedings of the International Conference on Data Engineering, 2006.
[19]
Felix Naumann. Quality-Driven Query Answering for Integrated Information Systems, volume 2261 of Lecture Notes in Computer Science. Springer, 2002.
[20]
Felix Naumann, Ulf Leser, and Johann Christoph Freytag. Quality-driven integration of heterogenous information systems. In VLDB, pages 447--458. Morgan Kaufmann, 1999.
[21]
Chris Olston, Boon Thau Loo, and Jennifer Widom. Adaptive precision setting for cached approximate values. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 355--366, 2001.
[22]
Chris Olston and Jennifer Widom. Best-effort cache synchronization with source cooperation. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 73--84, 2002.
[23]
Thomas Phan, Kavitha Ranganathan, and Radu Sion. Evolving toward the perfect schedule: Co-scheduling job assignments and data replication in wide-area systems using a genetic algorithm. In JSSPP, pages 173--193, 2005.
[24]
R. Lima and G. Francois and B. Srinivasan and R. Salcedo. Dynamic optimization of batch emulsion polymerization using MSIMPSA, a simulated-annealing-based algorithm. Ind. Eng. Chem. Res., 43(24), 2004.
[25]
R. Oliveira and R. Salcedo. Benchmark testing of simulated annealing, adaptive random search and genetic algorithms for the global optimization of bioprocesses. In International Conference on Adaptive and Natural Computing Algorithms, 2005.
[26]
S. Russell and P. Norvig. Artificial Intelligence: A Modern Approach. Prentice Hall, 2002.
[27]
Srikumar Venugopal, Rajkumar Buyya, and Kotagiri Ramamohanarao. A taxonomy of data grids for distributed data sharing, management, and processing. ACM Comput. Surv., 38(1), 2006.
[28]
W. Goffe and G. Ferrier and J. Rogers. Global optimization of statistical functions with simulated annealing. J. Econometrica, (60):65--99, 1994.
[29]
Ying Yan, Wen-Syan Li, and Jian Xu. Information value-driven near real-time decision support systems. In ICDCS, 2009.

Index Terms

  1. Correlation aware synchronization for near real time decision support systems

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    EDBT '10: Proceedings of the 13th International Conference on Extending Database Technology
    March 2010
    741 pages
    ISBN:9781605589459
    DOI:10.1145/1739041
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 March 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Conference

    EDBT/ICDT '10
    EDBT/ICDT '10: EDBT/ICDT '10 joint conference
    March 22 - 26, 2010
    Lausanne, Switzerland

    Acceptance Rates

    Overall Acceptance Rate 7 of 10 submissions, 70%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 63
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Feb 2025

    Other Metrics

    Citations

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media