Abstract
Data object replication onto distributed servers can potentially alleviate bottlenecks, reduce network traffic, increase scalability, add robustness, and decrease user perceived access time. The decision of selecting data object and server pairs requires solving a constraint optimization problem that in general is NP-complete. In this paper, we abstract the distributed database system as an agent-based model, wherein agents continuously compete for allocation and reallocation of data objects. Each agent aims to replicate objects onto its server such that the communication cost is minimized. However, these agents do not have a global view of the system. Thereby, the optimization process becomes highly localized. Such localized optimization may severely affect the overall system performance. To cope with such localized optimization, we propose a “semi-distributed” axiomatic game theoretical mechanism. The mechanism’s control is unique in its decision making process, wherein all the heavy processing is done on the servers of the distributed system and the central body is only required to take a binary decision: (0) not to replicate or (1) to replicate. The cost model used by the agents in the mechanism for the purpose of identifying beneficial data objects is tailored made so that even though the agents take decisions based on their local knowledge domain, the effect is translated into a system-wide performance enhancement. The proposed mechanism is extensively compared against seven well-known conventional and three game theoretical replica allocation methods, namely, branch and bound, greedy, genetic, data-aware replication, tree inspired bottom-up procedure, tree inspired min-max procedure, Benders’ decomposition based procedure, game theoretical English auction, game theoretical Dutch auction, and game theoretical selfish replication procedure. The experimental setup incorporates GT-ITM, Inet network topology generators, Soccer World Cup 1998 access logs, and NASA Kennedy Space Center access logs to closely mimic the Web in its infrastructure and user access patterns. The experimental results reveal that the proposed technique despite its non-cooperative nature improves the solution quality and reduces the execution time compared to other techniques.
Similar content being viewed by others
References
Ahmad, I., Ghafoor, A.: Semi-distributed load balancing for massively parallel multicomputer systems. IEEE Trans. Softw. Eng. 17(10), 987–1004 (1991)
Arlitt, M., Jin, T.: Workload characterization of the 1998 World Cup Web Site. Tech. Report, Hewlett Packard Lab, Palo Alto, HPL-1999-35(R.1) (1999)
Awerbuch, B., Bartal, Y., Fiat, A.: Competitive distributed file allocation. In: Proc. 25th ACM Symposium on Theory of Computation, pp. 164–173 (1993)
Bekta, T., Cordeaua, J.-F., Erkutc, E., Laportea, G.: Exact algorithms for the joint object placement and request routing problem in content distribution networks. Comput. Oper. Res. 35, 3860–3884 (2008)
Briest, P., Krysta, P., Vöcking, B.: Approximation techniques for utilitarian mechanism design. In: Proc. of 37th ACM Symposium on Theory of Computation, pp. 39–48 (2005)
Campbell, D.: Resource Allocation Mechanisms. Cambridge University Press, Cambridge (1987)
Casey, R.: Allocation of copies of a file in an information network. In: Proc. Spring Joint Computer Conf., IFIPS, pp. 617–625 (1972)
Chandy, J.A.: A generalized replica placement strategy to optimize latency in a wide area distributed storage system. In: International Workshop on Data-Aware Distributed Computing, pp. 49–54 (2008)
Chandy, K., Hewes, J.: File allocation in distributed systems. In: Proc. of the International Symposium on Computer Performance Modeling, Measurement and Evaluation, pp. 10–13 (1976)
Chang, H., Govindan, R., Jamin, S., Shenker, S.: Towards capturing representative AS-level Internet topologies. Comput. Netw. J. 44(6), 737–755 (2004)
Chu, W.: Optimal file allocation in a multiple computer system. IEEE Trans. Comput. C-18(10), 885–889 (1969)
Chun, B.-G., Chaudhuri, K., Wee, H., Barreno, M., Papadimitriou, C., Kubiatowicz, J.: Selfish caching in distributed systems: a game-theoretic analysis. In: Proc. of 23rd ACM Symposium on Principles of Distributed Computing, pp. 21–30 (2004)
Cidon, I., Kutten, S., Soffer, R.: Optimal allocation of electronic content. In: Proc. of IEEE INFOCOM, pp. 1773–1780 (2001)
Cook, S., Pachl, J., Pressman, I.: The optimal location of replicas in a network using a READ-ONE-WRITE-ALL policy. Distrib. Comput. 15(1), 57–66 (2002)
Dowdy, L., Foster, D.: Comparative models of the file assignment problem. ACM Comput. Surv. 14(2), 287–313 (1982)
Eswaran, K.P: Placement of records in a file and file allocation in a computer. In: Proc. of IFIP Congress, pp. 304–307 (1974)
Green, J., Laffont, J.: Characterization of satisfactory mechanisms for the revelation of preferences for public goods. Econometrica 45(2), 427–438 (1977)
Groves, T.: Incentives in teams. Econometrica 41, 617–631 (1973)
Habegger, P., Bieri, H.: Modeling the topology of the Internet: an assessment. Tech. Report, Institut für Informatik und angewandte Mathematik, Universität Bern, IM-02-2002
Hakimi, S.: Optimum location of switching centers and the absolute centers and medians of a graph. Oper. Res. 12, 450–459 (1964)
Hara, T.: Effective replica allocation in ad hoc networks for improving data accessibility. In: Proc. of INFOCOM, pp. 1568–1576 (2001)
Heddaya, A., Mirdad, S.: WebWave: globally load balanced fully distributed caching of hot published documents. In: Proc. 17th International Conference on Distributed Computing Systems, pp. 160–168 (1997)
Jamin, S., Jin, C., Jin, Y., Riaz, D., Shavitt, Y., Zhang, L.: On the placement of Internet instrumentation. In: Proc. of the IEEE INFOCOM, pp. 295–304 (2000)
Jamin, S., Jin, C., Kurc, T., Raz, D., Shavitt, Y.: Constrained mirror placement on the Internet. In: Proc. of the IEEE INFOCOM, pp. 31–40 (2001)
Kalpakis, K., Dasgupta, K., Wolfson, O.: Optimal placement of replicas in trees with read, write, and storage costs. IEEE Trans. Parallel Distrib. Syst. 12(6), 628–637 (2001)
Kangasharju, J., Roberts, J., Ross, K.: Object replication strategies in content distribution networks. In: Proc. of Web Caching and Content Distribution Workshop, pp. 455–456 (2001)
Karlsson, M., Mahalingam, M.: Do we need replica placement algorithms in content delivery networks? In: Proc. of Web Caching and Content Distribution Workshop, pp. 117–128 (2002)
Khan, S., Ahmad, I.: Internet content replication: a solution from game theory. Technical Report, University of Texas at Arlington, CSE-2004-5 (2004)
Khan, S., Ahmad, I.: Heuristic-based replication schemas for fast information retrieval over the Internet. In: Proc. of 17th International Conference on Parallel and Distributed Computing Systems, pp. 278–283 (2004)
Khan, S., Ahmad, I.: A powerful direct mechanism for optimal WWW content replication. In: Proc. of 19th IEEE International Parallel and Distributed Processing Symposium (2005)
Khan, S.U., Ahmad, I.: RAMM: a game theoretical replica allocation and management mechanism. In: 8th International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN), Las Vegas, NV, USA, pp. 160–165, December 2005
Khan, S.U., Ahmad, I.: A pure nash equilibrium guaranteeing game theoretical replica allocation method for reducing web access time. In: 12th International Conference on Parallel and Distributed Systems (ICPADS), Minneapolis, MN, USA, pp. 169–176, July 2006
Khan, S.U., Ahmad, I.: Replicating data objects in large-scale distributed computing systems using extended Vickery auction. Int. J. Comput. Intell. 3(1), 14–22 (2006)
Khan, S.U., Ahmad, I.: Discriminatory algorithmic mechanism design based WWW content replication. Informatica 31(1), 105–119 (2007)
Khan, S.U., Ahmad, I.: A cooperative game theoretical replica placement technique. In: 13th International Conference on Parallel and Distributed Systems (ICPADS), Hsinchu, Taiwan, December 2007
Korupolu, M., Plaxton, C.: Analysis of a local search heuristic for facility location problems. J. Algorithms 37(1), 146–188 (2000)
Krick, C., Racke, H., Westermann, M.: Approximation algorithms for data management in networks. In: Proc. of the Symposium on Parallel Algorithms and Architecture, pp. 237–246 (2001)
Krishna, V.: Auction Theory. Academic Press, San Diego (2002)
Krishnan, P., Raz, D., Shavitt, Y.: The cache location problem. IEEE/ACM Trans. Netw. 8(5), 568–582 (2000)
Kurose, J., Simha, R.: A microeconomic approach to optimal resource allocation in distributed computer systems. IEEE Trans. Comput. 38(5), 705–717 (1989)
Laoutaris, N., Telelis, O., Zissimopoulos, V.: Distributed selfish replication. IEEE Trans. Parallel Distrib. Syst. 17(12), 1401–1413 (2006)
Li, B., Golin, M., Italiano, G., Deng, X.: On the optimal placement of web proxies in the Internet. In: Proc. of the IEEE INFOCOM, pp. 1282–1290 (2000)
Lin, Y.-F., Liu, P., Wu, J.-J.: Optimal placement of replicas in data grid environments with locality assurance. In: 12th International Conference on Parallel and Distributed Systems (ICPADS), pp. 465–474 (2006)
Loukopoulos, T., Papadias, D., Ahmad, I.: An overview of data replication on the Internet. In: Proc. of International Symposium on Parallel Architectures, Algorithms and Networks, pp. 31–38 (2002)
Loukopoulos, T., Ahmad, I.: Static and adaptive distributed data replication using genetic algorithms. J. Parallel Distrib. Comput. 64(11), 1270–1285 (2004)
Mahmoud, S., Riordon, J.: Optimal allocation of resources in distributed information networks. ACM Trans. Database Syst. 1(1), 66–78 (1976)
Mas-Collel, A., Whinston, W., Green, J.: Microeconomic Theory. Oxford University Press, London (1995)
Medina, A., Matta, I., Byers, J.: On the origin of power laws in Internet topologies. ACM Comput. Commun. Rev. 30(2), 18–28 (2000)
Narebdran, B., Rangarajan, S., Yajnik, S.: Data distribution algorithms for load balancing fault-tolerant web access. In: Proc. of the 16th Symposium on Reliable Distributed Systems, pp. 97–106 (1997)
NASA Kennedy Space Center access log. Available at: http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html
Nisan, N., Ronen, A.: Algorithmic mechanism design. In: Proc. of 31st ACM Symposium on Theory of Computation, pp. 129–140 (1999)
Osborne, M., Rubinstein, A.: A Course in Game Theory. MIT Press, Cambridge (1994)
Pautet, L., Tardieu, S.: GLADE: a framework for building large object-oriented real-time distributed systems. In: 3rd International Symposium on Object-Oriented Real-Time Distributed Systems, pp. 244–251 (2000)
Qiu, L., Padmanabhan, V., Voelker, G.: On the placement of web server replicas. In: Proc. of the IEEE INFOCOM, pp. 1587–1596 (2001)
Rabinovich, M.: Issues in Web content replication. Data Eng. Bull. 21(4), 21–29 (1998)
Radoslavov, P., Govindan, R., Estrin, D.: Topology-informed Internet replica placement. Comput. Commun. 25(4), 384–392 (2002)
Saurabh, S., Parkes, D.: Hard-to-manipulate VCG-based auctions. Available at: http://www.eecs.harvard.edu/econcs/pubs/hard_to_manipulate.pdf
Venkataramanj, A., Weidmann, P., Dahlin, M.: Bandwidth constrained placement in a WAN. In: Proc. ACM Symposium on Principles of Distributed Computing, pp. 134–143 (2001)
Waxman, B.: Routing of multipoint connections. IEEE J. Sel. Areas Commun. 6(9), 1617–1622 (1988)
Wolfson, O., Jajodia, S., Hang, Y.: An adaptive data replication algorithm. ACM Trans. Database Syst. 22(4), 255–314 (1997)
Zegura, E., Calvert, K., Donahoo, M.: A quantitative comparison of graph-based models for Internet topology. IEEE/ACM Trans. Netw. 5(6), 770–783 (1997)
Zipf, G.: Human Behavior and the Principle of Least-Effort. Addison-Wesley, Reading (1949)
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Ashfaq Khokhar.
Rights and permissions
About this article
Cite this article
Khan, S.U., Ahmad, I. Replicating data objects in large distributed database systems: an axiomatic game theoretic mechanism design approach. Distrib Parallel Databases 28, 187–218 (2010). https://doi.org/10.1007/s10619-010-7069-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10619-010-7069-0