Abstract
In the Time-Warp synchronization model, the processes must occasionally interrupt execution in order to reclaim memory space used by state and event histories that are no longer needed (fossil-collection). Traditionally, fossil-collection techniques have required the processes to reach a consensus on the Global Virtual-Time (GVT) — the global progress time. Events with time-stamps less than GVT are guaranteed to have been processed correctly; their histories can be safely collected. This paper presents Optimistic Fossil-Collection (OFC), a new fossil-collection algorithm that is fully distributed. OFC uses a local decision function to estimate the fossilized portion of the histories (and optimistically collects them). Because a global property is estimated using local information only, an erroneous estimate is possible. Accordingly, OFC must also include a recovery mechanism to be feasible. An uncoordinated distributed checkpointing algorithm for Time-Warp that is domino-effect free and lightweight is used. We show that, in addition to eliminating the overhead for GVT estimation, OFC has several desireable memory-management properties.
This work was partially supported by the Advanced Research Projects Agency and monitored by the Department of Justice under contract number J-FBI-93-116.
Preview
Unable to display preview. Download preview PDF.
References
Chandy, K. M., and Lamport, L. Distributed snapshots: Determining global states of distributed systems. ACM Transactions on Computer Systems 3, 1 (Feb. 1985), 63–75.
Das, S., Fujimoto, R., Panesar, K., Allison, D., and Hybinette, M. GTW: a time warp system for shared memory multiprocessors. In Proceedings of the 1994 Winter Simulation Conference (Dec. 1994), J. D. Tew, S. Manivannan, D. A. Sadowski, and A. F. Seila, Eds., pp. 1332–1339.
D'Souza, L. M., Fan, X., and Wilsey, P. A. pGVT: An algorithm for accurate GVT estimation. In Proc. of the 8th Workshop on Parallel and Distributed Simulation (PADS 94) (July 1994), Society for Computer Simulation, pp. 102–109.
Elnozahy, E., Johnson, D., and Wang, Y. A survey of rollback-recovery protocols in message-passing systems. Tech. Rep. Tech. Rept. CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Oct. 1996.
Fleischmann, J., and Wilsey, P. A. Comparative analysis of periodic state saving techniques in time warp simulators. In Proc. of the 9th Workshop on Parallel and Distributed Simulation (PADS 95) (June 1995), pp. 50–58.
Fujimoto, R. Parallel discrete event simulation. Communications of the ACM 33, 10 (Oct. 1990), 30–53.
Jefferson, D. Virtual time. ACM Transactions on Programming Languages and Systems 7, 3 (July 1985), 405–425.
Lai, T., and Yang, J. On distributed snapshots. Information Processing Letters 25 (May 1987), 153–158.
Lin, Y.-B. Memory management algorithms for optimistic parallel simulation. In 6th Workshop on Parallel and Distributed Simulation (Jan. 1992), Society for Computer Simulation, pp. 43–52.
Martin, D. E., McBrayer, T. J., and Wilsey, P. A. warped: A time warp simulation kernel for analysis and application development. In 29th Hawaii International Conference on System Sciences (HICSS-29) (Jan. 1996), H. El-Rewini and B. D. Shriver, Eds., vol. Volume I, pp. 383–386.
Mattern, F. Virtual time and global states in distributed systems. In Proc. Workshop on Parallel and Distributed Algorithms (Oct. 1989), M. Cosnard et al, Ed., pp. 215–226.
Mattern, F. Efficient algorithms for distributed snapshots and global virtual time approximation. Journal of Parallel and Distributed Computing 18, 4 (Aug. 1993), 423–434.
Randell, B. System structure for software fault tolerance. IEEE Trans. on Software Engineering SE-1, 2 (June 1975), 220–232.
Ross, S. M.Introduction to Probability Models, 4 ed. Academic Press, San Diego, CA, 1989.
Spezialetti, M., and Kearns, P. Efficient distributed snapshots. In Proc. IEEE International Conference on Distributed Computing Systems (1986), pp. 382–388.
Tomlinson, A. I., and Garg, V. K. An algorithm for minimally latent global virtual time. In Proc of the 7th Workshop on Parallel and Distributed Simulation (PADS) (July 1993), Society for Computer Simulation, pp. 35–42.
Young, C. Methods for Optimistic Reclamation of Fossils in Time Warp Simulation. PhD thesis, University of Cincinnati, June 1997. (Ph.D. proposal).
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Young, C.H., Abu-Ghazaleh, N.B., Wilsey, P.A. (1998). OFC: A distributed fossil-collection algorithm for Time-Warp. In: Kutten, S. (eds) Distributed Computing. DISC 1998. Lecture Notes in Computer Science, vol 1499. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0056498
Download citation
DOI: https://doi.org/10.1007/BFb0056498
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65066-9
Online ISBN: 978-3-540-49693-9
eBook Packages: Springer Book Archive