QOS-based Checkpoint ProtOcOl for Multimedia Network Systems

Osada, Shinji; Higaki, 1Hiroaki

doi:10.1007/3-540-45453-5_74

QOS-based Checkpoint ProtOcOl for Multimedia Network Systems

Shinji Osada⁷ &
1Hiroaki Higaki

Conference paper
First Online: 20 November 2001

624 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2195))

Abstract

Advanced computer and network technologies have lead to the development of computer networks. Here, an application is realized by multiple processes located on multiple computers connected to a communication network such as the Internet. Each process computes and communicates with other processes by exchanging messages through communication channels. Mission- critical applications are required to be executed fauIt- tolerantly. That is, even if some processes fail, execution of an application is required to be continued. One of the important methods to realize fault-tolerant networks is checkpoint-recovery[2,4,6,7,10–12,16,19–21]. During failure-free execution, each process takes local checkpoints by storing state information into a stable storage [14]. If a certain process fails, the processes restart from the checkpoints by restoring the state information from the stable storage. For restarting execution of applications correctly in conventional data communication networks, a set of local checkpoints taken by all the processes and from which the processes restart should form a consistent global checkpoint [3]. A global checkpoint is defined to be consistent if there is neither orphan nor lost message. However, in a multimedia communication network, applications require transmission of large-size multimedia messages and low overhead failure-free execution rather than complete consistency. Hence, this paper proposes a novel criteria for consistent global checkpoints based on properties of multimedia communication networks and applications.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bernstein, P.A. and Goodman, N., “An Algorithm for Concurrency Control and Recovery in Replicated Distributed Databases,” ACM Trans. on Database Systems, Vol. 9, No. 4, pp. 1197–1207 (1984).
Article MathSciNet Google Scholar
Bhargava, B. and Liao, S.R., “Independent Check-pointing and Concurrent Rollback for Recovery in Distributed Systems,” The 7th International Symposium on Reliable Distributed Steins, pp. 3–12 (1988).
Google Scholar
Chaudy, K.M. and Lamport, L., “Distributed Snap shot: Determining Global States of Distributed Systems,” ACM Trams. on Computer Systems, Vol. 3, No. 1, pp. 63–75 (1985).
Article Google Scholar
Cristiau, F. and Jahaiiai, F., “A Timestamp-Based Checkpointing Protocol for Long Lived Distributed Computations,” Reliable Distributed Software and Database Systems, pp. 12–20 (1991).
Google Scholar
Douglas, E.C., “Internetworking with TCP/IP,” Prentice-Hall (1991).
Google Scholar
Elozahy, E.N., Johnson, D.B. and Wang, Y.M., “A Survey of Rollback-Recovery Protocols in Message-Passing Systems,” Technical Note of Carnegie Mellon University, CMU-CS-96-181 (1996).
Google Scholar
Elnozahy, E.N., Johnson, D.B. and Zwaenepoel, W., “The performance of consistent checkpointing,” The 11th International Symposium on Reliable Distributed Systems, pp. 39–47 (1992).
Google Scholar
Giffrod, D.K., “Weighted Voting for Replication Data, ” The 7th ACM Symposium on Operating Systems, pp. 150–162 (1979).
Google Scholar
Higaki, H., Nemoto, N., Tanaka, K. and Takizawa, M., “Protocol for Groups of Pseudo-Active Replication Objects,” International Workshop on Object Oriented Realtime Distributed Systems, pp. 35–41 (1999).
Google Scholar
Juang, T.T.Y. and Venkatesan, S., “Efficient Algorithms for Crash Recovery in Distributed Systems,” The 10th Conference on Foundations of Software Technology and Theoretical Computer Science, pp. 349–361 (1990).
Google Scholar
Johnson, D.B., “Efficient Transparent Optimistic Rollback Recovery for Distributed Application Programs,” The 12th International Symposium on Reliable Distributed Steins, pp. 86–95 (1993).
Google Scholar
Koo, R. and Toueg, S., “Checkpointing and Rollback-Recovery for Distributed Systems,” IEEE Trans. on Software Engineering, Vol. SE-13, No. 1, pp. 23–31 (1987).
Article Google Scholar
Kumar, A., “Hierarchical Quorum Consensus: A New Algorithm For Mamagiug Replicated Data,” IEEE Trans. on Computers, Vol. 40, No. 9, pp. 996–1004 (1991).
Article Google Scholar
Lampsou, B.W., Paul, M. and Siegert, H.J., “Distributed Systems-Architecture and Implementation,” Springer-Verlag, pp. 246–265 (1981).
Google Scholar
Mathew, E. H. and Russell, M. S., “MULTIMEDIA COMPUTING-Case Studies from MIT Project Athena,” Addison-Wesley (1993).
Google Scholar
Paukaj, J., “Fault Tolerance in Distributed Systems,” Prentice Hall, pp.185–213 (1994).
Google Scholar
Pu, C.A., Noe, D.D. and Proudfoot, A., “Regeneration of Replicated objects: A Technique and its Eden Implementation,” IEEE Trans. on Software Engineering, Vol. 14, No. 7, pp. 936–945 (1988).
Article Google Scholar
Shimamura, K., Tanaka., K. and Takizawa, M., “Group Protocol for Exchanging Multimedia Objects in a Group,” 2000 ICDCS Workshop on Group Computation and Comninunications, pp. 33–40 (2000).
Google Scholar
Silva, L.M. and Silva, J.G., “Global Checkpointing for Distributed Programs,” The 11th International Symposium on Reliable Distributed Systems, pp. 155–162 (1992).
Google Scholar
Venkatesh, K., Radhakrishnan, T. and Li, H.F., “Optimal and Local Recording for Domino-Free Rollback Recovery,” Information Processing Letters, Vol. 25, pp. 295–303 (1987).
Article Google Scholar
Wood, W.G., “A Decentralized Recovery Protocol,” The 11th International Symposium on Fault Tolerant Computing Systems, pp. 159–164 (1981).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computers and Systems Engineering, Tokyo Denki University, Japan
Shinji Osada

Authors

Shinji Osada
View author publications
You can also search for this author in PubMed Google Scholar
1Hiroaki Higaki
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research China, 5/F Beijing Sigma Center 49 Zhichung Road, Haidian District, Beijing, 100080, China
Heung-Yeung Shum
Institute of Information Science, Academia Sinica, Taiwan
Mark Liao
Department of Electrical Engineering, Columbia University, New York, NY, 10027, USA
Shih-Fu Chang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Osada, S., Higaki, 1. (2001). QOS-based Checkpoint ProtOcOl for Multimedia Network Systems. In: Shum, HY., Liao, M., Chang, SF. (eds) Advances in Multimedia Information Processing — PCM 2001. PCM 2001. Lecture Notes in Computer Science, vol 2195. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45453-5_74

Download citation

DOI: https://doi.org/10.1007/3-540-45453-5_74
Published: 20 November 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42680-6
Online ISBN: 978-3-540-45453-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics