Skip to main content
Log in

Fault tolerance in partitioned manufacturing networks

  • Published:
Journal of Systems Integration

Abstract

Fault tolerance is especially important for computer systems that require a high degree of confidence. Computer Integrated Manufacturing (CIM) is an area where computer systems must not be disturbed by uncontrolled failures. This article deals with two problems that are related to fault tolerance and network partitions in automated manufacturing systems.

The first problem relates to the distribution of information in partitioned data networks in CIM systems. We indicate how to overcome this problem by using the material network as a redundant data network:

The second problem relates to fault detection and diagnosis in manufacturing systems. The problem is whether the indication of a fault means that a production unit itself has actually broken down, or that the indication is instead due to disturbances in the transmission of material. That is, the production unit continues to operate propcrly despite indications to the contrary. We describe how the material network can be used for detection and diagnosis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. J.C. Adams, K.V.S. Ramarao, “Distributed diagnosis of Byzantine processors and links,” inProceedings of the 9th International Conference on Distributed Computing Systems, Newport Beach, CA, June 1989, pp. 562–569.

  2. A. Adlemo, S.-A. Andréasson, T. Andréasson, C. Carlsson, “Achieving fault tolerance in factory automation systems by dynamic configuration,” inProceedings of the 1st International Conference on Systems Integration, Morristown, NJ, April 1990, pp. 396–402.

  3. A. Adlemo, S.-A. Andréasson, C. Carlsson, “A model for network partitionings with caching in manufacturing systems,” inProceedings of the 10th SCCC International Conference on Computer Science, Santiago de Chile, Chile, July 1990, pp. 195–206.

  4. A. Adlemo, S.-A. Andréasson, “Fault tolerant information distribution in a partitioned manufacturing system,” inProceedings of the 1991 IEEE International Conference on Robotics and Automation, Sacramento, CA, April 1991, pp. 2238–2245.

  5. A. Adlemo, S.-A. Andréasson, “Models for fault tolerance in manufacturing systems,”Journal of Intelligent Manufacturing, vol. 3, no. 1, pp. 1–10, February 1992.

    Article  Google Scholar 

  6. A. Adlemo, S.-A. Andréasson, “Fault detection in manufacturing systems with data network partitions,” inProceedings of the 1992 IEEE International Conference on Robotics and Automation, Nice, France, May 1992, pp. 975–980.

  7. A. Adlemo, S.-A. Andréasson, M.I. Johansson, “Fault tolerance strategies in an existing FMS installation,” inPreprints of the 7th IFAC/IFIP/IFORS/IMACS/ISPE Symposium on Information Control Problems in Manufacturing Technology INCOM'92, Toronto, Canada, May 1992, pp. 366–373.

  8. R. Akella, Y. Choong, S.B. Gershwin, “Performance of hierarchical production scheduling policy,”IEEE Transactions on Components, Hybrids and Manufacturing Technology, vol. 7, no. 3, pp. 215–217, September 1984.

    Google Scholar 

  9. T. Andréasson, “Towards a hierarchical operating system for supporting fault tolerance in flexible manufacturing systems,” inProceedings of the 13th International Conference on Fault-Tolerant Systems and Diagnostics, Varna, Bulgaria, June 1990, pp. 368–374.

  10. D. Barbara, H. Garcia-Molina, A. Spauster, “Policies for dynamic vote reassignment,” in:Proceedings of the 6th International Conference on Distributed Computing Systems, Cambridge, MA, May 1986, pp. 37–44.

  11. D. Barbara, H. Garcia-Molina, B. Kogan, “Maintaining availability of replicated data in a dynamic failure environment,” inProceedings of the 6th Symposium on Reliability in Distributed Software and Database Systems, Williamsburg, PA, March 1987, pp177–187.

  12. W. Barfield, S.-L. Hwang, T.-C. Chang, “Human-computer supervisory performance in the operation and control of flexible manufacturing systems,”Flexible Manufacturing Systems: Methods and Studies, A. Kusiak (ed.), North-Holland, Amsterdam, 1986, pp. 377–408.

    Google Scholar 

  13. J. Bartlett, J. Gray, B. Horst, “Fault tolerance in Tandem computer systems,”Symposium on the Evolution of Fault Tolerant Computing, Baden, Austria, June 1986, pp. 55–76.

  14. G.R. Bitran, A.C. Hax, “On the design of hierarchical production planning systems,”Decision Sciences, vol. 8, pp. 28–55, 1977.

    Google Scholar 

  15. A. Borg, W. Blau, W. Graetsch, F. Hermann, W. Oberle, “Fault tolerance under UNIX,”ACM Transactions on Computer Systems, vol. 7, no. 1, pp. 1–24, February 1989.

    Article  Google Scholar 

  16. P.R. Chintamaneni, P. Jalote, Y.-B. Shieh, S.K. Tripathi, “On fault tolerance in manufacturing systems,”IEEE Network, vol. 2, no. 3, pp. 32–39, May 1988.

    Article  Google Scholar 

  17. F. Cristian, “Agreeing on who is present and who is absent in a synchronous distribution system,” inProceedings of the 18th International Conference on Fault Tolerant Computing, Tokyo, Japan, June 1988, pp. 206–211.

  18. F. Cristian, “Exception handling,”Dependability of Resilient Computers. T. Anderson (ed.), BSP Professional Books, Blackwell Scientific Publications. Oxford, 1989, pp. 68–97.

    Google Scholar 

  19. F. Cristian, “Understanding fault tolerant distributed systems,”Communications of the ACM, vol. 34, no. 2, pp. 56–78, February 1991.

    Article  Google Scholar 

  20. S.B. Davidson, H. Garcia-Molina, D. Skeen, “Consistency in partitioned networks,”Computing Surverys, vol. 17, no. 3, pp. 341–370, September 1986.

    Article  Google Scholar 

  21. W.J. Davis, “Evolving coordination schemes in real-time production scheduling,”Engineering Costs and Production Economics, vol. 17, no. 1–4, pp. 111–124, August, 1989.

    Article  Google Scholar 

  22. P.D. Ezhilchelvan, S.K. Shrivastava, “A characterization of faults in systems,” inProceedings of the 5th Symposium on Reliability in Distributed Software and Database Systems, Los angeles, CA, January 1986, pp. 215–222.

  23. H. Garcia-Molina, “Reliability issues for fully replicated distributed databases,”IEEE Computer, vol. 15, no. 9, pp. 34–42, September 1982.

    Google Scholar 

  24. D.K. Gifford, “Weighted voting for replicated data,” inProceedings of the 7th ACM Symposium on Operating Systems Principles, Pacific Grove, CA, December 1979, pp. 150–162.

  25. J. Gray, “Why do computers stop and what can be done about it?,” inProceedings of the 5th Symposium on Reliability in Distributed Software and Database Systems, Los Angeles, CA, January 1986, pp. 3–12.

  26. J. Gray, D.P. Siewiorek, “High-availability computer systems,”IEEE Computer, vol. 24, no. 9, pp. 39–48, September 1991.

    Google Scholar 

  27. B. Hennock, “A strategic model for reliability and availability in automatic manufacturing,”International Journal of Advanced Manufacturing Technology, vol. 3, no. 5, pp. 99–121, November 1988.

    Google Scholar 

  28. Y.J. Kang, W. Lovegrove, J.D. Spragins, H. Jafari, “An architecture for factory automation and information management systems,” inProceedings of the Pacific Computer Communications Symposium, Seoul, South Korea, October 1985, pp. 520–526.

  29. H. Kopetz, A. Damm, C. Koza, M. Mulazzani, W. Schwabl, C. Senft, R. Zainlinger, “Distributed fault tolerant real-time systems: The Mars approach,”IEEE Micro, vol. 9, no. 1, pp. 25–40, February 1989.

    Article  Google Scholar 

  30. N. Kronenberg, H. Levy, W. Strecker, “VAXclusters: A closely-coupled distributed system,”ACM Transactions on Computer Systems, vol. 4, no. 2, pp. 130–146, May 1986.

    Article  Google Scholar 

  31. A. Kusiak, S.S. Heragu, “Computer integrated manufacturing: A structural perspective,”IEEE Netwrk, vol. 2, no. 3, pp. 14–22, May 1988.

    Article  Google Scholar 

  32. L. Lamport, R. Shostak, M. Pease, “The Byzantine generals problem,”ACM Transactions on Programming Languages and Systems, vol. 4, no. 3, pp. 382–401, July 1982.

    Article  Google Scholar 

  33. L. Lilien, “Quasi-partitioning: A new paradigm for transaction execution in partitioned distributed database systems,” inProceedings of the 5th International Conference on Data Engineering Los Angeles, CA, February 1989, pp. 546–553.

  34. L.J. McGuffin, L.O. Reid, S.R. Sparks, “MAP/TOP in CIM Distributed Computing,”IEEE Network, vol. 2, no. 3, pp. 23–31, May 1988.

    Article  Google Scholar 

  35. G. Messina, G. Tricomi, “Fault and safety management in intelligent robot networks,” inProceedings of TENCON 87: 1987 IEEE Region 10 Conference “Computers and Communications Technology Toward 2000,” Seoul, South Korea, vol. 3, August 1987, pp. 1091–1096.

  36. D.L. Palumbo, R.W. Butler, “Measurement of SIFT operating system overhead,” NASA Technical Memo 86322, 1985.

  37. J.-F. Paris, D.D.E. Long, “Efficient dynamic voting algorithms,” inProceedings of the 4th International Conference on Data Engineering, Los Angeles, CA, February 1988, pp 268–275.

  38. R.V. Rogers, “An interactive graphical aided scheduling system,”Computers & Industrial Engineering, vol. 17, no. 1-4, pp. 113–118. 1989.

    Google Scholar 

  39. J. Tang, N. Natarajan, “A scheme for maintaining consistency and availability of replicated files in a partitioned distributed system,” inProceedings of the 5th International Conference on Data Engineering, Los Angeles, CA, February 1989, pp. 530–537.

  40. D. Taylor, G. Wilson, “The Stratus system architecture,”Dependability of Resilient Computers, T. Anderson (ed.), BSP Professional Books, Blackwell Scientific Publications, Oxford, 1989, pp. 222–256.

    Google Scholar 

  41. B.W. Wah, “File placement on distributed computer systems,”IEEE Computer, vol. 17, no. 1, pp. 23–32, January 1984.

    Google Scholar 

  42. H. Wang, “An experimental analysis of the flexible manufacturing system (FMS),”Flexible Manufacturing Systems: Methods and Studies. A Kusiak (ed.), North-Holland, Amsterdam, 1986, pp. 319–339.

    Google Scholar 

  43. H.P. Wiendahl, U. Winkelhake, “Strategy for availability improvement,”International Journal of Advanced Manufacturing Technology, vol. 1, no. 4, pp. 69–78, November 1986.

    Google Scholar 

  44. S.-Y.D. Wu, R.A. Wysk, “An inference structure for the control and scheduling of manufacturing systems,”Computers & Industrial Engineering, vol. 18, no. 3, pp. 247–262, 1990.

    Google Scholar 

  45. M. Yamamoto, S.Y. Nof, “Scheduling/rescheduling in the manufacturing operating system environment,”International Journal of Production Research, vol. 23, no. 4, pp. 705–722, July–August 1985.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Adlemo, A., Andréasson, SA. Fault tolerance in partitioned manufacturing networks. Journal of Systems Integration 3, 63–84 (1993). https://doi.org/10.1007/BF01974172

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01974172

Key Words

Navigation