Stochastics and Statistics
Optimal preventive maintenance for equipment with two quality states and general failure time distributions

https://doi.org/10.1016/j.ejor.2006.04.014Get rights and content

Abstract

We present an economic model for the optimization of preventive maintenance in a production process with two quality states. The equipment starts its operation in the in-control state but it may shift to the out-of-control state before failure or scheduled preventive maintenance. The time of shift and the time of failure are generally distributed random variables. The two states are characterized by different failure rates and revenues. We first derive the structure of the optimal maintenance policy, which is defined by two critical values of the equipment age that determine when to perform preventive maintenance depending on the actual (observable) state of the process. We then provide properties of the optimal solution and show how to determine the optimal values of the two critical maintenance times accurately and efficiently. The proposed model and, in particular, the behavior of the optimal solution as the model parameters and the shift and failure time distributions change are illustrated through numerical examples.

Introduction

It is commonly agreed nowadays that preventive maintenance policies can be very successful in improving equipment reliability while minimizing maintenance-related costs. During the past decades a plethora of preventive maintenance models have been developed and contributed to a good understanding of the properties and effectiveness of preventive maintenance policies under various conditions.

The purpose of this paper is to investigate the structure and properties of optimal preventive maintenance policies in production processes (equipment) that are subject to both quality deterioration and failure. More specifically, we develop, analyze and optimize an economic model for preventive maintenance in the following setting:

  • There are two possible quality states during operation (an in-control state and an out-of-control state).

  • The failure rate depends on both the quality state of the process and the equipment age.

  • The time to shift to the out-of-control state and the times to failure are generally distributed continuous random variables.

  • Operation in either one of the quality states generates income, which is higher in the in-control state.

  • Downtimes are not negligible.

Production processes, commercial equipment and even consumer products with the above operation and failure characteristics are commonly encountered in practice. For example, consider the production process of forged spare parts, where forging presses are used to form specific shapes on metal sheets. A misplacement or inadequate heating of the metal may cause a slight tilt in the positioning of the hydraulic piston; the press will still be working but the output quality will be significantly lower and the probability of a piston failure due to its improper operation will increase. Another example is the operation of a commercial cooler and freezer. A typical problem is an impeller malfunction; the fan loses its balancing and the result is poor cooling, higher electricity consumption, and higher proneness to failure for the compressor. Note that the same type of malfunction with similar consequences is also common in the operation of personal computers and other electronic devices.

When a production process is characterized by multiple operating states and a failure state, the most important and distinctive feature of the associated model is the definition of the various states and the assumed transition mechanism between them. In order to clearly explain the contribution of this paper and its positioning with respect to existing publications we provide below a brief literature review of related research with emphasis on the aspect of the process deterioration mechanism.

The earlier works on preventive maintenance models for production processes with more than one operating states and a failure state are by Derman, 1962, Derman, 1963, who studies a process that deteriorates moving through a finite number of states according to a Markov chain. Derman assumes that the state of the process is known with accuracy at all discrete points in time and shows that the optimal replacement policy is a control limit policy; that is, the equipment should be replaced as soon as it is observed to operate in a state worse than some critical state. The same result is also obtained by Kolesar (1966) for a similar model but with a more general cost function. More specifically, Kolesar introduces an additional “occupancy” cost associated with being in each operating state.

Another approach to model the process deterioration mechanism has been suggested by Kao (1973). Kao uses a discrete time finite-state semi-Markov process to formulate the problem. This approach can account both for changes in the process state and for the ageing process of the equipment. Thus, Kao examines state-dependent policies as well as state-age-dependent policies and proves that under reasonable conditions the optimal policy is of the control limit type. More recent examples of maintenance models for semi-Markovian deteriorating systems can be found in So, 1992, Lam and Yeh, 1994, Yeh, 1997.

The process operating states can alternatively be expressed by the magnitude of the cumulative damage or wear of the equipment. The process is assumed to be subject to exterior shocks that damage or cause wear to the equipment, thereby increasing its probability of failure. Shock models have been introduced by Taylor (1975) and they have been studied by several researchers, including Feldman, 1976, Bergman, 1978, Gottlieb, 1982, Aven and Gaarder, 1987, Murthy and Iskadar, 1991a, Murthy and Iskadar, 1991b, Qian et al., 2003, Sheu and Chien, 2004. An extensive literature review of shock models up to 1989 is provided in the survey by Valdez-Flores and Feldman (1989). Shock models differ in the assumed stochastic process that governs the time distribution between two successive shocks (state transitions). Shocks may occur according to a Poisson process (e.g., Taylor, 1975), according to a semi-Markov process (e.g., Gottlieb, 1982) or arbitrarily (e.g., Feldman, 1976, Bergman, 1978).

A usual assumption in shock models is that the equipment can only fail at shock times with a generally different failure probability that depends on the accumulated damage or wear. However, Bergman, 1978, Murthy and Iskadar, 1991a, Murthy and Iskadar, 1991b do not restrict failures to shock times but allow them to occur at any point in time. In Bergman, 1978, Murthy and Iskadar, 1991a, Murthy and Iskadar, 1991b assume that the shock process is stationary Poisson but the failure rate, in addition to experiencing a jump whenever a shock occurs, also increases continuously with age while operating in any particular state.

The optimal preventive maintenance policy in most shock models is shown to be of the control limit type in one or two dimensions. In other words, the maintenance policy is either state-dependent or state-age-dependent. In the former case the equipment is preventively maintained as soon as it reaches or exceeds some threshold of the accumulated damage (state). In the latter case, which is common when the semi-Markovian assumption for the shock process is adopted, the equipment is preventively maintained as soon as the time since the last shock reaches some critical value, which is a function of the accumulated damage.

Apart from the maintenance literature, deterioration of the process condition is also a standard feature of the statistical process control models in the quality field. The process is assumed to operate in the “good” quality state (in-control) until it shifts to an inferior quality state (out-of-control) as a consequence of the occurrence of some assignable causes. The time until the transition to an out-of-control state (quality shift) is usually assumed to be exponentially distributed (Poisson process) but other distributions have been considered as well (Banerjee and Rahim, 1988). In a model of a process with non-exponential transition times Rahim and Banerjee (1993) introduce the use of preventive maintenance actions to protect the equipment against quality shifts. Their work has been further extended by Rahim and Ben-Daya (1998), who develop an integrated model dealing with the determination of the optimal production quantity as well. These quality control models, however, typically assume that the process is not subject to failures that would result in stoppage of operation. The possibility of equipment failure in the quality control context has been investigated by Tagaras (1988) and by Makis and Fung, 1995, Makis and Fung, 1998.

In particular, Tagaras (1988) uses a Markovian approach similar to that of Derman, 1962, Derman, 1963 to describe the evolution of production processes characterized by several quality states (a single in-control state and multiple out-of-control states) and a single failure state. He simultaneously considers quality control and maintenance procedures and derives not only the optimal preventive maintenance time but also the optimal quality control parameters that minimize the expected total cost. Makis and Fung, 1995, Makis and Fung, 1998 determine the optimal lot size and the optimal number of periodic inspections in a production process subject both to random failures and to shifts to an inferior quality state. In the 1998 paper the maintenance policy consists solely of corrective actions, while the 1995 paper accounts for preventive maintenance (replacement) actions too. In the latter case the preventive replacement time is also optimized. In both papers Makis and Fung, 1995, Makis and Fung, 1998 assume that the time to quality deterioration is exponentially distributed while the time to failure is generally distributed but with the same distribution regardless of the quality state of the process.

This paper shares common features with previous work cited above, but it encompasses and examines in a unifying model some of the characteristics that have been previously studied in isolation. More specifically, the similarities and differences from those publications that are closer to our work are as follows:

  • The production process is characterized by two quality states (an in-control and an out-of-control state) and a failure state as in Makis and Fung, 1995, Makis and Fung, 1998. However, we use a general distribution for the time of transition to the out-of-control state and we allow for different failure time distributions in each quality state. On the other hand, Makis and Fung consider lot sizing and inspection decisions.

  • The time to quality shift (state transition) is arbitrarily distributed as in some shock models but, unlike most shock models, we allow failures to occur at any point in time. In that sense, our model resembles the one suggested by Bergman (1978), but the two models differ in the assumed failure rate pattern. The failure rate in the model of Bergman (1978) remains constant as long as the equipment operates within the same state, implying an exponential distribution for the time to failure, while we make no assumption about the failure rate in each state other than it is non-decreasing in time.

  • The failure mechanism adopted in our model is similar to that used by Murthy and Iskadar, 1991a, Murthy and Iskadar, 1991b; the failure rate increases not only whenever a state transition occurs, but also with the equipment age. However, we use a general distribution for the transition times to the out-of-control state in contrast to the exponential distribution for the transition times used by Murthy and Iskadar, 1991a, Murthy and Iskadar, 1991b. In addition, Murthy and Iskadar, 1991a, Murthy and Iskadar, 1991b restrict their analysis to two fixed types of preventive maintenance policies, while this paper contains structural results and properties of the optimal solution.

Furthermore, in contrast to Bergman, 1978, Murthy and Iskadar, 1991a, Murthy and Iskadar, 1991b, we allow non-negligible downtimes and a more general function that permits different operating revenues for each state. On the other hand, though, the models of Bergman, 1978, Murthy and Iskadar, 1991a, Murthy and Iskadar, 1991b consider more than two operating states.

In short, the most important contribution of this paper is that it does not require any of the distributions of the times to quality shift and failure to be exponential, thus significantly enhancing the domain of its applicability.

Our analysis indicates that the optimal preventive maintenance (PM) policy is a two-dimensional control limit policy using both state and age information. That is, the optimal PM policy is characterized by a critical equipment age for each quality state. It is important to note that this type of policy is different from the usual state-age-dependent policies because the latter depend on the current state of the process and the sojourn time in that state rather than the actual total age of the equipment.

The following section describes in more detail the problem and introduces the necessary notation. Section 3 describes the structure of the optimal policy and presents the development of the mathematical model. Some useful properties of the optimal solution are outlined in Section 4. Section 5 provides a numerical example for illustration purposes. Section 6 contains a numerical investigation of the effect of shift and failure time distributions on the optimal solution. In the last section the basic results are summarized and some future research directions are suggested.

Section snippets

Problem description, assumptions and notation

We consider a production process that is subject to quality shifts and failures. The process can be viewed as a succession of independent, stochastically identical cycles. Each cycle begins with the equipment in the as-good-as-new condition producing items of acceptable quality (“in-control” state, or state 0). At some random point in time the process may shift to an out-of-control state (state 1), characterized by a lower (or, at best, equal) net revenue and higher (or, at best, equal)

Model development/structure of the optimal policy

Before proceeding to the model development, we present an important property that characterizes the structure of the optimal policy and at the same time simplifies the model.

Proposition 1

For any given value of tm0, there is a critical time tm1, (0  tm1  tm0), such that the optimal values of tm1(t) as a function of the quality shift time t are as follows:

  • (a)

    tm1(t) = tm1 for all t  tm1,

  • (b)

    tm1(t) = t for all t > tm1.

Proof

See Appendix A. 

Note that the critical value of tm1 is unique as long as the failure rate in the

Properties of the optimal solution

The domain of the expected profit function EPT(tm0, tm1) can be graphically presented in two dimensions, as shown in Fig. 1. The allowable values of the two decision variables 0  tm1  tm0 are the points (tm0, tm1) in the area bounded by the horizontal axis and line tm0 = tm1.

In order to identify the optimal combination (tm0, tm1) that maximizes EPT(tm0, tm1) we start by deriving the first partial derivatives of EPT(tm0, tm1). As shown in Appendix B (see, in particular, expressions (B11), (B12)), these

Numerical illustration

Consider a process where the failure mechanism in both quality states (i = 0, 1) is expressed by Weibull distributions of the failure age. The Weibull density function associated with state i isφi(t)=λicitci-1e-λitci,t>0,ci>0,λi>0,i=0,1,where λi is the scale parameter and ci is the shape parameter of the distribution. In order to ensure that the failure rate when operating in the out-of-control state with equipment age t is larger than or equal to the failure rate in the in-control state with the

The effect of shift and failure time distributions on the optimal solution

As already stated in the introductory section, the main contribution of this paper is that it does not require any of the distributions of the times to quality shift and failure to be exponential. A natural question that arises is how significantly the actual types of these distributions affect the optimal solution. To answer this question we undertook a numerical investigation using the numerical example of the previous section as a basis; we tried several combinations of various distributions

Conclusions

We have developed a model for the optimization of preventive maintenance procedures in a production process that may operate in one of two different quality states and is also subject to failure. The model allows for maintenance decisions dependent on the quality state of the process since the two quality states are characterized by different failure rates and revenues. In its general form, the model also allows for maintenance times, while in the out-of-control state, that can be a general

References (23)

  • R.M. Feldman

    Optimal replacement with semi-Markov shock models

    Journal of Applied Probability

    (1976)
  • Cited by (81)

    • Integrated Quality, Maintenance and Production model for multivariate processes: A Bayesian Approach

      2022, Journal of Manufacturing Systems
      Citation Excerpt :

      Chiang and Yuan [12] studied continuously deteriorating systems characterised by multiple operating states, for which the optimal maintenance policy is associated with a periodic process inspection. Moreover, Panagiotidou and Tagaras [5,13], Xiang [14] and Le and Tan [15] developed integrated SPM and CBM models for processes with multiple operational states. Wan et al. [16] proposed an integrated maintenance and SPM model considering minor and catastrophic failures.

    • Integrated optimization of quality and maintenance: A literature review

      2021, Computers and Industrial Engineering
      Citation Excerpt :

      The present paper has also listed the papers in which decision variables only are related to one aspect of the three main tasks (quality, maintenance, and production), and these decision variables are optimized to the interaction of other tasks. The present paper reviews the papers (Rahim and Banerjee, 1993; Rahim, 1994; Rahim and Ben-Daya, 1998; Chiu and Huang, 1995; Chiu and Huang, 1996; Ben-Daya, 1999; Ben-Daya and Rahim, 2000; Cassady et al., 2000; Rahim and Ben-Daya, 2001; Linderman et al., 2005; Wang and Sheu, 2003; Kuo, 2006; Makis and Fung, 1995; Ben-Daya and Makhdoum, 1998; Lee and Rahim, 2001; Yeung et al., 2007; Makis and Fung, 1998; Alfares et al., 2005; Wu and Makis, 2008; Radhoui et al., 2009; Lin et al., 2011; Wu and Wang, 2011; Wang, 2011; Mehrafrooz and Noorossana, 2011; Ho and Quinino, 2012; Wang, 2012; Kim and Makis, 2012; Chen, 2013; Xiang, 2013; Liu et al., 2013; Zhong et al., 2016; Yin et al., 2015; Yin et al., 2015; Lu et al., 2016; Nourelfath et al., 2016; Salmasnia et al., 2017; Fakher et al., 2018; Zhong and Ma, 2017; Lesage and Dehombreux, 2012; Li et al., 2017; Liu et al., 2017; Li et al., 2017; Zhou et al., 2017; Li et al., 2018; Bahria et al., 2018; Salmasnia et al., 2018; Salmasnia et al., 2018; Tasias and Nenes, 2018; Wan et al., 2018; Salmasnia et al., 2018; Salmasnia et al., 2019; Tagaras, 1988; Wang and Chen, 1995; Chiang and Yuan, 2001; Zhou and Zhu, 2008; Wang et al., 2009; Panagiotidou and Tagaras, 2010; Charongrattanasakul and Pongpullponsak, 2011; Pandey et al., 2011; Pandey et al., 2012; Panagiotidou and Tagaras, 2012 ; Tambe and Kulkarni, 2014 ; Zhang et al., 2015; Makis, 2015; Tambe and Kulkarni, 2015; Ardakan et al., 2016; Jafari and Makis, 2016; Jafari and Makis, 2016; Bouslah et al., 2016; Khrueasom and Pongpullponsak, 2017; Tambe and Kulkarni, 2016; Jain and Lad, 2017; Bouslah et al., 2016; Bouslah et al., 2018; Rasay et al., 2018; Latrous et al., 2018; Farahani et al., 2019; Li et al., 2019; Hsu and Kuo, 1995; Chiu and Huang, 1996; Yerel et al., 2007; Panagiotidou and Tagaras, 2007; Panagiotidou and Tagaras, 2008; Chang et al., 2009; Panagiotidou and Nenes, 2009; Nenes and Panagiotidou, 2011; Pandey et al., 2010; Mehdi et al., 2010; Engin, 2010; Radhoui et al., 2010; Rahim and Shakil, 2011; Colledani and Tolio, 2012; Morales, 2013; Dhouib et al., 2012; Rivera-Gómez et al., 2013; Shrivastava et al., 2016; Dan et al., 2016; Rasay et al., 2018; Yin and Makis, 2010; Azadeh et al., 2017; Cheng et al., 2018; Rasay et al., 2018; Wan et al., 2018; Pasha and Moghadam, 2018; Duan et al., 2019; Wang et al., 2019; Beheshti Fakher et al., 2017; Kouki et al., 2014; Deloux et al., 2009; Nguyen et al., 2019; Pan et al., 2012; Maillart et al., 2009; Zhou and Liu, 2016; Lampreia et al., 2018; Lesage and Dehombreux, 2012; Rivera-Gomez et al., 2013; Alsyouf et al., 2016; Yang and Zeng, 2018 Apr; Zhang et al., 2018; Bahria et al., 2019; He et al., 2019; Lin, 2004; Le and Tan, 2013; Ivy and Nembhard, 2005; Ji-Wen et al., 2010; Chan, 2003; Chan and Wu, 2009; Azizi, 2015; Gupta et al., 2009; Katter et al., 1997; Alsyouf et al., 2015; Yeong et al., 2013; Ali et al., 2020). Table 1 provides information on journals that have published papers in the field of optimization, either maintenance and quality, or maintenance, quality, and production.

    • Production and replacement policies for a deteriorating manufacturing system under random demand and quality

      2018, European Journal of Operational Research
      Citation Excerpt :

      In the same direction, Colledani and Tolio (2011) presented a new approximate analytical method to evaluate the performance of production systems by simultaneous tackling of quality and production issues. Panagiotidou and Tagaras (2007), Xiang (2013) and Panagiotidou (2014) presented an economic model for the optimization of preventive maintenance in a production process with two possible quality states characterized by different failure rates. Furthermore, Radhoui, Rezg, and Chelbi (2010) developed a mathematical model for one machine system producing lots of products to satisfy a constant demand rate.

    View all citing articles on Scopus
    View full text