Abstract
Early detection of unnatural control chart patterns (CCP) is desirable for any industrial process. Most of recent CCP recognition works are on statistical feature extraction and artificial neural network (ANN)-based recognizers. In this paper, a two-stage hybrid detection system has been proposed using support vector machine (SVM) with self-organized maps. Direct Cosine transform of the CCP data is taken as input. Simulation results show significant improvement over conventional recognizers, with reduced detection window length. An analogous recognition system consisting of statistical feature vector input to the SVM classifier is further developed for comparison.
Similar content being viewed by others
References
Montgomery DC (2008) Introduction to statistical quality control, 6th edn. Wiley, New York
Al-Ghanning AM, Kamat SJ (1995) Unnatural pattern recognition on control charts using correlation analysis techniques. Comput Ind Eng 29(1–4):43–47
Pham DT, Wani MA (1997) Feature-based control chart pattern recognition. Int J Prod Res 35:1875–1890
Gauri SK, Chakraborty S (2007) A study on the various features for effective control chart pattern recognition. Int J Adv Manuf Technol 34(3–4):385–398
Pham DT, Oztemel E (1992) Control chart pattern recognition using neural networks. J Syst Eng 2:256–262
Guh R-S, Tannock JDT (1999) A neural network approach to characterize pattern parameters in process control charts. J Intell Manuf 10(5):449–462
Perry MB, Spoerre JK, Velasco T (2001) Control chart pattern recognition using back propagation artificial neural networks. Int J Prod Res 39:3399–3418
Guh R-S (2005) A hybrid learning-based model for on-line detection and analysis of control chart patterns. Comput Ind Eng 49(1):35–62
Guh R-S, Shiue Y-R (2005) On-line identification of control chart patterns using self organizing approaches. Int J Prod Res 43:1225–1254
Pham DT, Otri S, Ghanbarzadeh A, Koc E (2006) Application of the Bees algorithm to the training of learning vector quantisation networks for control chart pattern recognition. Inf Commun Technol 1:1624–1629
Cheng CS (1997) A neural network approach for the analysis of control chart patterns. Int J Prod Res 35:667–697
Hassan A (2008) Ensemble ANN-based recognizers to improve classification of X-bar control chart patterns. Ind Eng Eng Manage 1996–2000
Shao-xiong Wu, Huan En-zhou (2007) Control chart pattern recognition based on support vector machine. Comput Appl 27(1):61–64
Xiaoh W (2008) Hybrid abnormal patterns recognition of control chart using support vector machine. Int Conf Comput Intell Secur 2:238–241
Wani MA, Rashid S (2005) Fourth international conference on machine learning and applications (ICMLA’05), IEEE 2005
Guh R-S (2004) Optimizing feed forward neural networks for control chart pattern recognition through genetic algorithms. Int J Pattern Recognit Artif Intell 18(2):75–99
Assaleh K, Al-assaf Y (2005) Features extraction and analysis for classifying causable patterns in control charts. Comput Ind Eng 49(1):168–181
Shaoxiong Wu, Wu Biying (2006) Wavelet neural network-based control chart patterns recognition. Int Cont Automat 2:9718–9721
Vapnik V (1979) Estimation of dependences based on empirical data [in Russian]. Nauka, Moscow (English trans: Springer, New York, 1982)
Burges C (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Disc 2:121–167
Ritter H, Martinez T, Schulten K (1992) Neural Computation and Self-organizing Maps. Addison-Wesley Pub Co., MA
Alspector J, Meir R, Yuhas B, Jayakumar A, Lippe D (1993) A parallel gradient descent method for learning in analog VLSI neural networks. In: Hanson SJ, Cohen JD, Giles CL (eds) Advances in neural information processing systems 5:836–844, San Mateo, CA, Morgan Kaufmann
Kohonen T (1990) The self-organized map. Proc IEEE 78(9):1464–1480
Chang C-C, Lin C-J (2001) LIBSVM: a library for support vector machines. (http://www.csie.ntu.edu.tw/~cjlin/libsvm)
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1: Formulation of features
Following are the mathematical formulations of the features used in this work, all of which have been taken from [4] with some minor modifications, in case of SRANGE, RVE and ABDPE.
-
(a)
AASBP: It is the average of absolute slope of straight lines passing through the consecutive points in terms of process standard deviation.
$$ {\text{AASBP}} = {{\left( {\sum\limits_{{i = 1}}^{{N - 1}}{\left| {{\frac{{y_{{i + 1}} - y_{i} }}{{t_{{i + 1}} - t_{i} }}}}\right|} } \right)} \mathord{\left/ {\vphantom {{\left({\sum\limits_{{i = 1}}^{{N - 1}} {\left| {{\frac{{y_{{i + 1}} -y_{i} }}{{t_{{i + 1}} - t_{i} }}}} \right|} } \right)} {\left( {N -1} \right)\sigma }}} \right. \kern-\nulldelimiterspace} {\left( {N -1} \right)\sigma }} $$(15)Here, N is the number of points in the CCP run window, i.e., the length of run window. Y i is the observation for ith sample point. Since in the CCP run window, the x-axis represents the sample numbers that are consecutive positive integers, difference between two consecutive time samples \( \left({t_{i + 1} \;\rm {and}\;t_{i}} \right) \) is taken as 1. This applies to all of the following expressions. \( \sigma \) is the standard deviation (note that it was used to generate the CCP, hence it is not the estimated process standard deviation \( \hat{\sigma} \)).
-
(b)
RVE: Ratio between variance of the observations and mean sum of squares of errors of the least-square (LS) line representing an overall pattern. In this study, instead of corrected sum of squares [4], uncorrected sums of squares of the observations and sample points are used.
$$ {\text{RVE}} = {{\left( {{\frac{{s_{{y^{2} }} }}{{N - 1}}}}\right)} \mathord{\left/ {\vphantom {{\left( {{\frac{{s_{{y^{2} }}}}{{N - 1}}}} \right)} {\left( {{\frac{1}{{N - 2}}}\left( {s_{{y^{2}}} - {\frac{{s_{{yt}}^{2} }}{{s_{{t^{2} }} }}}} \right)} \right)}}}\right. \kern-\nulldelimiterspace} {\left( {{\frac{1}{{N -2}}}\left( {s_{{y^{2} }} - {\frac{{s_{{yt}}^{2} }}{{s_{{t^{2} }}}}}} \right)} \right)}}$$(16)where,
$$ S_{{y^{2}}} = \sum\limits_{i}^{N} {\left({y_{i}} \right)^{2}} $$$$ S_{yt} = \sum\limits_{i}^{N} {y_{i} t_{i}} $$$$ S_{{t^{2}}} = \sum\limits_{i}^{N} {\left({t_{i}} \right)^{2}} $$ -
(c)
ALSPI: It is found from the area between the pattern and LS line per interval in terms of estimated process SD.
$$ {\text{ALSPI}} = (area\;between\;pattern\;and\;LS\;line)/\left({N - 1} \right)\hat{\sigma} $$(17)It is calculated by summing the triangles and trapeziums that are formed by the LS line and overall pattern.
-
(d)
ASL: Average slope of the straight lines passing through six pair-wise combinations of midpoints in four equal segments. The midpoint of each segment can be calculated by,
$$ y_{{mid}} = {{\left( {\sum\limits_{{i = i_{{sg}} }}^{{i_{{sg}}+ 7}} {y_{i} } } \right)} \mathord{\left/ {\vphantom {{\left({\sum\limits_{{i = i_{{sg}} }}^{{i_{{sg}} + 7}} {y_{i} } } \right)}{\left( {{\frac{N}{2}}} \right)}}} \right.\kern-\nulldelimiterspace} {\left( {{\frac{N}{4}}} \right)}} $$(18)and,
$$ t_{{mid}} = {{\left( {\sum\limits_{{i = i_{{sg}} }}^{{i_{{sg}}+ 7}} {t_{i} } } \right)} \mathord{\left/ {\vphantom {{\left({\sum\limits_{{i = i_{{sg}} }}^{{i_{{sg}} + 7}} {t_{i} } } \right)}{\left( {{\frac{N}{2}}} \right)}}} \right.\kern-\nulldelimiterspace} {\left( {{\frac{N}{4}}} \right)}} $$(19)where, i sg = starting sample number of the segment. Since there will be four such segments, 6 \( \left( {C_{2}^{4} } \right) \) straight lines can be generated.
$${\text{ASL}} = {{\left( {\sum\nolimits_{\begin{subarray}{l}{i,j} \\ {i < j} \end{subarray} } {s_{{i,j}} } } \right)}\mathord{\left/ {\vphantom {{\left({\sum\nolimits_{\begin{subarray}{l} {i,j} \\ i < j\end{subarray} } {s_{{i,j}} } } \right)} 6}} \right.\kern-\nulldelimiterspace} 6}$$(20)where s ij is the slope of the line joining the midpoint of ith and jth segment.
-
(e)
SRANGE: Range of slopes of straight lines passing through six pair-wise combinations of midpoints found previously.
$$ {\text{SRANGE}} = {\text{maximum}}\left({S_{ij}} \right)-{\text{minimum}}\left({S_{ij}} \right) $$(21)where, i < j and i = 1, 2, 3 and j = 2, 3, 4. Here, SRANGE is modified by scaling it with estimated process SD \( \left( {\hat{\sigma }} \right) \) i.e.,
$$ {\text{SRANGE}}^{/} = {\text{SRANGE}}/\hat{\sigma} $$(22) -
(f)
REPEPE: The CCP run window is divided into two segments in such a way that the two LS lines fitted on the segments lead to minimum pooled mean square error (PMSE). In this way, the optimal portioning point is found (i opt ), such that
$$ N/4 \le i_{opt} \le 3 N/4 $$(23)If the equations of the two LS lines are
$$ y_{ls1} \left(i \right) = s_{1} i + c_{1} $$(24)$$ y_{ls2} \left(i \right) = s_{2} i + c_{2} $$(25)then, minimum
$$ {\text{PMSE}} = {\frac{{\left({\left({i_{opt} - 1} \right)\sum\nolimits_{i = 1}^{{i_{opt}}} {\left({y_{ls1}\left(i \right) - y_{i}} \right)^{2} + \left({N - i_{opt} - 1} \right)\sum\nolimits_{{i = i_{opt} + 1}}^{N} {\left({y_{ls2} \left(i \right) - y_{i}} \right)^{2}}}} \right)}}{N - 2}} $$(26)Similarly, the mean square error (MSE) of the LS line (slope s) fitted with the overall pattern is calculated. So,
$$ {\text{REPEPE}} = {\text{MSE}}/{\text{PMSE}} $$(27) -
(g)
ABDPE: From partitioning as above, the average of slopes of the two LS line fitted to the two partitions is calculated and absolute slope difference with LS line fitted to overall pattern is found.
$$ {\text{ABDPE}} = {\frac{{\left| {s - \left({s_{1} + s_{2}} \right)} \right|}}{{2\hat{\sigma}}}} $$(28)
Appendix 2: Simulation parameter values
Rights and permissions
About this article
Cite this article
Das, P., Banerjee, I. An hybrid detection system of control chart patterns using cascaded SVM and neural network–based detector. Neural Comput & Applic 20, 287–296 (2011). https://doi.org/10.1007/s00521-010-0443-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-010-0443-z