Improvement of causal analysis using multivariate statistical process control

Chang, Ching-Pao; Chu, Chih-Ping

doi:10.1007/s11219-007-9042-3

Improvement of causal analysis using multivariate statistical process control

Published: 23 January 2008

Volume 16, pages 377–409, (2008)
Cite this article

Software Quality Journal Aims and scope Submit manuscript

Ching-Pao Chang¹ &
Chih-Ping Chu¹

250 Accesses
6 Citations
Explore all metrics

Abstract

Statistical process control (SPC) is a conventional means of monitoring software processes and detecting related problems, where the causes of detected problems can be identified using causal analysis. Determining the actual causes of reported problems requires significant effort due to the large number of possible causes. This study presents an approach to detect problems and identify the causes of problems using multivariate SPC. This proposed method can be applied to monitor multiple measures of software process simultaneously. The measures which are detected as the major impacts to the out-of-control signals can be used to identify the causes where the partial least squares (PLS) and statistical hypothesis testing are utilized to validate the identified causes of problems in this study. The main advantage of the proposed approach is that the correlated indices can be monitored simultaneously to facilitate the causal analysis of a software process.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Recent Developments in Software Reliability Modeling and its Applications

Monitoring and Diagnosis of Causal Relationships Among Variables

On statistical models for predicting software quality/reliability: generalized linear and linear mixed modeling

Article 01 March 2017

Notes

The range of variable X can be treated as a variable and expressed as W = w/σ_w (standardized range), in which σ_w denotes the standard deviation of W. The E(W^k) represents the k-th moments of W, and the value of E(W^k) depends on n and k, in which the n denotes the subgroup size. The d ₂ is defined as the first moment (or expectation) of W. Therefore, the value of d ₂ depends on n, and can be treated as a control chart constant for given n.
The d ₃ is defined as the standard deviation of the range and can be expressed as \(E[(\hbox{W}-\upeta_{\rm w})^{2}]\) , where \(\upeta_{{\rm w}}\) is the mean of W. The value of d ₃ can be calculated using the second moment of W and d ₂, as follows: \(d_{3}=E[(\hbox{W}-\upeta_{\rm w})^{2}]=E[\hbox{W}^{2}-2\hbox{W}\upeta_{{\rm w}}+\upeta_{{\rm w}}^{2}]=E(\hbox{W}^{2})-2E(\hbox{W})\upeta_{w}+\upeta_{w}^{2}= E(\hbox{W}^{2})-2\upeta_{{\rm w}}^{2}+\upeta_{{\rm w}}^{2}= E(\hbox{W}^{2})-\upeta_{{\rm w}}^{2}=E(\hbox{W}^{2})-d_{2}^{2}.\) The values of d ₂ and d ₃ are tabulated for different n.
This is because \(\sigma_{\overline{\rm X}}=(1.6555-1.5189)/3=0.0455,\) and the z value of the sixth point is (1.67 − 1.5189)/0.0455≈3.32. Therefore, the probability of P(Z < 3.32) is about 0.9995.
The \(\hbox{UCL}_{T^{2}}\) is calculated using α = 0.02, m = 18 and k = 2, and obtains B _0.99(1,7.5) = 0.4588. Therefore, the \(\hbox{UCL}_{T^{2}}\) is about 16.0556 × 0.4583 = 7.37 and \(\hbox{LCL}_{T^{2}}\) is about 0.02.

References

Antoniol, G., Gradara, S., & Venturi, G. (2004). Methodological issues in a CMM level 4 implementation. Software Process: Improvement and Practice, 9(1), 33–50.
Article Google Scholar
Baldassarre, M. T., Boffoli, N., Caivano, D., & Visaggio, G. (2005). Improving dynamic calibration through statistical process control. International Conference on Software Maintenance (ICSM’2005), Budapest, Hungary, pp. 273–282.
Basili, V. R., & Rombach, H. D. (1988). The TAME project: towards improvement-oriented software environments. IEEE Transactions on Software Engineering, 14(6), 758–773.
Article Google Scholar
Benbasat, I., Goldstein, D. K., & Mead, M. (1987). The case research strategy in studies of information systems. MIS Quarterly, 11(3), 369–386.
Article Google Scholar
Boehm, B. (1981). Software engineering economics. New Jersey: Prentice-Hill.
MATH Google Scholar
Boehm, B. (1993). Value-based software engineering. ACM SIGSOFT, 28(2):1–12.
Google Scholar
Box, G. E. P., & Cox, D. R. (1964). An analysis of transformations. Journal of the Royal Statistical Society: Series B (Methodological), 26(2), 211–252.
MATH MathSciNet Google Scholar
Briand, L. C., Morasca, S., & Basili, V. R. (2002). An operational process for goal-driven definition of measures. IEEE Transactions on Software Engineering, 28(12), 1106–1125.
Article Google Scholar
Brodman, J. G., & Johnson, D. L. (1996). Return on investment from software process improvement as measured by U.S. industry, Crosstalk, STSC, Hill Air Force Base, Utah pp. 23–29.
Card, D. (1993). Defect-causal analysis drives down error rates. IEEE Software, 10(4), 98–99.
Article Google Scholar
Chillarege, R., Bhandari, I. S., Chaar, J. K., Halliday, M. J., Moebus, D. S., Ray, B. K., & Wong, M.-Y. (1992). Orthogonal defect classification—a concept for in-process measurements. IEEE Transactions on Software Engineering, 18(11), 943–956.
Article Google Scholar
Chrissis, M. B., Konrad, M., & Shrum, S. (2003). CMMI guidelines for process integration and product improvement (pp. 143–155). Addison-Wesley, MA.
Christensen, D. S. (1993). Determining an accurate estimate at completion. National Contract Management Journal, 25, 17–25.
MathSciNet Google Scholar
Cohen, J. (1988). Statistical power analysis for the behavioural sciences, 2nd ed. Hillsdale, NJ: Erlbaum.
Google Scholar
Cohen, J. (1996). Exploring psychological statistics. Pacific Grove, CA: Brooks/Cole.
Google Scholar
Crosier, R. B. (1988). Multivariate generations of cumulative sum quality-control schema. Technometrics, 30(3), 291–303.
Article MATH MathSciNet Google Scholar
D’Agostino, R. B., & Stephens, M. A. (1986). Goodness-of-fit techniques. NY, USA: Marcel Dekker.
MATH Google Scholar
Doppke, J. C., Heimbigner, D., & Wolf, A. L. (1998). Software process modeling and execution within virtual environments. ACM Transactions on Software Engineering and Methodology, 7(1), 1–40.
Article Google Scholar
Fenton, N., Krause, P., & Neil, M. (2002). Software measurement: Uncertainty and causal modeling. IEEE Software, 19(4), 116–122.
Article Google Scholar
Fenton, N., & Ohlsson, N. (2000). Quantitative analysis of faults and failure in a complex software system. IEEE Transactions on Software Engineering, 26(8), 797–814.
Article Google Scholar
Fenton, N., & Pfleeger, S. L. (1996). Software metrics: A rigorous and practical approach. International Thomson Publishing Company.
Fleming, Q. W. (1988). Cost/schedule control system criteria: The management guide to C/SCSC. Chicago: Probus.
Google Scholar
Fleming, Q. W., & Koppelman, J. M. (1998). Earned value project management: A powerful tool for software projects, CROSSTALK. The Journal of Defense Software Engineering, 10(7), 19–23.
Google Scholar
Florac, W. A., & Carleton, A. D. (1999). Measuring the software process. Boston: Addison Wesley.
Google Scholar
Geladi, P., & Kowalski, B. (1986). Partial least-squares regression: A tutorial. Analytica Chemica Acta, 185, 1–17.
Article Google Scholar
Harter, H. L. (1960). Tables of range and studentized range. The Annals of Mathematical Statistics, 31(4), 1122–1147.
Article Google Scholar
Helland, I. S. (1990). PLS regression and statistical models. Scandivian Journal of Statistics, 17, 94–114.
MathSciNet Google Scholar
Henderson, K. (2003). Earned schedule: A breakthrough extension to earned value theory? A retrospective analysis of real project data. The Measurable News, Summer 2003.
Hihn, J., & Habib-Agahi, H. (1991). Cost estimation of software intensive projects: A survey of current practices. International Conference on Software Engineering. Los Alamitor, CA, USA: IEEE Computer Society Press, pp. 276–287.
Hoskuldsson, A. (1988). PLS regression methods. Journal of Chemometrics, 2, 211–228.
Article Google Scholar
Hotelling, H. (1931). The generalization of student’s ratio. The Annals of Mathematical Statistics, 2(3), 360–378.
Article MATH Google Scholar
Hotelling, H. (1947). Multivariate quality control, illustrated by the air testing of sample bomb-sights, techniques of statistical analysis (pp. 111–184). New York: McGraw-Hill.
Google Scholar
Hunter, J. S. (1986). Exponentially Weighted Moving Average. Journal Quality Technology, 18, 203–210.
Google Scholar
Jacob, A. L., & Pillai, S. K. (2003). Statistical process control to improve coding and code review. IEEE Software, 20(3), 50–55.
Article Google Scholar
Jalote, P. (2000). CMM in practice: Processes for executing software projects at Infosys. Boston: Addison-Wesley.
Google Scholar
Kan, S. H. (2003). Metrics and models in software quality engineering. Boston: Addison-Wesley.
Google Scholar
Khoshgoftaar, T. M., Allen, E. B., Jones, W. D., & Hudepohl, J. P. (2000). Classification-tree models of software-quality over multiple releases. IEEE Transactions on Reliability, 49(1), 4–11.
Article Google Scholar
Kourti, T., & MacGregor, J. F. (1995). Process analysis, monitoring and diagnosis using multivariate projection methods. Chemometrics and Intelligent Laboratory Systems, 28, 3–21.
Article Google Scholar
Lane, J. A., & Zubrow, D. (1997). Integrating measurement with improvement: An action-oriented approach experience report. In Proc. of the ICSE 97. Boston, MA.
Lavazza, L. (2000). Providing automated support for the GQM measurement process. IEEE Software, 17(3), 56–62.
Article Google Scholar
Lawler, J., & Kitchenham, B. (2003). Measurement modeling technology. IEEE Software, 12(3), 68–75.
Article Google Scholar
Leszak, M., Perry, D. E., & Stoll, D. (2002). Classification and evaluation of defect in a project retrospective. The Journal of System and Software, 61(3), 173–187.
Article Google Scholar
Lipke, W. (2003). Schedule is different The Measurable News, Mar., 10–15.
Lipsey, M. W. (1990). Design sensitivity: Statistical power for experimental research. Newbury Park, CA: SAGE.
Google Scholar
Lowry, C. A., Woodall, W. H., Champ, C. W., & Rigdon, S. E. (1992). A multivariate exponentially weighted moving average control chart. Technometrics, 34(1), 46–53.
Article MATH Google Scholar
MacGregor, J. F. (1990). A different view of the funnel experiment. Journal of Quality Technology, 22(4), 255–259.
Google Scholar
Mahalanobis, P. C. (1936). On the generalized distance in statistics. In Proceedings of National Institute of Science India 12, Sec. 7, pp. 49–55.
Mason, R. L., Tracy, N. D., & Young, J. C. (1997). A practical approach for interpreting multivariate T² control chart signals. Journal of Quality Technology, 29, 396–406.
Google Scholar
McGarry, J., Card, D., Jones, C., Layman, B., Clark, E., Dean, J., & Hall, F. (2001). Practical software measurement: Objective information for decision makers. Addison-Wesley Pub Co.
Meyer, A. D., Loch, C. H., & Pich, M. T. (2002). Managing project uncertainty: From variation to chaos. Operations Management and Research, 43(2), 60–67.
Google Scholar
Mohapatra, S., & Mohanty, B. (2001). Defect prevention through defect prediction: A case study at Infosys. In Proc. of 2001 International Conference on Software Maintenance (ICSM 2001) (pp. 260–272). Florence, Italy.
Nijhuis, A., de Jong, S., & Vandeginste, B. G. M. (1997). Multivariate statistical process control in chromatography. Chemometrics and Intelligent Laboratory Systems, 38, 51–62.
Article Google Scholar
Nomikos, P., & MacGregor, J. F. (1995). Multivariate SPC charts for monitoring batch process. Technometrics, 37(1), 41–59.
Article MATH Google Scholar
Ortiz-Estarelles, O., Martín-Biosca, Y., Medina-Hernández, M. J., & Sagrado, S. (2001). On the internal multivariate quality control of analytical laboratories. A case study: The quality of drinking water. Chemometrics and Intelligent Laboratory System, 56, 93–103.
Article Google Scholar
Page, E. S. (1954). Continuous inspection schemes. Biometrika, 41(1/2), 100–115.
Article MATH MathSciNet Google Scholar
Pressman, R. S. (2001). Software engineering: A practitioner’s approach. New York: McGraw-Hill.
Google Scholar
Rencher, A. C. (1998). Multivariate statistical inference and applications. New York, NY: John Wiley.
MATH Google Scholar
Rubin, H. (1993). Software process maturity: Measuring its impact on productivity and quality. IEEE International Software Metrics Symposium (ICSE 93’). Baltimore, Maryland, United States, pp. 468–476.
Shaffer, J. P. (1995). Multiple hypothesis testing. Annual Review of Psychology, 46, 561–584.
Article Google Scholar
Shewhart, W. A. (1939). Statistical method from the viewpoint of quality control. New York, USA: Dover Publications, ISBN 0486652327.
Tayntor, C. B. (2002). Six sigma software development. Auerbach Publications.
Tracy, N. D., Young, J. C., & Mason, R. L. (1992). Multivariate control charts for individual observations. Journal of Qualify Technology, 24, 88–95.
Google Scholar
Walpole, R. E., Myers, R. H., Myers, S. L., & Ye, K. (2002). Probability and statistics for engineers and scientists. New Jersey: Prentice Hall.
Google Scholar
Weller, E. F. (2000). Practical applications of statistical process control. IEEE Software, 17(3), 48–55.
Article Google Scholar
Wilks, S. S. (1962). Mathematical statistics. New York: Wiley.
MATH Google Scholar
Wold, H. (1966). Estimation of principal components and related models by iterative least squares. In Proceedings of International Symposium in Dayton (pp. 391–402). Academic Press.
Yin, R. K. (2002). Case study research: Design and methods, 3 ed. London: Sage Publications.
Google Scholar

Download references

Acknowledgements

This work is partially supported by the National Science Council of Taiwan, R.O.C., under grant NSC-92-2213-E-309-005, and partially sponsored by the Ministry of Economic Affairs of Taiwan, under grant 93-EC-17-A-02-S1-029.

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Cheng-Kung University, Taiwan No.1, Ta-Hsueh Road, Tainan, 701, Taiwan
Ching-Pao Chang & Chih-Ping Chu

Authors

Ching-Pao Chang
View author publications
You can also search for this author in PubMed Google Scholar
Chih-Ping Chu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ching-Pao Chang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chang, CP., Chu, CP. Improvement of causal analysis using multivariate statistical process control. Software Qual J 16, 377–409 (2008). https://doi.org/10.1007/s11219-007-9042-3

Download citation

Published: 23 January 2008
Issue Date: September 2008
DOI: https://doi.org/10.1007/s11219-007-9042-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improvement of causal analysis using multivariate statistical process control

Abstract

Access this article

Similar content being viewed by others

Recent Developments in Software Reliability Modeling and its Applications

Monitoring and Diagnosis of Causal Relationships Among Variables

On statistical models for predicting software quality/reliability: generalized linear and linear mixed modeling

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Improvement of causal analysis using multivariate statistical process control

Abstract

Access this article

Similar content being viewed by others

Recent Developments in Software Reliability Modeling and its Applications

Monitoring and Diagnosis of Causal Relationships Among Variables

On statistical models for predicting software quality/reliability: generalized linear and linear mixed modeling

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation