Skip to main content
Log in

Improvement of causal analysis using multivariate statistical process control

  • Published:
Software Quality Journal Aims and scope Submit manuscript

Abstract

Statistical process control (SPC) is a conventional means of monitoring software processes and detecting related problems, where the causes of detected problems can be identified using causal analysis. Determining the actual causes of reported problems requires significant effort due to the large number of possible causes. This study presents an approach to detect problems and identify the causes of problems using multivariate SPC. This proposed method can be applied to monitor multiple measures of software process simultaneously. The measures which are detected as the major impacts to the out-of-control signals can be used to identify the causes where the partial least squares (PLS) and statistical hypothesis testing are utilized to validate the identified causes of problems in this study. The main advantage of the proposed approach is that the correlated indices can be monitored simultaneously to facilitate the causal analysis of a software process.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

Notes

  1. The range of variable X can be treated as a variable and expressed as W = w w (standardized range), in which σ w denotes the standard deviation of W. The E(Wk) represents the k-th moments of W, and the value of E(Wk) depends on n and k, in which the n denotes the subgroup size. The d 2 is defined as the first moment (or expectation) of W. Therefore, the value of d 2 depends on n, and can be treated as a control chart constant for given n.

  2. The d 3 is defined as the standard deviation of the range and can be expressed as \(E[(\hbox{W}-\upeta_{\rm w})^{2}]\) , where \(\upeta_{{\rm w}}\) is the mean of W. The value of d 3 can be calculated using the second moment of W and d 2, as follows: \(d_{3}=E[(\hbox{W}-\upeta_{\rm w})^{2}]=E[\hbox{W}^{2}-2\hbox{W}\upeta_{{\rm w}}+\upeta_{{\rm w}}^{2}]=E(\hbox{W}^{2})-2E(\hbox{W})\upeta_{w}+\upeta_{w}^{2}= E(\hbox{W}^{2})-2\upeta_{{\rm w}}^{2}+\upeta_{{\rm w}}^{2}= E(\hbox{W}^{2})-\upeta_{{\rm w}}^{2}=E(\hbox{W}^{2})-d_{2}^{2}.\) The values of d 2 and d 3 are tabulated for different n.

  3. This is because \(\sigma_{\overline{\rm X}}=(1.6555-1.5189)/3=0.0455,\) and the z value of the sixth point is (1.67 − 1.5189)/0.0455≈3.32. Therefore, the probability of P(Z < 3.32) is about 0.9995.

  4. The \(\hbox{UCL}_{T^{2}}\) is calculated using α = 0.02, m = 18 and k = 2, and obtains B 0.99(1,7.5) = 0.4588. Therefore, the \(\hbox{UCL}_{T^{2}}\) is about 16.0556 × 0.4583 = 7.37 and \(\hbox{LCL}_{T^{2}}\) is about 0.02.

References

  • Antoniol, G., Gradara, S., & Venturi, G. (2004). Methodological issues in a CMM level 4 implementation. Software Process: Improvement and Practice, 9(1), 33–50.

    Article  Google Scholar 

  • Baldassarre, M. T., Boffoli, N., Caivano, D., & Visaggio, G. (2005). Improving dynamic calibration through statistical process control. International Conference on Software Maintenance (ICSM’2005), Budapest, Hungary, pp. 273–282.

  • Basili, V. R., & Rombach, H. D. (1988). The TAME project: towards improvement-oriented software environments. IEEE Transactions on Software Engineering, 14(6), 758–773.

    Article  Google Scholar 

  • Benbasat, I., Goldstein, D. K., & Mead, M. (1987). The case research strategy in studies of information systems. MIS Quarterly, 11(3), 369–386.

    Article  Google Scholar 

  • Boehm, B. (1981). Software engineering economics. New Jersey: Prentice-Hill.

    MATH  Google Scholar 

  • Boehm, B. (1993). Value-based software engineering. ACM SIGSOFT, 28(2):1–12.

    Google Scholar 

  • Box, G. E. P., & Cox, D. R. (1964). An analysis of transformations. Journal of the Royal Statistical Society: Series B (Methodological), 26(2), 211–252.

    MATH  MathSciNet  Google Scholar 

  • Briand, L. C., Morasca, S., & Basili, V. R. (2002). An operational process for goal-driven definition of measures. IEEE Transactions on Software Engineering, 28(12), 1106–1125.

    Article  Google Scholar 

  • Brodman, J. G., & Johnson, D. L. (1996). Return on investment from software process improvement as measured by U.S. industry, Crosstalk, STSC, Hill Air Force Base, Utah pp. 23–29.

  • Card, D. (1993). Defect-causal analysis drives down error rates. IEEE Software, 10(4), 98–99.

    Article  Google Scholar 

  • Chillarege, R., Bhandari, I. S., Chaar, J. K., Halliday, M. J., Moebus, D. S., Ray, B. K., & Wong, M.-Y. (1992). Orthogonal defect classification—a concept for in-process measurements. IEEE Transactions on Software Engineering, 18(11), 943–956.

    Article  Google Scholar 

  • Chrissis, M. B., Konrad, M., & Shrum, S. (2003). CMMI guidelines for process integration and product improvement (pp. 143–155). Addison-Wesley, MA.

  • Christensen, D. S. (1993). Determining an accurate estimate at completion. National Contract Management Journal, 25, 17–25.

    MathSciNet  Google Scholar 

  • Cohen, J. (1988). Statistical power analysis for the behavioural sciences, 2nd ed. Hillsdale, NJ: Erlbaum.

    Google Scholar 

  • Cohen, J. (1996). Exploring psychological statistics. Pacific Grove, CA: Brooks/Cole.

    Google Scholar 

  • Crosier, R. B. (1988). Multivariate generations of cumulative sum quality-control schema. Technometrics, 30(3), 291–303.

    Article  MATH  MathSciNet  Google Scholar 

  • D’Agostino, R. B., & Stephens, M. A. (1986). Goodness-of-fit techniques. NY, USA: Marcel Dekker.

    MATH  Google Scholar 

  • Doppke, J. C., Heimbigner, D., & Wolf, A. L. (1998). Software process modeling and execution within virtual environments. ACM Transactions on Software Engineering and Methodology, 7(1), 1–40.

    Article  Google Scholar 

  • Fenton, N., Krause, P., & Neil, M. (2002). Software measurement: Uncertainty and causal modeling. IEEE Software, 19(4), 116–122.

    Article  Google Scholar 

  • Fenton, N., & Ohlsson, N. (2000). Quantitative analysis of faults and failure in a complex software system. IEEE Transactions on Software Engineering, 26(8), 797–814.

    Article  Google Scholar 

  • Fenton, N., & Pfleeger, S. L. (1996). Software metrics: A rigorous and practical approach. International Thomson Publishing Company.

  • Fleming, Q. W. (1988). Cost/schedule control system criteria: The management guide to C/SCSC. Chicago: Probus.

    Google Scholar 

  • Fleming, Q. W., & Koppelman, J. M. (1998). Earned value project management: A powerful tool for software projects, CROSSTALK. The Journal of Defense Software Engineering, 10(7), 19–23.

    Google Scholar 

  • Florac, W. A., & Carleton, A. D. (1999). Measuring the software process. Boston: Addison Wesley.

    Google Scholar 

  • Geladi, P., & Kowalski, B. (1986). Partial least-squares regression: A tutorial. Analytica Chemica Acta, 185, 1–17.

    Article  Google Scholar 

  • Harter, H. L. (1960). Tables of range and studentized range. The Annals of Mathematical Statistics, 31(4), 1122–1147.

    Article  Google Scholar 

  • Helland, I. S. (1990). PLS regression and statistical models. Scandivian Journal of Statistics, 17, 94–114.

    MathSciNet  Google Scholar 

  • Henderson, K. (2003). Earned schedule: A breakthrough extension to earned value theory? A retrospective analysis of real project data. The Measurable News, Summer 2003.

  • Hihn, J., & Habib-Agahi, H. (1991). Cost estimation of software intensive projects: A survey of current practices. International Conference on Software Engineering. Los Alamitor, CA, USA: IEEE Computer Society Press, pp. 276–287.

  • Hoskuldsson, A. (1988). PLS regression methods. Journal of Chemometrics, 2, 211–228.

    Article  Google Scholar 

  • Hotelling, H. (1931). The generalization of student’s ratio. The Annals of Mathematical Statistics, 2(3), 360–378.

    Article  MATH  Google Scholar 

  • Hotelling, H. (1947). Multivariate quality control, illustrated by the air testing of sample bomb-sights, techniques of statistical analysis (pp. 111–184). New York: McGraw-Hill.

    Google Scholar 

  • Hunter, J. S. (1986). Exponentially Weighted Moving Average. Journal Quality Technology, 18, 203–210.

    Google Scholar 

  • Jacob, A. L., & Pillai, S. K. (2003). Statistical process control to improve coding and code review. IEEE Software, 20(3), 50–55.

    Article  Google Scholar 

  • Jalote, P. (2000). CMM in practice: Processes for executing software projects at Infosys. Boston: Addison-Wesley.

    Google Scholar 

  • Kan, S. H. (2003). Metrics and models in software quality engineering. Boston: Addison-Wesley.

    Google Scholar 

  • Khoshgoftaar, T. M., Allen, E. B., Jones, W. D., & Hudepohl, J. P. (2000). Classification-tree models of software-quality over multiple releases. IEEE Transactions on Reliability, 49(1), 4–11.

    Article  Google Scholar 

  • Kourti, T., & MacGregor, J. F. (1995). Process analysis, monitoring and diagnosis using multivariate projection methods. Chemometrics and Intelligent Laboratory Systems, 28, 3–21.

    Article  Google Scholar 

  • Lane, J. A., & Zubrow, D. (1997). Integrating measurement with improvement: An action-oriented approach experience report. In Proc. of the ICSE 97. Boston, MA.

  • Lavazza, L. (2000). Providing automated support for the GQM measurement process. IEEE Software, 17(3), 56–62.

    Article  Google Scholar 

  • Lawler, J., & Kitchenham, B. (2003). Measurement modeling technology. IEEE Software, 12(3), 68–75.

    Article  Google Scholar 

  • Leszak, M., Perry, D. E., & Stoll, D. (2002). Classification and evaluation of defect in a project retrospective. The Journal of System and Software, 61(3), 173–187.

    Article  Google Scholar 

  • Lipke, W. (2003). Schedule is different The Measurable News, Mar., 10–15.

  • Lipsey, M. W. (1990). Design sensitivity: Statistical power for experimental research. Newbury Park, CA: SAGE.

    Google Scholar 

  • Lowry, C. A., Woodall, W. H., Champ, C. W., & Rigdon, S. E. (1992). A multivariate exponentially weighted moving average control chart. Technometrics, 34(1), 46–53.

    Article  MATH  Google Scholar 

  • MacGregor, J. F. (1990). A different view of the funnel experiment. Journal of Quality Technology, 22(4), 255–259.

    Google Scholar 

  • Mahalanobis, P. C. (1936). On the generalized distance in statistics. In Proceedings of National Institute of Science India 12, Sec. 7, pp. 49–55.

  • Mason, R. L., Tracy, N. D., & Young, J. C. (1997). A practical approach for interpreting multivariate T2 control chart signals. Journal of Quality Technology, 29, 396–406.

    Google Scholar 

  • McGarry, J., Card, D., Jones, C., Layman, B., Clark, E., Dean, J., & Hall, F. (2001). Practical software measurement: Objective information for decision makers. Addison-Wesley Pub Co.

  • Meyer, A. D., Loch, C. H., & Pich, M. T. (2002). Managing project uncertainty: From variation to chaos. Operations Management and Research, 43(2), 60–67.

    Google Scholar 

  • Mohapatra, S., & Mohanty, B. (2001). Defect prevention through defect prediction: A case study at Infosys. In Proc. of 2001 International Conference on Software Maintenance (ICSM 2001) (pp. 260–272). Florence, Italy.

  • Nijhuis, A., de Jong, S., & Vandeginste, B. G. M. (1997). Multivariate statistical process control in chromatography. Chemometrics and Intelligent Laboratory Systems, 38, 51–62.

    Article  Google Scholar 

  • Nomikos, P., & MacGregor, J. F. (1995). Multivariate SPC charts for monitoring batch process. Technometrics, 37(1), 41–59.

    Article  MATH  Google Scholar 

  • Ortiz-Estarelles, O., Martín-Biosca, Y., Medina-Hernández, M. J., & Sagrado, S. (2001). On the internal multivariate quality control of analytical laboratories. A case study: The quality of drinking water. Chemometrics and Intelligent Laboratory System, 56, 93–103.

    Article  Google Scholar 

  • Page, E. S. (1954). Continuous inspection schemes. Biometrika, 41(1/2), 100–115.

    Article  MATH  MathSciNet  Google Scholar 

  • Pressman, R. S. (2001). Software engineering: A practitioner’s approach. New York: McGraw-Hill.

    Google Scholar 

  • Rencher, A. C. (1998). Multivariate statistical inference and applications. New York, NY: John Wiley.

    MATH  Google Scholar 

  • Rubin, H. (1993). Software process maturity: Measuring its impact on productivity and quality. IEEE International Software Metrics Symposium (ICSE 93’). Baltimore, Maryland, United States, pp. 468–476.

  • Shaffer, J. P. (1995). Multiple hypothesis testing. Annual Review of Psychology, 46, 561–584.

    Article  Google Scholar 

  • Shewhart, W. A. (1939). Statistical method from the viewpoint of quality control. New York, USA: Dover Publications, ISBN 0486652327.

  • Tayntor, C. B. (2002). Six sigma software development. Auerbach Publications.

  • Tracy, N. D., Young, J. C., & Mason, R. L. (1992). Multivariate control charts for individual observations. Journal of Qualify Technology, 24, 88–95.

    Google Scholar 

  • Walpole, R. E., Myers, R. H., Myers, S. L., & Ye, K. (2002). Probability and statistics for engineers and scientists. New Jersey: Prentice Hall.

    Google Scholar 

  • Weller, E. F. (2000). Practical applications of statistical process control. IEEE Software, 17(3), 48–55.

    Article  Google Scholar 

  • Wilks, S. S. (1962). Mathematical statistics. New York: Wiley.

    MATH  Google Scholar 

  • Wold, H. (1966). Estimation of principal components and related models by iterative least squares. In Proceedings of International Symposium in Dayton (pp. 391–402). Academic Press.

  • Yin, R. K. (2002). Case study research: Design and methods, 3 ed. London: Sage Publications.

    Google Scholar 

Download references

Acknowledgements

This work is partially supported by the National Science Council of Taiwan, R.O.C., under grant NSC-92-2213-E-309-005, and partially sponsored by the Ministry of Economic Affairs of Taiwan, under grant 93-EC-17-A-02-S1-029.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ching-Pao Chang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chang, CP., Chu, CP. Improvement of causal analysis using multivariate statistical process control. Software Qual J 16, 377–409 (2008). https://doi.org/10.1007/s11219-007-9042-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11219-007-9042-3

Keywords

Navigation