Abstract
Statistical process control (SPC) is a conventional means of monitoring software processes and detecting related problems, where the causes of detected problems can be identified using causal analysis. Determining the actual causes of reported problems requires significant effort due to the large number of possible causes. This study presents an approach to detect problems and identify the causes of problems using multivariate SPC. This proposed method can be applied to monitor multiple measures of software process simultaneously. The measures which are detected as the major impacts to the out-of-control signals can be used to identify the causes where the partial least squares (PLS) and statistical hypothesis testing are utilized to validate the identified causes of problems in this study. The main advantage of the proposed approach is that the correlated indices can be monitored simultaneously to facilitate the causal analysis of a software process.
Similar content being viewed by others
Notes
The range of variable X can be treated as a variable and expressed as W = w/σ w (standardized range), in which σ w denotes the standard deviation of W. The E(Wk) represents the k-th moments of W, and the value of E(Wk) depends on n and k, in which the n denotes the subgroup size. The d 2 is defined as the first moment (or expectation) of W. Therefore, the value of d 2 depends on n, and can be treated as a control chart constant for given n.
The d 3 is defined as the standard deviation of the range and can be expressed as \(E[(\hbox{W}-\upeta_{\rm w})^{2}]\) , where \(\upeta_{{\rm w}}\) is the mean of W. The value of d 3 can be calculated using the second moment of W and d 2, as follows: \(d_{3}=E[(\hbox{W}-\upeta_{\rm w})^{2}]=E[\hbox{W}^{2}-2\hbox{W}\upeta_{{\rm w}}+\upeta_{{\rm w}}^{2}]=E(\hbox{W}^{2})-2E(\hbox{W})\upeta_{w}+\upeta_{w}^{2}= E(\hbox{W}^{2})-2\upeta_{{\rm w}}^{2}+\upeta_{{\rm w}}^{2}= E(\hbox{W}^{2})-\upeta_{{\rm w}}^{2}=E(\hbox{W}^{2})-d_{2}^{2}.\) The values of d 2 and d 3 are tabulated for different n.
This is because \(\sigma_{\overline{\rm X}}=(1.6555-1.5189)/3=0.0455,\) and the z value of the sixth point is (1.67 − 1.5189)/0.0455≈3.32. Therefore, the probability of P(Z < 3.32) is about 0.9995.
The \(\hbox{UCL}_{T^{2}}\) is calculated using α = 0.02, m = 18 and k = 2, and obtains B 0.99(1,7.5) = 0.4588. Therefore, the \(\hbox{UCL}_{T^{2}}\) is about 16.0556 × 0.4583 = 7.37 and \(\hbox{LCL}_{T^{2}}\) is about 0.02.
References
Antoniol, G., Gradara, S., & Venturi, G. (2004). Methodological issues in a CMM level 4 implementation. Software Process: Improvement and Practice, 9(1), 33–50.
Baldassarre, M. T., Boffoli, N., Caivano, D., & Visaggio, G. (2005). Improving dynamic calibration through statistical process control. International Conference on Software Maintenance (ICSM’2005), Budapest, Hungary, pp. 273–282.
Basili, V. R., & Rombach, H. D. (1988). The TAME project: towards improvement-oriented software environments. IEEE Transactions on Software Engineering, 14(6), 758–773.
Benbasat, I., Goldstein, D. K., & Mead, M. (1987). The case research strategy in studies of information systems. MIS Quarterly, 11(3), 369–386.
Boehm, B. (1981). Software engineering economics. New Jersey: Prentice-Hill.
Boehm, B. (1993). Value-based software engineering. ACM SIGSOFT, 28(2):1–12.
Box, G. E. P., & Cox, D. R. (1964). An analysis of transformations. Journal of the Royal Statistical Society: Series B (Methodological), 26(2), 211–252.
Briand, L. C., Morasca, S., & Basili, V. R. (2002). An operational process for goal-driven definition of measures. IEEE Transactions on Software Engineering, 28(12), 1106–1125.
Brodman, J. G., & Johnson, D. L. (1996). Return on investment from software process improvement as measured by U.S. industry, Crosstalk, STSC, Hill Air Force Base, Utah pp. 23–29.
Card, D. (1993). Defect-causal analysis drives down error rates. IEEE Software, 10(4), 98–99.
Chillarege, R., Bhandari, I. S., Chaar, J. K., Halliday, M. J., Moebus, D. S., Ray, B. K., & Wong, M.-Y. (1992). Orthogonal defect classification—a concept for in-process measurements. IEEE Transactions on Software Engineering, 18(11), 943–956.
Chrissis, M. B., Konrad, M., & Shrum, S. (2003). CMMI guidelines for process integration and product improvement (pp. 143–155). Addison-Wesley, MA.
Christensen, D. S. (1993). Determining an accurate estimate at completion. National Contract Management Journal, 25, 17–25.
Cohen, J. (1988). Statistical power analysis for the behavioural sciences, 2nd ed. Hillsdale, NJ: Erlbaum.
Cohen, J. (1996). Exploring psychological statistics. Pacific Grove, CA: Brooks/Cole.
Crosier, R. B. (1988). Multivariate generations of cumulative sum quality-control schema. Technometrics, 30(3), 291–303.
D’Agostino, R. B., & Stephens, M. A. (1986). Goodness-of-fit techniques. NY, USA: Marcel Dekker.
Doppke, J. C., Heimbigner, D., & Wolf, A. L. (1998). Software process modeling and execution within virtual environments. ACM Transactions on Software Engineering and Methodology, 7(1), 1–40.
Fenton, N., Krause, P., & Neil, M. (2002). Software measurement: Uncertainty and causal modeling. IEEE Software, 19(4), 116–122.
Fenton, N., & Ohlsson, N. (2000). Quantitative analysis of faults and failure in a complex software system. IEEE Transactions on Software Engineering, 26(8), 797–814.
Fenton, N., & Pfleeger, S. L. (1996). Software metrics: A rigorous and practical approach. International Thomson Publishing Company.
Fleming, Q. W. (1988). Cost/schedule control system criteria: The management guide to C/SCSC. Chicago: Probus.
Fleming, Q. W., & Koppelman, J. M. (1998). Earned value project management: A powerful tool for software projects, CROSSTALK. The Journal of Defense Software Engineering, 10(7), 19–23.
Florac, W. A., & Carleton, A. D. (1999). Measuring the software process. Boston: Addison Wesley.
Geladi, P., & Kowalski, B. (1986). Partial least-squares regression: A tutorial. Analytica Chemica Acta, 185, 1–17.
Harter, H. L. (1960). Tables of range and studentized range. The Annals of Mathematical Statistics, 31(4), 1122–1147.
Helland, I. S. (1990). PLS regression and statistical models. Scandivian Journal of Statistics, 17, 94–114.
Henderson, K. (2003). Earned schedule: A breakthrough extension to earned value theory? A retrospective analysis of real project data. The Measurable News, Summer 2003.
Hihn, J., & Habib-Agahi, H. (1991). Cost estimation of software intensive projects: A survey of current practices. International Conference on Software Engineering. Los Alamitor, CA, USA: IEEE Computer Society Press, pp. 276–287.
Hoskuldsson, A. (1988). PLS regression methods. Journal of Chemometrics, 2, 211–228.
Hotelling, H. (1931). The generalization of student’s ratio. The Annals of Mathematical Statistics, 2(3), 360–378.
Hotelling, H. (1947). Multivariate quality control, illustrated by the air testing of sample bomb-sights, techniques of statistical analysis (pp. 111–184). New York: McGraw-Hill.
Hunter, J. S. (1986). Exponentially Weighted Moving Average. Journal Quality Technology, 18, 203–210.
Jacob, A. L., & Pillai, S. K. (2003). Statistical process control to improve coding and code review. IEEE Software, 20(3), 50–55.
Jalote, P. (2000). CMM in practice: Processes for executing software projects at Infosys. Boston: Addison-Wesley.
Kan, S. H. (2003). Metrics and models in software quality engineering. Boston: Addison-Wesley.
Khoshgoftaar, T. M., Allen, E. B., Jones, W. D., & Hudepohl, J. P. (2000). Classification-tree models of software-quality over multiple releases. IEEE Transactions on Reliability, 49(1), 4–11.
Kourti, T., & MacGregor, J. F. (1995). Process analysis, monitoring and diagnosis using multivariate projection methods. Chemometrics and Intelligent Laboratory Systems, 28, 3–21.
Lane, J. A., & Zubrow, D. (1997). Integrating measurement with improvement: An action-oriented approach experience report. In Proc. of the ICSE 97. Boston, MA.
Lavazza, L. (2000). Providing automated support for the GQM measurement process. IEEE Software, 17(3), 56–62.
Lawler, J., & Kitchenham, B. (2003). Measurement modeling technology. IEEE Software, 12(3), 68–75.
Leszak, M., Perry, D. E., & Stoll, D. (2002). Classification and evaluation of defect in a project retrospective. The Journal of System and Software, 61(3), 173–187.
Lipke, W. (2003). Schedule is different The Measurable News, Mar., 10–15.
Lipsey, M. W. (1990). Design sensitivity: Statistical power for experimental research. Newbury Park, CA: SAGE.
Lowry, C. A., Woodall, W. H., Champ, C. W., & Rigdon, S. E. (1992). A multivariate exponentially weighted moving average control chart. Technometrics, 34(1), 46–53.
MacGregor, J. F. (1990). A different view of the funnel experiment. Journal of Quality Technology, 22(4), 255–259.
Mahalanobis, P. C. (1936). On the generalized distance in statistics. In Proceedings of National Institute of Science India 12, Sec. 7, pp. 49–55.
Mason, R. L., Tracy, N. D., & Young, J. C. (1997). A practical approach for interpreting multivariate T2 control chart signals. Journal of Quality Technology, 29, 396–406.
McGarry, J., Card, D., Jones, C., Layman, B., Clark, E., Dean, J., & Hall, F. (2001). Practical software measurement: Objective information for decision makers. Addison-Wesley Pub Co.
Meyer, A. D., Loch, C. H., & Pich, M. T. (2002). Managing project uncertainty: From variation to chaos. Operations Management and Research, 43(2), 60–67.
Mohapatra, S., & Mohanty, B. (2001). Defect prevention through defect prediction: A case study at Infosys. In Proc. of 2001 International Conference on Software Maintenance (ICSM 2001) (pp. 260–272). Florence, Italy.
Nijhuis, A., de Jong, S., & Vandeginste, B. G. M. (1997). Multivariate statistical process control in chromatography. Chemometrics and Intelligent Laboratory Systems, 38, 51–62.
Nomikos, P., & MacGregor, J. F. (1995). Multivariate SPC charts for monitoring batch process. Technometrics, 37(1), 41–59.
Ortiz-Estarelles, O., Martín-Biosca, Y., Medina-Hernández, M. J., & Sagrado, S. (2001). On the internal multivariate quality control of analytical laboratories. A case study: The quality of drinking water. Chemometrics and Intelligent Laboratory System, 56, 93–103.
Page, E. S. (1954). Continuous inspection schemes. Biometrika, 41(1/2), 100–115.
Pressman, R. S. (2001). Software engineering: A practitioner’s approach. New York: McGraw-Hill.
Rencher, A. C. (1998). Multivariate statistical inference and applications. New York, NY: John Wiley.
Rubin, H. (1993). Software process maturity: Measuring its impact on productivity and quality. IEEE International Software Metrics Symposium (ICSE 93’). Baltimore, Maryland, United States, pp. 468–476.
Shaffer, J. P. (1995). Multiple hypothesis testing. Annual Review of Psychology, 46, 561–584.
Shewhart, W. A. (1939). Statistical method from the viewpoint of quality control. New York, USA: Dover Publications, ISBN 0486652327.
Tayntor, C. B. (2002). Six sigma software development. Auerbach Publications.
Tracy, N. D., Young, J. C., & Mason, R. L. (1992). Multivariate control charts for individual observations. Journal of Qualify Technology, 24, 88–95.
Walpole, R. E., Myers, R. H., Myers, S. L., & Ye, K. (2002). Probability and statistics for engineers and scientists. New Jersey: Prentice Hall.
Weller, E. F. (2000). Practical applications of statistical process control. IEEE Software, 17(3), 48–55.
Wilks, S. S. (1962). Mathematical statistics. New York: Wiley.
Wold, H. (1966). Estimation of principal components and related models by iterative least squares. In Proceedings of International Symposium in Dayton (pp. 391–402). Academic Press.
Yin, R. K. (2002). Case study research: Design and methods, 3 ed. London: Sage Publications.
Acknowledgements
This work is partially supported by the National Science Council of Taiwan, R.O.C., under grant NSC-92-2213-E-309-005, and partially sponsored by the Ministry of Economic Affairs of Taiwan, under grant 93-EC-17-A-02-S1-029.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chang, CP., Chu, CP. Improvement of causal analysis using multivariate statistical process control. Software Qual J 16, 377–409 (2008). https://doi.org/10.1007/s11219-007-9042-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11219-007-9042-3