Skip to main content
Log in

Joint modeling of classification and regression for improving faulty wafer detection in semiconductor manufacturing

  • Published:
Journal of Intelligent Manufacturing Aims and scope Submit manuscript

Abstract

In the semiconductor manufacturing process, it is important to identify wafers on which faults have occurred or will occur to avoid unnecessary and costly further processing and physical inspections. This issue can be addressed by formulating the faulty wafer detection problem as a predictive modeling task, in which the process parameters/measurements and subsequent inspection results concerning the faults comprise the input and output variables at the wafer level, respectively. To achieve improved predictive performance, this paper presents a joint modeling method that incorporates classification and regression tasks into a single prediction model. Given the output variables in both binary and continuous forms, the prediction model simultaneously considers both the classification and regression tasks to complement each other, where each task predicts the binary and continuous output variables, respectively. The outputs from these two tasks are combined to predict whether a wafer is faulty. The entire model is implemented as a neural network, and is trained by optimizing a single objective function. The effectiveness of the model is demonstrated with a case study using real-world data from a semiconductor manufacturer.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  • Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., et al. (2016). TensorFlow: Large-scale machine learning on heterogeneous systems. In Proceedings of the 12th USENIX symposium on operating systems design and implementation (pp. 265–283).

  • Bai, Y., Sun, Z., Zeng, B., Long, J., Li, L., de Oliveira, J. V., et al. (2018). A comparison of dimension reduction techniques for support vector machine modeling of multi-parameter manufacturing quality prediction. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-017-1388-1.

    Google Scholar 

  • Balaguer, B., Erinc, G., & Carpin S. (2012). Combining classification and regression for WiFi localization of heterogeneous robot teams in unknown environments. In Proceedings of the 2012 IEEE/RSJ international conference on intelligent robots and systems (pp. 3496–3503).

  • Baly, R., & Hajj, H. (2012). Wafer classification using support vector machines. IEEE Transactions on Semiconductor Manufacturing, 25(3), 373–383.

    Google Scholar 

  • Bradley, A. P. (1997). The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7), 1145–1159.

    Google Scholar 

  • Caruana, R. (1997). Multitask learning. Machine Learning, 28(1), 41–75.

    Google Scholar 

  • Chang, C. Y., Li, C. H., Chang, Y. C., & Jeng, M. (2011). Wafer defect inspection by neural analysis of region features. Journal of Intelligent Manufacturing, 22(6), 953–964.

    Google Scholar 

  • Chen, P., Wu, S., Lin, J., Ko, F., Lo, H., Wang, J., et al. (2005). Virtual metrology: A solution for wafer to wafer advanced process control. In Proceedings of the 2005 IEEE international symposium on semiconductor manufacturing (pp. 155–157).

  • Chien, C. F., & Chuang, S. C. (2014). A framework for root cause detection of sub-batch processing system for semiconductor manufacturing big data analytics. IEEE Transactions on Semiconductor Manufacturing, 27(4), 475–488.

    Google Scholar 

  • Chien, C. F., Hsu, C. Y., & Chen, P. N. (2013). Semiconductor fault detection and classification for yield enhancement and manufacturing intelligence. Flexible Services and Manufacturing Journal, 25(3), 367–388.

    Google Scholar 

  • Chien, C. F., Liu, C. W., & Chuang, S. C. (2017). Analysing semiconductor manufacturing big data for root cause detection of excursion for yield enhancement. International Journal of Production Research, 55(17), 5095–5107.

    Google Scholar 

  • Chien, C. F., Wang, W. C., & Cheng, J. C. (2007). Data mining for yield enhancement in semiconductor manufacturing and an empirical study. Expert Systems with Applications, 33(1), 192–198.

    Google Scholar 

  • García, V., Sánchez, J. S., Rodríguez-Picón, L. A., Méndez-González, L. C., & de Jesús Ochoa-Domínguez, H. (2018). Using regression models for predicting the product quality in a tubing extrusion process. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-018-1418-7.

    Google Scholar 

  • Glocker, B., Pauly, O., Konukoglu, E., & Criminisi, A. (2012). Joint classification-regression forests for spatially structured multi-object segmentation. In Proceedings of the 2012 European conference on computer vision (pp. 870–881). Springer.

  • He, H., & Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on Knowledge & Data Engineering, 21(9), 1263–1284.

    Google Scholar 

  • He, Q. P., & Wang, J. (2007). Fault detection using the k-nearest neighbor rule for semiconductor manufacturing processes. IEEE Transactions on Semiconductor Manufacturing, 20(4), 345–354.

    Google Scholar 

  • He, S. G., He, Z., & Wang, G. A. (2013). Online monitoring and fault identification of mean shifts in bivariate processes using decision tree learning techniques. Journal of Intelligent Manufacturing, 24(1), 25–34.

    Google Scholar 

  • Hervás, C., Gutierrez, P. A., Silva, M., & Serrano, J. M. (2007). Combining classification and regression approaches for the quantification of highly overlapping capillary electrophoresis peaks by using evolutionary sigmoidal and product unit neural networks. Journal of Chemometrics, 21(12), 567–577.

    Google Scholar 

  • Hsu, C. C., & Chen, M. S. (2016). Intelligent maintenance prediction system for LED wafer testing machine. Journal of Intelligent Manufacturing, 27(2), 335–342.

    Google Scholar 

  • Huang, J., & Ling, C. X. (2005). Using AUC and accuracy in evaluating learning algorithms. IEEE Transactions on Knowledge and Data Engineering, 17(3), 299–310.

    Google Scholar 

  • Huang, S. H., & Pan, Y. C. (2015). Automated visual inspection in the semiconductor industry: A survey. Computers in Industry, 66, 1–10.

    Google Scholar 

  • Japkowicz, N., & Stephen, S. (2002). The class imbalance problem: A systematic study. Intelligent Data Analysis, 6(5), 429–449.

    Google Scholar 

  • Joshi, K. D., Chauhan, V., & Surgenor, B. (2018). A flexible machine vision system for small part inspection based on a hybrid SVM/ANN approach. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-018-1438-3.

    Google Scholar 

  • Kang, S. (2018). On effectiveness of transfer learning approach for neural network-based virtual metrology modeling. IEEE Transactions on Semiconductor Manufacturing, 31(1), 149–155.

    Google Scholar 

  • Kang, S., & Kang, P. (2018). Locally linear ensemble for regression. Information Sciences, 432, 199–209.

    Google Scholar 

  • Khan, A. A., Moyne, J. R., & Tilbury, D. M. (2007). An approach for factory-wide control utilizing virtual metrology. IEEE Transactions on Semiconductor Manufacturing, 20(4), 364–375.

    Google Scholar 

  • Kim, D., Kang, P., Cho, S., joo Lee, H., & Doh, S. (2012). Machine learning-based novelty detection for faulty wafer detection in semiconductor manufacturing. Expert Systems with Applications, 39(4), 4075–4083.

    Google Scholar 

  • Kingma, D. P., & Ba, J. (2015). Adam: A method for stochastic optimization. In Proceedings of the 3rd international conference on learning representations.

  • Kumar, N., Kennedy, K., Gildersleeve, K., Abelson, R., Mastrangelo, C. M., & Montgomery, D. C. (2006). A review of yield modelling techniques for semiconductor manufacturing. International Journal of Production Research, 44(23), 5019–5036.

    Google Scholar 

  • Kyeong, K., & Kim, H. (2018). Classification of mixed-type defect patterns in wafer bin maps using convolutional neural networks. IEEE Transactions on Semiconductor Manufacturing, 31(3), 395–402. https://doi.org/10.1109/TSM.2018.2841416.

    Article  Google Scholar 

  • LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444.

    Google Scholar 

  • Lee, K. B., & Kim, C. O. (2018). Recurrent feature-incorporated convolutional neural network for virtual metrology of the chemical mechanical planarization process. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-018-1437-4.

    Google Scholar 

  • Li, C., Sanchez, R. V., Zurita, G., Cerrada, M., Cabrera, D., & Vásquez, R. E. (2015). Multimodal deep support vector classification with homologous features and its application to gearbox fault diagnosis. Neurocomputing, 168, 119–127.

    Google Scholar 

  • Li, D. C., Chen, W. C., Liu, C. W., & Lin, Y. S. (2012). A non-linear quality improvement model using SVR for manufacturing TFT-LCDs. Journal of Intelligent Manufacturing, 23(3), 835–844.

    Google Scholar 

  • Li, Y., Lan, C., Xing, J., Zeng, W., Yuan, C., & Liu, J. (2016). Online human action detection using joint classification-regression recurrent neural networks. In Proceedings of the 2016 European conference on computer vision (pp. 203–220). Springer.

  • Liu, L., Tian, S., Xue, D., Zhang, T., & Chen, Y. (2018). Industrial feedforward control technology: A review. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-018-1399-6.

    Google Scholar 

  • Miao, Q., Zhang, X., Liu, Z., & Zhang, H. (2017). Condition multi-classification and evaluation of system degradation process using an improved support vector machine. Microelectronics Reliability, 75, 223–232.

    Google Scholar 

  • Moyne, J., Samantaray, J., & Armacost, M. (2016). Big data capabilities applied to semiconductor manufacturing advanced process control. IEEE Transactions on Semiconductor Manufacturing, 29(4), p283–291.

    Google Scholar 

  • Pan, J. C. H., & Tai, D. H. E. (2011). A new strategy for defect inspection by the virtual inspection in semiconductor wafer fabrication. Computers & Industrial Engineering, 60(1), 16–24.

    Google Scholar 

  • Park, C., Kim, Y., Park, Y., & Kim, S. B. (2018). Multitask learning for virtual metrology in semiconductor manufacturing systems. Computers & Industrial Engineering, 123, 209–219.

    Google Scholar 

  • Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.

    Google Scholar 

  • Plastino, A., Gonçalves, E. C., da Silva, P. N., Carneiro, G., & de Vasconcellos Azeredo, R. B. (2017). Combining classification and regression for improving permeability estimations from 1H NMR relaxation data. Journal of Applied Geophysics, 146, 95–102.

    Google Scholar 

  • Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Neural Networks, 61, 85–117.

    Google Scholar 

  • Schulter, S., Leistner, C., Wohlhart, P., Roth, P. M., & Bischof, H. (2014). Accurate object detection with joint classification-regression random forests. In Proceedings of the 2014 IEEE conference on computer vision and pattern recognition (pp. 923–930).

  • Seera, M., Lim, C. P., & Loo, C. K. (2016). Motor fault detection and diagnosis using a hybrid FMM-CART model with online learning. Journal of Intelligent Manufacturing, 27(6), 1273–1285.

    Google Scholar 

  • Shin, C. K., & Park, S. C. (2000). A machine learning approach to yield management in semiconductor manufacturing. International Journal of Production Research, 38(17), 4261–4271.

    Google Scholar 

  • Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15, 1929–1958.

    Google Scholar 

  • Su, A. J., Jeng, J. C., Huang, H. P., Yu, C. C., Hung, S. Y., & Chao, C. K. (2007). Control relevant issues in semiconductor manufacturing: Overview with some new results. Control Engineering Practice, 15(10), 1268–1279.

    Google Scholar 

  • Tong, L. I., & Chao, L. C. (2008). Novel yield model for integrated circuits with clustered defects. Expert Systems with Applications, 34(4), 2334–2341.

    Google Scholar 

  • Uzsoy, R., Lee, C. Y., & Martin-Vega, L. A. (1992). A review of production planning and scheduling models in the semiconductor industry part I: System characteristics, performance evaluation and production planning. IIE Transactions, 24(4), 47–60.

    Google Scholar 

  • van Heerden, C., Barnard, E., Davel, M., van der Walt, C., van Dyk, E., Feld, M., et al. (2010). Combining regression and classification methods for improving automatic speaker age recognition. In Proceedings of the 2010 IEEE international conference on acoustics, speech and signal processing (pp. 5174–5177).

  • Wolpert, D. H. (1996). The lack of a priori distinctions between learning algorithms. Neural Computation, 8(7), 1341–1390.

    Google Scholar 

  • Wu, L., & Zhang, J. (2010). Fuzzy neural network based yield prediction model for semiconductor manufacturing system. International Journal of Production Research, 48(11), 3225–3243.

    Google Scholar 

  • Wu, M. J., Jang, J. S. R., & Chen, J. L. (2015). Wafer map failure pattern recognition and similarity ranking for large-scale data sets. IEEE Transactions on Semiconductor Manufacturing, 28(1), 1–12.

    Google Scholar 

  • Wu, Q., Ding, K., & Huang, B. (2018). Approach for fault prognosis using recurrent neural network. Journal of Intelligent Manufacturing. https://doi.org/10.1007/s10845-018-1428-5.

  • Yung-Cheng, J. C., & Cheng, F. T. (2005). Application development of virtual metrology in semiconductor industry. In Proceedings of the 32nd annual conference of IEEE industrial electronics society (pp. 124–129).

  • Zhang, Y., & Yang, Q. (2017). A survey on multi-task learning. arXiv preprint arXiv:1707.08114.

  • Zhu, X., Suk, H. I., & Shen, D. (2014). A novel matrix-similarity based loss function for joint regression and classification in AD diagnosis. NeuroImage, 100, 91–105.

    Google Scholar 

  • Zhu, X., Suk, H. I., Wang, L., Lee, S. W., Shen, D., & Alzheimer’s Disease Neuroimaging Initiative. (2017). A novel relational regularization feature selection method for joint regression and classification in AD diagnosis. Medical Image Analysis, 38, 205–214.

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korea government (MSIT; Ministry of Science and ICT) (No. NRF-2017R1C1B5075685).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Seokho Kang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kang, S. Joint modeling of classification and regression for improving faulty wafer detection in semiconductor manufacturing. J Intell Manuf 31, 319–326 (2020). https://doi.org/10.1007/s10845-018-1447-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10845-018-1447-2

Keywords