Abstract
The classification of defect patterns in wafer maps is a crucial task in the semiconductor industry. By accurately identifying and classifying these patterns, manufacturers can take necessary measures to improve the quality of their wafers and optimize their production processes. To address this challenge, we propose a stacking ensemble method and utilizes data augmentation. In the first step, to address the issue of data imbalance resulting from difficulties in collecting data in real-world environments, we apply data augmentation techniques with convolutional autoencoder to expand the training dataset. This helps to minimize the impact of minority sample class and improve the generalization ability of the classification model. In the second step, we leverage the stacking ensemble method, which combines multiple classification models to make more accurate predictions. By combining handcrafted features with deep features in training deep convolutional models, this approach has been shown to improve the classification performance and provide valuable support for fault diagnosis in the semiconductor industry. Experimental evaluation is performed with stacking models with fivefold cross-validation, evaluating and comparing them in terms of accuracy, precision, recall, and F1-score. The proposed method achieves an average classification accuracy of 0.9818, a micro-F1-score of 0.9818, and a macro-F1-score of 0.9318.













Similar content being viewed by others
Data availability
No datasets were generated or analysed during the current study.
References
Liu CW, Chien CF (2013) An intelligent system for wafer bin map defect diagnosis: An empirical study for semiconductor manufacturing. Eng Appl Artif Intel 26(5–6):1479–1486. https://doi.org/10.1016/j.engappai.2012.11.009
Chien CF, Hsu CY, Chen PN (2013) Semiconductor fault detection and classification for yield enhancement and manufacturing intelligence. Flex Serv Manuf J 25(3):367–388. https://doi.org/10.1007/s10696-012-9161-4
Wu MJ, Jang JSR, Chen JL (2015) Wafer map failure pattern recognition and similarity ranking for large-scale data sets. IEEE Trans Semicond Manuf 28(1):1–12. https://doi.org/10.1109/tsm.2014.2364237
Cunningham SP, Spanos CJ, Voros K (1995) Semiconductor yield improvement: results and best practices. IEEE Trans Semicond Manuf 8(2):103–109. https://doi.org/10.1109/66.382273
Chien CF, Wang WC, Cheng JC (2007) Data mining for yield enhancement in semiconductor manufacturing and an empirical study. Expert Syst Appl 33(1):192–198. https://doi.org/10.1016/j.eswa.2006.04.014
Shankar NG, Zhong ZW (2005) Defect detection on semiconductor wafer surfaces. Microelectron Eng 77(3–4):337–346. https://doi.org/10.1016/j.mee.2004.12.003
Batool U, Shapiai MI, Tahir M, Ismail ZH, Zakaria NJ, Elfakharany A (2021) A systematic review of deep learning for silicon wafer defect recognition. IEEE Access 9:116572–116593. https://doi.org/10.1109/access.2021.3106171
Wei Y, Wang H (2022) Mixed-type wafer defect pattern recognition framework based on multifaceted dynamic convolution. IEEE Trans Instrum Meas 71:1–11. https://doi.org/10.1109/tim.2022.3178498
Deng G, Wang H (2024) Efficient mixed-type wafer defect pattern recognition based on light-weight neural network. Micromachines 15(7):836. https://doi.org/10.3390/mi15070836
Kang S (2020) Rotation-invariant wafer map pattern classification with convolutional neural networks. IEEE Access 8:170650–170658. https://doi.org/10.1109/access.2020.3024603
Tziolas T et al (2022) Wafer map defect pattern recognition using imbalanced datasets. In: 2022 13th International Conference on Information, Intelligence, Systems & Applications (IISA), Corfu, Greece, pp 1–8, https://doi.org/10.1109/IISA56318.2022.9904402
Saqlain M, Abbas Q, Lee JY (2020) A deep convolutional neural network for wafer defect identification on an imbalanced dataset in semiconductor manufacturing processes. IEEE Trans Semicond Manuf 33(3):436–444. https://doi.org/10.1109/tsm.2020.2994357
Alqudah R, Al-Mousa AA, Abu Hashyeh Y, Alzaibaq OZ (2023) A systemic comparison between using augmented data and synthetic data as means of enhancing wafermap defect classification. Computers in Industry 145:103809. https://doi.org/10.1016/j.compind.2022.103809
Khalifa NE, Loey M, Mirjalili S (2022) A comprehensive survey of recent trends in deep learning for digital images augmentation. Artif Intell Rev 55(3):2351–2377. https://doi.org/10.1007/s10462-021-10066-4
Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(1):60. https://doi.org/10.1186/s40537-019-0197-0
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507. https://doi.org/10.1126/science.1127647
Shawon A, Faruk MO, Habib MB and Khan AM (2019) Silicon wafer map defect classification using deep convolutional neural network with data augmentation. In: 2019 IEEE 5th International Conference on Computer and Communications (ICCC), Chengdu, China, 2019, pp 1995–1999, https://doi.org/10.1109/ICCC47050.2019.9064029
Wang FK, Chou JH, Amogne ZE (2022) A deep convolutional neural network with residual blocks for wafer map defect pattern recognition. Qual Reliab Eng Int 38(1):343–357. https://doi.org/10.1002/qre.2983
Yu N, Chen H, Xu Q, Hasan MM, Sie O (2022) Wafer map defect patterns classification based on a lightweight network and data augmentation. CAAI Trans Intel Technol 8(3):1029–1042. https://doi.org/10.1049/cit2.12126
Tsai TH, Lee YC (2020) A light-weight neural network for wafer map classification based on data augmentation. IEEE Trans Semicond Manuf 33(4):663–672. https://doi.org/10.1109/TSM.2020.3013004
Zhang Y et al (2024) Class imbalance wafer defect pattern recognition based on shared-database decentralized federated learning framework. IEEE Trans Instrum Meas 73:1–17. https://doi.org/10.1109/tim.2024.3395316
Mengying F, Qin W and van der Waal B (2016) Wafer defect patterns recognition based on OPTICS and multi-label classification. In: 2016 IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), Xi’an, China, pp 912-915, https://doi.org/10.1109/IMCEC.2016.7867343
Saqlain M, Jargalsaikhan B, Lee JY (2019) A voting ensemble classifier for wafer map defect patterns identification in semiconductor manufacturing. IEEE Trans Semicond Manuf 32(2):171–182. https://doi.org/10.1109/tsm.2019.2904306
Kang H, Kang S (2021) A stacking ensemble classifier with handcrafted and convolutional features for wafer map pattern classification. Comput Ind 129:103450. https://doi.org/10.1016/j.compind.2021.103450
Yi Z et al (2024) Large-margin extreme learning machines with hybrid features for wafer map defect recognition. IEEE Trans Instrum Meas 73:1–10. https://doi.org/10.1109/tim.2024.3374295
Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444. https://doi.org/10.1038/nature14539
Ishida T, Nitta I, Fukuda D and Kanazawa Y (2019) Deep learning-based wafer-map failure pattern recognition framework. In: 20th International Symposium on Quality Electronic Design (ISQED), 6–7 March 2019, pp 291–297, https://doi.org/10.1109/ISQED.2019.8697407
Shin E, Yoo CD (2023) Efficient convolutional neural networks for semiconductor wafer bin map classification. Sensors 23(4):1926. https://doi.org/10.3390/s23041926
Chen S, Huang Z, Wang T, Hou X, Ma J (2023) Wafer map defect recognition based on multi-scale feature fusion and attention spatial pyramid pooling. J Intel Manuf 36(1):271–284. https://doi.org/10.1007/s10845-023-02231-z
Zhou ZH (2012) Ensemble methods: foundations and algorithms. Chapman & Hall/CRC, Boca Raton
Aggarwal CC (2014) Data classification: algorithms and applications. Chapman & Hall/CRC, Boca Raton
Fei X, Zhang Q, Ling Q (2019) Vehicle exhaust concentration estimation based on an improved stacking model. IEEE Access 7:179454–179463. https://doi.org/10.1109/ACCESS.2019.2958703
Rawat W, Wang ZH (2017) Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput 29(9):2352–2449. https://doi.org/10.1162/neco_a_00990
Simonyan K and Zisserman A (2014) Very deep convolutional networks for large-scale image recognition, CoRR, vol abs/1409.1556
He K, Zhang X, Ren S and Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, pp 770–778, https://doi.org/10.1109/CVPR.2016.90
Huang G, Liu Z, Van Der Maaten L and Weinberger KQ (2017) Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, pp 2261–2269, https://doi.org/10.1109/CVPR.2017.243
Russakovsky O et al (2015) ImageNet large scale visual recognition challenge. Int J Comput Vision 115(3):211–252. https://doi.org/10.1007/s11263-015-0816-y
Author information
Authors and Affiliations
Contributions
Chiung-Jung Yang wrote the main manuscript. Sun-Yuan Hsieh checked the correctness of the proposed algorithms and experiments.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yang, CJ., Chen, YH. & Hsieh, SY. Enhanced wafer map defect pattern classification through stacking ensemble method and data augmentation integration. J Supercomput 81, 643 (2025). https://doi.org/10.1007/s11227-025-07113-0
Accepted:
Published:
DOI: https://doi.org/10.1007/s11227-025-07113-0