Abstract
Key performance indicators of complex industrial process such as production quality and pollutant emissions concentration are difficult to be measured online due to limited detection technology and high economical cost. Their modeling samples have high dimension, strong uncertainty, and small sample, which cannot satisfy the needs of traditional machine learning algorithms. A virtual sample generation method based on generative adversarial fuzzy neural network (GAFNN) is proposed to address the abovementioned problems. First, an adaptive feature selection algorithm based on random forest is used to reduce input feature for the original real samples. Second, candidate virtual samples are generated by GAFNN to alleviate the problems of uncertainty and small sample. Third, the virtual samples are screened by a multi-constrained selection mechanism to improve the quality of virtual samples. Finally, a deep forest classification model is constructed on the basis of the mixed samples in terms of the original real and selected virtual samples. The effectiveness of the proposed method is verified on benchmark and real industrial data.


















Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Data availability statement
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Zhang T, Chen J, Xie J, Pan T (2021) SASLN: signals augmented self-taught learning networks for mechanical fault diagnosis under small sample condition. IEEE Trans Instrum Meas 70:1–11
Pan TY, Chen JL, Zhang TC, Liu S, He SL, Lv HX (2021) Generative adversarial network in mechanical fault diagnosis under small sample: A systematic review on applications and future perspectives. ISA Trans 128(Part B):1–10
Yin S, Ding XS, Xie XC, Luo H (2014) A review on basic data-driven approaches for industrial process monitoring. IEEE Trans Ind Electron 61(11):6418–6428
Zhu JL, Ge ZQ, Song ZH, Gao FR (2018) Review and big data perspectives on robust data mining approaches for industrial process modeling with outliers and missing data. Annu Rev Control 46:107–133
Chen Y, Xu P, Chu Y et al (2017) Short-term electrical load forecasting using the support vector regression (SVR) model to calculate the demand response baseline for office buildings. Appl Energy 195:659–670
Pang TY, Yu TX, Song BF (2021) A Bayesian network model for fault diagnosis of a lock mechanism based on degradation data. Eng Fail Anal 122:1–21
Ding YF, Jia MP, Miao QH, Huang P (2021) Remaining useful life estimation using deep metric transfer learning for kernel regression. Reliab Eng Syst Saf 212:1–11
Yuan XF, Gu YJ, Wang YL, Yang CH, Gui WH (2019) A deep supervised learning framework for data-driven soft sensor modeling of industrial processes. IEEE Trans Neural Netw Learn Syst 31(11):4737–4746
Hu XB, Niu PF, Wang JM, Zhang XX (2020) Multi-objective prediction of coal-fired boiler with a deep hybrid neural networks. Atmos Pollut Res 11(7):1084–1090
Liu XJ, Zhang H, Kong XB, Lee KY (2020) Wind speed forecasting using deep neural network with feature selection. Neurocomputing 397:393–403
Hemanth DJ, Deperlioglu O, Kose U (2020) An enhanced diabetic retinopathy detection and classification approach using deep convolutional neural network. Neural Comput Appl 32(3):707–721
Kromp F, Fischer L, Bozsaky E et al (2021) Evaluation of deep learning architectures for complex immunofluorescence nuclear image segmentation. IEEE Trans Med Imaging 40(7):1934–1949
Liang Y, Li BB, Jiao B (2020) A deep learning method for motor fault diagnosis based on a capsule network with gate-structure dilated convolutions. Neural Comput Appl 33(5):1401–1418
Wen L, Li XY, Gao L (2019) A transfer convolutional neural network for fault diagnosis based on ResNet-50. Neural Comput Appl 32(10):6111–6124
Tang J, Xia H, Zhang J, Qiao JF, Yu W (2021) Deep forest regression based on cross-layer full connection. Neural Comput Appl 33:9307–9328
Xia H, Tang J, Qiao JF, Zhang J, Yu W (2022) DF classification algorithm for constructing a small sample size of data-oriented DF regression model. Neural Comput Appl 34:2785–2810
Gong HF, Chen ZS, Zhu QX, He YL (2017) A Monte Carlo and PSO based virtual sample generation method for enhancing the energy prediction and energy optimization on small data problem: an empirical study of petrochemical industries. Appl Energy 197:405–455
Zhu QX, Chen ZS, Zhang HX et al (2020) Dealing with small sample size problems in process industry using virtual sample generation: a Kriging-based approach. Soft Comput 24:6889–6902
Zhu QX, Liu DP, Xu Y, He YL (2021) Novel space projection interpolation based virtual sample generation for solving the small data problem in developing soft sensor. Chemom Intell Lab Syst 217:1–13
Tang J, Qiao JF, Chai TY, Liu Z, Wu ZW (2018) Modeling multiple components mechanical signals by means of virtual sample generation technique. Acta Automatica Sinica 44(9):1569–1589
Tang J, Xia H, Aljerf L, Wang DD, Ukaogo OP (2022) Prediction of dioxin emission from municipal solid waste incineration based on expansion, interpolation, and selection for small samples. J Environ Chem Eng 10:108314
Li CA, Lin LS (2014) Generating information for small data sets with a multi-modal distribution. Decis Support Syst 66:71–81
Bennin EK, Keung JW, Monden A (2019) On the relative value of data resampling approaches for software defect prediction. Empir Softw Eng 24(2):602–636
Xie YX, Qiu M, Zhang HB, Peng LZ, Chen ZX (2020) Gaussian distribution based oversampling for imbalanced data classification. IEEE Trans Knowl Data Eng 34(2):667–679
He YL, Hua Q, Zhu QX, Lu S (2022) Enhanced virtual sample generation based on manifold features: applications to developing soft sensor using small data. ISA Trans 126:398–406
Li DC, Wu SC, Tsai TI, Lina YS (2007) Using mega-trend-diffusion and artificial samples in small data set learning for early flexible manufacturing system scheduling knowledge. Comput Oper Res 34(4):966–982
Li DC, Chen CC, Chang CJ, Lin WK (2012) A tree-based-trend-diffusion prediction procedure for small sample sets in the early stages of manufacturing systems. Expert Syst Appl 39(1):1575–1581
Goodfellow IJ, Pouget-Abadie J, Mirza M et al (2014) Generative adversarial networks. Adv Neural Inf Process Syst 3:2672–2680
Lian J, Jia WK, Zareapoor M et al (2020) Deep-learning-based small surface defect detection via an exaggerated local variation-based generative adversarial network. IEEE Trans Ind Inf 16(2):1343–1351
Li YB, Zou WT, Jiang L (2022) Fault diagnosis of rotating machinery based on combination of Wasserstein generative adversarial networks and long short term memory fully convolutional network. Measurement 191:1–16
Tang J, Cui CL, Xia H, Wang DD, Qiao JF (2022) Dioxin emission risk warning model in MSWI process based on GAN with active learning mechanism. J Beijing Univ Technol 1–14 (Accepted)
Tang J, Liu Z, Zhang J, Wu ZW, Chai TY, Yu W (2016) Kernel latent features adaptive extraction and selection method for multi-component non-stationary signal of industrial mechanical device. Neurocomputing 216:296–309
Wan YT, Ma AL, Zhong YF, Hu X, Zhang LP (2020) Multiobjective hyperspectral feature selection based on discrete sine cosine algorithm. IEEE Trans Geosci Remote Sens 58(5):3601–3618
Bao S, Zhang L, Han XS et al (2022) Feature selection method for nonintrusive load monitoring with balanced redundancy and relevancy. IEEE Trans Ind Appl 58(1):163–172
Xia H, Tang J, Aljerf L (2022) Dioxin emission prediction based on improved deep forest regression for municipal solid waste incineration process. Chemosphere 294:1–13
Zadeh LA (1973) Outline of a new approach to the analysis of complex systems and decision processes. IEEE Trans Syst Man Cybern 3(1):28–44
Jang JSR (1993) ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans Syst Man Cybern 23(3):665–685
Zhang RD, Tao JL (2018) A nonlinear fuzzy neural network modeling approach using an improved genetic algorithm. IEEE Trans Ind Electron 65(7):5882–5892
Wang GQ, Chen X, Li YX (2019) Fuzzy neural network analysis on gray cast iron with high tensile strength and thermal conductivity. China Foundry 16(3):190–197
Qiao JF, Quan LM, Yang CL (2020) Design of modeling error PDF based fuzzy neural network for effluent ammonia nitrogen prediction. Appl Soft Comput J 91:1–11
Juang CF, Chen YW (2022) Automatic hitting-duration estimation of a rechargeable impact wrench using a fuzzy neural network to reach target toques. Int J Fuzzy Syst. https://doi.org/10.1007/s40815-022-01387-9
Adeleke O, Akinlabi S, Jen TC, Adedeji PA, Dunmade I (2022) Evolutionary-based neuro-fuzzy modelling of combustion enthalpy of municipal solid waste. Neural Comput Appl 34:7419–7436
Lee SC, Lee ET (1975) Fuzzy neural networks. Math Biosci 23(1–2):151–177
Wan P, Sun DH, Zhao M, Huang S (2019) multistability for almost-periodic solutions of Takagi-Sugeno fuzzy neural networks with nonmonotonic discontinuous activation functions and time-varying delays. IEEE Trans Fuzzy Syst 29(2):400–414
Zhou ZH, Feng J (2019) Deep forest. Natl Sci Rev 6:74–86
Fan W, Wang H (2013) Is random model better? On its accuracy and efficiency. In: ICDM’03: proceedings of the third IEEE international conference on data mining, pp 51–58
Tang J, Guo ZH, Qiao JF (2022) Dioxin emission concentration soft measurement based on multi-source latent feature selective ensemble modeling for municipal solid waste incineration process. Acta Automatica Sinica 48(01):223–238
Tang J, Wang DD, Guo ZH, Qiao JF (2021) Prediction of dioxin emission concentration in municipal solid waste incineration process based on optimal selection of virtual samples. J Beijing Univ Technol 47(05):431–443
Acknowledgements
This work was financially supported by National Natural Science Foundation of China (62073006 and 62021003), Beijing Natural Science Foundation (4212032 and 4192009), National Key Research and Development Program of China (2021ZD0112301 and 2021ZD0112302).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest. This article is considered for publication on the understanding that the article has neither been published nor will be published anywhere else before being published in Neural Computing and Applications.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Cui, C., Tang, J., Xia, H. et al. Virtual sample generation method based on generative adversarial fuzzy neural network. Neural Comput & Applic 35, 6979–7001 (2023). https://doi.org/10.1007/s00521-022-08104-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-08104-5