Skip to main content
Log in

Virtual sample generation method based on generative adversarial fuzzy neural network

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Key performance indicators of complex industrial process such as production quality and pollutant emissions concentration are difficult to be measured online due to limited detection technology and high economical cost. Their modeling samples have high dimension, strong uncertainty, and small sample, which cannot satisfy the needs of traditional machine learning algorithms. A virtual sample generation method based on generative adversarial fuzzy neural network (GAFNN) is proposed to address the abovementioned problems. First, an adaptive feature selection algorithm based on random forest is used to reduce input feature for the original real samples. Second, candidate virtual samples are generated by GAFNN to alleviate the problems of uncertainty and small sample. Third, the virtual samples are screened by a multi-constrained selection mechanism to improve the quality of virtual samples. Finally, a deep forest classification model is constructed on the basis of the mixed samples in terms of the original real and selected virtual samples. The effectiveness of the proposed method is verified on benchmark and real industrial data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

Data availability statement

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

  1. Zhang T, Chen J, Xie J, Pan T (2021) SASLN: signals augmented self-taught learning networks for mechanical fault diagnosis under small sample condition. IEEE Trans Instrum Meas 70:1–11

    Google Scholar 

  2. Pan TY, Chen JL, Zhang TC, Liu S, He SL, Lv HX (2021) Generative adversarial network in mechanical fault diagnosis under small sample: A systematic review on applications and future perspectives. ISA Trans 128(Part B):1–10

    Google Scholar 

  3. Yin S, Ding XS, Xie XC, Luo H (2014) A review on basic data-driven approaches for industrial process monitoring. IEEE Trans Ind Electron 61(11):6418–6428

    Google Scholar 

  4. Zhu JL, Ge ZQ, Song ZH, Gao FR (2018) Review and big data perspectives on robust data mining approaches for industrial process modeling with outliers and missing data. Annu Rev Control 46:107–133

    MathSciNet  Google Scholar 

  5. Chen Y, Xu P, Chu Y et al (2017) Short-term electrical load forecasting using the support vector regression (SVR) model to calculate the demand response baseline for office buildings. Appl Energy 195:659–670

    Google Scholar 

  6. Pang TY, Yu TX, Song BF (2021) A Bayesian network model for fault diagnosis of a lock mechanism based on degradation data. Eng Fail Anal 122:1–21

    Google Scholar 

  7. Ding YF, Jia MP, Miao QH, Huang P (2021) Remaining useful life estimation using deep metric transfer learning for kernel regression. Reliab Eng Syst Saf 212:1–11

    Google Scholar 

  8. Yuan XF, Gu YJ, Wang YL, Yang CH, Gui WH (2019) A deep supervised learning framework for data-driven soft sensor modeling of industrial processes. IEEE Trans Neural Netw Learn Syst 31(11):4737–4746

    Google Scholar 

  9. Hu XB, Niu PF, Wang JM, Zhang XX (2020) Multi-objective prediction of coal-fired boiler with a deep hybrid neural networks. Atmos Pollut Res 11(7):1084–1090

    Google Scholar 

  10. Liu XJ, Zhang H, Kong XB, Lee KY (2020) Wind speed forecasting using deep neural network with feature selection. Neurocomputing 397:393–403

    Google Scholar 

  11. Hemanth DJ, Deperlioglu O, Kose U (2020) An enhanced diabetic retinopathy detection and classification approach using deep convolutional neural network. Neural Comput Appl 32(3):707–721

    Google Scholar 

  12. Kromp F, Fischer L, Bozsaky E et al (2021) Evaluation of deep learning architectures for complex immunofluorescence nuclear image segmentation. IEEE Trans Med Imaging 40(7):1934–1949

    Google Scholar 

  13. Liang Y, Li BB, Jiao B (2020) A deep learning method for motor fault diagnosis based on a capsule network with gate-structure dilated convolutions. Neural Comput Appl 33(5):1401–1418

    Google Scholar 

  14. Wen L, Li XY, Gao L (2019) A transfer convolutional neural network for fault diagnosis based on ResNet-50. Neural Comput Appl 32(10):6111–6124

    Google Scholar 

  15. Tang J, Xia H, Zhang J, Qiao JF, Yu W (2021) Deep forest regression based on cross-layer full connection. Neural Comput Appl 33:9307–9328

    Google Scholar 

  16. Xia H, Tang J, Qiao JF, Zhang J, Yu W (2022) DF classification algorithm for constructing a small sample size of data-oriented DF regression model. Neural Comput Appl 34:2785–2810

    Google Scholar 

  17. Gong HF, Chen ZS, Zhu QX, He YL (2017) A Monte Carlo and PSO based virtual sample generation method for enhancing the energy prediction and energy optimization on small data problem: an empirical study of petrochemical industries. Appl Energy 197:405–455

    Google Scholar 

  18. Zhu QX, Chen ZS, Zhang HX et al (2020) Dealing with small sample size problems in process industry using virtual sample generation: a Kriging-based approach. Soft Comput 24:6889–6902

    Google Scholar 

  19. Zhu QX, Liu DP, Xu Y, He YL (2021) Novel space projection interpolation based virtual sample generation for solving the small data problem in developing soft sensor. Chemom Intell Lab Syst 217:1–13

    Google Scholar 

  20. Tang J, Qiao JF, Chai TY, Liu Z, Wu ZW (2018) Modeling multiple components mechanical signals by means of virtual sample generation technique. Acta Automatica Sinica 44(9):1569–1589

    Google Scholar 

  21. Tang J, Xia H, Aljerf L, Wang DD, Ukaogo OP (2022) Prediction of dioxin emission from municipal solid waste incineration based on expansion, interpolation, and selection for small samples. J Environ Chem Eng 10:108314

    Google Scholar 

  22. Li CA, Lin LS (2014) Generating information for small data sets with a multi-modal distribution. Decis Support Syst 66:71–81

    Google Scholar 

  23. Bennin EK, Keung JW, Monden A (2019) On the relative value of data resampling approaches for software defect prediction. Empir Softw Eng 24(2):602–636

    Google Scholar 

  24. Xie YX, Qiu M, Zhang HB, Peng LZ, Chen ZX (2020) Gaussian distribution based oversampling for imbalanced data classification. IEEE Trans Knowl Data Eng 34(2):667–679

    Google Scholar 

  25. He YL, Hua Q, Zhu QX, Lu S (2022) Enhanced virtual sample generation based on manifold features: applications to developing soft sensor using small data. ISA Trans 126:398–406

    Google Scholar 

  26. Li DC, Wu SC, Tsai TI, Lina YS (2007) Using mega-trend-diffusion and artificial samples in small data set learning for early flexible manufacturing system scheduling knowledge. Comput Oper Res 34(4):966–982

    MATH  Google Scholar 

  27. Li DC, Chen CC, Chang CJ, Lin WK (2012) A tree-based-trend-diffusion prediction procedure for small sample sets in the early stages of manufacturing systems. Expert Syst Appl 39(1):1575–1581

    Google Scholar 

  28. Goodfellow IJ, Pouget-Abadie J, Mirza M et al (2014) Generative adversarial networks. Adv Neural Inf Process Syst 3:2672–2680

    Google Scholar 

  29. Lian J, Jia WK, Zareapoor M et al (2020) Deep-learning-based small surface defect detection via an exaggerated local variation-based generative adversarial network. IEEE Trans Ind Inf 16(2):1343–1351

    Google Scholar 

  30. Li YB, Zou WT, Jiang L (2022) Fault diagnosis of rotating machinery based on combination of Wasserstein generative adversarial networks and long short term memory fully convolutional network. Measurement 191:1–16

    Google Scholar 

  31. Tang J, Cui CL, Xia H, Wang DD, Qiao JF (2022) Dioxin emission risk warning model in MSWI process based on GAN with active learning mechanism. J Beijing Univ Technol 1–14 (Accepted)

  32. Tang J, Liu Z, Zhang J, Wu ZW, Chai TY, Yu W (2016) Kernel latent features adaptive extraction and selection method for multi-component non-stationary signal of industrial mechanical device. Neurocomputing 216:296–309

    Google Scholar 

  33. Wan YT, Ma AL, Zhong YF, Hu X, Zhang LP (2020) Multiobjective hyperspectral feature selection based on discrete sine cosine algorithm. IEEE Trans Geosci Remote Sens 58(5):3601–3618

    Google Scholar 

  34. Bao S, Zhang L, Han XS et al (2022) Feature selection method for nonintrusive load monitoring with balanced redundancy and relevancy. IEEE Trans Ind Appl 58(1):163–172

    Google Scholar 

  35. Xia H, Tang J, Aljerf L (2022) Dioxin emission prediction based on improved deep forest regression for municipal solid waste incineration process. Chemosphere 294:1–13

    Google Scholar 

  36. Zadeh LA (1973) Outline of a new approach to the analysis of complex systems and decision processes. IEEE Trans Syst Man Cybern 3(1):28–44

    MathSciNet  MATH  Google Scholar 

  37. Jang JSR (1993) ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans Syst Man Cybern 23(3):665–685

    Google Scholar 

  38. Zhang RD, Tao JL (2018) A nonlinear fuzzy neural network modeling approach using an improved genetic algorithm. IEEE Trans Ind Electron 65(7):5882–5892

    Google Scholar 

  39. Wang GQ, Chen X, Li YX (2019) Fuzzy neural network analysis on gray cast iron with high tensile strength and thermal conductivity. China Foundry 16(3):190–197

    Google Scholar 

  40. Qiao JF, Quan LM, Yang CL (2020) Design of modeling error PDF based fuzzy neural network for effluent ammonia nitrogen prediction. Appl Soft Comput J 91:1–11

    Google Scholar 

  41. Juang CF, Chen YW (2022) Automatic hitting-duration estimation of a rechargeable impact wrench using a fuzzy neural network to reach target toques. Int J Fuzzy Syst. https://doi.org/10.1007/s40815-022-01387-9

    Article  Google Scholar 

  42. Adeleke O, Akinlabi S, Jen TC, Adedeji PA, Dunmade I (2022) Evolutionary-based neuro-fuzzy modelling of combustion enthalpy of municipal solid waste. Neural Comput Appl 34:7419–7436

    Google Scholar 

  43. Lee SC, Lee ET (1975) Fuzzy neural networks. Math Biosci 23(1–2):151–177

    MathSciNet  MATH  Google Scholar 

  44. Wan P, Sun DH, Zhao M, Huang S (2019) multistability for almost-periodic solutions of Takagi-Sugeno fuzzy neural networks with nonmonotonic discontinuous activation functions and time-varying delays. IEEE Trans Fuzzy Syst 29(2):400–414

    Google Scholar 

  45. Zhou ZH, Feng J (2019) Deep forest. Natl Sci Rev 6:74–86

    Google Scholar 

  46. Fan W, Wang H (2013) Is random model better? On its accuracy and efficiency. In: ICDM’03: proceedings of the third IEEE international conference on data mining, pp 51–58

  47. Tang J, Guo ZH, Qiao JF (2022) Dioxin emission concentration soft measurement based on multi-source latent feature selective ensemble modeling for municipal solid waste incineration process. Acta Automatica Sinica 48(01):223–238

    Google Scholar 

  48. Tang J, Wang DD, Guo ZH, Qiao JF (2021) Prediction of dioxin emission concentration in municipal solid waste incineration process based on optimal selection of virtual samples. J Beijing Univ Technol 47(05):431–443

    Google Scholar 

Download references

Acknowledgements

This work was financially supported by National Natural Science Foundation of China (62073006 and 62021003), Beijing Natural Science Foundation (4212032 and 4192009), National Key Research and Development Program of China (2021ZD0112301 and 2021ZD0112302).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian Tang.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest. This article is considered for publication on the understanding that the article has neither been published nor will be published anywhere else before being published in Neural Computing and Applications.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cui, C., Tang, J., Xia, H. et al. Virtual sample generation method based on generative adversarial fuzzy neural network. Neural Comput & Applic 35, 6979–7001 (2023). https://doi.org/10.1007/s00521-022-08104-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-022-08104-5

Keywords

Navigation