Abstract
Fault diagnosis is an important yet challenging task. Because of the powerful feature representation capabilities of deep model, intelligent fault diagnosis on deep learning becomes a research hotspot in the field. Although many deep models as sparse autoencoder, deep belief network is developed for fault diagnosis with encouraging performance, integrating the merits of deep learning into fault diagnosis still has a long way to go. In this paper, we propose a novel method, namely generalized transformer. Compared to previous deep models, generalized transformer excavates relations among inputs and nonlinearity between inputs and outputs by attention mechanism. To deal with structured data, generalized transformer further borrows the idea from graph attention network. By replacing dot product between query and key information in transformer, we introduce a forward network with learned weight vector to compute the similarity. Through limiting the similarity calculations in a neighbor region, prior knowledge can be injected into generalized transformer. On Tennessee Eastman process dataset, our new model can produce high performance, which is better or competitive to state-of-the-art models. Extensive ablation studies validate the effectiveness of the proposed model.





Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Korbicz J, Koscielny JM, Kowalczuk Z, Cholewa W (2004) Fault diagnosis. Models, artificial intelligence, applications. Springer, Berlin
Ku W, Storer RH, Georgakis C (1995) Disturbance detection and isolation by dynamic principal component analysis. Chemom Intell Lab Syst 30(1):179–196
Mansouri M, Nounou M, Nounou H, Karim N (2016) Kernel pca-based glrt for nonlinear fault detection of chemical processes. J Loss Prev Process Indust 40((Supplement C)):334–347
Yin S, Ding SX, Xie X, Luo H (2014) A review on basic data-driven approaches for industrial process monitoring. IEEE Trans Industr Electron 61(11):6418–6428
Chen Z, Li H (2017) Multisensor feature fusion for bearing fault diagnosis using sparse autoencoder and deep belief network. IEEE Trans Instrum Meas 66(7):1693–1702
Wu H, Zhao J (2018) Deep convolutional neural network model based chemical process fault diagnosis. Comput Chem Eng 115(12):185–197
Wang Y, Pan Z, Yuan X, Yang C, Gui W (2020) A novel deep learning based fault diagnosis approach for chemical process with extended deep belief network. ISA Trans 96:457–467
Li X, Zhang W, Ding Q (2019) Understanding and improving deep learning-based rolling bearing fault diagnosis with attention mechanism. Signal Process 161:136–154
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: CVPR, pp 770–778
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: NIPS, pp 5998–6008
Veličković P, Cucurull G, Casanova A, Romero A, Lió P, Bengio Y (2018) Graph attention networks. In ICLR, pages 1–12
Turner CR, Fuggetta A, Lavazza L, Wolf AL (1999) A conceptual basis for feature engineering. J Syst Softw 49(1):3–15
Bengio Y, Courville A, Vincent P (2013) Representation learning: A review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–1828
Sun Q, Ge Z (2021) A survey on deep learning for data-driven soft sensors. IEEE Trans Indust Inform 17(9):5853–5866
Chiang LH, Russell EL, Braatz RD (2000) Fault diagnosis in chemical processes using fisher discriminant analysis, discriminant partial least squares, and principal component analysis. Chemom Intell Lab Syst 50:243–252
Yin S, Ding SX, Haghani A, Hao H, Zhang P (2012) A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark tennessee eastman process. J Process Control 22:1567–1581
Tax DM, Duin RP (1999) Support vector domain description. Pattern Recognit Lett 20:1191–1199
Ge Z, Zhang M, Song Z (2010) Nonlinear process monitoring based on linear subspace and bayesian inference. J Process Control 20(5):676–688
Zhang J, Chen H, Chen S, Hong X (2017) An improved mixture of probabilistic pca for nonlinear data-driven process monitoring. IEEE Trans Cybern (Spec) 49:198–210
Wang Y, Sun F, Li B (2018) Multiscale neighborhood normalization-based multiple dynamic pca monitoring method for batch processes with frequent operations. IEEE Trans Autom Sci Eng 17(9):1053–1064
Gao X, Hou J (2016) An improved svm integrated gs-pca fault diagnosis approach of tennessee eastman process. Neurocomputing 174:906–911
You D, Gao X, Katayama S (2015) Wpd-pca-based laser welding process monitoring and defects diagnosis by using fnn and svm. IEEE Trans Industr Electron 62(1):628–636
Deng X, Zhang Z (2020) Nonlinear chemical process fault diagnosis using ensemble deep support vector data description. Sensors 20(4599):1–19
Guo C, Hu W, Yang F, Huang D (2020) Deep learning technique for process fault detection and diagnosis in the presence of incomplete data. Chin J Chem Eng 28(9):2358–2367
Zhao H, Sun S, Jin B (2018) Sequential fault diagnosis based on lstm neural network. IEEE Access 6:12929–12939
Zhang Z, Zhao J (2017) A deep belief network based fault diagnosis model for complex chemical processes. Comput Chem Eng 107:395–407
Cheng F, He QP, Zhao J (2019) A novel process monitoring approach based on variational recurrent autoencoder. Comput Chem Eng 129:106515106515
Wang Z, Su Y, Shen W, Jin S, Clark JH, Ren J, Zhang X (2019) Predictive deep learning models for environmental properties: the direct calculation of octanol water partition coefficients from molecular graphs. Green Chem 16:4555–4565
Bahdanau D, Cho K, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: ICLR
Gehring J, Auli M, Grangier D, Dauphin Y. N (2017) A convolutional encoder model for neural machine translation. In: ACL
Downs JJ, Vogel EF (1993) A plant-wide industrial process control problem. Comput Chem Eng 17(3):245–255
Kingma D. P, Ba J (2014) Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980
Zhang L, Du Y, Li X, Zhen X (2020) Calibrated multivariate regression networks. IEEE Trans Circuits Syst Video Technol 30(11):4222–4231
Author information
Authors and Affiliations
Contributions
ZP and ZS were involved in conceptualization. LZ contributed to writing and experiments. ZP and QZ contributed to methodology and supervision. Thanks for the support from National Natural Science Foundation of China (61933013), Project of Educational Commission of Guangdong province of China (2018KCXTD019), and Natural Science Foundation of Guangdong Province of China (2021A1515011846).
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, L., Song, Z., Zhang, Q. et al. Generalized transformer in fault diagnosis of Tennessee Eastman process. Neural Comput & Applic 34, 8575–8585 (2022). https://doi.org/10.1007/s00521-021-06711-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06711-2