Quality prediction of multi-stage batch process based on integrated ConvBiGRU with attention mechanism

Liu, Kai; Zhao, Xiaoqiang; Mou, Miao; Hui, Yongyong

doi:10.1007/s10489-024-06002-y

Quality prediction of multi-stage batch process based on integrated ConvBiGRU with attention mechanism

Published: 10 December 2024

Volume 55, article number 123, (2025)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Kai Liu^1,2,
Xiaoqiang Zhao ORCID: orcid.org/0000-0001-5687-942X^1,2,3,
Miao Mou^1,2 &
…
Yongyong Hui^1,2,3

123 Accesses
Explore all metrics

Abstract

It is important for quality prediction and monitoring to ensure the safe operation of the process. When constructing a prediction model, it is crucial to choose appropriate input variables to influence the online prediction performance and quality monitoring. Data-driven techniques have been widely used for prediction and monitoring of quality variables, but there are some difficulties in the application of batch processes, three-dimensional characteristics of data, different initial conditions, and multi-stage characteristics within batches. Therefore, we propose a quality prediction model of multi-stage batch process based on integrated ConvBiGRU with attention mechanism (MI-ConvBiGRU-AM). Firstly, Firstly, the original 3D data are expanded into 2D time slices by the batch-variable expansion method. Secondly, the 2D time slices are clustered to complete stage identification using the improved affine propagation clustering method based on the design of the Markov chain similarity matrix. At each stage, we select product quality-related modeling variables using the Maximum Relevance Minimum Redundancy (mRMR). Then, the selected variables are used to train a convolutional bi-directional gated recurrent unit with an attention mechanism (ConvBiGRU-AM). Finally, ConvBiGRU-AM model for each stage is integrated together a whole prediction model for the entire process to accomplish quality prediction, and the prediction residuals are utilized for quality monitoring. The validity of the proposed method was verified by Industrial-scale fed-batch fermentation (IFBF) process and the Hot strip mill (HSM) process. For the IFBF process, the model achieved an FDR of 99.73%, FAR of 0.54%, MAE of 0.0043, RMSE of 0.0396, MAPE of 0.0121, and R² of 0.9971. For the HSM process, the results were an FDR of 99.95%, FAR of 0.25%, MAE of 0.0053, RMSE of 0.0111, MAPE of 0.1539, and R² of 0.9990. These results demonstrate that the proposed method significantly improves prediction accuracy and achieves better quality monitoring compared to existing methods, highlighting its effectiveness for industrial applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 11

Novel Multi-flow Multi-scale Convolutional Neural Network Developed for Quality Prediction of Batch Processes to Fuse Data With Different Sampling Frequencies

Article 26 April 2024

Attention-Based Convolutional Aggregation: An Efficient Model for Off-Gas Profile Forecasting and Dynamic Pre-Control of BOF Steelmaking

Article Open access 23 December 2024

Hybrid static-sensory data modeling for prediction tasks in basic oxygen furnace process

Article 11 November 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Fujita H, Fournier-Viger P, Sasaki J, Ali M (2021) Advances in theory and applications of artificial intelligence. AI Mag 42(1):86–87
MATH Google Scholar
Chandrasekar A, Radhika T, Zhu Q (2022) Further results on input-to-state stability of stochastic Cohen–Grossberg BAM neural networks with probabilistic time-varying delays. Neural Process Lett 1–23
Radhika T, Chandrasekar A, Vijayakumar V, Zhu Q (2023) Analysis of Markovian jump stochastic Cohen-Grossberg BAM neural networks with time delays for exponential input-to-state stability. Neural Process Lett 55(8):11055–11072
Article MATH Google Scholar
Tamil Thendral M, Ganesh Babu TR, Chandrasekar A, Cao Y (2022) Synchronization of Markovian jump neural networks for sampled data control systems with additive delay components: analysis of image encryption technique, Mathematical methods in the applied sciences
Ji C, Ma F, Wang J, Sun W (2023) Profitability related industrial-scale batch processes monitoring via deep learning based soft sensor development. Comput Chem Eng 170:108125
Article Google Scholar
Peng C, ChunHao D (2022) Monitoring multi-domain batch process state based on fuzzy broad learning system. Expert Syst Appl 187:115851
Article MATH Google Scholar
Sansana J, Rendall R, Joswiak MN, Castillo I, Miller G, Chiang LH, Reis MS (2023) a functional data-driven approach to monitor and analyze equipment degradation in multiproduct batch processes. Process Safety Environ Protect
Zhang Y, Cao J, Zhao X, Hui Y (2023) Nonlinear multiphase batch process monitoring and quality prediction using multi-way concurrent locally weighted projection regression. Chemom Intell Lab Syst 240:104922
Article MATH Google Scholar
Yu Y (2012) Intelligent quality prediction using weighted least square support vector regression. Phys Procedia 24:1392–1399
Article MATH Google Scholar
Yuan X, Ge Z, Song Z (2014) Locally weighted kernel principal component regression model for soft sensing of nonlinear time-variant processes. Ind Eng Chem Res 53(35):13736–13749
Article MATH Google Scholar
Yu J (2012) Multiway Gaussian mixture model based adaptive kernel partial least squares regression method for soft sensor estimation and reliable quality prediction of nonlinear multiphase batch processes. Ind Eng Chem Res 51(40):13227–13237
Article MATH Google Scholar
Rong M, Shi H, Tan S (2019) Large-scale supervised process monitoring based on distributed modified principal component regression. Ind Eng Chem Res 58(39):18223–18240
Article Google Scholar
Gins G, Van Impe JF, Reis MS (2018) Finding the optimal time resolution for batch-end quality prediction: MRQP–A framework for multi-resolution quality prediction. Chemom Intell Lab Syst 172:150–158
Article MATH Google Scholar
Hinton GE, Osindero S, Teh Y-W (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554
Article MathSciNet MATH Google Scholar
Yamashita R, Nishio M, Do RKG, Togashi K (2018) Convolutional neural networks: an overview and application in radiology. Insights Imaging 9:611–629
Article Google Scholar
Jiang K, Han Q, Du X, Ni P (2021) A decentralized unsupervised structural condition diagnosis approach using deep auto-encoders. Computer-Aided Civil Infrastruct Eng 36(6):711–732
Article MATH Google Scholar
Yu Y, Si X, Hu C, Zhang J (2019) A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput 31(7):1235–1270
Article MathSciNet MATH Google Scholar
Zhao R, Wang D, Yan R, Mao K, Shen F, Wang J (2017) Machine health monitoring using local feature-based gated recurrent unit networks. IEEE Trans Industr Electron 65(2):1539–1548
Article MATH Google Scholar
Yao L, Ge Z (2023) Causal variable selection for industrial process quality prediction via attention-based GRU network. Eng Appl Artif Intell 118:105658
Article MATH Google Scholar
Ma L, Wang M, Peng K (2022) A novel bidirectional gated recurrent unit-based soft sensor modeling framework for quality prediction in manufacturing processes. IEEE Sens J 22(19):18610–18619
Article MATH Google Scholar
Li J, Yang C, Li Y, Xie S (2021) A context-aware enhanced GRU network with feature-temporal attention for prediction of silicon content in hot metal. IEEE Trans Industr Inf 18(10):6631–6641
Article MATH Google Scholar
Sun K, Liu J, Kang J-L, Jang S-S, Wong DS-H, Chen D-S (2014) Development of a variable selection method for soft sensor using artificial neural network and nonnegative garrote. J Process Control 24(7):1068–1075
Article MATH Google Scholar
Fujiwara K, Kano M (2015) Efficient input variable selection for soft-senor design based on nearest correlation spectral clustering and group Lasso. ISA Trans 58:367–379
Article MATH Google Scholar
Yao L, Ge Z (2018) Variable selection for nonlinear soft sensor development with enhanced binary differential evolution algorithm. Control Eng Practice 72:68–82
Article MATH Google Scholar
Zhao C (2014) Concurrent phase partition and between-mode statistical analysis for multimode and multiphase batch process monitoring. AIChE J 60(2):559–573
Article MATH Google Scholar
Luo L, Bao S, Mao J, Tang D (2016) Phase partition and phase-based process monitoring methods for multiphase batch processes with uneven durations. Ind Eng Chem Res 55(7):2035–2048
Article MATH Google Scholar
Peng K, Li Q, Zhang K, Dong J (2016) Quality-related process monitoring for dynamic non-Gaussian batch process with multi-phase using a new data-driven method. Neurocomputing 214:317–328
Article MATH Google Scholar
Liu J, Liu T, Chen J (2018) Sequential local-based Gaussian mixture model for monitoring multiphase batch processes. Chem Eng Sci 181:101–113
Article MATH Google Scholar
Peng C, Lu R, Kang O, Kai W (2020) Batch process fault detection for multi-stage broad learning system. Neural Netw 129:298–312
Article MATH Google Scholar
Zhao X, Liu K, Hui Y (2023) Fault monitoring of batch process based on multi-stage optimization regularized neighborhood preserving embedding algorithm. Trans Inst Meas Control 45(1):89–103
Article MathSciNet MATH Google Scholar
Frey BJ, Dueck D (2007) Clustering by passing messages between data points. Science 315(5814):972–976
Lerm S, Saeedi A, Rahm E (2021) Extended affinity propagation clustering for multi-source entity resolution
Wei Z, He D, Jin Z, Liu B, Shan S, Chen Y, Miao J (2023) Density-based affinity propagation tensor clustering for intelligent fault diagnosis of train bogie bearing. IEEE Trans Intell Transp Syst 24(6):6053–6064
Article MATH Google Scholar
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article MATH Google Scholar
Cho K, Van Merriënboer B, Bahdanau D, Bengio Y (2014) On the properties of neural machine translation: encoder-decoder approaches, arXiv preprint arXiv:1409.1259
Zhang X, Tang L, Chen J (2021) Fault diagnosis for electro-mechanical actuators based on STL-HSTA-GRU and SM. IEEE Trans Instrum Meas 70:1–16
Article MATH Google Scholar
Xia M, Shao H, Ma X, De Silva CW (2021) A stacked GRU-RNN-based approach for predicting renewable energy and electricity load for smart grid operation. IEEE Trans Industr Inf 17(10):7050–7059
Article MATH Google Scholar
Zhao H (2018) Dynamic graph embedding for fault detection. Comput Chem Eng 117:359–371
Article MATH Google Scholar
Gu X, Guo J, Xiao L, Li C (2022) Conditional mutual information-based feature selection algorithm for maximal relevance minimal redundancy. Appl Intell 52(2):1436–1447
Article MATH Google Scholar
Niu Z, Zhong G, Yu H (2021) A review on the attention mechanism of deep learning. Neurocomputing 452:48–62
Article MATH Google Scholar
Goldrick S, Ştefan A, Lovett D, Montague G, Lennox B (2015) The development of an industrial-scale fed-batch fermentation simulation. J Biotechnol 193:70–82
Article MATH Google Scholar
Ding SX, Yin S, Peng K, Hao H, Shen B (2012) A novel scheme for key performance indicator prediction and diagnosis with application to an industrial hot strip mill. IEEE Trans Industr Inf 9(4):2239–2247
Article MATH Google Scholar
Mears L, Stocks SM, Sin G, Gernaey KV (2017) A review of control strategies for manipulating the feed rate in fed-batch fermentation processes. J Biotechnol 245:34–46
Article Google Scholar
Nadal-Rey G, McClure DD, Kavanagh JM, Cassells B, Cornelissen S, Fletcher DF, Gernaey KV (2021) Development of dynamic compartment models for industrial aerobic fed-batch fermentation processes. Chem Eng J 420:130402
Article Google Scholar
Mourchid Y, Slama R (2023) D-STGCNT: a dense spatio-temporal graph Conv-GRU Network based on transformer for assessment of patient physical rehabilitation. Comput Biol Med 165:107420
Article MATH Google Scholar

Download references

Acknowledgements

This research work has been awarded by the National Natural Science Foundation of China (62263021, 62163023), Industrial Support Project of Education Department of Gansu Province (2023CYZC-24), the Open Fund project of Gansu Provincial Key Laboratory of Advanced Control for Industrial Process (2022KX07).

Author information

Authors and Affiliations

College of Electrical and Information Engineering, Lanzhou University of Technology, Lanzhou, China
Kai Liu, Xiaoqiang Zhao, Miao Mou & Yongyong Hui
Gansu Key Laboratory of Advanced Control for Industrial Processes, Lanzhou, China
Kai Liu, Xiaoqiang Zhao, Miao Mou & Yongyong Hui
National Experimental Teaching Centre of Electrical and Control Engineering, Lanzhou University of Technology, Lanzhou, China
Xiaoqiang Zhao & Yongyong Hui

Authors

Kai Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoqiang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Miao Mou
View author publications
You can also search for this author in PubMed Google Scholar
Yongyong Hui
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoqiang Zhao.

Ethics declarations

Conflict of interest

There is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Liu, K., Zhao, X., Mou, M. et al. Quality prediction of multi-stage batch process based on integrated ConvBiGRU with attention mechanism. Appl Intell 55, 123 (2025). https://doi.org/10.1007/s10489-024-06002-y

Download citation

Accepted: 30 September 2024
Published: 10 December 2024
DOI: https://doi.org/10.1007/s10489-024-06002-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Quality prediction of multi-stage batch process based on integrated ConvBiGRU with attention mechanism

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Novel Multi-flow Multi-scale Convolutional Neural Network Developed for Quality Prediction of Batch Processes to Fuse Data With Different Sampling Frequencies

Attention-Based Convolutional Aggregation: An Efficient Model for Off-Gas Profile Forecasting and Dynamic Pre-Control of BOF Steelmaking

Hybrid static-sensory data modeling for prediction tasks in basic oxygen furnace process

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Quality prediction of multi-stage batch process based on integrated ConvBiGRU with attention mechanism

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Novel Multi-flow Multi-scale Convolutional Neural Network Developed for Quality Prediction of Batch Processes to Fuse Data With Different Sampling Frequencies

Attention-Based Convolutional Aggregation: An Efficient Model for Off-Gas Profile Forecasting and Dynamic Pre-Control of BOF Steelmaking

Hybrid static-sensory data modeling for prediction tasks in basic oxygen furnace process

Explore related subjects

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation