Abstract
Heat load prediction is essential to discover blast furnace (BF) anomalies in time and take measures in advance to reduce erosion in the ironmaking process. However, owing to the redundancy of the high dimensional data and the multi-granularity features of the state monitoring data, the general prediction model is hard to accurately predict the heat load, especially the rapid change caused by physical and chemical reactions. Therefore, this paper puts forward an attention-based one-dimension convolution neural network (1DCNN) combined with a bidirectional long short-term memory (BiLSTM) network for heat load prediction. Firstly, the two-stage data pre-processing realizes dimension reduction and key variable selection. Secondly, fine-grained features are extracted by 1DCNN, and the BiLSTM extracts the coarse-grained features for prediction output. Moreover, an attention branch is added to the 1DCNN to extract the critical fine-grained features when the heat load changes rapidly. Finally, experiments are carried out with the actual industrial data from a BF ironmaking process. The efforts show that the proposed prediction model presents better performances in the result of different metrics and has higher accuracy than the traditional prediction algorithms.













Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Acheampong, F. A., Nunoo-Mensah, H., & Chen, W. (2021). Transformer models for text-based emotion detection: a review of BERT-based approaches. Artif Intell Rev, 54(8), 5789–5829. https://doi.org/10.1007/s10462-021-09958-2
Chen, X. H., Zhang, B. K., & Gao, D. (2020). Bearing fault diagnosis base on multi-scale CNN and LSTM model. J Intell Manuf. https://doi.org/10.1007/s10845-020-01600-2
Deng, W. H., & Wang, G. Y. (2017). A novel water quality data analysis framework based on time-series data mining. J Environ Manage, 196, 365–375. https://doi.org/10.1016/j.jenvman.2017.03.024
Dong Z, Xiang W, Xue X, Chen S, Wang X (2011) On-line identification of thermal process using a modified TS-type neuro-fuzzy system. In: 2011 2nd International Conference on Artificial Intelligence, Management Science and Electronic Commerce (AIMSEC)
Eren, L., Ince, T., & Kiranyaz, S. (2019). A generic intelligent bearing fault diagnosis system using compact adaptive 1D CNN classifier. J Signal Process Syst Signal Image Video Technol, 91(2), 179–189. https://doi.org/10.1007/s11265-018-1378-3
Hou, Z., Ma, K., Wang, Y., Yu, J., Ji, K., Chen, Z., & Abraham, A. (2021). Attention-based learning of self-media data for marketing intention detection. Eng Appl Artif Intell. https://doi.org/10.1016/j.engappai.2020.104118
Hu, R., Wen, S. P., Zeng, Z. G., & Huang, T. W. (2017). A short-term power load forecasting model based on the generalized regression neural network with decreasing step fruit fly optimization algorithm. Neurocomputing, 221, 24–31. https://doi.org/10.1016/j.neucom.2016.09.027
Hyndman, R. J., & Koehler, A. B. (2006). Another look at measures of forecast accuracy. Int J Forecast, 22(4), 679–688. https://doi.org/10.1016/j.ijforecast.2006.03.001
Kok, C., Jahmunah, V., Oh, S. L., Zhou, X., Gururajan, R., Tao, X., Cheong, K. H., Gururajan, R., Molinari, F., & Acharya, U. R. (2020). Automated prediction of sepsis using temporal convolutional network. Comput Biol Med, 127, 103957. https://doi.org/10.1016/j.compbiomed.2020.103957
Kurunov, I. F., Loginov, V. N., & Tikhonov, D. N. (2006). Methods of extending a blast-furnace campaign. Metallurgist, 50(11–12), 605–613. https://doi.org/10.1007/s11015-006-0131-5
Lecun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proc IEEE, 86(11), 2278–2324. https://doi.org/10.1109/5.726791
Li, H. Z., Guo, S., Li, C. J., & Sun, J. Q. (2013). A hybrid annual power load forecasting model based on generalized regression neural network with fruit fly optimization algorithm. Knowl-Based Syst, 37, 378–387. https://doi.org/10.1016/j.knosys.2012.08.015
Li, C. K., Hou, Y. H., Wang, P. C., & Li, W. Q. (2017). Joint distance maps based action recognition with convolutional neural networks. IEEE Signal Process Lett, 24(5), 624–628. https://doi.org/10.1109/Lsp.2017.2678539
Li, J., Hua, C., & Yang, Y. (2021). Output space transfer-based MIMO RVFLNs modeling for estimation of blast furnace molten iron quality with missing indexes. IEEE Trans Instrum Measure, 70, 1–10. https://doi.org/10.1109/tim.2021.3063200
Li, J., Hua, C., Yang, Y., & Guan, X. (2022). Data-driven Bayesian-based Takagi-Sugeno fuzzy modeling for dynamic prediction of hot metal silicon content in blast furnace. IEEE Trans Syst Man Cybern Syst, 52(2), 1087–1099. https://doi.org/10.1109/tsmc.2020.3013972
Liao, S., Zhu, Q., Qian, Y., & Lin, G. (2018). Multi-granularity feature selection on cost-sensitive data with measurement errors and variable costs. Knowl Based Syst, 158, 25–42. https://doi.org/10.1016/j.knosys.2018.05.020
Lv, Y. L., Ji, Q. H., Liu, Y., & Zhang, J. (2020). Data-driven sensitivity analysis of contact resistance to assembly errors for proton-exchange membrane fuel cells. Measure Control, 53(7–8), 1354–1363. https://doi.org/10.1177/0020294020926604
Lv, Y., Zhou, Q., Li, Y., & Li, W. (2021). A predictive maintenance system for multi-granularity faults based on AdaBelief-BP neural network and fuzzy decision making. Adv Eng Inform, 49. https://doi.org/10.1016/j.aei.2021.101318
Pan, Y. J., Yang, C. J., An, R. Q., & Sun, Y. X. (2018). Robust principal component pursuit for fault detection in a blast furnace process. Indus Eng Chem Res, 57(1), 283–291. https://doi.org/10.1021/acs.iecr.7b03338
Qin, W., Zha, D. Y., & Zhang, J. (2020). An effective approach for causal variables analysis in diesel engine production by using mutual information and network deconvolution. J Intell Manuf, 31(7), 1661–1671. https://doi.org/10.1007/s10845-018-1397-8
Ribeiro, G. T., Mariani, V. C., Coelho, L., & DS. (2019). Enhanced ensemble structures using wavelet neural networks applied to short-term load forecasting. Eng Appl Artif Intell, 82, 272–281. https://doi.org/10.1016/j.engappai.2019.03.012
Semenov, Y. S., Shumel’chik, E. I., Gorupakha, V. V., Nasledov, A. V., Kuznetsov, A. M., & Zubenko, A. V. (2017). Monitoring blast furnace lining condition during five years of operation. Metallurgist, 61(3–4), 291–297. https://doi.org/10.1007/s11015-017-0491-z
Taler, J., Zima, W., Oclon, P., Gradziel, S., Taler, D., Cebula, A., Jaremkiewicz, M., Korzen, A., Cisek, P., Kaczmarski, K., & Majewski, K. (2019). Mathematical model of a supercritical power boiler for simulating rapid changes in boiler thermal loading. Energy, 175, 580–592. https://doi.org/10.1016/j.energy.2019.03.085
Tian, H. X., Ren, D. X., Li, K., & Zhao, Z. (2020). An adaptive update model based on improved long short term memory for online prediction of vibration signal. J Intell Manuf. https://doi.org/10.1007/s10845-020-01556-3
Wang, J. L., Zhang, J., & Wang, X. X. (2018a). Bilateral LSTM: a two-dimensional long short-term memory model with multiply memory units for short-term cycle time forecasting in re-entrant manufacturing systems. IEEE Trans Indus Inform, 14(2), 748–758. https://doi.org/10.1109/Tii.2017.2754641
Wang, L., Yang, C. J., Sun, Y. X., Zhang, H. F., & Li, M. L. (2018b). Effective variable selection and moving window HMM-based approach for iron-making process monitoring. J Process Control, 68, 86–95. https://doi.org/10.1016/j.jprocont.2018.04.008
Wang, J. L., Yang, Z. L., Zhang, J., Zhang, Q. H., & Chien, W. T. K. (2019). AdaBalGAN: an improved generative adversarial network with imbalanced learning for wafer defective pattern recognition. IEEE Trans Semicond Manuf, 32(3), 310–319. https://doi.org/10.1109/Tsm.2019.2925361
Wang, J. L., Xu, C. Q., Yang, Z. L., Zhang, J., & Li, X. O. (2020). Deformable convolutional networks for efficient mixed-type wafer defect pattern recognition. IEEE Trans Semicond Manuf, 33(4), 587–596. https://doi.org/10.1109/Tsm.2020.3020985
Wang, X., Hu, T., & Tang, L. (2021). A multiobjective evolutionary nonlinear ensemble learning with evolutionary feature selection for silicon prediction in blast furnace. IEEE Trans Neural Netw Learn Syst. https://doi.org/10.1109/TNNLS.2021.3059784
Wang HW, Yang GK, Pan CC, Gong QS (2015) Prediction of hot metal silicon content in blast furnace based on EMD and DNN. In: 2015 34th Chinese Control Conference (Ccc), pp 8214–8218
Wang PC, Li WQ, Gao ZM, Zhang YY, Tang C, Ogunbona P (2017) Scene flow to action map: a new representation for RGB-D based action recognition with convolutional neural networks. In: 30th IEEE Conference on Computer Vision and Pattern Recognition (Cvpr 2017), pp 416–425. https://doi.org/10.1109/Cvpr.2017.52
Xie, L. (2017). The heat load prediction model based on BP neural network-markov model. Adv Inform Commun Technol, 107, 296–300. https://doi.org/10.1016/j.procs.2017.03.108
Xing, X., Rogers, H., Zhang, G. Q., Hockings, K., Zulli, P., Deev, A., Mathieson, J., & Ostrovski, O. (2017). Effect of charcoal addition on the properties of a coke subjected to simulated blast furnace conditions. Fuel Process Technol, 157, 42–51. https://doi.org/10.1016/j.fuproc.2016.11.009
Xu, L. J., Khan, J. A., & Chen, Z. H. (2000). Thermal load deviation model for superheater and reheater of a utility boiler. Appl Thermal Eng, 20(6), 545–558. https://doi.org/10.1016/S1359-4311(99)00049-6
Xu, H. W., Zhang, J., Lv, Y. L., & Zheng, P. (2020). Hybrid feature selection for wafer acceptance test parameters in semiconductor manufacturing. IEEE Access, 8, 17320–17330. https://doi.org/10.1109/Access.2020.2966520
Xu, H. W., Qin, W., Lv, Y. L., & Zhang, J. (2022). Data-driven adaptive virtual metrology for yield prediction in multi-batch wafers. IEEE Trans Indus Inform. https://doi.org/10.1109/TII.2022.3162268
Yukun L, Peng X, Zhao K (2011) Hybrid modeling optimization of thermal efficiency and NO_x emission of utility boiler. Proc CSEE. https://doi.org/10.13334/j.0258-8013.pcsee.2011.26.00
Zhang, J. J., Wang, P., Yan, R. Q., & Gao, R. X. (2018). Long short-term memory for machine remaining life prediction. J Manuf Syst, 48, 78–86. https://doi.org/10.1016/j.jmsy.2018.05.011
Zhang, F. L., Yan, J. X., Fu, P. L., Wang, J. J., & Gao, R. X. (2020). Ensemble sparse supervised model for bearing fault diagnosis in smart manufacturing. Robot Comput-Integr Manuf. https://doi.org/10.1016/j.rcim.2019.101920
Zhao, J. J., Peng, Y. X., & He, X. T. (2020). Attribute hierarchy based multi-task learning for fine-grained image classification. Neurocomputing, 395, 150–159. https://doi.org/10.1016/j.neucom.2018.02.109
Zhao H, Wang PH (2009) Modeling and optimization of efficiency and NOx emission at a coal-fired utility boiler. In: 2009 Asia-Pacific Power and Energy Engineering Conference (Appeec), vols 1–7, pp 2832–2835
Zhong WL, Jiang LF, Zhang T, Ji JS, Xiong HL (2018) A multi-part convolutional attention network for fine-grained image recognition. In: 2018 24th International Conference on Pattern Recognition (ICPR), pp 1857–1862
Zhou, P., Lv, Y. B., Wang, H., & Chai, T. Y. (2017). Data-driven robust RVFLNs modeling of a blast furnace iron-making process using cauchy distribution weighted M-estimation. IEEE Trans Indus Electr, 64(9), 7141–7151. https://doi.org/10.1109/Tie.2017.2686369
Zhou, P., Guo, D. W., & Chai, T. Y. (2018). Data-driven predictive control of molten iron quality in blast furnace ironmaking using multi-output LS-SVR based inverse system identification. Neurocomputing, 308, 101–110. https://doi.org/10.1016/j.neucom.2018.04.060
Acknowledgements
This work was supported by the National Key Research and Development Program of China (No. 2020YFB1711100, No. 2019YFB1704401). Furthermore, the authors wish to thank the Editor and anonymous referees for their constructive comments and recommendations, which have significantly improved the presentation of this paper.
Funding
This work was supported by the National Key Research and Development Program of China (No. 2020YFB1711100, No. 2019YFB1704401).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
No potential conflict of interest was reported by the author(s).
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xu, HW., Qin, W., Sun, YN. et al. Attention mechanism-based deep learning for heat load prediction in blast furnace ironmaking process. J Intell Manuf 35, 1207–1220 (2024). https://doi.org/10.1007/s10845-023-02106-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10845-023-02106-3