Abstract
The use of gene expression information for disease prediction in the cultivation of rice offers a novel approach to improve agricultural output and health. In order to predict future epidemics, it involves examining how genes are expressed based on different illnesses. Early identification of illness by gene expression might be achieved even before signs appear, allowing for timely treatments to reduce crop loss and stop the propagation of the illness. However, because gene expression information is so complicated and requires advanced quantitative and bioinformatic knowledge, understanding it may be quite difficult. This paper leverages cutting-edge technological solutions by developing a system based on deep learning and machine learning for disease prediction in rice crops using gene expression data. The initial step involves gathering gene expression data from reputable sources to ensure a diverse and comprehensive dataset. Once collected, variable-length gene data are filled by padding to standardize data input formats. Following this, repetitions in gene data are identified using multi-similarity methods, ensuring the uniqueness and relevance of the data for analysis. The obtained data are further used for optimal gene selection using the enhanced red-tailed hawk algorithm (ERTH), which ensures superior performance. Finally, the prediction is performed using an innovative approach named adaptively optimized residual long short-term memory with multilayer perception (AO-RLSTM-MLP) that combines residual long short-term memory (RLSTM) and multilayer perception (MLP). During this phase, the parameters of the AO-RLSTM-MLP are optimally tuned using the same ERTH approach. Experimental analysis is conducted on this deep learning-based model to validate its effectiveness in disease prediction in rice crops using gene expression data. This analysis demonstrates the potential of this robust solution in the field, highlighting its capabilities to predict disease outbreaks effectively and contribute to sustainable agriculture practices.






Similar content being viewed by others
Data availability
No datasets were generated or analysed during the current study.
References
Jiang, J., Xing, F., Zeng, X., Zou, Q.: Investigating maize yield-related genes in multiple omics interaction network data. IEEE Trans. NanoBiosci. 19(1), 142–151 (2020)
Chang, Y.-J., Lai, M.-H., Wang, C.-H., Huang, Y.-S., Lin, J.: Target-aware yield prediction (TAYP) model used to improve agriculture crop productivity. IEEE Trans. Geosci. Remote Sens. 62, 1–11 (2024)
Sharma, R.P., Ramesh, D., Pal, P., Tripathi, S., Kumar, C.: IoT-Enabled IEEE 802.15.4 WSN monitoring infrastructure-driven fuzzy-logic-based crop pest prediction. IEEE Internet Things J. 9(4), 3037–3045 (2022)
Wen, H.-G., Zhao, J.-H., Zhang, B.-S., Gao, F., Xue-Ming, W., Yan, Y.-S., Zhang, J., Guo, H.-S.: Microbe-induced gene silencing boosts crop protection against soil-borne fungal pathogens. Nat. Plants 9, 1409–1418 (2023)
Ju, X., Lian, F., Ge, H., Jiang, Y., Zhang, Y., Xu, D.: Identification of rice varieties and adulteration using gas chromatography-ion mobility spectrometry. IEEE Access 9, 18222–18234 (2021)
Chen, W.-L., Lin, Y.-B., Ng, F.-L., Liu, C.-Y., Lin, Y.-W.: RiceTalk: rice blast detection using internet of things and artificial intelligence technologies. IEEE Internet Things J. 7(2), 1001–1010 (2020)
Zhang, H.: GERWR: identifying the key pathogenicity—associated sRNAs of magnaporthe oryzae infection in rice based on graph embedding and random walk with restart. IEEE/ACM Trans. Comput. Biol. Bioinform. 21(2), 227–239 (2024)
Islam, T., Kim, C.H., Iwata, H., Shimono, H., Kimura, A.: DeepCGP: a deep learning method to compress genome-wide polymorphisms for predicting phenotype of rice. IEEE/ACM Trans. Comput. Biol. Bioinform. 20(3), 2078–2088 (2023)
Gao, Y., Zhou, Q., Luo, J., Xia, C., Zhang, Y., Yue, Z.: Crop-GPA: an integrated platform of crop gene-phenotype associations. npj Syst. Biol. Appl. 10, 15 (2024)
Xiao, X., Lu, Y., Huang, X., Chen, T.: Temporal series crop classification study in rural china based on sentinel-1 SAR data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 14, 2769–2780 (2021)
Zhang, H., Xu, C., Wang, J.: Fertilizer strength prediction model based on shape characteristics. IEEE Access 9, 87007–87023 (2021)
Garg, P.: Environmental and soil parameters for germination of leaf spot disease in the groundnut plant using IoT-enabled sensor system. IEEE Sens. Lett. 7(12), 1–4 (2023)
Pham, D., Tan, X., Xu, J., Grice, L.F., Lam, P.Y., Raghubar, A., Vukovic, J., Ruitenberg, M.J. and Nguyen, Q.: stLearn: integrating spatial location, tissue morphology, and gene expression to find cell types, cell-cell interactions and spatial trajectories within undissociated tissues. BioRxiv, pp. 2020–05 (2020)
Delnevo, G., Girau, R., Ceccarini, C., Prandi, C.: A deep learning and social IoT approach for plants disease prediction toward a sustainable agriculture. IEEE Internet Things J. 9(10), 7243–7250 (2022)
Reis, A.C., Salis, H.M.: An automated model test system for systematic development and improvement of gene expression models. ACS Synth. Biol. 9(11), 3145–3156 (2020)
Liu, Z., Bashir, R.N., Iqbal, S., Shahid, M.M.A., Tausif, M., Umer, Q.: Internet of things (IoT) and machine learning model of plant disease prediction-blister blight for tea plant. IEEE Access 10, 44934–44944 (2022)
Patle, K.S., Saini, R., Kumar, A., Palaparthy, V.S.: Field evaluation of smart sensor system for plant disease prediction using LSTM network. IEEE Sens. J. 22(4), 3715–3725 (2022)
Zrimec, J., Börlin, C.S., Buric, F., Muhammad, A.S., Chen, R., Siewers, V., Verendel, V., Nielsen, J., Töpel, M., Zelezniak, A.: Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure. Nat. Commun. (2020). https://doi.org/10.1038/s41467-020-19921-4
Saini, R., Patle, K.S., Kumar, A., Surya, S.G., Palaparthy, V.S.: Attention-based multi-input multi-output neural network for plant disease prediction using multisensor system. IEEE Sens. J. 22(24), 24242–24252 (2022)
Zhao, Y., Cai, H., Zhang, Z., Tang, J., Li, Y.: Learning interpretable cellular and gene signature embeddings from single-cell transcriptomic data. Nat. Commun. (2021). https://doi.org/10.1038/s41467-021-25534-2
Menden, K., Marouf, M., Oller, S., Dalmia, A., Magruder, D.S., Kloiber, K., Heutink, P., Bonn, S.: Deep learning–based cell composition analysis from tissue expression profiles. Sci. Adv. (2020). https://doi.org/10.1126/sciadv.aba2619
Avsec, Ž, Agarwal, V., Visentin, D., Ledsam, J.R., Grabska-Barwinska, A., Taylor, K.R., Assael, Y., Jumper, J., Kohli, P., Kelley, D.R.: Effective gene expression prediction from sequence by integrating long-range interactions. Nat. Methods 18(10), 1196–1203 (2021)
Poirion, O.B., Jing, Z., Chaudhary, K., Huang, S., Garmire, L.X.: DeepProg: an ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data. Genome Med. 13, 1–15 (2021)
Kumar, M., Kumar, A., Palaparthy, V.S.: Soil sensors-based prediction system for plant diseases using exploratory data analysis and machine learning. IEEE Sens. J. 21(16), 17455–17468 (2021)
Kumar, R., Khatri, A., Acharya, V.: Deep learning uncovers distinct behavior of rice network to pathogens response. Science (2022). https://doi.org/10.2139/ssrn.4014762
Bijoy, M.H., Hasan, N., Biswas, M., Mazumdar, S., Jimenez, A., Ahmed, F., Rasheduzzaman, M., Momen, S.: Towards sustainable agriculture: a novel approach for rice leaf disease detection using dCNN and enhanced dataset. IEEE Access 12, 34174–34191 (2024)
Chen, J., Chen, W., Zeb, A., Yang, S., Zhang, D.: Lightweight inception networks for the recognition and detection of rice plant diseases. IEEE Sens. J. 22(14), 14628–14638 (2022)
Mohanty, J.K., Jha, U.C., Dixit, G.P., Bharadwaj, C., Parida, S.K.: eQTL-seq: a rapid genome-wide integrative genetical genomics strategy to dissect complex regulatory architecture of gene expression underlying quantitative trait variation in crop plants. Plant Mol. Biol. Rep. 42, 218 (2023)
Patil, R.R., Kumar, S.: Rice transformer: a novel integrated management system for controlling rice diseases. IEEE Access 10, 87698–87714 (2022)
Joshi, P., Das, D., Udutalapally, V., Pradhan, M.K., Misra, S.: RiceBioS: identification of biotic stress in rice crops using edge-as-a-service. IEEE Sens. J. 22(5), 4616–4624 (2022)
Luo, D., Huguet-Tapia, J.C., Taylor Raborn, R., White, F.F., Brende, V.P., Yang, B.: The Xa7 resistance gene guards the rice susceptibility gene SWEET14 against exploitation by the bacterial blight pathogen. Plant Commun. 2(3), 100164 (2021)
Senthilkumar, T.P., Prabhusundhar, P.: Prediction of rice disease using modified feature weighted fuzzy clustering (MFWFC) based segmentation and hybrid classification model. Int. J. Syst. Assur. Eng. Manag. (2023). https://doi.org/10.1007/s13198-022-01835-7
Chen, S., Wang, C., Yang, J., Chen, B., Wang, W., Jing, S., Feng, A., Zeng, L., Zhu, X.: Identification of the novel bacterial blight resistance gene Xa46(t) by mapping and expression analysis of the rice mutant H120. Sci. Rep. 10, 12642 (2020)
Chen, X., Liu, P., Mei, L., He, X., Chen, L., Liu, H., Shen, S., Ji, Z., Zheng, X., Zhang, Y., Gao, Z., Zeng, D., Qian, Q., Ma, B.: Xa7, a new executor R gene that confers durable and broad-spectrum resistance to bacterial blight disease in rice. Plant Commun. 2(3), 10014 (2021)
Yang, X., Xin, Gu., Ding, J., Yao, L., Gao, X., Zhang, M., Meng, Q., Wei, S.: Gene expression analysis of resistant and susceptible rice cultivars to sheath blight after inoculation with Rhizoctonia solani. BMC Genomics 23, 278 (2022)
Patil, R.R., Kumar, S.: Rice-fusion: a multimodality data fusion framework for rice disease diagnosis. IEEE Access 10, 5207–5222 (2022)
Ferahtia, S., Houari, A., Rezk, H., Djerioui, A., Machmoum, M., Motahhir, S., Ait-Ahmed, M.: Red-tailed hawk algorithm for numerical optimization and real-world problems. Sci. Rep. 13, 12950 (2023)
Liu, F., Lu, Y., Cai, M.: A hybrid method with adaptive sub-series clustering and attention-based stacked residual LSTMs for multivariate time series forecasting. IEEE Access 8, 62423–62438 (2020)
Yan, H., Jiang, Y., Zheng, J., Peng, C., Li, Q.: A multilayer perceptron-based medical decision support system for heart disease diagnosis. Expert Syst. Appl. 30(2), 272–281 (2006)
Bi, L., Hu, G., Raza, M.M., Kandel, Y., Leandro, L., Mueller, D.: A gated recurrent units (GRU)-based model for early detection of soybean sudden death syndrome through time-series satellite imagery. Remote Sens. 12, 3621 (2020)
Song, J., Zhu, A., Tu, Y., Huang, H., Arif, M.A., Shen, Z., Cao, G.: Effects of different feature parameters of sEMG on human motion pattern recognition using multilayer perceptrons and LSTM neural networks. Appl. Sci. 10, 3358 (2020)
Pan, J.-S., Zhang, L.-G., Wang, R.-B., Snášel, V., Chu, S.-C.: Gannet optimization algorithm: a new metaheuristic algorithm for solving engineering optimization problems. Math. Comput. Simul 202, 343–373 (2022)
Montazeri, Z., Niknam, T., Aghaei, J., Malik, O.P., Dehghani, M., Dhiman, G.: Golf optimization algorithm: a new game-based metaheuristic algorithm and its application to energy commitment problem considering resilience. Biomimetics 8(5), 386 (2023)
Chahardoli, M., Eraghi, N.O., Nazari, S.: Namib beetle optimization algorithm: a new meta-heuristic method for feature selection and dimension reduction. Concurr. Comput. Pract. Exp. 34, 2022 (2022). https://doi.org/10.1002/cpe.6524
Acknowledgements
I would like to express my very great appreciation to the co-authors of this manuscript for their valuable and constructive suggestions during the planning and development of this research work.
Funding
This research did not receive any specific funding.
Author information
Authors and Affiliations
Contributions
Vijaya Lakshmi Adluri and Dr. Raju Bhukya designed the model, computational framework and carried out the implementation. Vijaya Lakshmi Adluri performed the calculations and wrote the manuscript with all the inputs. Vijaya Lakshmi Adluri and Dr. Raju Bhukya discussed the results and contributed to the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Informed consent
Not Applicable.
Ethical approval
Not Applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Adluri, V.L., Raju Bhukya An intelligent framework of heuristic approach-aided optimal gene selection and residual LSTM with MLP for disease prediction in rice crop using gene expression data. SIViP 19, 307 (2025). https://doi.org/10.1007/s11760-025-03859-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-025-03859-5