Abstract
Acquired immunodeficiency syndrome (AIDS) continues to be a significant cause of mortality, disability, and economic repercussions, especially in underdeveloped countries. Extensive research has been conducted to develop effective therapies for human immunodeficiency virus (HIV) infection, including the prediction of HIV-1 protease cleavage sites. Accurate prediction of these sites can expedite the discovery of new HIV-1 protease inhibitors. Motivated by this, we propose a novel approach for HIV-1 protease cleavage site prediction using numerical descriptors based on octapeptide sequences. Our method incorporates multi-view feature extraction, combining sequence order effects of amino acids with physicochemical features. To capture important information, we utilize a convolutional neural network for feature extraction. For the classification task, we employ a fuzzy rank-based ensemble method, utilizing Random Forest, Logistic Regression, and Support Vector Machine as base classifiers. The ensemble combines their predictions to make the final prediction. Experimental evaluation on benchmark datasets demonstrates the effectiveness of our approach, achieving average Accuracy, AUC, Precision, Recall, and F-measure of 0.93, 0.95, 0.85, 0.76, and 0.80, respectively. Comparisons with existing studies confirm the potential of our proposed technique. The source code can be downloaded from Github: https://github.com/SusmitaPalmal/RC_CNN_HIV.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arya, N., Saha, S.: Multi-modal classification for human breast cancer prognosis prediction: proposal of deep-learning based stacked ensemble model. IEEE/ACM Trans. Comput. Biol. Bioinf. 19(2), 1032–1041 (2020)
Brik, A., Wong, C.H.: HIV-1 protease: mechanism and drug discovery. Organ. Biomolecul. Chem. 1(1), 5–14 (2003)
Gök, M., Özcerit, A.T.: A new feature encoding scheme for HIV-1 protease cleavage site prediction. Neural Comput. Appl. 22, 1757–1761 (2013)
Hu, L., Hu, P., Luo, X., Yuan, X., You, Z.H.: Incorporating the coevolving information of substrates in predicting HIV-1 protease cleavage sites. IEEE/ACM Trans. Comput. Biol. Bioinf. 17(6), 2017–2028 (2019)
Hu, L., Li, Z., Tang, Z., Zhao, C., Zhou, X., Hu, P.: Effectively predicting HIV-1 protease cleavage sites by using an ensemble learning approach. BMC Bioinformatics 23(1), 447 (2022)
Kawashima, S., Ogata, H., Kanehisa, M.: Aaindex: amino acid index database. Nucleic Acids Res. 27(1), 368–369 (1999)
Li, F., et al.: Deepcleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites. Bioinformatics 36(4), 1057–1065 (2020)
Li, Z., Hu, L., Tang, Z., Zhao, C.: Predicting hiv-1 protease cleavage sites with positive-unlabeled learning. Front. Genet. 12, 658078 (2021)
Manna, A., Kundu, R., Kaplun, D., Sinitca, A., Sarkar, R.: A fuzzy rank-based ensemble of CNN models for classification of cervical cytology. Sci. Rep. 11(1), 14538 (2021)
Miller, R.J., Cairns, J.S., Bridges, S., Sarver, N.: Human immunodeficiency virus and aids: insights from animal lentiviruses. J. Virol. 74(16), 7187–7195 (2000)
Nanni, L.: Comparison among feature extraction methods for HIV-1 protease cleavage site prediction. Pattern Recogn. 39(4), 711–713 (2006)
Nanni, L., Lumini, A., Gupta, D., Garg, A.: Identifying bacterial virulent proteins by fusing a set of classifiers based on variants of chou’s pseudo amino acid composition and on evolutionary information. IEEE/ACM Trans. Comput. Biol. Bioinf. 9(2), 467–475 (2011)
Onah, E., et al.: Prediction of HIV-1 protease cleavage site from octapeptide sequence information using selected classifiers and hybrid descriptors. BMC Bioinformatics 23(1), 1–20 (2022)
Palmal, S., Arya, N., Saha, S., Tripathy, S.: A Multi-modal graph convolutional network for predicting human breast cancer prognosis. In: Tanveer, M., Agarwal, S., Ozawa, S., Ekbal, A., Jatowt, A. (eds.) Neural Information Processing: 29th International Conference, ICONIP 2022, Virtual Event, 22–26 November 2022, Proceedings, Part VII, pp. 187–198. Springer, Singapore (2023). https://doi.org/10.1007/978-981-99-1648-1_16
Palmal, S., Arya, N., Saha, S., Tripathy, S.: Breast cancer survival prognosis using the graph convolutional network with choquet fuzzy integral. Sci. Rep. 13(1), 14757 (2023)
Palmal, S., Saha, S., Tripathy, S.: HIV-1 protease cleavage site prediction using stacked autoencoder with ensemble of classifiers. In: 2022 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2022)
Palmal, S., Saha, S., Tripathy, S.: Multi-objective optimization with majority voting ensemble of classifiers for prediction of HIV-1 protease cleavage site. Soft Comput. 1–11 (2023)
Rögnvaldsson, T., You, L., Garwicz, D.: State of the art prediction of HIV-1 protease cleavage sites. Bioinformatics 31(8), 1204–1210 (2015)
Shen, H.B., Chou, K.C.: HIVcleave: a web-server for predicting human immunodeficiency virus protease cleavage sites in proteins. Anal. Biochem. 375(2), 388–390 (2008)
Acknowledgment
Dr. Sriparna Saha gratefully acknowledges the Young Faculty Research Fellowship (YFRF) Award, supported by Visvesvaraya Ph.D. Scheme for Electronics and IT, Ministry of Electronics and Information Technology (sMeitY), Government of India, being implemented by Digital India Corporation (formerly Media Lab Asia) for carrying out this research.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Palmal, S., Saha, S., Tripathy, S. (2024). Integrating Multi-view Feature Extraction and Fuzzy Rank-Based Ensemble for Accurate HIV-1 Protease Cleavage Site Prediction. In: Luo, B., Cheng, L., Wu, ZG., Li, H., Li, C. (eds) Neural Information Processing. ICONIP 2023. Communications in Computer and Information Science, vol 1964. Springer, Singapore. https://doi.org/10.1007/978-981-99-8141-0_36
Download citation
DOI: https://doi.org/10.1007/978-981-99-8141-0_36
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8140-3
Online ISBN: 978-981-99-8141-0
eBook Packages: Computer ScienceComputer Science (R0)