Abstract
Accurate localization of proteins from fluorescence microscopy images is challenging due to the inter-class similarities and intra-class disparities introducing grave concerns in addressing multi-class classification problems. Conventional machine learning-based image prediction pipelines rely heavily on pre-processing such as normalization and segmentation followed by handcrafted feature extraction to identify useful, informative, and application-specific features. Here, we demonstrate that deep learning-based pipelines can effectively classify protein images from different datasets. We propose an end-to-end Protein Localization Convolutional Neural Network (PLCNN) that classifies protein images more accurately and reliably. PLCNN processes raw imagery without involving any pre-processing steps and produces outputs without any customization or parameter adjustment for a particular dataset. Experimental analysis is performed on five benchmark datasets. PLCNN consistently outperformed the existing state-of-the-art approaches from traditional machine learning and deep architectures. This study highlights the importance of deep learning for the analysis of fluorescence microscopy protein imagery. The proposed deep pipeline can better guide drug designing procedures in the pharmaceutical industry and open new avenues for researchers in computational biology and bioinformatics.
Similar content being viewed by others
Availability of data and material
Publicly available datasets are used in this study.
Notes
Code available at https://github.com/saeed-anwar/PLCNN.
References
Yang F, Xu YY, Wang ST, Shen HB (2014) Image-based classification of protein subcellular location patterns in human reproductive tissue by ensemble learning global and local features. Neurocomputing 131:113–123
Chong YT, Koh JL, Friesen H, Duffy SK, Cox MJ, Moses A, Moffat J, Boone C, Andrews BJ (2015) Yeast proteome dynamics from single cell imaging and automated analysis. Cell 161(6):1413–1424
Pärnamaa T, Parts L (2017) Accurate classification of protein subcellular localization from high-throughput microscopy images using deep learning. G3 Genes Genomes Genet 7(5):1385–1392
Shao W, Ding Y, Shen HB, Zhang D (2017) Deep model-based feature extraction for predicting protein subcellular localizations from bio-images. Front Comput Sci 11(2):243–252
Glory E, Murphy RF (2007) Automated subcellular location determination and high-throughput microscopy. Dev Cell 12(1):7–16
Xu YY, Yao LX, Shen HB (2018) Bioimage-based protein subcellular location prediction: a comprehensive review. Front Comput Sci 12(1):26–39
Tahir M, Khan A, Majid A (2011) Protein subcellular localization of fluorescence imagery using spatial and transform domain features. Bioinformatics 28(1):91–97
Itzhak DN, Tyanova S, Cox J, Borner GH (2016) Global, quantitative and dynamic mapping of protein subcellular localization. Elife 5:e16950
Xiang S, Liang Q, Hu Y, Tang P, Coppola G, Zhang D, Sun W (2019) Amc-net: asymmetric and multi-scale convolutional neural network for multi-label hpa classification. Comput Methods Programs Biomed 178:275–287
Kreft M, Milisav I, Potokar M, Zorec R (2004) Automated high through-put colocalization analysis of multichannel confocal images. Comput Methods Programs Biomed 74(1):63–67
Tahir M, Khan A (2016) Protein subcellular localization of fluorescence microscopy images: employing new statistical and texton based image features and svm based ensemble classification. Inf Sci 345:65–80
Boland MV, Markey MK, Murphy RF (1998) Automated recognition of patterns characteristic of subcellular structures in fluorescence microscopy images. Cytom J Int Soc Anal Cytol 33(3):366–375
Murphy RF, Boland MV, Velliste M et al (2000) Towards a systematics for protein subcellular location: quantitative description of protein localization patterns and automated analysis of fluorescence microscope images. In: ISMB, pp 251–259
Huang K, Murphy RF (2004) Boosting accuracy of automated classification of fluorescence microscope images for location proteomics. BMC Bioinform 5(1):78
Hamilton NA, Pantelic RS, Hanson K, Teasdale RD (2007) Fast automated cell phenotype image classification. BMC Bioinform 8(1):110
Chebira A, Barbotin Y, Jackson C, Merryman T, Srinivasa G, Murphy RF, Kovačević J (2007) A multiresolution approach to automated classification of protein subcellular location images. BMC Bioinform 8(1):210
Nanni L, Lumini A (2008) A reliable method for cell phenotype image classification. Artif Intell Med 43(2):87–97
Nanni L, Lumini A, Brahnam S (2010) Local binary patterns variants as texture descriptors for medical image analysis. Artif Intell Med 49(2):117–125
Li C, Wang Xh, Zheng L, Huang Jf (2013) Automated protein subcellular localization based on local invariant features. Protein J 32(3):230–237
Zhang B, Pham TD (2011) Phenotype recognition with combined features and random subspace classifier ensemble. BMC Bioinform 12(1):128
Tahir M, Jan B, Hayat M, Shah SU, Amin M (2018) Efficient computational model for classification of protein localization images using extended threshold adjacency statistics and support vector machines. Comput Methods Programs Biomed 157:205–215
Nanni L, Ghidoni S, Brahnam S (2017) Handcrafted vs. non-handcrafted features for computer vision classification. Pattern Recogn 71:158–172
Xiao M, Shen X, Pan W (2019) Application of deep convolutional neural networks in classification of protein subcellular localization with microscopy images. Genet Epidemiol 43(3):330–341
Kraus OZ, Grys BT, Ba J, Chong Y, Frey BJ, Boone C, Andrews BJ (2017) Automated analysis of high-content microscopy data with deep learning. Mol Syst Biol 13(4):924
Le NQK, Huynh TT, Yapp EKY, Yeh HY (2019) Identification of clathrin proteins by incorporating hyperparameter optimization in deep learning and pssm profiles. Comput Methods Programs Biomed 177:81–88
Zhang L, Yu G, Xia D, Wang J (2019) Protein-protein interactions prediction based on ensemble deep neural networks. Neurocomputing 324:10–19
Guo Y, Shang X, Li Z (2019) Identification of cancer subtypes by integrating multiple types of transcriptomics data with deep learning in breast cancer. Neurocomputing 324:20–30
Gu J, Wang Z, Kuen J, Ma L, Shahroudy A, Shuai B, Liu T, Wang X, Wang G, Cai J et al (2018) Recent advances in convolutional neural networks. Pattern Recogn 77:354–377
Dürr O, Sick B (2016) Single-cell phenotype classification using deep convolutional neural networks. J Biomol Screen 21(9):998–1003
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B (Methodol) 58(1):267–288
Godinez WJ, Hossain I, Lazic SE, Davies JW, Zhang X (2017) A multi-scale convolutional neural network for phenotyping high-content cellular images. Bioinformatics 33(13):2010–2019
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: Proceedings of the international conference on learning representations
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Szegedy C, Ioffe S, Vanhoucke V, Alemi AA (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: AAAI, vol 4, p 12
Sabour S, Frosst N, Hinton GE (2017) Dynamic routing between capsules. In: NIPS, pp 3856–3866
Lao Q, Fevens T (2019) Cell phenotype classification using deep residual network and its variants. Int J Pattern Recognit Artif Intell 33(11):1940017
Zagoruyko S, Komodakis N (2016) Wide residual networks. In: Proceedings of the British machine vision conference (BMVC). BMVA Press, pp 1–12
Xie S, Girshick R, Dollár P, Tu Z, He K (2017) Aggregated residual transformations for deep neural networks. In: CVPR. IEEE, pp 5987–5995
Han D, Kim J, Kim J (2017) Deep pyramidal residual networks. In: CVPR. IEEE, pp 6307–6315
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2016) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
Anwar S, Barnes N (2020) Densely residual Laplacian super-resolution. IEEE Trans Pattern Anal Mach Intell
Boland MV, Murphy RF (2001) A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of hela cells. Bioinformatics 17(12):1213–1223
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: CVPR, pp 1–9
Ruder S (2016) An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747
Ljosa V, Sokolnicki KL, Carpenter AE (2012) Annotated high-throughput microscopy image sets for validation. Nat Methods 9(7):637–637
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2017) Grad-cam: visual explanations from deep networks via gradient-based localization. In: ICCV, pp 618–626
Nanni L, Brahnam S, Alessandra L (2010) Selecting the best performing rotation invariant patterns in local binary/ternary patterns. In: International conference on IP, computer vision, and pattern recognition, pp 369–375
Lin CC, Tsai YS, Lin YS, Chiu TY, Hsiung CC, Lee MI, Simpson JC, Hsu CN (2007) Boosting multiclass learning with repeating codes and weak detectors for protein subcellular localization. Bioinformatics 23(24):3374–3381
Nanni L, Brahnam S, Ghidoni S, Lumini A (2018) Bioimage classification with handcrafted and learned features. IEEE/ACM Trans Comput Biol Bioinf 16(3):874–885
Nanni L, Brahnam S, Ghidoni S, Maguolo G (2019) General purpose (genp) bioimage ensemble of handcrafted and learned features with data augmentation. arXiv preprint arXiv:1904.08084
Anwar S, Barnes N (October 2019) Real image denoising with feature attention. In: ICCV
Funding
This research did not receive any specific funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethics approval
Since no experiments are performed on humans or animals (dead or alive) in this research, therefore, Ethical approval is not required.
Code availability
Code available at https://github.com/saeed-anwar/PLCNN
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tahir, M., Anwar, S., Mian, A. et al. Deep localization of subcellular protein structures from fluorescence microscopy images. Neural Comput & Applic 34, 5701–5714 (2022). https://doi.org/10.1007/s00521-021-06715-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06715-y