Abstract
Benefitting from the breakthrough development of microscopy imaging techniques, various bio-microscopic images have accumulated rapidly for the past decade. Using computer vision and machine learning methods, biological activities and molecular functions can be interpreted from these images, thus image analysis has become more and more important in current life science research. A prominent difficulty in biological image analysis is the lack of annotation, and the test set even contains data from unseen classes, i.e. the open-set issue. The image-based protein subcellular localization is a typical open-set problem. There are tens of subcellular compartments in cells, while the labeled data may only consist of proteins from several major organelles. Till now, the open-set problem has been rarely studied for biomedical image data. The main goal of this study is to train a few-shot learning model for the recognition of protein subcellular localization from immunofluorescence images. We conduct experiments on a data set collected from Human Protein Atlas (HPA) and the results show that the introduced system can provide accurate results even with a small handful of images for an unknown class in a multi-instance learning scenario.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Briesemeister, S., et al.: Yloc-an interpretable web server for predicting subcellular localization. Nucleic Acids Res. 38(suppl\(\_\)2), W497–W502 (2010)
Deng, J., Guo, J., Xue, N., Zafeiriou, S.: Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4690–4699 (2019)
Emanuelsson, O., Nielsen, H., Brunak, S., Von Heijne, G.: Predicting subcellular localization of proteins based on their n-terminal amino acid sequence. J. Mol. Biol. 300(4), 1005–1016 (2000)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Koch, G., et al.: Siamese neural networks for one-shot image recognition. In: ICML Deep Learning Workshop, vol. 2. Lille (2015)
Kumar, A., et al.: Automated analysis of immunohistochemistry images identifies candidate location biomarkers for cancers. Proc. Natl. Acad. Sci. 111(51), 18249–18254 (2014)
Liu, W., et al.: Sphereface: deep hypersphere embedding for face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 212–220 (2017)
Long, W., Yang, Y., Shen, H.B.: Imploc: a multi-instance deep learning model for the prediction of protein subcellular localization based on immunohistochemistry images. Bioinformatics 36(7), 2244–2250 (2020)
Newberg, J., Murphy, R.F.: A framework for the automated analysis of subcellular patterns in human protein atlas images. J. Proteome Res. 7(6), 2300–2308 (2008)
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986)
Uhlen, M., et al.: Towards a knowledge-based human protein atlas. Nat. Biotechnol. 28(12), 1248–1250 (2010)
Wang, H., et al.: Cosface: large margin cosine loss for deep face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5265–5274 (2018)
Xu, Y.Y., Fan, Y., Shen, H.B.: Incorporating organelle correlations into semi-supervised learning for protein subcellular localization prediction. Bioinformatics (14), btw219 (2016)
Xu, Y.Y., Yang, F., Zhang, Y., Shen, H.B.: An image-based multi-label human protein subcellular localization predictor (i locator) reveals protein mislocalizations in cancer tissues. Bioinformatics 29(16), 2032–2040 (2013)
Xu, Y.Y., Yang, F., Zhang, Y., Shen, H.B.: Bioimaging-based detection of mislocalized proteins in human cancers by semi-supervised learning. Bioinformatics (Oxford, England) 31, November 2014. https://doi.org/10.1093/bioinformatics/btu772
Zhou, H., Yang, Y., Shen, H.B.: Hum-mploc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features. Bioinformatics 33(6), 843–853 (2017)
Funding
This work was supported by the National Natural Science Foundation of China (No. 61972251).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Arcamone, F., Tu, Y., Yang, Y. (2021). Prediction of Protein Subcellular Localization from Microscopic Images via Few-Shot Learning. In: Wei, Y., Li, M., Skums, P., Cai, Z. (eds) Bioinformatics Research and Applications. ISBRA 2021. Lecture Notes in Computer Science(), vol 13064. Springer, Cham. https://doi.org/10.1007/978-3-030-91415-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-030-91415-8_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91414-1
Online ISBN: 978-3-030-91415-8
eBook Packages: Computer ScienceComputer Science (R0)