Abstract
Automatic classification systems are required to support medical literature databases like PubMedCentral, which allow an easy access to millions of articles. FHDO Biomedical Computer Science Group (BCSG) participated at the ImageCLEF 2016 Subfigure Classification Task to improve existing approaches for classifying figures from medical literature. In this work, a data analysis is conducted in order to improve image preprocessing for deep learning approaches. Evaluations on the dataset show better ensemble classification accuracies using only visual information with an optimized training, in comparison to the mixed feature approaches of BCSG at ImageCLEF 2016. Additionally, a self-training approach is investigated to generate more labeled data in the medical domain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
https://www.ncbi.nlm.nih.gov/pmc/ (last access: 19.04.2017).
- 2.
https://openi.nlm.nih.gov/ (last access: 19.04.2017).
- 3.
https://ceb.nlm.nih.gov/ridem/iti.html (last access: 24.04.2017).
- 4.
https://github.com/NVIDIA/DIGITS (last access: 20.04.2017).
- 5.
https://github.com/tensorflow/models/blob/master/slim (last access: 09.03.2017).
- 6.
https://www.tensorflow.org (last access: 09.03.2017).
- 7.
https://github.com/razorx89/imageclef-med-2016-follow-up-research (last access: 03.05.2017).
- 8.
https://www.ncbi.nlm.nih.gov/pmc/tools/openftlist/ (last access: 20.04.2017).
References
Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I.J., Harp, A., Irving, G., Isard, M., Jia, Y., Józefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mané, D., Monga, R., Moore, S., Murray, D.G., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P.A., Vanhoucke, V., Vasudevan, V., Viégas, F.B., Vinyals, O., Warden, P., Wattenberg, M., Wicke, M., Yu, Y., Zheng, X.: TensorFlow: large-scale machine learning on heterogeneous distributed systems. CoRR abs/1603.04467 (2016). http://arxiv.org/abs/1603.04467
Csurka, G. (ed.): Domain Adaption in Computer Vision Applications. Advances in Computer Vision and Pattern Recognition, 1 edn. Springer International Publishing (2017). http://www.springer.com/de/book/9783319583464
Demner-Fushman, D., Antani, S., Simpson, M., Thoma, G.R.: Design and development of a multimodal biomedical information retrieval system. J. Comput. Sci. Eng. 6(2), 168–177 (2012)
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: DeCAF: a deep convolutional activation feature for generic visual recognition. In: Jebara, T., Xing, E.P. (eds.) Proceedings of the 31st International Conference on Machine Learning (ICML 2014), pp. 647–655, JMLR Workshop and Conference Proceedings (2014)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016
García Seco de Herrera, A., Kalpathy-Cramer, J., Demner Fushman, D., Antani, S., Müller, H.: Overview of the ImageCLEF 2013 medical tasks. In: Working Notes of CLEF 2013 (Cross Language Evaluation Forum), CEUR Workshop Proceedings, vol. 1179, September 2013
García Seco de Herrera, A., Schaer, R., Antani, S., Müller, H.: Using crowdsourcing for multi-label biomedical compound figure annotation. In: Carneiro, G., et al. (eds.) LABELS/DLMIA -2016. LNCS, vol. 10008, pp. 228–237. Springer, Cham (2016). doi:10.1007/978-3-319-46976-8_24
García Seco de Herrera, A., Schaer, R., Bromuri, S., Müller, H.: Overview of the ImageCLEF 2016 medical task. In: CLEF2016 Working Notes, CEUR Workshop Proceedings, CEUR-WS.org, Évora, Portugal (2016). http://ceur-ws.org
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia (MM 2014), NY, USA, pp. 675–678. ACM, New York (2014)
Koitka, S., Friedrich, C.M.: Traditional feature engineering and deep learning approaches at medical classification task of ImageCLEF 2016. In: Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Èvora, Portugal, 5–8 September, 2016. CEUR-WS Proceedings Notes, vol. 1609, pp. 304–317, July 2016
Kumar, A., Kim, J., Lyndon, D., Fulham, M., Feng, D.: An ensemble of fine-tuned convolutional neural networks for medical image classification. IEEE J. Biomed. Health Inform. 21(1), 31–40 (2017)
Lê Cao, K.A., Boitard, S., Besse, P.: Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems. BMC Bioinform. 12(1), 253 (2011)
Pelka, O., Friedrich, C.M.: FHDO biomedical computer science group at medical classification task of imageclef 2015. In: Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum, Toulouse, France, 8–11 September 2015. CEUR-WS Proceedings Notes, vol. 1391 (2015)
Pelka, O., Friedrich, C.M.: Modality prediction of biomedical literature images using multimodal feature representation. GMS Med. Inform. Biometry Epidemiol. (MIBE) 12(1), Doc4 (2016)
Personnaz, L., Guyon, I., Dreyfus, G.: Collective computational properties of neural networks: new learning mechanisms. Phys. Rev. A (General Physics) 34(5), 4217–4228 (1986)
Ried, K., Frank, O.R., Stocks, N.P., Fakler, P., Sullivan, T.: Effect of garlic on blood pressure: a systematic review and meta-analysis. BMC Cardiovasc. Disorders 8(1), 13 (2008)
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A.C., Fei-Fei, L.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)
Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.A.: Inception-v4, Inception-ResNet and the impact of residual connections on learning. In: ICLR 2016 Workshop (2016)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015
Valavanis, L., Stathopoulos, S., Kalamboukis, T.: IPL at CLEF 2016 medical task. In: Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Èvora, Portugal, 5–8 September 2016. CEUR-WS Proceedings Notes, vol. 1609, pp. 413–420 (2016). http://ceur-ws.org/Vol-1609/
Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27. pp. 3320–3328. Curran Associates, Inc. (2014)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Koitka, S., Friedrich, C.M. (2017). Optimized Convolutional Neural Network Ensembles for Medical Subfigure Classification. In: Jones, G., et al. Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2017. Lecture Notes in Computer Science(), vol 10456. Springer, Cham. https://doi.org/10.1007/978-3-319-65813-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-65813-1_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65812-4
Online ISBN: 978-3-319-65813-1
eBook Packages: Computer ScienceComputer Science (R0)