Abstract
Automatic annotation of images is considered to be an important research problem in image retrieval. Traditional methods are computationally complex and fail to annotate correctly when the number of image classes is large and related. This paper proposes a novel approach, an autoencoder hashing, to categorize images of large-scale image classes. The intra bin classifiers are trained to classify the query image, and the tag weight and tag frequency are computed to achieve a more effective annotation of the query image. The proposed approach has been compared with other existing approaches in the literature using performance measures, such as precision, accuracy, mean average precision (MAP), and F1 score. The experimental results indicate that our proposed approach outperforms the existing approaches.








Similar content being viewed by others
References
Liu, P., Choo, K.-K. R., Wang, L., and Huang, F., SVM or deep learning? A comparative study on remote sensing image classification. Soft Computing 21(23):7053–7065, 2017.
Wang, L., Zhang, J., Liu, P., Choo, K.-K. R., and Huang, F., Spectral-spatial multi-feature based deep learning for hyperspectral remote sensing image classification. Soft Computing 21(1):213–221, 2017.
D’Orazio, C., and Choo, K.-K. R., An adversary model to evaluate DRM protection of video contents on iOS devices. Computers & Security 56:94–110, 2016.
Anbarasan, K., and Chitrakala, S., Clustering-based color image segmentation using local maxima. International Journal of Intelligent Information Technologies 14(1):28–47, 2018.
Bouramoul, A., Gravizor: A graphical tool for the visualization of web search engines results with multi-agent modeling. International Journal of Intelligent Information Technologies 13(3):37–56, 2017.
Raghuveera, T., Vidhushini, S., and Swathi, M., Comparative study of CAMSHIFT and RANSAC methods for face and eye tracking in real-time video. International Journal of Intelligent Information Technologies 13(2):63–75, 2017.
Chen, H. Y., Choo, K.-K. R., and Chen, W. U., Tamper detection and image recovery for BTC-compressed images. Multimedia Tools and Applications 76(14):15435–15463, 2017.
Mohammadi, R., and Javidan, R., An intelligent traffic engineering method over software defined networks for video surveillance systems based on artificial bee colony. International Journal of Intelligent Information Technologies 12(4):45–62, 2016.
Bagiwa, M. A., Wahab, A. W. A., Idris, M. Y. I., Khan, S., and Choo, K.-K. R., Chroma key background detection for digital video using statistical correlation of blurring artifact. Digital Investigation 19:29–43, 2016.
Priyatharshini, R., and Chitrakala, S., An efficient coronary disease diagnosis system using dual-phase multi-objective and embedded feature selection. International Journal of Intelligent Information Technologies 13(3):15–36, 2017.
Li, J., and Wang, J. Z., Automatic linguistic indexing of pictures by a statistical modelling approach. IEEE Tr. On Pattern Analysis & Machine Intelligence 25(9):1075–1088, 2003. https://doi.org/10.1109/TPAMI.2003.1227984.
Luo, J., Savakis, A. E., and Singhal, A., A Bayesian network-based framework for semantic image understanding. Pattern Recognition 38(6):919–934, 2005. https://doi.org/10.1016/j.patcog.2004.11.00.
Christos, D., George, S., Panagiotis, P., Christos, P., Nikos, D., and Anastasios, D., Large scale concept detection in multimedia data using small training sets and cross domain concept fusion. IEEE Tr. on Circuits and Systems for Video Technology 20(12):1808–1822, 2010. https://doi.org/10.1109/TCSVT.2010.2087814.
Cabral, R., De la Torre, F., Costeira, J. P., and Bernardino, A., Matrix completion for weakly-supervised multi-label image classification. IEEE Transactions on Pattern Analysis and Machine Intelligence 37(1):121–135, 2015. https://doi.org/10.1109/TPAMI.2014.2343234.
Galar, M., Fernandez, A., Barrenechea, E., Bustince, H., and Herrera, F., An overview of ensemble methods for binary classifiers in multi-class problems: An experimental study on one-vs-one and one-vs-all schemes. Pattern Recognition, Elsevier 44(8):1761–1776, 2011. https://doi.org/10.1016/j.patcog.2011.01.017.
Hsu, C. W., and Lin, C. J., A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks 13(2):415–425, 2002.
Polat, K., and Gune, S., A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems. Expert Systems with Applications 36(2):1587–1592, 2009.
Yu, X., Aloimonos, Y., Attribute-based transfer learning for object categorization with zero/one training example. European Conference on Computer Vision pp 127–140. doi:https://doi.org/10.1007/978-3-642-15555-0_10, 2010.
Miao, Q., Liu, R., Zhao, P., Li, Y., and Sun, E., A semi-supervised image classification model based on improved ensemble projection algorithm. IEEE Access 6:1372–1379, 2017. https://doi.org/10.1109/ACCESS.2017.2778881.
Miller, G., Wordnet: A lexical database for English. Communications of the ACM 38(11):39–41, 1995.
Fellbaum, C., Wordnet. Theory and applications of ontology: Computer applications pp 231–243, 2010.
Cilibrasi, R., and Vitanyi, P., The Google similarity distance. IEEE Transactions on Knowledge and Data Engineering 19(3):370–383, 2007.
Fan, J., Shen, Y., Yang, C., and Zhou, N., Structured max-margin learning for inter-related classifier training and multilabel image annotation. IEEE Transactions on Image Processing 20(3):837–854, 2011.
Dong, P., Mei, K., Zheng, N., Lei, H., and Fan, J., Training inter-related classifiers for automatic image classification and annotation. Pattern Recognition, Elsevier 46(5):1382–1395, 2012.
Torralba, A., Murphy, K., and Freeman, W., Sharing features: Efficient boosting procedures for multiclass object detection. IEEE computer society conference on computer vision and Pattern Recognition 2:762–769, 2004.
Kuang, Z., Li, Z., Zhao, T., Fan, J., Deep multi-task learning for large-scale image classification. IEEE 3rd International Conference on Multimedia Big Data. doi:10.1109/BigMM.2017.72, 2017.
Chen, T., Lu, S., Fan, J., S-CNN: Subcategory-aware convolutional networks for object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence doi:10.1109/TPAMI.2017.2756936, 1, 2017.
Fan, J., Gao, Y., Luo, H., and Jain, R., Mining multilevel image semantics via hierarchical classification. IEEE Transactions on Multimedia 10(2):167–187, 2008. https://doi.org/10.1109/TMM.2007.911775.
Fan, J., Gao, Y., and Luo, H., Integrating concept ontology and multitask learning to achieve more effective classifier training for multilevel image annotation. IEEE Transactions on Image Processing 17(3):407–426, 2008.
Marszalek, M., Schmid, C., Constructing category hierarchies for visual recognition. European Conference on Computer Vision, Springer pp 479–491. doi:10.1007/978-3-540-88693-8_35, 2008.
Marszalek, M., Schmid, C., Semantic hierarchies for visual object recognition. IEEE Conference on Computer Vision and Pattern Recognition pp 1–7. doi:10.1109/CVPR.2007.383272, 2007.
Tousch, A., Herbin, S., and Audibert, J., Semantic hierarchies for image annotation: A survey. Pattern Recognition, Elsevier 45(1):333–345, 2011. https://doi.org/10.1016/j.patcog.2011.05.017.
Zhu, S., Jin, D., Liang, Z., Wang, Q., Sun, Y., and Xu, G., Integration of semantic and visual hashing for image retrieval. Journal of Visual Communication & Image Representation 44:229–235, 2016. https://doi.org/10.1016/j.jvcir.2016.08.013.
Zhao, W., Luo, H., Peng, J., and Fan, J., Spatial pyramid deep hashing for large-scale image retrieval. Neurocomputing 243:166–173, 2017. https://doi.org/10.1016/j.neucom.2017.03.021.
Lu, Y., Zhang, L., Liu, J., and Tian, Q., Constructing concept lexica with small semantic gap. IEEE Transaction on Multimedia 12(4):288–299, 2010. https://doi.org/10.1109/TMM.2010.2046292.
Jing, X. Y., Wu, F., Li, Z., Hu, R., and Zhang, D., Multi-label dictionary learning for image annotation. IEEE Transactions on Image Processing 25(6):2712–2725, 2016.
Wang, R., Xie, Y., Yang, J., Xue, L., Hu, M., and Zhang, Q., Large scale automatic image annotation based on convolutional neural network. Journal of Visual Communication & Image Representation 49:213–224, 2017. https://doi.org/10.1016/j.jvcir.2017.07.004.
Dimitrovskia, I., Kocevb, D., Kitanovskia, I., Loskovskaa, S., and Dzeroski, S., Improved medical image modality classification using a combination of visual and textual features. Computerized Medical Imaging and Graphics, Elsevier 39:14–26, 2014. https://doi.org/10.1016/j.compmedimag.2014.06.005.
Arias, J., Martinez-Gomez, J., Gamez, J. A., Seco de Herrera, A. G., and Muller, H., Medical images modality classification using discrete Bayesian networks. Computer Vision and Image Understanding, Elsevier 151:61–71, 2016. https://doi.org/10.1016/j.cviu.2016.04.002.
Tang, Q., Liu, Y., and Liu, H., Medical image classification via multiscale representation learning. Artificial Intelligence in Medicine, Elsevier 79:71–78, 2017. https://doi.org/10.1016/j.artmed.2017.06.009.
Liu, G. H., Li, Z. Y., Zhang, L., and Xu, Y., Image retrieval based on micro-structure descriptor. Pattern Recognition 44(9):2123–2133, 2011. https://doi.org/10.1016/j.patcog.2011.02.003.
Wang, J., Kumar, S., and Chang, S., Semi-supervised hashing for large scale search. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(12):2393–2406, 2012. https://doi.org/10.1109/TPAMI.2012.48.
Liong, V. E., Lu, J., Wang, G., Moulin, P., Zhou, J., Deep hashing for compact binary codes learning. IEEE Conference on Computer Vision and Pattern Recognition pp 2475–2483. doi:10.1109/CVPR.2015.7298862, 2015.
Ivasic-Kos, M., Pobar, M., and Ribaric, S., Two-tier image annotation model based on a multi-label classifier and fuzzy-knowledge representation scheme. Elsevier-Pattern Recognition 52:287–305, 2016.
Acknowledgements
Dr. Sugumaran’s research has been supported by a 2018 School of Business Administration Spring/Summer Research Fellowship from Oakland University.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Mercy Rajaselvi Beaulah P. declares that she has no conflict of interest. Manjula D. declares that she has no conflict of interest. Vijayan Sugumaran declares that he has no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
This article is part of the Topical Collection on Image & Signal Processing
Rights and permissions
About this article
Cite this article
Mercy Rajaselvi Beaulah, P., Manjula, D. & Sugumaran, V. Categorization of Images Using Autoencoder Hashing and Training of Intra Bin Classifiers for Image Classification and Annotation. J Med Syst 42, 132 (2018). https://doi.org/10.1007/s10916-018-0986-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10916-018-0986-6