Accelerating Bag-of-Words with SOM

Chen, Jian-Hui; Wang, Zuo-Ren; Liu, Cheng-Lin

doi:10.1007/978-3-030-36718-3_48

Accelerating Bag-of-Words with SOM

Jian-Hui Chen^11,12,13,14,
Zuo-Ren Wang^11,13,14 &
Cheng-Lin Liu^12,13,14

Conference paper
First Online: 09 December 2019

2736 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11955))

Abstract

We propose a fast Bag-of-Words (BoW) method for image classification, inspired by the mechanism that arrangement of neurons in visual cortex can preserve the topology of mapping from inputs, and the fact that human brain can retrieve information almost instantly. We propose algorithms for accelerating both Self-Organizing Map (SOM) training and BoW coding. First, we modify the traditional SOM based on the matrix factorization form of K-means. Utilizing the topology-preserving property of dictionary learned by SOM, the coding process of BoW can be accelerated by fast search of k-nearest neighbor codewords in the grid of SOM dictionary. We evaluate the proposed method in different coding scenarios for image classification task on MNIST and CIFAR-10 datasets. The results show that the proposed method accelerates BoW classification greatly with little loss of classification accuracy.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bauckhage, C.: K-means clustering is matrix factorization. arXiv preprint arXiv:1512.07548 (2015)
Brendel, W., Bethge, M.: Approximating CNNs with bag-of-local-features models works surprisingly well on imagenet. arXiv preprint arXiv:1904.00760 (2019)
Coates, A., Ng, A.Y.: Learning feature representations with k-means. In: Montavon, G., Orr, G.B., Müller, K.-R. (eds.) Neural Networks: Tricks of the Trade. LNCS, vol. 7700, pp. 561–580. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35289-8_30
Chapter Google Scholar
Coates, A., Ng, A.Y., Lee, H.: An analysis of single-layer networks in unsupervised feature learning. In: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011, Fort Lauderdale, USA, 11–13 April 2011, pp. 215–223 (2011)
Google Scholar
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, Prague, vol. 1, pp. 1–2 (2004)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), pp. 886–893 (2005)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)
Article Google Scholar
Huang, Y., Wu, Z., Wang, L., Tan, T.: Feature coding in image classification: a comprehensive study. IEEE Trans. Pattern Anal. Mach. Intell. 36(3), 493–506 (2014)
Article Google Scholar
Iyyer, M., Manjunatha, V., Boyd-Graber, J., Daumé III, H.: Deep unordered composition rivals syntactic methods for text classification. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Long Papers, vol. 1, pp. 1681–1691 (2015)
Google Scholar
Jiang, Z., Zhang, G., Davis, L.S.: Submodular dictionary learning for sparse coding. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2012), pp. 3418–3425 (2012)
Google Scholar
Kandel, E.R., et al.: Principles of Neural Science, vol. 4. McGraw-Hill, New York (2000)
Google Scholar
Khan, F.S., Van De Weijer, J., Anwer, R.M., Bagdanov, A.D., Felsberg, M., Laaksonen, J.: Scale coding bag of deep features for human attribute and action recognition. Mach. Vis. Appl. 29(1), 55–71 (2018)
Article Google Scholar
Kohonen, T.: Self-organized formation of topologically correct feature maps. Biol. Cybern. 43(1), 59–69 (1982)
Article Google Scholar
Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images. Technical report, Citeseer (2009)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Marszalek, M., Schmid, C., Harzallah, H., van de Weijer, J.: Learning representations for visual object class recognition. In: Proceedings of the PASCAL Visual Object Classes Challenge 2007 (2007)
Google Scholar
Novikov, A.: PyClustering: data mining library. J. Open Source Softw. 4(36), 1230 (2019)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Yang, J., Yu, K., Gong, Y., Huang, T.S.: Linear spatial pyramid matching using sparse coding for image classification. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2009), pp. 1794–1801 (2009)
Google Scholar
Yin, H.: The self-organizing maps: background, theories, extensions and applications. In: Fulcher, J., Jain, L.C. (eds.) Computational Intelligence: A Compendium. SCI, vol. 115, pp. 715–762. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-78293-3_17
Chapter Google Scholar
Zhou, H., Wei, L., Lim, C.P., Creighton, D., Nahavandi, S.: Robust vehicle detection in aerial images using bag-of-words and orientation aware scanning. IEEE Trans. Geosci. Remote Sens. 99, 1–12 (2018)
Google Scholar

Download references

Acknowledgements

Supported by the Major Project for New Generation of AI Grant No. 2018AAA0100400, the National Natural Science Foundation of China (NSFC) Grant No. 61721004, the Strategic Priority Research Program of Chinese Academy of Science, Grant No. XDB32010300, Shanghai Municipal Science and Technology Major Project (Grant No. 2018SHZDZX05).

Author information

Authors and Affiliations

State Key Laboratory of Neuroscience, Institute of Neuroscience, Chinese Academy of Sciences, Shanghai, People’s Republic of China
Jian-Hui Chen & Zuo-Ren Wang
NLPR, Institute of Automation, Chinese Academy of Sciences, Beijing, People’s Republic of China
Jian-Hui Chen & Cheng-Lin Liu
University of Chinese Academy of Sciences, Beijing, People’s Republic of China
Jian-Hui Chen, Zuo-Ren Wang & Cheng-Lin Liu
CAS Center for Excellence of Brain Science and Intelligence Technology, Chinese Academy of Sciences, Beijing, People’s Republic of China
Jian-Hui Chen, Zuo-Ren Wang & Cheng-Lin Liu

Authors

Jian-Hui Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zuo-Ren Wang
View author publications
You can also search for this author in PubMed Google Scholar
Cheng-Lin Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cheng-Lin Liu .

Editor information

Editors and Affiliations

Australian National University, Canberra, ACT, Australia
Tom Gedeon
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, JH., Wang, ZR., Liu, CL. (2019). Accelerating Bag-of-Words with SOM. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Lecture Notes in Computer Science(), vol 11955. Springer, Cham. https://doi.org/10.1007/978-3-030-36718-3_48

Download citation

DOI: https://doi.org/10.1007/978-3-030-36718-3_48
Published: 09 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36717-6
Online ISBN: 978-3-030-36718-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics