skip to main content
10.1145/3009977.3009978acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvgipConference Proceedingsconference-collections
research-article

A differential excitation based rotational invariance for convolutional neural networks

Authors Info & Claims
Published:18 December 2016Publication History

ABSTRACT

Deep Learning (DL) methods extract complex set of features using architectures containing hierarchical set of layers. The features so learned have high discriminative power and thus represents the input to the network in the most efficient manner. Convolutional Neural Networks (CNN) are one of the deep learning architectures, extracts structural features with little invariance to smaller translational, scaling and other forms of distortions. In this paper, the learning capabilities of CNN's are explored towards providing improvement in rotational invariance to its architecture. We propose a new CNN architecture with an additional layer formed by differential excitation against distance for the improvement of rotational invariance and is called as RICNN. Moreover, we show that the proposed method is giving superior performance towards invariance to rotations against the original CNN architecture (training samples with different orientations are not considered) without disturbing the invariance to smaller translational, scaling and other forms of distortions. Different profiles like training time, testing time and accuracies are evaluated at different percentages of training data for comparing the performance of the proposed configuration with original configuration.

References

  1. Yann LeCun and Yoshua Bengio. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 3361(10):1995, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Patrice Y Simard, Dave Steinkraus, and John C Platt. Best practices for convolutional neural networks applied to visual document analysis. In null, page 958. IEEE, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Sergey Zagoruyko and Nikos Komodakis. Learning to compare image patches via convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4353--4361, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  4. Fok Hing Chi Tivive and Abdesselam Bouzerdoum. A new class of convolutional neural networks (siconnets) and their application of face detection. In Neural Networks, 2003. Proceedings of the International Joint Conference on, volume 3, pages 2157--2162. IEEE, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  5. Earnest Paul Ijjina and C Krishna Mohan. Facial expression recognition using kinect depth sensor and convolutional neural networks. In Machine Learning and Applications (ICMLA), 2014 13th International Conference on, pages 392--396. IEEE, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. Image super-resolution using deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence, 38(2):295--307, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. K Haribabu, GRKS Subrahmanyam, and Deepak Mishra. A robust digital image watermarking technique using auto encoder based convolutional neural networks. In Computational Intelligence: Theories, Applications and Future Directions (WCI), 2015 IEEE Workshop on, pages 1--6. IEEE, 2015.Google ScholarGoogle Scholar
  8. Christophe Garcia and Manolis Delakis. Convolutional face finder: A neural architecture for fast and robust face detection. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 26(11):1408--1423, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Shih-Chung B Lo, Heang-Ping Chan, Jyh-Shyan Lin, Huai Li, Matthew T Freedman, and Seong K Mun. Artificial convolution neural network for medical image pattern recognition. Neural networks, 8(7):1201--1214, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Fok Hing Chi Tivive and Abdesselam Bouzerdoum. Rotation invariant face detection using convolutional neural networks. In International Conference on Neural Information Processing, pages 260--269. Springer, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Anil K Jain. Fundamentals of digital image processing. Prentice-Hall, Inc., 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278--2324, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  13. David E Rumelhart, Geoffrey E Hinton, and Ronald J Williams. Learning representations by back-propagating errors. Cognitive modeling, 5(3):1, 1988.Google ScholarGoogle Scholar
  14. Yann LeCun and Corinna Cortes. Mnist handwritten digit database. AT&T Labs {Online}. Available: http://yann.lecun.com/exdb/mnist, 2010.Google ScholarGoogle Scholar
  15. Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, and Wen Gao. Wld: A robust local image descriptor. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 32(9):1705--1720, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Mutawarra Hussain, Ghulam Muhammad, Sahar Q Saleh, Anwar M Mirza, and George Bebis. Image forgery detection using multi-resolution weber local descriptors. In EUROCON, 2013 IEEE, pages 1570--1577. IEEE, 2013.Google ScholarGoogle Scholar
  17. Mircea Cimpoi, Subhransu Maji, Iasonas Kokkinos, Sammy Mohamed, and Andrea Vedaldi. Describing textures in the wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3606--3613, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A differential excitation based rotational invariance for convolutional neural networks

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            ICVGIP '16: Proceedings of the Tenth Indian Conference on Computer Vision, Graphics and Image Processing
            December 2016
            743 pages
            ISBN:9781450347532
            DOI:10.1145/3009977

            Copyright © 2016 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 18 December 2016

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            ICVGIP '16 Paper Acceptance Rate95of286submissions,33%Overall Acceptance Rate95of286submissions,33%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader