skip to main content
10.1145/2425836.2425882acmotherconferencesArticle/Chapter ViewAbstractPublication PagesivcnzConference Proceedingsconference-collections
poster

An optimal parameter analysis and GPU acceleration of the image receptive fields neural network approach

Published:26 November 2012Publication History

ABSTRACT

The Image Receptive Fields Neural Networks (IRFNN) algorithm is a recent approach for image classification that is as accurate and an order of magnitude faster than using a traditional feed-forward neural network (multi-layer perceptron), with a linear input layer, non-linear hidden layer and linear output layer. This paper investigates the algorithm's optimal parameter configuration along with a GPU implementation, further extending the performance of the algorithm. Optimization of classification is achieved through a deep search of potential configurations with respect to the number of neurons in the hidden layer and receptive field placement within the image plane. Second stage refinement is achieved through a search for optimal Gaussian receptive field size and shape in 2D. These processes guarantee an optimal network configuration. Secondly, a GPU acceleration of the feed-forward processing of images into the network is implemented. Receptive fields are uploaded to the GPU and all computations take place on the GPU resulting in a large performance increase. Analysis of both improvements are described in the paper.

References

  1. AT&T Laboratories Cambridge. Our Database of Faces (formerly 'The ORL Database of Faces'), 2011. http://www.cl.cam.ac.uk/research/dtg/attarchive/facedatabase.html, visited on October 9, 2011.Google ScholarGoogle Scholar
  2. P. Daum, J.-L. Buessler, and J.-P. Urban. Image receptive fields neural networks for object recognition. In T. Honkela, W. Duch, M. A. Girolami, and S. Kaski, editors, ICANN (2), volume 6792 of Lecture Notes in Computer Science, pages 95--102. Springer, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Eric Antonelo, the site administrator. Reservoir Computing: Shaping Dynamics into Information, 2011. http://reservoir-computing.org, visited on October 12, 2011.Google ScholarGoogle Scholar
  4. A. Ghani, T. M. McGinnity, L. Maguire, L. McDaid, and A. Belatreche. Neuro-Inspired Speech Recognition Based on Reservoir Computing, Advances in Speech Recognition, Noam Shabtai (Ed.). InTech, 2010.Google ScholarGoogle Scholar
  5. B. J. Grzyb, E. Chinellato, G. M. Wojcik, and W. A. Kaminski. Facial expression recognition based on liquid state machines built of alternative neuron models. IEEE International Joint Conference on Neural Networks, (1): 1011--1017, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Jaeger. The "echo state" approach to analysing and training recurrent neural networks. Technical Report 148, GMD - German National Research Institute for Computer Science, 2001.Google ScholarGoogle Scholar
  7. H. Jaeger. Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the "echo state network" approach. Technical report, Fraunhofer Institute AIS, St. Augustin-Germany, 2002.Google ScholarGoogle Scholar
  8. Y. LeCun and Y. Bengio. Convolutional networks for images, speech, and time-series. In M. A. Arbib, editor, The Handbook of Brain Theory and Neural Networks. MIT Press, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. Maass, T. Natschlaeger, and H. Markram. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Computation, 14(11): 2531--2560, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. R. Madane, W. Banu, P. S., and S. C. R. Madane. BImplementation of High Speed Face Recognition Based on Karhunen Loeve Transform and FisherÕs Discriminant, Radial Basis Function of Echo State Neural Network. International Journal of Soft Computing, 3(3): 248--253, 2008.Google ScholarGoogle Scholar
  11. E. Niv and S. J. Weddell. A comparative study of random hidden node networks for pattern recognition. In Proceedings of the Image and Vision Computing New Zealand Conference (IVCNZ), pages 311--314, New Zealand, 2011.Google ScholarGoogle Scholar
  12. S. Scherer, M. Oubbati, F. Schwenker, and G. Palm. Real-time emotion recognition from speech using echo state networks. Artificial Neural Networks in Pattern Recognition, 5064: 205--216, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. C. Y. Tsai, X. Dutoit, K. T. Song, H. V. Brussel, and M. Nuttin. Robust face tracking control of a mobile robot using self-tuning Kalman filter and echo state network. Asian Journal of Control, 12(4): 488--509, 2010.Google ScholarGoogle Scholar
  14. A. Woodward and T. Ikegami. A reservoir computing approach to image classification using coupled echo state and back-propagation neural networks. In Proceedings of the Image and Vision Computing New Zealand Conference (IVCNZ), pages 453--458, New Zealand, 2011.Google ScholarGoogle Scholar

Index Terms

  1. An optimal parameter analysis and GPU acceleration of the image receptive fields neural network approach

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Other conferences
              IVCNZ '12: Proceedings of the 27th Conference on Image and Vision Computing New Zealand
              November 2012
              547 pages
              ISBN:9781450314732
              DOI:10.1145/2425836

              Copyright © 2012 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 26 November 2012

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • poster

              Acceptance Rates

              Overall Acceptance Rate55of74submissions,74%
            • Article Metrics

              • Downloads (Last 12 months)3
              • Downloads (Last 6 weeks)0

              Other Metrics

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader