skip to main content
10.1145/3078155.3078160acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiwoclConference Proceedingsconference-collections
extended-abstract

Accelerated Machine Learning Using TensorFlow and SYCL on OpenCL Devices

Published:16 May 2017Publication History

ABSTRACT

Machine learning is being used in more and more artificial intelligence applications. While existing machine learning frameworks mostly support NVIDIA CUDA GPUs, there has been little research dedicated to targeting other devices through open standards such as OpenCL. In this paper, we explain how machine learning applications can harness the power of OpenCL using open standards and how, by using SYCL, TensorFlow can be extended to include customized operations running on OpenCL devices.

References

  1. Martín Abadi and others. 2016. TensorFlow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI). Savannah, Georgia, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Rami Al-Rfou and others. 2016. Theano: A Python framework for fast computation of mathematical expressions. arXiv preprint arXiv:1605.02688 (2016).Google ScholarGoogle Scholar
  3. Anelia Angelova, Alex Krizhevsky, and Vincent Vanhoucke. 2015. Pedestrian detection with a large-field-of-view deep network. In Robotics and Automation (ICRA), 2015 IEEE International Conference on. IEEE, 704--711.Google ScholarGoogle ScholarCross RefCross Ref
  4. Tianqi Chen and others. 2015. Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems. arXiv preprint arXiv:1512.01274 (2015).Google ScholarGoogle Scholar
  5. Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, and Evan Shelhamer. 2014. cudnn: Efficient primitives for deep learning. arXiv preprint arXiv:1410.0759 (2014).Google ScholarGoogle Scholar
  6. Ronan Collobert and others. 2002. Torch: a modular machine learning software library. Technical Report. Idiap.Google ScholarGoogle Scholar
  7. Mehdi Goli. 2016. VisionCPP: A SYCL-based Computer Vision Framework. In Proceedings of the 4th International Workshop on OpenCL. ACM, 6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Mehdi Goli. 2017. SYCL backend for Eigen. Technical peresentation in 1st workshop on Distributed and Heterogeneous Programming in C and C++(DHPCC++17)- To be appear in May 2017 (2017).Google ScholarGoogle Scholar
  9. Khronos OpenCL Working Group. 2008. The OpenCL Specification. (2008).Google ScholarGoogle Scholar
  10. OpenACC Working Group and others. 2011. The OpenACC Application Programming Interface. (2011).Google ScholarGoogle Scholar
  11. Junli Gu, Yibing Liu, Yuan Gao, and Maohua Zhu. 2016. OpenCL caffe: Accelerating and enabling a cross platform machine learning framework. In Proceedings of the 4th International Workshop on OpenCL. ACM, 8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Gael Guennebaud, Benoit Jacob, and others. 2014. Eigen: a C++ linear algebra library. (2014).Google ScholarGoogle Scholar
  13. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770--778.Google ScholarGoogle ScholarCross RefCross Ref
  14. Yangqing Jia and others. 2014. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the 22nd ACM international conference on Multimedia. ACM, 675--678. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Norman P. Jouppi and others. 2017. In-Datacenter Performance Analysis of a Tensor Processing Unit. (2017).Google ScholarGoogle Scholar
  16. Khronos Group 2014. The SPIR Specification. Khronos Group.Google ScholarGoogle Scholar
  17. Microsoft. 2013. C++ AMP: Language and Programming Model. (2013).Google ScholarGoogle Scholar
  18. CUDA Nvidia. 2010. Programming guide. (2010).Google ScholarGoogle Scholar
  19. ARB OpenMP. 2011. OpenMP Application Programming Interface. (2011).Google ScholarGoogle Scholar
  20. Ralph Potter, Paul Keir, Russell J Bradford, and Alastair Murray. 2015. Kernel composition in SYCL. In Proceedings of the 3rd International Workshop on OpenCL. ACM, 11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Olga Russakovsky and others. 2015. Imagenet large scale visual recognition challenge. International Journal of Computer Vision 115, 3 (2015), 211--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Ben Sander, Greg Stoner, Siu-chi Chan, WH Chung, and Robin Maffeo. 2015. HCC: A C++ Compiler For Heterogeneous Computing. HSA Foundation, Tech. Rep. (2015).Google ScholarGoogle Scholar
  23. Khronos OpenCL Working Group SYCL subgroup. 2015. SYCL Specification. (2015).Google ScholarGoogle Scholar

Index Terms

  1. Accelerated Machine Learning Using TensorFlow and SYCL on OpenCL Devices
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          IWOCL '17: Proceedings of the 5th International Workshop on OpenCL
          May 2017
          135 pages
          ISBN:9781450352147
          DOI:10.1145/3078155

          Copyright © 2017 Owner/Author

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 16 May 2017

          Check for updates

          Qualifiers

          • extended-abstract
          • Research
          • Refereed limited

          Acceptance Rates

          IWOCL '17 Paper Acceptance Rate15of29submissions,52%Overall Acceptance Rate84of152submissions,55%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader