Abstract
Video analytics frameworks often rely on Neural Networks to perform their tasks. For example, a “You Only Look Once” object detection algorithm applies a single neural network to each image, divides the image into regions, and predicts bounding boxes (weighted by the predicted probabilities) with probabilities for each region. Those algorithms often run more efficiently on hardware accelerators. Libraries which use CUDA enabled GPUs can achieve tremendous advances in speed for those functionalities. Frequently, video analytic researchers develop large solutions to allow them to solve problems with complex setup procedures for other researchers to be able to duplicate their efforts. Here we present a software solution that can be run on multiple computer environments without having to customize systems and software, and support the measurement of the performance of machine learning algorithms on disparate datasets. In this publication, we introduce a common base container that provides GPU-optimized access to common Computer Vision (CV) and Machine Learning (ML) libraries, and can be used as the building container (think Docker FROM) for complex analytics to be interactively designed and tested, and as the base for Docker container images that can be shared between analytics researchers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Docker CE installation instructions can be found at https://docs.docker.com/install/linux/docker-ce/ubuntu/.
- 2.
The Github repository for Nvidia Docker (v2) is available at https://github.com/NVIDIA/nvidia-docker.
- 3.
(Open Source Neural Networks in C https://pjreddie.com/darknet/.
- 4.
YOLO v3 https://pjreddie.com/darknet/yolo/.
- 5.
- 6.
Because we are using older tagged version of tensorflow container, which is built from an older ubuntu:16.04 base container, some security patches have not been applied to that older containers.
References
George A., et al.: TRECVID 2016: evaluating video search, video event detection, localization, and hyperlinking. In: Proceedings of TRECVID 2016. NIST, USA (2016). https://www-nlpir.nist.gov/projects/tvpubs/tv16.papers/tv16overview.pdf
Guan, H., et al.: MFC datasets: large-scale benchmark datasets for media forensic challenge evaluation. In: 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), pp. 63–72. IEEE (2019). https://www.nist.gov/publications/mfc-datasets-large-scale-benchmark-datasets-media-forensic-challenge-evaluation
Mariotti, K.: SARUS: an OCI-compliant container runtime for HPC. HPC-AI Advisory Council, April 3rd 2019. http://hpcadvisorycouncil.com/events/2019/swiss-workshop/pdf/030419/K_Mariotti_CSCS_SARUS_OCI_ContainerRuntime_04032019.pdf
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Michel, M., Burnett, N. (2019). Enabling GPU-Enhanced Computer Vision and Machine Learning Research Using Containers. In: Weiland, M., Juckeland, G., Alam, S., Jagode, H. (eds) High Performance Computing. ISC High Performance 2019. Lecture Notes in Computer Science(), vol 11887. Springer, Cham. https://doi.org/10.1007/978-3-030-34356-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-34356-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34355-2
Online ISBN: 978-3-030-34356-9
eBook Packages: Computer ScienceComputer Science (R0)