Parallelizing Multimodal Background Modeling on a Low-Power Integrated GPU

Azmat, Shoaib; Wills, Linda; Wills, Scott

doi:10.1007/s11265-016-1111-z

Parallelizing Multimodal Background Modeling on a Low-Power Integrated GPU

Published: 13 February 2016

Volume 88, pages 43–53, (2017)
Cite this article

Journal of Signal Processing Systems Aims and scope Submit manuscript

Shoaib Azmat¹,
Linda Wills² &
Scott Wills²

335 Accesses
3 Citations
Explore all metrics

Abstract

Background modeling techniques for embedded computer vision applications must balance accuracy, speed, and power. Basic background modeling techniques run quickly, but their accuracy is not sufficient for computer vision problems involving dynamic background. In contrast, adaptive background modeling techniques are more robust, but run more slowly. Due to its high inherent fine-grain parallelism, robust adaptive background modeling has been implemented on GPUs with significant performance improvements over CPUs. However, these implementations are infeasible in embedded applications due to the high power ratings of the targeted general-purpose GPU platforms. This paper focuses on exploiting fine-grain data parallelism and optimizing memory access patterns to target a low-cost adaptive background modeling algorithm multimodal mean (MMM) to a low-power GPU with thermal design power (TDP) of only 12 watts. The algorithm has comparable accuracy with the Gaussian mixture model (GMM) algorithm, but less computational and memory cost. It achieves a frame rate of 392 fps with a full VGA resolution (640x480) frame on the low-power integrated GPU NVIDIA ION. This is a 20x speed-up of the MMM algorithm compared to the embedded CPU platform Intel Atom of comparable TDP. In addition, the MMM algorithm attains a 5-6x speed up over the GMM implementation on the ION GPU platform.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Test images for wallflower paper (1999). http://research.microsoft.com/en-us/um/people/jckrumm/wallflower/testimages.htm.
PETS 2009 benchmark data (2009). http://www.cvg.reading.ac.uk/PETS2009/a.html#s111.
Intellio ILC-BL series smart cameras. http://www.videoline-tvcc.com/upload/pdf/ILC-BL_series_datasheet_ENG.pdf. accessed November 2015.
Matrox IRIS-GT smart camera. http://www.matrox.com/imaging/media/pdf/products/iris_gt_da/iris_gt_da.pdf. accessed November 2015.
Sony XCISX100C-XP smart camera. http://pro.sony.com/bbsc/ssr/cat-camerasindustrial/cat-cismartcameras/product-XCISX100C%2FXP/. accessed November 2015.
NI-177x series smart cameras. http://sine.ni.com/ds/app/doc/p/id/ds-370/lang/en. accessed November 2015.
Apewokin, S., Valentine, B., Forsthoefel, D., Wills, L., Wills, S., & Gentile, A. (2010). Embedded real-time surveillance using multimodal mean background modeling. In kisacanin, B., Bhattacharyya, S., & Chai, S. (Eds.) Embedded computer vision, (pp. 163–175.): Springer.
Azmat, S., Wills, L., & Wills, S. (2012). Accelerating adaptive background modeling on low-power integrated GPUs. In International workshop on embedded multicore systems (ICPP-EMS 2012), held in conjunction with the 41st IEEE international conference on parallel processing (pp. 568–573).
Carr, P. (2008). GPU accelerated multimodal background subtraction. In Digital image computing: Techniques and applications (DICTA) (pp. 279–286).
Fabiàn, T., & Gaura, J. (2008). Parallel implementation of recursive background modeling technique in CUDA for tracking moving objects in video traffic surveillance. In 4th Doctoral Workshop on Mathematical and Engineering Methods in Computer Science. http://www.fi.muni.cz/memics07/2008/pres/fabian_cuda.pdf.
Horprasert, T., Harwood, D., & Davis, L.S. (1999). A statistical approach for real-time robust background subtraction and shadow detection. In IEEE International conferecne on computer vision (ICCV), (Vol. 99 pp. 1–19).
Hsieh, K.Y., Lai, C.H., Lai, S.H., & Lee, J.K. (2012). Parallelization of belief propagation on cell processors for stereo vision. ACM Transactions on Embedded Computing Systems (TECS), 11(1), 13.
Google Scholar
Kirk, D.B., & Wen-mei, W.H. (2012). Programming massively parallel processors: a hands-on approach Newnes.
Liu, Y., & Hu, J. (2011). GPU-based parallelization for fast circuit optimization. ACM Transactions on Design Automation of Electronic Systems (TODAES), 16(3), 24.
Article Google Scholar
NVIDIA Corporation: NVIDIA Compute Unified Device Architecture C Programming Guide v6.5. http://docs.nvidia.com/cuda/pdf/CUDA_C_Programming_Guide.pdf. accessed January 2015.
NVIDIA Corporation: NVIDIA Compute Unified Device Architecture C Best Practices Guide v6.5. http://docs.nvidia.com/cuda/pdf/CUDA_C_Best_Practices_Guide.pdf. accessed January 2015.
Pham, V., Vo, P., & Hung, V.T. (2010). GPU implementation of extended gaussian mixture model for background subtraction. In 2010 IEEE RIVF International conference on computing and communication technologies, research, innovation, and vision for the future (pp. 1–4).
Poremba, M., Xie, Y., & Wolf, M. (2010). Accelerating adaptive background subtraction with GPU and CBEA architecture. In 2010 IEEE Workshop on signal processing systems (SIPS) (pp. 305–310).
Scogland, T.R.W., Lin, H., & Feng, W. (2010). A first look at integrated GPUs for green high-performance computing. Computer Science - Research and Development, 25(3–4), 125–134.
Article Google Scholar
Sen-Ching, S.C., & Kamath, C. (2004). Robust techniques for background subtraction in urban traffic video. In Electronic imaging 2004. International society for optics and photonics (pp. 881–892).
Stauffer, C., & Grimson, W.E.L. (2000). Learning patterns of activity using real-time tracking. In IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), (Vol. 22 pp. 747–757).
Volkov, V. (2010). Better performance at lower occupancy (Presentation in GPU Technology Conference. http://www.cs.berkeley.edu/~volkov/volkov10-GTC.pdf.
Williams, S., Shalf, J., Oliker, L., Kamil, S., Husbands, P., & Yelick, K. (2006). The potential of the cell processor for scientific computing. In Proceedings of the 3rd conference on computing frontiers, ACM (pp. 9–20).
Zhu, Y., Wang, B., & Deng, Y. (2011). Massively parallel logic simulation with GPUs. ACM Transactions on Design Automation of Electronic Systems (TODAES), 16(3), 29.
Article Google Scholar
Zivkovic, Z., & van der Heijden, F. (2006). Efficient adaptive density estimation per image pixel for the task of background subtraction. Pattern Recognition Letters, 27(7), 773–780.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Comsats Institute of Information Technology, Abbottabad, Pakistan
Shoaib Azmat
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA
Linda Wills & Scott Wills

Authors

Shoaib Azmat
View author publications
You can also search for this author in PubMed Google Scholar
Linda Wills
View author publications
You can also search for this author in PubMed Google Scholar
Scott Wills
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shoaib Azmat.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Azmat, S., Wills, L. & Wills, S. Parallelizing Multimodal Background Modeling on a Low-Power Integrated GPU. J Sign Process Syst 88, 43–53 (2017). https://doi.org/10.1007/s11265-016-1111-z

Download citation

Received: 11 May 2015
Revised: 13 November 2015
Accepted: 28 January 2016
Published: 13 February 2016
Issue Date: July 2017
DOI: https://doi.org/10.1007/s11265-016-1111-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Parallelizing Multimodal Background Modeling on a Low-Power Integrated GPU

Abstract

Access this article

Similar content being viewed by others

GPU Accelerated Non-Parametric Background Subtraction

Performance and Scalability Improvement of GMM Background Segmentation Algorithm on Multi-core Parallel Platforms

Poisson Mixture Model for High Speed and Low-Power Background Subtraction

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Parallelizing Multimodal Background Modeling on a Low-Power Integrated GPU

Abstract

Access this article

Similar content being viewed by others

GPU Accelerated Non-Parametric Background Subtraction

Performance and Scalability Improvement of GMM Background Segmentation Algorithm on Multi-core Parallel Platforms

Poisson Mixture Model for High Speed and Low-Power Background Subtraction

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation