Abstract
Background modeling techniques for embedded computer vision applications must balance accuracy, speed, and power. Basic background modeling techniques run quickly, but their accuracy is not sufficient for computer vision problems involving dynamic background. In contrast, adaptive background modeling techniques are more robust, but run more slowly. Due to its high inherent fine-grain parallelism, robust adaptive background modeling has been implemented on GPUs with significant performance improvements over CPUs. However, these implementations are infeasible in embedded applications due to the high power ratings of the targeted general-purpose GPU platforms. This paper focuses on exploiting fine-grain data parallelism and optimizing memory access patterns to target a low-cost adaptive background modeling algorithm multimodal mean (MMM) to a low-power GPU with thermal design power (TDP) of only 12 watts. The algorithm has comparable accuracy with the Gaussian mixture model (GMM) algorithm, but less computational and memory cost. It achieves a frame rate of 392 fps with a full VGA resolution (640x480) frame on the low-power integrated GPU NVIDIA ION. This is a 20x speed-up of the MMM algorithm compared to the embedded CPU platform Intel Atom of comparable TDP. In addition, the MMM algorithm attains a 5-6x speed up over the GMM implementation on the ION GPU platform.
Similar content being viewed by others
References
Test images for wallflower paper (1999). http://research.microsoft.com/en-us/um/people/jckrumm/wallflower/testimages.htm.
PETS 2009 benchmark data (2009). http://www.cvg.reading.ac.uk/PETS2009/a.html#s111.
Intellio ILC-BL series smart cameras. http://www.videoline-tvcc.com/upload/pdf/ILC-BL_series_datasheet_ENG.pdf. accessed November 2015.
Matrox IRIS-GT smart camera. http://www.matrox.com/imaging/media/pdf/products/iris_gt_da/iris_gt_da.pdf. accessed November 2015.
Sony XCISX100C-XP smart camera. http://pro.sony.com/bbsc/ssr/cat-camerasindustrial/cat-cismartcameras/product-XCISX100C%2FXP/. accessed November 2015.
NI-177x series smart cameras. http://sine.ni.com/ds/app/doc/p/id/ds-370/lang/en. accessed November 2015.
Apewokin, S., Valentine, B., Forsthoefel, D., Wills, L., Wills, S., & Gentile, A. (2010). Embedded real-time surveillance using multimodal mean background modeling. In kisacanin, B., Bhattacharyya, S., & Chai, S. (Eds.) Embedded computer vision, (pp. 163–175.): Springer.
Azmat, S., Wills, L., & Wills, S. (2012). Accelerating adaptive background modeling on low-power integrated GPUs. In International workshop on embedded multicore systems (ICPP-EMS 2012), held in conjunction with the 41st IEEE international conference on parallel processing (pp. 568–573).
Carr, P. (2008). GPU accelerated multimodal background subtraction. In Digital image computing: Techniques and applications (DICTA) (pp. 279–286).
Fabiàn, T., & Gaura, J. (2008). Parallel implementation of recursive background modeling technique in CUDA for tracking moving objects in video traffic surveillance. In 4th Doctoral Workshop on Mathematical and Engineering Methods in Computer Science. http://www.fi.muni.cz/memics07/2008/pres/fabian_cuda.pdf.
Horprasert, T., Harwood, D., & Davis, L.S. (1999). A statistical approach for real-time robust background subtraction and shadow detection. In IEEE International conferecne on computer vision (ICCV), (Vol. 99 pp. 1–19).
Hsieh, K.Y., Lai, C.H., Lai, S.H., & Lee, J.K. (2012). Parallelization of belief propagation on cell processors for stereo vision. ACM Transactions on Embedded Computing Systems (TECS), 11(1), 13.
Kirk, D.B., & Wen-mei, W.H. (2012). Programming massively parallel processors: a hands-on approach Newnes.
Liu, Y., & Hu, J. (2011). GPU-based parallelization for fast circuit optimization. ACM Transactions on Design Automation of Electronic Systems (TODAES), 16(3), 24.
NVIDIA Corporation: NVIDIA Compute Unified Device Architecture C Programming Guide v6.5. http://docs.nvidia.com/cuda/pdf/CUDA_C_Programming_Guide.pdf. accessed January 2015.
NVIDIA Corporation: NVIDIA Compute Unified Device Architecture C Best Practices Guide v6.5. http://docs.nvidia.com/cuda/pdf/CUDA_C_Best_Practices_Guide.pdf. accessed January 2015.
Pham, V., Vo, P., & Hung, V.T. (2010). GPU implementation of extended gaussian mixture model for background subtraction. In 2010 IEEE RIVF International conference on computing and communication technologies, research, innovation, and vision for the future (pp. 1–4).
Poremba, M., Xie, Y., & Wolf, M. (2010). Accelerating adaptive background subtraction with GPU and CBEA architecture. In 2010 IEEE Workshop on signal processing systems (SIPS) (pp. 305–310).
Scogland, T.R.W., Lin, H., & Feng, W. (2010). A first look at integrated GPUs for green high-performance computing. Computer Science - Research and Development, 25(3–4), 125–134.
Sen-Ching, S.C., & Kamath, C. (2004). Robust techniques for background subtraction in urban traffic video. In Electronic imaging 2004. International society for optics and photonics (pp. 881–892).
Stauffer, C., & Grimson, W.E.L. (2000). Learning patterns of activity using real-time tracking. In IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), (Vol. 22 pp. 747–757).
Volkov, V. (2010). Better performance at lower occupancy (Presentation in GPU Technology Conference. http://www.cs.berkeley.edu/~volkov/volkov10-GTC.pdf.
Williams, S., Shalf, J., Oliker, L., Kamil, S., Husbands, P., & Yelick, K. (2006). The potential of the cell processor for scientific computing. In Proceedings of the 3rd conference on computing frontiers, ACM (pp. 9–20).
Zhu, Y., Wang, B., & Deng, Y. (2011). Massively parallel logic simulation with GPUs. ACM Transactions on Design Automation of Electronic Systems (TODAES), 16(3), 29.
Zivkovic, Z., & van der Heijden, F. (2006). Efficient adaptive density estimation per image pixel for the task of background subtraction. Pattern Recognition Letters, 27(7), 773–780.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Azmat, S., Wills, L. & Wills, S. Parallelizing Multimodal Background Modeling on a Low-Power Integrated GPU. J Sign Process Syst 88, 43–53 (2017). https://doi.org/10.1007/s11265-016-1111-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11265-016-1111-z