Skip to main content

GPU Accelerated Image Processing for Lip Segmentation

  • Conference paper
Parallel Processing and Applied Mathematics (PPAM 2011)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7203))

Abstract

This paper presents the problem of lip segmentation in parallel environment using computational capabilities of GPUs and CUDA. The presented implementation of lip segmentation is based on image processing methods using the most popular transformations such as morphological operations and convolution filters. The obtained experimental results for the parallel implementation on GPU indicate significant speedup in comparison to its sequential counterpart. Consequently, the use of popular graphics cards provides a very promising possibility of quick lips segmentation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Parallel programming and computing platform CUDA NVIDIA, http://www.nvidia.com/object/cuda_home_new.html

  2. Aleksic, P.S., Williams, J.J., Wu, Z., Katsaggelos, A.K.: Audio-Visual speech recognition using MPEG-4 compliant visual features. EURASIP Journal on Advances in Signal Processing 2002, 1213–1227 (2002)

    Article  MATH  Google Scholar 

  3. Desouza, G.N., Kak, A.C.: Vision for mobile robot navigation: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 237–267 (2002)

    Article  Google Scholar 

  4. Farrugia, J.P., Horain, P., Guehenneux, E., Alusse, Y.: GPUCV: a framework for image processing acceleration with graphics processors. In: 2006 IEEE International Conference on Multimedia and Expo., pp. 585–588 (2006)

    Google Scholar 

  5. Forsyth, D.A., Ponce, J.: Computer Vision: A Modern Approach. Prentice Hall (2002)

    Google Scholar 

  6. Fung, J., Mann, S.: OpenVIDIA: parallel GPU computer vision. In: MULTIMEDIA 2005: Proceedings of the 13th Annual ACM International Conference on Multimedia, pp. 849–852 (2005)

    Google Scholar 

  7. Guan, Y.: Automatic extraction of lips based on multi-scale wavelet edge detection. IET Computer Vision 2, 23 (2008)

    Article  Google Scholar 

  8. Hopf, M., Ertl, T.: Accelerating 3D convolution using graphics hardware. In: Proceedings of the Visualization 1999, pp. 471–564 (1999)

    Google Scholar 

  9. Hopf, M., Ertl, T.: Accelerating morphological analysis with graphics hardware. In: Workshop on Vision, Modeling, and Visualization, VMV 2000, vol. 337, pp. 337–345 (2000)

    Google Scholar 

  10. Hurlbert, A., Poggio, T.: Synthesizing a color algorithm from examples. Science 239, 482–485 (1998)

    Article  Google Scholar 

  11. Liew, A.W., Wang, S.: Visual speech recognition: lip segmentation and mapping. Idea Group Inc. (IGI) (2009)

    Google Scholar 

  12. Lucey, S., Sridharan, S., Chandran, V.: Adaptive mouth segmentation using chromatic features. Pattern Recognition Letters (2002)

    Google Scholar 

  13. McConnell, J.: Computer Graphics: Theory Into Practice. Jones & Bartlett Pub. (2005)

    Google Scholar 

  14. Neti, C., Potamianos, G., Luettin, J., Matthews, I., Glotin, H., Vergyri, D., Sison, J., Mashari, A., Zhou, J.: Audio-visual speech recognition. In: Final Workshop 2000 Report, vol. 764 (2000)

    Google Scholar 

  15. Shapiro, L.G., Stockman, G.C.: Computer Vision. Prentice Hall (2001)

    Google Scholar 

  16. Shin, J., Lee, J., Kim, D.: Real-time lip reading system for isolated korean word recognition. Pattern Recognition 44, 559–571 (2011)

    Article  MATH  Google Scholar 

  17. Shirley, P.: Fundamentals of Computer Graphics. CRC Press (2002)

    Google Scholar 

  18. Sinha, S.N., Frahm, J., Pollefeys, M., Genc, Y.: Feature tracking and matching in video using programmable graphics hardware. Machine Vision and Applications 22, 207–217 (2007)

    Article  Google Scholar 

  19. Strzodka, R., Garbe, C.: Real-time motion estimation and visualization on graphics cards. In: IEEE Visualization 2004, pp. 545–552 (2004)

    Google Scholar 

  20. Strzodka, R., Telea, A.: Generalized distance transforms and skeletons in graphics hardware. In: Proceedings of EG/IEEE TCVG Symposium on Visualization (VisSym 2004), pp. 221–230 (2004)

    Google Scholar 

  21. Strzodka, R., Ihrke, I., Magnor, M.: A graphics hardware implementation of the generalized hough transform for fast object recognition, scale, and 3D pose detection. In: Proceedings of IEEE International Conference on Image Analysis and Processing (ICIAP 2003), pp. 188–193 (2003)

    Google Scholar 

  22. Ujaldon, M., Ruiz, A., Guil, N.: On the computation of the circle hough transform by a GPU rasterizer. Pattern Recogn. Lett. 29, 309–318 (2008)

    Article  Google Scholar 

  23. Wang, S., Lau, W., Liew, A.W., Leung, S.: Robust lip region segmentation for lip images with complex background. Pattern Recognition 40, 3481–3491 (2007)

    Article  MATH  Google Scholar 

  24. Yang, M., You, Z., Shih, Y.: Lip contour extraction for language learning in VEC3D. Machine Vision and Applications 21, 33–41 (2008)

    Article  Google Scholar 

  25. Zhao, W., Chellappa, R., Phillips, P.J., Rosenfeld, A.: Face recognition: A literature survey. ACM Comput. Surv. 35, 399–458 (2003)

    Article  Google Scholar 

  26. Zhi, Q., Kaynak, M.N.N., Sengupta, K., Cheok, A.D., Ko, C.C.: A study of the modeling aspects in bimodal speech recognition. In: IEEE International Conference on Multimedia and Expo., ICME 2001 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Adrjanowicz, L., Kubanek, M., Tomas, A. (2012). GPU Accelerated Image Processing for Lip Segmentation. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2011. Lecture Notes in Computer Science, vol 7203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31464-3_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31464-3_36

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31463-6

  • Online ISBN: 978-3-642-31464-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics