Use of Facial Landmarks for Adaptive Compression of Videos on Mobile Devices

Chhikara, Garima; Banerjee, Ruchika; Naik, Vinayak; Subramanyam, A. V.; Dey, Kuntal

doi:10.1007/978-3-030-10659-1_4

Garima Chhikara²¹,
Ruchika Banerjee²¹,
Vinayak Naik²²,
A. V. Subramanyam²¹ &
…
Kuntal Dey²³

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 11227))

Included in the following conference series:

International Conference on Communication Systems and Networks

375 Accesses

Abstract

Challenges, such as requirements of resources, limited availability of storage space on devices, and mobile bandwidth spectrum, inhibit unconstrained and ubiquitous video consumption. We propose a first-of-its-kind methodology to compress videos that stream human faces. We detect facial landmarks on-the-fly and compress the video by storing a sequence of distinct frames extracted from the video, such that the facial landmarks of a pair of successively stored frames are significantly different. We use a dynamic thresholding technique to detect the significance of difference and store meta-information for reconstructing the missing frames. To reduce glitches in the decompressed video, we use morphing technique that smoothens the transition between successive frames. We measure the objective goodness of our technique by evaluating the time taken to compress, the entropy per frame, peak signal-to-noise ratio (PSNR), structural similarity index (SSIM), and compression ratio. For subjective analysis, we perform a user study observing user satisfaction at different compression ratios. We provide an extension of our technique to handle videos with multiple faces. Our approach is complementary to the existing compression techniques, e.g. JPEG. By using the complementary approach, we further improve the compression ratio without compromising on the quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Video indexing through human face images using LGFA and window technique

Article 09 April 2022

Video-Based Face Recognition

An energy-saving video compression targeting face recognition of disaster victim

Article 05 March 2021

References

Google API for detecting facial landmarks. https://developers.google.com/android/reference/com/google/android/gms/vision/face/Landmark
Banerjee, R.: Video compression technique using facial landmarks on mobile devices. Master’s thesis, IIIT Delhi (2016). https://repository.iiitd.edu.in/jspui/handle/123456789/435
Bichsel, M.: Automatic interpolation and recognition of face images by morphing. In: Proceedings of the Second International Conference on Automatic Face and Gesture Recognition, pp. 128–135 (1996)
Google Scholar
Cootes, T.: Talking face video database. Images. https://www-prima.inrialpes.fr/FGnet/data/01-TalkingFace/talking_face.html
Ekman, P., Friesen, W.V.: Measuring facial movement. Environ. Psychol. Nonverbal Behav. 1(1), 56–75 (1976)
Article Google Scholar
Facerig, F.: Facial animation system. https://facerig.com/
Heizmann, J., Zelinsky, A.: Robust real-time face tracking and gesture recognition. In: IJCAI, pp. 1525–1530 (1997)
Google Scholar
Hore, A., Ziou, D.: Image quality metrics: PSNR vs. SSIM. In: ICPR, pp. 2366–2369. IEEE (2010)
Google Scholar
Jo, G.S., Choi, I.H., Kim, Y.G.: Robust facial expression recognition against illumination variation appeared in mobile environment. In: CNS, pp. 10–13. IEEE (2011)
Google Scholar
Kim, M., Kumar, S., Pavlovic, V., Rowley, H.: Face tracking and recognition with visual constraints in real-world videos. In: CVPR, pp. 1–8. IEEE (2008)
Google Scholar
King, D.E.: Dlib-ml: A machine learning toolkit. J. Mach. Learn. Res. 10, 1755–1758 (2009). http://dl.acm.org/citation.cfm?id=1577069.1755843
Lane, N.D., Miluzzo, E., Lu, H., Peebles, D., Choudhury, T., Campbell, A.T.: A survey of mobile phone sensing. IEEE Commun. Mag. 48(9), 140–150 (2010)
Article Google Scholar
Lane, N.D., et al.: Bewell: a smartphone application to monitor, model and promote wellbeing. In: ICST Conference on Pervasive Computing Technologies for Healthcare, pp. 23–26 (2011)
Google Scholar
Lee, K.C., Ho, J., Yang, M.H., Kriegman, D.: Video-based face recognition using probabilistic appearance manifolds. In: CVPR, vol. 1, pp. 313–320. IEEE (2003)
Google Scholar
Lien, J.J.J., Kanade, T., Cohn, J.F., Li, C.C.: Detection, tracking, and classification of action units in facial expression. Robot. Auton. Syst. 31(3), 131–146 (2000)
Article Google Scholar
Likert, R.: A technique for the measurement of attitudes. Archives of psychology (1932)
Google Scholar
Otsuka, T., Ohya, J.: Spotting segments displaying facial expression from image sequences using HMM. In: Conference on Automatic Face and Gesture Recognition, pp. 442–447. IEEE (1998)
Google Scholar
Paleari, M., Lisetti, C.L.: Toward multimodal fusion of affective cues. In: Workshop on Human-centered Multimedia, pp. 99–108. ACM (2006)
Google Scholar
Richardson, I.E.: H. 264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia. Wiley, Chichester (2004)
Google Scholar
Setton, E., Yoo, T., Zhu, X., Goldsmith, A., Girod, B.: Cross-layer design of ad hoc networks for real-time video streaming. IEEE Wirel. Commun. 12(4), 59–65 (2005)
Article Google Scholar
Suk, M., Prabhakaran, B.: Real-time mobile facial expression recognition system-a case study. In: CVPR Workshops, pp. 132–137. IEEE (2014)
Google Scholar
Sullivan, G.J., Ohm, J.R., Han, W.J., Wiegand, T.: Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circ. Syst. Video Technol. 22(12), 1649–1668 (2012)
Article Google Scholar
Tanaka, M., Kamio, R., Okutomi, M.: Seamless image cloning by a closed form solution of a modified poisson problem. In: SIGGRAPH Asia 2012 Posters, SA 2012, p. 15:1. ACM, New York (2012). https://doi.org/10.1145/2407156.2407173
Wang, J., Cohen, M.F.: Very low frame-rate video streaming for face-to-face teleconference. In: Data Compression Conference, pp. 309–318. IEEE (2005)
Google Scholar
Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. In: CVPR, pp. 529–534. IEEE (2011)
Google Scholar
Yeasin, M., Bullot, B., Sharma, R.: Recognition of facial expressions and measurement of levels of interest from video. Trans. Multimedia 8(3), 500–508 (2006)
Article Google Scholar
Yi Tao, W.I.G.: Delaunay triangulation for image object indexing: a novel method for shape representation. In: Proceedings of the Seventh SPIE Symposium on Storage and Retrieval for Image and Video Databases, pp. 23–29 (1998)
Google Scholar
Zhang, Z.L., Wang, Y., Du, D.H., Shu, D.: Video staging: a proxy-server-based approach to end-to-end video delivery over wide-area networks. IEEE/ACM Trans. Netw. (TON) 8(4), 429–442 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

IIIT Delhi, New Delhi, India
Garima Chhikara, Ruchika Banerjee & A. V. Subramanyam
BITS Pilani, Sancoale, Goa, India
Vinayak Naik
IBM Research, New Delhi, India
Kuntal Dey

Authors

Garima Chhikara
View author publications
You can also search for this author in PubMed Google Scholar
Ruchika Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
Vinayak Naik
View author publications
You can also search for this author in PubMed Google Scholar
A. V. Subramanyam
View author publications
You can also search for this author in PubMed Google Scholar
Kuntal Dey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vinayak Naik .

Editor information

Editors and Affiliations

Michigan State University, East Lansing, MI, USA
Subir Biswas
Indian Institute of Technology, Kharagpur, India
Animesh Mukherjee
National University of Singapore, Singapore, Singapore
Mun Choon Chan
Indian Institute of Technology, Kharagpur, India
Sandip Chakraborty
Indian Institute of Technology Hyderabad, Kandi, India
Abhinav Kumar
Qualcomm Inc., San Diego, CA, USA
Giridhar Mandyam
Tata Consultancy Services, Bangalore, India
Rajeev Shorey

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chhikara, G., Banerjee, R., Naik, V., Subramanyam, A.V., Dey, K. (2019). Use of Facial Landmarks for Adaptive Compression of Videos on Mobile Devices. In: Biswas, S., et al. Communication Systems and Networks. COMSNETS 2018. Lecture Notes in Computer Science(), vol 11227. Springer, Cham. https://doi.org/10.1007/978-3-030-10659-1_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-10659-1_4
Published: 31 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-10658-4
Online ISBN: 978-3-030-10659-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics