Skip to main content
Log in

Mobile video communication based on augmented reality

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Augmented Reality (AR)–based video telephony service can allow mobile users a better user experience (UX) since it allows participants to place and transmit augmented objects on video frames to a peer. However, there are quite a few AR-based mobile video communication models today, yet the existing models are limited and insufficient in supporting technical service such as real-time object detection, dynamic data selection, and discrimination between local data augmentation and remote data augmentation. This paper presents an enhanced AR–based mobile video telephony scheme, in which the object of interest can be dynamically combined with a video frame through real-time object detection, and users can immediately share their experience with their friend during a video call. In order to evaluate the effectiveness and feasibility of the proposed scheme, an application has been implemented on the mobile system and the computational time has been measured. Experimental results show that the proposed system can give customers better UX with small increase of computational time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Chillet D, Eiche A, Pillement S, Sentieys O (2011) Real-time scheduling on heterogeneous system-on-chip architectures using an optimized artificial neural network. J Syst Archit 57(4):340–353

    Article  Google Scholar 

  2. Chuen-Horng L, Yu-Jhuang S (2010) Fast segmentation of porcelain images based on texture features original research article. J Visual Commun Image Recogn 21(7):707–721

    Article  Google Scholar 

  3. Daping W, Michael B (2012) Augmented reality: service construction via a 4D communication model. IEEE Commun Mag 50(3):26–31

    Article  Google Scholar 

  4. Duanggate C, Uyyanonvara B, Makhanov S, Barman S, Williamson T (2011) Object detection with feature stability over scale space. J Visual Commun Image Recogn 22(4):345–352

  5. Dunko GA (2009) Enhanced video telephony through augmented reality, US Patent 20090231413A1

  6. Freund Y, Schapire RE (1997) A Decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst 55:119–139

    Article  MathSciNet  MATH  Google Scholar 

  7. Fukayama A, Takamiya S, Nakagawa J, Arakawa N, Kanamaru N, Uchida N (2011) Architecture and prototype of augmented reality videophone service. In Proc. Of the 15th International Conference on Intelligence in Next Generation Networks 80–85

  8. ITU (1996) Rec. H.263: video coding for low bit rate communication, telecommunication standardization sector of ITU

  9. Jain AK (1989) Fundamentals of digital image processing. Prentice-Hall

  10. Jana S, Baik E, Pande A, Mohapatra P (2014) Improving mobile video telephony. In Proc. Of the 2014 Eleventh Annual IEEE International Conference on Sensing, Communication, and Networking (SECON), pp.495–503

  11. Kim DW, Gil GT, Kim DH (2011) A handover decision strategy with a novel modified load-based adaptive hysteresis adjustment in 3GPP LTE system. IEICE Trans Inf Syst E94-D(6):1130–1136

    Article  Google Scholar 

  12. Lanitis A, Taylor CJ, Cootes TF (1995) An autoatic face identification system using flexible appearance models. Image Vis Comput 13(5):393–401

    Article  Google Scholar 

  13. Lee D-H, Park J (2007) An accessible and collaborative tourist guide based on augmented reality and mobile devices. In Proc. of the Second Workshop on Digital Media and its Application in Museum & Heritages 379–382

  14. Manpreet K, Jasdeep K, Jappreet K (2011) Survey of contrast enhancement techniques based on histogram equalization. Int J Adv Comput Sci Appl 2(7):137–141

    Google Scholar 

  15. McKenna SJ, Gong S, Raja Y (1998) Modeling facial colour and identity with gaussian mixtures. Pattern Recogn 31(12):1883–1892

    Article  Google Scholar 

  16. Mirzaei MR, Ghorshi S, Mortazavi M (2013) Audio-visual speech recognition techniques in augmented reality environments. J Visual Comput 30(3):245–257

    Article  Google Scholar 

  17. Osuna E, Freund R, Girosi F (1997) Training support vector machines: an application to face detection. Proc IEEE Conf Comput Vis Pattern Recogn 130–136

  18. Popov A, Dimitrova D (2008) A new approach for finding face features in color images. In Proc. Of the 4th International Conference on Intelligent Systems 1233–1237

  19. Rajagopalan A, Kumar K, Karlekar J, Manivasakan R, Patil M, Desai U, Poonacha P, Chaudhuri S (1998) Finding faces in photographs. Proc Sixth IEEE Int Conf Comput Vis 640–645

  20. Rosenberg J, Schulzrinne H, Camarillo G, Johnston A, Peterson J, Sparks R, Handley H, Schooler E (2002) SIP: Session Initiation Protocol (RFC 3261), Internet Engineering Task Force Network Working Group

  21. Samanchuen T, Kiattisin S (2014) implementation and quality evaluation of video telephony using session initiation protocol. In Proc. of the 2014 Annual Summit and Conference on Asia-Pacific Signal and Information Processing Association (APSIPA), pp. 1–4

  22. Schneiderman H, Kanade T (1998) Probabilistic modeling of local appearance and spatial relationships for object recognition. Proc IEEE Conf Comput Vis Pattern Recogn 45–51

  23. Sirawongphatsara P, Wuttidittachotti P, Daengsi T (2015) Comparison of video telephony: a case study of LINE and Tango over 3G in Bangkok. In Proc. Of the 2015 International Conference on Information Networking (ICOIN) 205–209

  24. Smith JR, Jabri MA (2004) The 3G-324 M protocols for conversational video telephony. IEEE Multimedia 11(3):102–105

    Article  Google Scholar 

  25. Su GM, Lai YC, Kwasinski A, Wang H (2011) 3D video communications: challenges and opportunities. Int J Commun Syst 24(10):1261–1281

    Article  Google Scholar 

  26. Sung KK, Poggio T (1998) Example-based learning for view-based human face detection. IEEE Trans Pattern Anal Mach Intell 20(1):39–51

    Article  Google Scholar 

  27. Taragay O, Mikhail S, Vlad B, Supun S, Rakesh K (2015) Augmented reality binoculars. IEEE TVCG 21(5):611–623

    Google Scholar 

  28. Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71–86

    Article  Google Scholar 

  29. Viola P, Jone MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154

    Article  Google Scholar 

  30. Wu-Chih H, Chao-Ho C, Deng-Yuan H, Yan-Ting Y (2012) Video object segmentation in rainy situations based on difference scheme with object structure and color analysis. J Visual Commun Image Recogn 23(2):303–312

    Article  Google Scholar 

  31. Yu Y-C (2013) Design of a mobile telephony system for social interaction. In Proc. of the 2013 I.E. and Internet of Things (iThings/CPSCom). IEEE Int Conf IEEE Cyber Phys Soc Comput 1006–1012

Download references

Acknowledgments

This paper was supported by Research Fund, Kumoh National Institute of Technology.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Young-Woong Ko.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jang, S.B., Kim, Y.G. & Ko, YW. Mobile video communication based on augmented reality. Multimed Tools Appl 76, 16893–16909 (2017). https://doi.org/10.1007/s11042-016-3627-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-016-3627-4

Keywords

Navigation