skip to main content
10.1145/3170427.3188596acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
abstract

ParaPara: Synthesizing Pseudo-2.5D Content from Monocular Videos for Mixed Reality

Published:20 April 2018Publication History

ABSTRACT

In this work, we propose a simple yet effective method for synthesizing a pseudo-2.5D scene from a monocular video for mixed reality (MR) content. We also propose the ParaPara system, which applies this method. Most previously proposed systems convert real-world objects into 3D graphic models using expensive equipment; this is a barrier for individuals or small groups to create MR content. ParaPara uses four points in an image and their manually estimated distances to synthesize MR content by applying deep neural networks and simple image processing techniques to monocular videos. The synthesized content can be observed through an MR head-mounted display, and spatial mapping and spatial sound are applied to support the interaction between the real-world and MR content. The proposed system is expected to reduce the entry barriers to create MR content because it can create such content from a large number of previously captured videos.

Skip Supplemental Material Section

Supplemental Material

lbw1459-file3.mp4

mp4

8.2 MB

References

  1. T. Kanade, P. Rander, and P. J. Narayanan. 1997. Virtualized reality: constructing virtual worlds from real scenes. IEEE MultiMedia 4, 1 (Jan. 1997), 34-- 47. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Canon Global. Canon announces development of the Free Viewpoint Video System virtual camera system that creates an immersive viewing experience. (Sep. 2017). Retrieved Jan 10, 2018 from http://global.canon/en/news/2017/20170921.htmlGoogle ScholarGoogle Scholar
  3. Youichi Horry, Ken-Ichi Anjyo, and Kiyoshi Arai. 1997. Tour into the Picture: Using a Spidery Mesh Interface to Make Animation from a Single Image. In Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH '97). ACM Press/Addison-Wesley Publishing Co., New York, NY, USA, 225--232. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Itaru Kitahara, Yuichi Ohta, Hideo Saito, Shinji Akimichi, Tooru Ono, and Takeo Kanade. 2002. Recording of multiple videos in a large-scale space for large-scale virtualized reality. Journal of the Institute of Image Information and Television Engineers 56, 8 (Aug. 2002), 1328--1333.Google ScholarGoogle ScholarCross RefCross Ref
  5. Y. Kameda, T. Koyama, Y. Mukaigawa, F. Yoshikawa, and Y. Ohta. 2004. Free viewpoint browsing of live soccer games. In 2004 IEEE International Conference on Multimedia and Expo (ICME '04). 747--750.Google ScholarGoogle Scholar
  6. P. Goorts, S. Maesen, M. Dumont, S. Rogmans, and P. Bekaert. 2014. Free Viewpoint Video for Soccer using Histogram-Based Validity Maps in Plane Sweeping. In Proceedings of the Ninth International Conference on Computer Vision Theory and Applications (VISAPP '14). 378--386.Google ScholarGoogle Scholar
  7. N. Inamoto and H. Saito. 2005. Free viewpoint video synthesis and presentation from multiple sporting videos. In 2005 IEEE International Conference on Multimedia and Expo (ICME '05). 40--49.Google ScholarGoogle Scholar
  8. Alvaro Collet, Ming Chuang, Pat Sweeney, Don Gillett, Dennis Evseev, David Calabrese, Hugues Hoppe, Adam Kirk, and Steve Sullivan. 2015. Highquality Streamable Free-viewpoint Video. ACM Trans. Graph. 34, 4, Article 69 (July. 2015), 13 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Sergio Orts-Escolano et al. 2016. Holoportation: Virtual 3D Teleportation in Real-time. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology (UIST '16). ACM, New York, NY, USA, 741--754. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. N. Dalal and B. Triggs. 2005. Histograms of oriented gradients for human detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '05). 886-- 893. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Z. Cao, T. Simon, S. E. Wei, and Y. Sheikh. 2017. Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '17). 1302--1310.Google ScholarGoogle Scholar
  12. Zoran Zivkovic and Ferdinand van der Heijden. 2006. Efficient adaptive density estimation per image pixel for the task of background subtraction. Pattern Recognition Letters 27, 7 (May. 2006), 773--780. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Tomas Akenine-Möller, Eric Haines, and Naty Hoffman. 2008. Real-Time Rendering (3rd. ed.). A. K. Peters, Ltd., Natick, MA, USA.Google ScholarGoogle Scholar
  14. Michael S. Landy and J. Anthony Movshon (Eds.). 1991. Computational Models of Visual Processing. Massachusetts Institute of Technology Press, Cambridge, MA, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. ParaPara: Synthesizing Pseudo-2.5D Content from Monocular Videos for Mixed Reality

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CHI EA '18: Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems
      April 2018
      3155 pages
      ISBN:9781450356213
      DOI:10.1145/3170427

      Copyright © 2018 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 April 2018

      Check for updates

      Qualifiers

      • abstract

      Acceptance Rates

      CHI EA '18 Paper Acceptance Rate1,208of3,955submissions,31%Overall Acceptance Rate6,164of23,696submissions,26%

      Upcoming Conference

      CHI '24
      CHI Conference on Human Factors in Computing Systems
      May 11 - 16, 2024
      Honolulu , HI , USA

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader