skip to main content
research-article

SmartShots: An Optimization Approach for Generating Videos with Data Visualizations Embedded

Published:04 March 2022Publication History
Skip Abstract Section

Abstract

Videos are well-received methods for storytellers to communicate various narratives. To further engage viewers, we introduce a novel visual medium where data visualizations are embedded into videos to present data insights. However, creating such data-driven videos requires professional video editing skills, data visualization knowledge, and even design talents. To ease the difficulty, we propose an optimization method and develop SmartShots, which facilitates the automatic integration of in-video visualizations. For its development, we first collaborated with experts from different backgrounds, including information visualization, design, and video production. Our discussions led to a design space that summarizes crucial design considerations along three dimensions: visualization, embedded layout, and rhythm. Based on that, we formulated an optimization problem that aims to address two challenges: (1) embedding visualizations while considering both contextual relevance and aesthetic principles and (2) generating videos by assembling multi-media materials. We show how SmartShots solves this optimization problem and demonstrate its usage in three cases. Finally, we report the results of semi-structured interviews with experts and amateur users on the usability of SmartShots.

REFERENCES

  1. [1] E-tailing Group. 2013. How Consumers Shop with Video: Based on a 4Q 2012 Research Study of 1000 Consumers. Technical Report.Google ScholarGoogle Scholar
  2. [2] GitHub. 2019. G2. Retrieved March 31, 2019 from https://github.com/antvis/g2.Google ScholarGoogle Scholar
  3. [3] OpenCV. 2019. OpenCV. Retrieved March 31, 2019 from https://opencv.org/.Google ScholarGoogle Scholar
  4. [4] React. 2019. React. Retrieved March 31, 2019 from https://reactjs.org/.Google ScholarGoogle Scholar
  5. [5] Abadi Martín, Barham Paul, Chen Jianmin, Chen Zhifeng, Davis Andy, Dean Jeffrey, Devin Matthieu, et al. 2016. TensorFlow: A system for large-scale machine learning. In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation. 265283.Google ScholarGoogle Scholar
  6. [6] Amazon. 2005. Amazon Mechanical Turk. Retrieved March 31, 2019 from https://www.mturk.com/.Google ScholarGoogle Scholar
  7. [7] Amini Fereshteh, Riche Nathalie Henry, Lee Bongshin, Hurter Christophe, and Irani Pourang. 2015. Understanding data videos: Looking at narrative visualization through the cinematography lens. In Proceedings of ACM Conference on Human Factors in Computing Systems. 14591468.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Amini Fereshteh, Riche Nathalie Henry, Lee Bongshin, Leboe-McGowan Jason, and Irani Pourang. 2018. Hooked on data videos: Assessing the effect of animation and pictographs on viewer engagement. In Proceedings of the Working Conference on Advanced Visual Interfaces. 19.Google ScholarGoogle Scholar
  9. [9] Amini Fereshteh, Riche Nathalie Henry, Lee Bongshin, Monroy-Hernandez Andres, and Irani Pourang. 2017. Authoring data-driven videos with dataclips. IEEE Transactions on Visualization and Computer Graphics 23, 1 (2017), 501510.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Balinsky Helen Y., Wiley Anthony J., and Roberts Matthew C.. 2009. Aesthetic measure of alignment and regularity. In Proceedings of ACM Symposium on Document Engineering. 5665.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Barker Ronald C. and Schuler Chester L.. 1985. Video composition method and apparatus. US Patent 4,538,188.Google ScholarGoogle Scholar
  12. [12] Bierman Gavin, Abadi Martín, and Torgersen Mads. 2014. Understanding typescript. In Proceedings of the European Conference on Object-Oriented Programming. 257281.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Bostock Michael, Ogievetsky Vadim, and Heer Jeffrey. 2011. D\( ^3 \): Data-driven documents. IEEE Transactions on Visualization and Computer Graphics 17, 12 (2011), 23012309.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Brophy Jere. 2003. Using Video in Teacher Education. Emerald Group Publishing Limited.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Chen Zhutian, Su Yijia, Wang Yifang, Wang Qianwen, Qu Huamin, and Wu Yingcai. 2020. MARVisT: Authoring glyph-based visualization in mobile augmented reality. IEEE Transactions on Visualization and Computer Graphics 26, 8 (2020), 26452658.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] Ngo Chong-Wah, Ma Yu-Fei, and Zhang Hong-Jiang. 2003. Automatic video summarization by graph modeling. In Proceedings of IEEE Conference on Computer Vision. 104109.Google ScholarGoogle Scholar
  17. [17] Coelho D. and Mueller K.. 2020. Infomages: Embedding data into thematic images. Computer Graphics Forum 39, 3 (2020), 593606.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Cohen-Or Daniel, Sorkine Olga, Gal Ran, Leyvand Tommer, and Xu Ying-Qing. 2006. Color harmonization. ACM Transactions on Graphics 25, 3 (2006), 624630.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Cormen Thomas H., Leiserson Charles E., Rivest Ronald L., and Stein Clifford. 2009. Introduction to Algorithms (3rd ed.). MIT Press, Cambridge, MA.Google ScholarGoogle Scholar
  20. [20] Food and Nations Agriculture Organization of the United. 2018. The State of World’s Forests. Retrieved March 31, 2019 from http://www.fao.org/state-of-forests/en.Google ScholarGoogle Scholar
  21. [21] Ge T., Zhao Y., Lee B., Ren D., Chen B., and Wang Y.. 2020. Canis: A high-level language for data-driven chart animations. Computer Graphics Forum 39, 3 (2020), 607617.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Guo Jinlian, Mei Tao, Liu Falin, and Hua Xian-Sheng. 2009. AdOn: An intelligent overlay video advertising system. In Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval. 628629.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Harrabin Roger. 2019. China and India Help Make Planet Leafier. Retrieved March 31, 2019 from https://www.bbc.com/news/science-environment-47210849.Google ScholarGoogle Scholar
  24. [24] Harrower Mark and Brewer Cynthia A.. 2003. ColorBrewer.org: An online tool for selecting colour schemes for maps. Cartographic Journal 40, 1 (2003), 27–37.Google ScholarGoogle Scholar
  25. [25] Heer Jeffrey and Robertson George G.. 2007. Animated transitions in statistical data graphics. IEEE Transactions on Visualization and Computer Graphics 13, 6 (2007), 12401247.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Hegde Srinidhi, Maurya Jitender, Kalkar Aniruddha, and Hebbalaguppe Ramya. 2020. SmartOverlays: A visual saliency driven label placement for intelligent human-computer interfaces. In Proceedings of the IEEE Winter Conference on Applications of Computer Vision. 11211130.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Hu Yongtao, Kautz Jan, Yu Yizhou, and Wang Wenping. 2015. Speaker-following video subtitles. ACM Transactions on Multimedia Computing, Communications, and Applications 11, 2 (2015), 117.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Huang Jonathan, Rathod Vivek, Sun Chen, Zhu Menglong, Korattikara Anoop, Fathi Alireza, Fischer Ian, et al. 2017. Speed/accuracy trade-offs for modern convolutional object detectors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 73107311.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Kosara Robert and Mackinlay Jock. 2013. Storytelling: The next step for visualization. IEEE Computer 46, 5 (2013), 4450.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Kurzhals Kuno, Göbel Fabian, Angerbauer Katrin, Sedlmair Michael, and Raubal Martin. 2020. A view on the viewer: Gaze-adaptive captions for videos. In Proceedings of ACM Conference on Human Factors in Computing Systems. 112.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. [31] Lee Bongshin, Riche Nathalie Henry, Isenberg Petra, and Carpendale Sheelagh. 2015. More than telling a story: Transforming data into visually shared stories. IEEE Computer Graphics and Applications 35, 5 (2015), 8490.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Lidwell William, Holden Kritina, and Butler Jill. 2010. Proximity. In Universal Principles of Design. Rockport Publishers, Beverly, MA, 196–197.Google ScholarGoogle Scholar
  33. [33] Lok Simon, Feiner Steven, and Ngai Gary. 2004. Evaluation of visual balance for automated layout. In Proceedings of the ACM Conference on Intelligent User Interfaces. 101108.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Lu Junhua, Wang Jie, Ye Hui, Gu Yuhui, Ding Zhiyu, Xu Mingliang, and Chen Wei. 2020. Illustrating changes in time-series data with data video. IEEE Computer Graphics and Applications 40, 2 (2020), 1831.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] McFee Brian, Raffel Colin, Liang Dawen, Ellis Daniel P. W., McVicar Matt, Battenberg Eric, and Nieto Oriol. 2015. librosa: Audio and music signal analysis in Python. In Proceedings of the Python in Science Conference. 1825.Google ScholarGoogle Scholar
  36. [36] McNamara Ann, Boyd Katherine, George Joanne, Jones Weston, Oh Somyung, and Suther Annie. 2019. Information placement in virtual reality. In Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces. IEEE, Los Alamitos, CA, 17651769.Google ScholarGoogle Scholar
  37. [37] Mei Tao, Hua Xian-Sheng, Yang Linjun, and Li Shipeng. 2007. VideoSense: Towards effective online video advertising. In Proceedings of the ACM International Conference on Multimedia. 10751084.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] Munzner Tamara. 2014. Visualization Analysis and Design. CRC Press, Boca Raton, FL.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] O’Donovan Peter, Agarwala Aseem, and Hertzmann Aaron. 2014. Learning layouts for single-page graphic designs. IEEE Transactions on Visualization and Computer Graphics 20, 8 (2014), 12001213.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] O’Donovan Peter, Agarwala Aseem, and Hertzmann Aaron. 2015. DesignScape: Design with interactive layout suggestions. In Proceedings of the ACM Conference on Human Factors in Computing Systems. 12211224.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Pavlik John. 2000. The impact of technology on journalism. Journalism Studies 1, 2 (2000), 229237.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Purvis Lisa, Harrington Steven, O’Sullivan Barry, and Freuder Eugene C.. 2003. Creating personalized documents: An optimization approach. In Proceedings of the ACM Symposium on Document Engineering. 6877.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Ren Shaoqing, He Kaiming, Girshick Ross, and Sun Jian. 2017. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 6 (2017), 11371149.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] Rosling Hans. 2009. Gapminder. Gapminder Foundation. Retrieved March 31, 2019 from http://www.gapminder.org.Google ScholarGoogle Scholar
  45. [45] Segel Edward and Heer Jeffrey. 2010. Narrative visualization: Telling stories with data. IEEE Transactions on Visualization and Computer Graphics 16, 6 (2010), 11391148.Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. [46] Tang Tan, Rubab Sadia, Lai Jiewen, Cui Weiwei, Yu Lingyun, and Wu Yingcai. 2019. iStoryline: Effective convergence to hand-drawn storylines. IEEE Transactions on Visualization and Computer Graphics 25, 1 (2019), 769778.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Tang Tan, Tang Junxiu, Hong Jiayi, Yu Lingyun, Ren Peiran, and Wu Yingcai. 2020. Design guidelines for augmenting short-form videos using animated data visualizations. Journal of Visualization 23 (2020), 707720.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Tang Tan, Tang Junxiu, Lai Jiewen, Ying Lu, Ren Peiran, Yu Lingyun, and Wu Yingcai. 2020. SmartShots: Enabling automatic generation of videos with data visualizations embedded. In Proceedings of the 28th ACM International Conference on Multimedia. 45094511.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Mei Tao, Hua Xian-Sheng, and Li Shipeng. 2009. VideoSense: A contextual in-video advertising system. IEEE Transactions on Circuits and Systems for Video Technology 19, 12 (2009), 18661879.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. [50] Todorovic Dejan. 2008. Gestalt principles. Scholarpedia 3, 12 (2008), 5345.Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Tokumaru Masataka, Muranaka Noriaki, and Imanishi Shigeru. 2002. Color design support system considering color harmony. In Proceedings of the IEEE Conference on Fuzzy Systems. 378383.Google ScholarGoogle ScholarCross RefCross Ref
  52. [52] Krevelen D. Van and Poelman R.. 2010. A survey of augmented reality: Technologies, applications, and limitations. International Journal of Virtual Reality 9, 2 (2010), 1.Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Wang Jinjun, Chng Engsiong, and Xu Changsheng. 2006. Fully and semi-automatic music sports video composition. In Proceedings of the IEEE Conference on Multimedia and Expo. 18971900.Google ScholarGoogle ScholarCross RefCross Ref
  54. [54] Wang Jinjun, Xu Changsheng, Chng Engsiong, Duan Lingyu, Wan Kongwah, and Tian Qi. 2005. Automatic generation of personalized music sports video. In Proceedings of the ACM Conference on Multimedia. 735744.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. [55] Wang Yunhai, Chen Xin, Ge Tong, Bao Chen, Sedlmair Michael, Fu Chi-Wing, Deussen Oliver, and Chen Baoquan. 2019. Optimizing color assignment for perception of class separability in multiclass scatterplots. IEEE Transactions on Visualization and Computer Graphics 25, 1 (2019), 820829.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. [56] Wang Yunhai, Han Fubo, Zhu Lifeng, Deussen Oliver, and Chen Baoquan. 2018. Line graph or scatter plot? Automatic selection of methods for visualizing trends in time series. IEEE Transactions on Visualization and Computer Graphics 24, 2 (2018), 11411154.Google ScholarGoogle ScholarCross RefCross Ref
  57. [57] Wikipedia. 2019. Vlog. Retrieved March 31, 2019 from https://en.wikipedia.org/wiki/Vlog.Google ScholarGoogle Scholar
  58. [58] Willett Wesley, Jansen Yvonne, and Dragicevic Pierre. 2017. Embedded data representations. IEEE Transactions on Visualization and Computer Graphics 23, 1 (2017), 461470.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. [59] Yadati Karthik, Katti Harish, and Kankanhalli Mohan. 2014. CAVVA: Computational affective video-in-video advertising. IEEE Transactions on Multimedia 16, 1 (2014), 1523.Google ScholarGoogle ScholarCross RefCross Ref
  60. [60] Yang Xuyong, Mei Tao, Xu Ying-Qing, Rui Yong, and Li Shipeng. 2016. Automatic generation of visual-textual presentation layout. ACM Transactions on Multimedia Computing, Communications, and Applications 12, 2 (2016), 122.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. [61] Zhang Jiajing, Yu Jinhui, Zhang Kang, Zheng Xianjun Sam, and Zhang Junsong. 2017. Computational aesthetic evaluation of logos. ACM Transactions on Applied Perception 14, 3 (2017), 121.Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. [62] Zhang Jiayi Eris, Sultanum Nicole, Bezerianos Anastasia, and Chevalier Fanny. 2020. DataQuilt: Extracting visual elements from images to craft pictorial visualizations. In Proceedings of the ACM Conference on Human Factors in Computing Systems. 113.Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. [63] Zhang Yunke, Hu Kangkang, Ren Peiran, Yang Changyuan, Xu Weiwei, and Hua Xian-Sheng. 2017. Layout style modeling for automating banner design. In Proceedings of the ACM Conference on Multimedia Thematic Workshops. 451459.Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. [64] Zhao Ying, Jiang Haojin, Chen Qi’an, Qin Yaqi, Wu Yitao, Liu Shixia, Zhou Zhiguang, Xia Jiazhi, and Zhou Fangfang. 2021. Preserving minority structures in graph sampling. IEEE Transactions on Visualization and Computer Graphics 27 (2021), 16981708.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. SmartShots: An Optimization Approach for Generating Videos with Data Visualizations Embedded

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Interactive Intelligent Systems
        ACM Transactions on Interactive Intelligent Systems  Volume 12, Issue 1
        March 2022
        206 pages
        ISSN:2160-6455
        EISSN:2160-6463
        DOI:10.1145/3505196
        Issue’s Table of Contents

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 March 2022
        • Accepted: 1 August 2021
        • Revised: 1 May 2021
        • Received: 1 August 2020
        Published in tiis Volume 12, Issue 1

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      View Full Text

      HTML Format

      View this article in HTML Format .

      View HTML Format