Sequential Image Synthesis for Human Activity Video Generation

Khan, Fahim Hasan; de Silva, Akila; Yetukuri, Jayanth; Norouzi, Narges

doi:10.1007/978-3-030-27272-2_11

Fahim Hasan Khan¹¹,
Akila de Silva¹¹,
Jayanth Yetukuri¹¹ &
…
Narges Norouzi¹¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11663))

Included in the following conference series:

International Conference on Image Analysis and Recognition

1746 Accesses

Abstract

In the field of computer graphics and multimedia, automatic synthesis of a new set of image sequences from another different set of image sequences for creating realistic video or animation of some human activity performed is a research challenge. Traditionally, creating such animation or similar visual media contents is done manually, which is a tedious task. Recent advancements in deep learning have made some promising progress for automating this type of media creation process. This work is motivated by the idea to synthesize a temporally coherent sequence of images (e.g., a video) of a person performing some activity by using a video or set of images of a different person performing a similar activity. To achieve that, our approach utilized the cycle-consistent adversarial network (CycleGAN). We present a new approach for learning to transfer a human activity from a source domain to a target domain without using any complicated pose detection or extraction method. Our objective in this work is to learn a mapping between two consecutive sequences of images from two domains representing two different activities and use that mapping to transfer the activity from one domain to another for synthesizing an entirely new consecutive sequence of images, which can be combined to make a video of new human activity. We also present and analyze some qualitative results generated by our method.

All the authors shared an equal amount of contribution to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

LEO: Generative Latent Image Animator for Human Video Synthesis

Article 27 September 2024

Controllable Video Generation Through Global and Local Motion Dynamics

From Recognition to Generation Using Deep Learning: A Case Study with Video Generation

References

Stock images, photos, vectors, video, and music. https://www.shutterstock.com/
Chan, C., Ginosar, S., Zhou, T., Efros, A.A.: Everybody dance now. arXiv preprint arXiv:1808.07371 (2018)
Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Google Scholar
Liu, J., Kuipers, B., Savarese, S.: Recognizing human actions by attributes. In: CVPR 2011, pp. 3337–3344. IEEE (2011)
Google Scholar
Villegas, R., Yang, J., Ceylan, D., Lee, H.: Neural kinematic networks for unsupervised motion retargetting. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8639–8648 (2018)
Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2223–2232 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

University of California, Santa Cruz, CA, 95064, USA
Fahim Hasan Khan, Akila de Silva, Jayanth Yetukuri & Narges Norouzi

Authors

Fahim Hasan Khan
View author publications
You can also search for this author in PubMed Google Scholar
Akila de Silva
View author publications
You can also search for this author in PubMed Google Scholar
Jayanth Yetukuri
View author publications
You can also search for this author in PubMed Google Scholar
Narges Norouzi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fahim Hasan Khan .

Editor information

Editors and Affiliations

University of Waterloo, Waterloo, ON, Canada
Fakhri Karray
University of Porto, Porto, Portugal
Aurélio Campilho
University of Waterloo, Waterloo, ON, Canada
Alfred Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khan, F.H., de Silva, A., Yetukuri, J., Norouzi, N. (2019). Sequential Image Synthesis for Human Activity Video Generation. In: Karray, F., Campilho, A., Yu, A. (eds) Image Analysis and Recognition. ICIAR 2019. Lecture Notes in Computer Science(), vol 11663. Springer, Cham. https://doi.org/10.1007/978-3-030-27272-2_11

Download citation

DOI: https://doi.org/10.1007/978-3-030-27272-2_11
Published: 03 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-27271-5
Online ISBN: 978-3-030-27272-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics