skip to main content
10.1145/3208159.3208170acmotherconferencesArticle/Chapter ViewAbstractPublication PagescgiConference Proceedingsconference-collections
research-article

Voxelized Facial Reconstruction Using Deep Neural Network

Published: 11 June 2018 Publication History

Abstract

This paper presents an approach to predicting variation tendency of human faces with regard to cranium changes based on deep learning. Our work focuses on generating individual customized facial models with high plausibility. Inspired by the performance of encoder-decoder convolutional neural network, the core trainable predicting engine of our learning network is designed for three-dimension voxelized data representation as the encoder-decoder structure and the encoder part is similar to the 7 layers of VGG16 network. To take full consideration of the cranium changes and features of original human face, a novel formation of channeled volumetric data structure is presented, and also the corresponding sub and up-sampling strategies for volume data. Our encoder-decoder neural network consumes discrete 3-channel volume data and generates 1-channel volume data as predicted post-variation human face. This framework is quantified with clinical dataset and it shows that its' performance improves in comparison with the state-of-the-art technologies.

References

[1]
Vijay Badrinarayanan, Alex Kendall, and Roberto Cipolla. 2017. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 12 (2017), 2481--2495.
[2]
Pengfei Dou, Shishir K. Shah, and Ioannis A. Kakadiaris. 2017. End-to-End 3D Face Reconstruction with Deep Neural Networks. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, Piscataway, NJ, USA, 1503--1512.
[3]
Rohit Girdhar, David F. Fouhey, Mikel Rodriguez, and Abhinav Gupta. 2016. Learning a Predictable and Generative Vector Representation for Objects. In European Conference on Computer Vision. Springer International Publishing, Cham, 484--499.
[4]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. In IEEE International Conference on Computer Vision. IEEE, Piscataway, NJ, USA, 1026--1034.
[5]
Yongli Hu, Fuqing Duan, Baocai Yin, Mingquan Zhou, Yanfeng Sun, Zhongke Wu, and Guohua Geng. 2013. A Hierarchical Dense Deformable Model for 3D Face Reconstruction from Skull. Multimedia Tools and Applications 64, 2 (2013), 345--364.
[6]
Biao Leng, Xiangyang Zhang, Ming Yao, and Zhang Xiong. 2015. A 3D Model Recognition Mechanism Based on Deep Boltzmann Machines. Neurocomputing 151 (2015), 593--602.
[7]
Yangyan Li, Soeren Pirk, Hao Su, Charles R Qi, and Leonidas J Guibas. 2016. FPNN: Field Probing Neural Networks for 3D Data. In Advances in Neural Information Processing Systems 29, D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett (Eds.). Curran Associates, Inc., 307--315.
[8]
Charles R. Qi, Hao Su, Kaichun Mo, and Leonidas J. Guibas. 2017. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, Piscataway, NJ, USA.
[9]
Charles Ruizhongtai Qi, Hao Su, Matthias Nießner, Angela Dai, Mengyuan Yan, and Leonidas J. Guibas. 2016. Volumetric and Multi-View CNNs for Object Classification on 3D Data. CoRR abs/1604.03265 (2016).
[10]
Gernot Riegler, Ali Osman Ulusoy, and Andreas Geiger. 2017. OctNet: Learning Deep 3D Representations at High Resolutions. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11]
Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR abs/1409.1556 (2014). arXiv:1409.1556 http://arxiv.org/abs/1409.1556
[12]
Hang Su, Subhransu Maji, Evangelos Kalogerakis, and Erik Learned-Miller. 2015. Multi-View Convolutional Neural Networks for 3D Shape Recognition. In IEEE International Conference on Computer Vision. IEEE Computer Society, Washington, DC, USA, 945--953.
[13]
P. Vanezis, M. Vanezis, G. McCombe, and T. Niblett. 2000. Facial reconstruction using 3-D computer graphics. Forensic Science International 108, 2 (2000), 81--95.
[14]
Zhirong Wu, Shuran Song, Aditya Khosla, Xiaoou Tang, and Jianxiong Xiao. 2014. 3D ShapeNets for 2.5D Object Recognition and Next-Best-View Prediction. CoRR abs/1406.5670 (2014).

Cited By

View all
  • (2024)SHC: soft-hard correspondences framework for simplifying point cloud registrationEURASIP Journal on Advances in Signal Processing10.1186/s13634-023-01104-02024:1Online publication date: 17-Jan-2024
  • (2024)Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a surveyThe Visual Computer10.1007/s00371-024-03641-7Online publication date: 26-Sep-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CGI 2018: Proceedings of Computer Graphics International 2018
June 2018
284 pages
ISBN:9781450364010
DOI:10.1145/3208159
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 3D Reconstruction
  2. Deep Learning
  3. Facial Reconstruction

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Interdisciplinary Program of Shanghai Jiao Tong University
  • National Natural Science Foundation of China
  • National High-tech R&D Program of China (863 Program)
  • Research Grants Council of Hong Kong
  • Key Program for International S&T Cooperation Project of China

Conference

CGI 2018
CGI 2018: Computer Graphics International 2018
June 11 - 14, 2018
Island, Bintan, Indonesia

Acceptance Rates

CGI 2018 Paper Acceptance Rate 35 of 159 submissions, 22%;
Overall Acceptance Rate 35 of 159 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)SHC: soft-hard correspondences framework for simplifying point cloud registrationEURASIP Journal on Advances in Signal Processing10.1186/s13634-023-01104-02024:1Online publication date: 17-Jan-2024
  • (2024)Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a surveyThe Visual Computer10.1007/s00371-024-03641-7Online publication date: 26-Sep-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media